Show HN: ColBERT Build from Sentence Transformers (opens in new tab)

(github.com)

66 pointsraphaelty2y ago18 comments

18 comments

15 comments · 6 top-level

tinyhouse2y ago· 2 in thread

Looks cool. A couple of questions: 1. Does it support fine tuning with different losses? For example, where you don't need to provide negatives and it uses the other examples in the batch as negatives 2. Can you share inference speed info? I know that Colbert should be slow since it creates many embeddings per passage

raphaeltyOP2y ago

Hi, there is a single loss right now, but I plan to add some Sentence Transformers losses. ColBERT is slow as a retriever, but is quite efficient as a Ranker on GPU (way faster than cross-encoder). I plan to release pre-trained checkpoints on HuggingFace with benchmarks using BEIRand inference speed info.

aashu_dwivedi2y ago

Do you mean it's faster when the embeddings are pre-computed or is it faster when the embeddings are computed on the fly as well. Also, what's the recommended way to store the colbert embeddings as, because of the 2d nature of the embeddings it's not practical to store in a vector database.

1 more reply

barefeg2y ago· 2 in thread

Do you need to have the same number of positive and negatives? Is there any meaning of pairing a positive an a negative in the triplet?

raphaeltyOP2y ago

It's because of the loss of the model. I ask the model to produce a higher similarity between the query and the positive document rather than between the query and the negative document. I'll add more losses soon so there are more choices

alexmolas2y ago

is the loss the usual lambdarank?

vorticalbox2y ago· 2 in thread

Is a negative document one that doesn't match the query?

raphaeltyOP2y ago

Yes exactly

vorticalbox2y ago

Does that help much in terms of training?

2 more replies

ramoz2y ago· 1 in thread

Anecdote: neural-cherche seems useful as I have analysts creating positive & negative feedback data (basically thumbs-up/down signals) that we will use to fine tune retrieval models.

Assuming not much effort is required to make this work for similar models? (i.e. BGE)

raphaeltyOP2y ago

Nice, it might already be compatible with BGE, I'll try it and add it to the documentation soon

kamranjon2y ago· 1 in thread

What sort of high level user facing feature could you build with this?

raphaeltyOP2y ago

You could recommend content based on user query, tag content produced by the user, use colbert as part of a ChatBot to show evidences to the user questions

espadrine2y ago· 1 in thread

I like the inclusion of both positive and negative examples!

Do you have advice for how to measure the quality of the finetuning beyond seeing the loss drop?

raphaeltyOP2y ago

In the documentation there is an evaluation module with detailed informations. The idea is to gather relevant pairs of queries and documents that are not part of the training set. Then the idea is to measure, using various metrics, how your model can retrieve accurate documents.

j / k navigate · click thread line to collapse

18 comments

15 comments · 6 top-level

tinyhouse2y ago· 2 in thread

raphaeltyOP2y ago

aashu_dwivedi2y ago

1 more reply

barefeg2y ago· 2 in thread

Do you need to have the same number of positive and negatives? Is there any meaning of pairing a positive an a negative in the triplet?

raphaeltyOP2y ago

alexmolas2y ago

is the loss the usual lambdarank?

vorticalbox2y ago· 2 in thread

Is a negative document one that doesn't match the query?

raphaeltyOP2y ago

Yes exactly

vorticalbox2y ago

Does that help much in terms of training?

2 more replies

ramoz2y ago· 1 in thread

Anecdote: neural-cherche seems useful as I have analysts creating positive & negative feedback data (basically thumbs-up/down signals) that we will use to fine tune retrieval models.

Assuming not much effort is required to make this work for similar models? (i.e. BGE)

raphaeltyOP2y ago

Nice, it might already be compatible with BGE, I'll try it and add it to the documentation soon

kamranjon2y ago· 1 in thread

What sort of high level user facing feature could you build with this?

raphaeltyOP2y ago

You could recommend content based on user query, tag content produced by the user, use colbert as part of a ChatBot to show evidences to the user questions

espadrine2y ago· 1 in thread

I like the inclusion of both positive and negative examples!

Do you have advice for how to measure the quality of the finetuning beyond seeing the loss drop?

raphaeltyOP2y ago

j / k navigate · click thread line to collapse