undefined | Better HN

0 pointssimonw2y ago0 comments

I have an explanation of RAG in the context of embeddings here: https://simonwillison.net/2023/Oct/23/embeddings/#answering-...

0 comments

3 comments · 1 top-level

Grimburger2y ago· 2 in thread

You could just sum it up for us all rather than do a divert to your blog?

It's Retrieval Augmented Generation btw.

To quote:

> The key idea is this: a user asks a question. You search your private documents for content that appears relevant to the question, then paste excerpts of that content into the LLM (respecting its size limit, usually between 3,000 and 6,000 words) along with the original question.

> The LLM can then answer the question based on the additional content you provided.

simonwOP2y ago

> You could just sum it up for us all rather than do a divert to your blog?

Why? Have links gone out of fashion?

I even linked directly to the relevant section rather than linking to the top of the page.

The paper that coined the term used the hyphen, though I think I prefer it without: https://arxiv.org/abs/2005.11401

Grimburger2y ago

> Have links gone out of fashion?

Yes.

You wrote far more words than needed to answer the comment, I did it for you instead.

4 more replies

j / k navigate · click thread line to collapse