Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
spott
1y ago
0 comments
Save
Share
They aren’t using the prompt caching on the query side, only on the embedding side… so you cache the document in the context window when ingesting it, but not during retrieval.
0 comments
1 comments · 1 top-level
top
newest
oldest
KTibow
1y ago
It seems a little odd to make multiple requests instead of using one request to create all the context for all the chunks.
j
/
k
navigate · click thread line to collapse