Skip to content
Better HN
Speculative sampling: LLMs writing a lot faster using smaller LLMs | Better HN