Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
Count tokens used by GPT-4 and Llama for large texts (> 50k characters)
(opens in new tab)
(huggingface.co)
2 points
xenova
2y ago
1 comments
Save
Share
1 comments
1 comments · 1 top-level
top
newest
oldest
xenova
OP
2y ago
This web-app fixes the two main problems of OpenAI's tokenizer playground: (1) being capped at 50k characters, and (2) not supporting GPT-4/GPT-3.5 tokenizers.
Everything runs in-browser thanks to Transformers.js.
j
/
k
navigate · click thread line to collapse