undefined | Better HN

0 pointsInsanity2mo ago0 comments

That's an interesting idea. But IMO the real 'token saver' isn't in the language keywords but it's in the naming of things like variables, classes, etc.

There are languages that are already pretty sparse with keywords. e.g in Go you can write 'func main() string', no need to define that it's public, or static etc. So combining a less verbose language with 'codegolfing' the variables might be enough.

0 comments

danielvaughn2mo ago

I'm not an expert in LLMs, but I don't think character length matters. Text is deterministically tokenized into byte sequences before being fed as context to the LLM, so in theory `mySuperLongVariableName` uses the same number of tokens as `a`. Happy to be corrected here.

fragmede2mo ago

Running it through https://platform.openai.com/tokenizer "mySuperLongVariableName" takes 5 tokens. "a", takes 1. mediumvarname is 3 though. "though" is 1.

coderenegade2mo ago

You're more likely to save tokens in the architecture than the language. A clean, extensible architecture will communicate intent more clearly, require fewer searches through the codebase, and take up less of the context window.

gf0002mo ago

Go is one of the most verbose mainstream programming languages, so that's a pretty terrible example.

InsanityOP2mo ago

Maybe not a perfect example but it’s more lightweight than Java at least haha

gf0002mo ago

If by lightweight you mean verbosity, then absolutely no.

In go every third line is a noisy if err check.

LtWorf2mo ago

Well LLMs are made to be extremely verbose so it's a good match!

nineteen9992mo ago

I think there's a huge range here - ChatGPT to me seems extra verbose on the web version, but when running with Codex it seems extra terse.

Claude seems more consistently _concise_ to me, both in web and cli versions. But who knows, after 12 months of stuff it could be me who is hallucinating...

giancarlostoro2mo ago

To you maybe, but Go is running a large amount of internet infrastructure today.

gf0002mo ago

How does that relate to Go being a verbose language?

giancarlostoro2mo ago

Its not verbose to some of us. It is explicit in what it does, meaning I don't have to wonder if there's syntatic sugar hiding intent. Drastically more minimal than equivalent code in other languages.

1 more reply

j / k navigate · click thread line to collapse

0 comments

danielvaughn2mo ago

fragmede2mo ago

Running it through https://platform.openai.com/tokenizer "mySuperLongVariableName" takes 5 tokens. "a", takes 1. mediumvarname is 3 though. "though" is 1.

coderenegade2mo ago

gf0002mo ago

Go is one of the most verbose mainstream programming languages, so that's a pretty terrible example.

InsanityOP2mo ago

Maybe not a perfect example but it’s more lightweight than Java at least haha

gf0002mo ago

If by lightweight you mean verbosity, then absolutely no.

In go every third line is a noisy if err check.

LtWorf2mo ago

Well LLMs are made to be extremely verbose so it's a good match!

nineteen9992mo ago

I think there's a huge range here - ChatGPT to me seems extra verbose on the web version, but when running with Codex it seems extra terse.

Claude seems more consistently _concise_ to me, both in web and cli versions. But who knows, after 12 months of stuff it could be me who is hallucinating...

giancarlostoro2mo ago

To you maybe, but Go is running a large amount of internet infrastructure today.

gf0002mo ago

How does that relate to Go being a verbose language?

giancarlostoro2mo ago

1 more reply

j / k navigate · click thread line to collapse