undefined | Better HN

0 pointsspywaregorilla2y ago0 comments

It is not computing the right ones. There is no debate there. These algorithms do not map inputs to outputs reliably.

0 comments

3 comments · 1 top-level

famouswaffles2y ago· 2 in thread

The point is that this is just blatantly wrong.

>Asking a token prediction model to do math is like asking a human to do math without doing the math. What's 9 times 9? I can tell you it's 81 from sheer memorization. I can probably invoke 9 x 9 + 1 = 82 without needing to do any calculation either. But if you ask me 32 * 64, that's very difficult to do without doing calculations. Implicitly doing math is not sensible.

It's not about memorization. It's trivial to test on instances that would not appear in training and see GPT-4 be better than a human who would attempt the problem without a tool or pad.

The biggest problem with LLM arithmetic is tokenization. https://arxiv.org/abs/2310.02989

Other than that, the algorithms it uses for arithmetic will continue to converge during training until it is correct.

spywaregorillaOP2y ago

I'm not claiming it's about memorization. I have not memorized 9 x 9 + 1 is 82. I can simply do it without invoking any calculation. That is a close equivalent to trying to do with math token prediction.

Humans doing things "without a pad" is not the same thing as doing mental math vs. trying to intuit an answer.

Token predictions will not converge on correct arithmetic algorithms.

famouswaffles2y ago

>I'm not claiming it's about memorization. I have not memorized 9 x 9 + 1 is 82. I can simply do it without invoking any calculation.

>What's 9 times 9? I can tell you it's 81 from sheer memorization. I can probably invoke 9 x 9 + 1 = 82 without needing to do any calculation

Performing math how you suggest is little more than memorization except you've just memorized a different chain in the process.

>That is a close equivalent to trying to do with math token prediction.

This isn't going to become true no matter how much you repeat it. You've consistently made assertions that are trivially proven false.

>Token predictions will not converge on correct arithmetic algorithms.

Yes it will. It literally will. This isn't some debate. This is something that has been researched.

https://www.alignmentforum.org/posts/N6WM6hs7RQMKDhYjB/a-mec...

You think you have an understanding of Language Models and token prediction. Unfortunately you don't.

1 more reply

j / k navigate · click thread line to collapse