undefined | Better HN

0 pointsseba_dos13d ago0 comments

Genuine question - what else did you expect?

0 comments

For it to follow the instructions I had for it. Call me naive and stupid for thinking the 1M context window on the brand new model would actually, y'know, work.

quesera3d ago

That's a bit anthropomorphic though.

When LLMs become able to reflectively examine their own premises and weight paths, they will exceed the self-awareness of ordinary humans.

hgoel3d ago

Just dealt with this last night with Claude repeatedly risking a full system crash by failing to ensure that the previous training run of a model ended before starting the next one.

It's a pretty strange issue, makes me feel like the 1M context model was actually a downgrade, but it's probably something weird about the state of its memory document. I wasn't even very deep into the context.

Natfan3d ago

why would further chance at context pollution be a good thing? i feel like it is easier for data to get lost in a larger context

grey-area3d ago

It doesn’t reason or explicitly follow instructions, it generates plausible text given a context.

j / k navigate · click thread line to collapse