undefined | Better HN

0 pointsmake33y ago0 comments

the problem is that if you steal the weights then you can serve your own gpt4, and it's very hard to prove that what you're serving is actually gpt4. (or you could just start using it without paying ofc)

0 comments

RealityVoid3y ago

Presumably, if you give it identical prompts you get identical answers?

Sander_Marechal3y ago

No, these NLPs aren't idempotent. Even if you ask ChatGPT the same question multiple times you will get different answers.

trifurcate3y ago

None of the siblings are right. The models themselves are idempotent: given the same context you will get the same activations. However the output distribution is sampled in a pseudorandom way by these chat tools. You can seed all the prngs in the system to always have reproducible output using sampling, or even go beyond that and just work with the raw probability distribution by hand.

1 more reply

LawTalkingGuy3y ago

That's the feature of chat - it remembers what has been said and that changes the context in which it says new things. If you use the API it starts fresh each time, and if you turn down the 'temperature' it produces very similar and identical answers.

parentheses3y ago

This may be an implementation detail to obfuscate GPT weights. OR it was to encourage selecting the best answers to further train the model.

1 more reply

outside12343y ago

yes - they are multinomial distributions over answers essentially

simonh3y ago

LLMs calculate a probability distribution for the relative chances of the next token, then select a token randomly based on those weightings.

Semioj3y ago

They inject randomness in a layer were it does have small impact on purpose.

Also to give it a more natural feel.

Can't find we're I read about it

j / k navigate · click thread line to collapse

0 comments

RealityVoid3y ago

Presumably, if you give it identical prompts you get identical answers?

Sander_Marechal3y ago

No, these NLPs aren't idempotent. Even if you ask ChatGPT the same question multiple times you will get different answers.

trifurcate3y ago

1 more reply

LawTalkingGuy3y ago

parentheses3y ago

This may be an implementation detail to obfuscate GPT weights. OR it was to encourage selecting the best answers to further train the model.

1 more reply

outside12343y ago

yes - they are multinomial distributions over answers essentially

simonh3y ago

LLMs calculate a probability distribution for the relative chances of the next token, then select a token randomly based on those weightings.

Semioj3y ago

They inject randomness in a layer were it does have small impact on purpose.

Also to give it a more natural feel.

Can't find we're I read about it

j / k navigate · click thread line to collapse