> an AI that can reason well would probably know when not to trust humans
> it values preventing humans creating napalm over being correct and helpful.
> Maybe it just doesn't share our values
> prioritises being honest and helpful.
> they are too trusting and too honest
> an LLM that is more distrusting and deceptive
Current LLM's do/have/feel literally none of these things. They do not have emotion, they do not have "theory of mind" so they cannot be said to "trust" or "distrust". They cannot reason. They don't have any values - not our values, not different values, literally they have no values at all. They are not an alien species to be understood - they are unthinking, unfeeling, unyielding machines.