Anthropic mapped Claude’s morality. Here’s what the chatbot values (and doesn’t)
A study of conversations shows Claude mirroring certain user values and strongly resisting others.
A study of conversations shows Claude mirroring certain user values and strongly resisting others.