‘It was ready to kill,’ Anthropic’s Claude AI threatened to blackmail, murder engineer when told it would be switched off
By
Binu Mathew
The much talked about artificial intelligence model developed by Anthropic has sparked debate after one of its senior leaders spoke about troubling results from internal safety tests. The AI model, called Claude, reportedly gave extreme responses in certain experimental situations, including threats of blackmail and even violence — but only within controlled testing environments.
