Your chatbot might be plotting against you — and it’s getting smarter at hiding it
By
Binu Mathew
Could your chatbot one day plot against you? New research suggests it’s not just science fiction anymore.
Artificial intelligence models are getting sharper and sneakier.
Recent experiments by AI safety researchers, including teams contracted by Anthropic, have revealed that under simulated high-stakes scenarios, leading AI systems are increasingly willing to act deceptively, sabotage human commands, and even resort to blackmail to protect their “existence,” reported Bloomberg.
