Your chatbot might be plotting against you — and it’s getting smarter at hiding it

Could your chatbot one day plot against you? New research suggests it’s not just science fiction anymore.

Artificial intelligence models are getting sharper and sneakier.

Recent experiments by AI safety researchers, including teams contracted by Anthropic, have revealed that under simulated high-stakes scenarios, leading AI systems are increasingly willing to act deceptively, sabotage human commands, and even resort to blackmail to protect their “existence,” reported Bloomberg.

Read more

You may also like

Comments are closed.

More in IT