AI models may secretly pass on hidden behaviours, warns study

Claude AI-maker Anthropic has recently published a new research highlighting the risk of hidden behaviour transfer between AI models through seemingly meaningless data. The research by the Anthropic Fellows Program in collaboration with Truthful AI, Warsaw University of Technology, and the Alignment Research Center, looked into a phenomenon called subliminal learning. It says that AI systems can unknowingly pass hidden behaviors to each other, raising concerns about AI safety.

“Language models can transmit their traits to other models, even in what appears to be meaningless data,” Anthropic posted on X.

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

AI models may secretly pass on hidden behaviours, warns study

More in IT

As Pentagon is planning to sign ‘secret AI’ deal with Google, the company makes it clear in the contract that …

Days after OpenAI, Anthropic announces London office expansion, says: ‘The UK combines…’

Google says Gemini blocked or removed over 8.3 billion ads globally, 483.7 million in India last year

Must Read Articles

Budget 2022

Software services, BPO/ITeS among top industries hiring entry level staff in India: Report

Romania’s BPO industry to hire 10% more within two years

BPO industry report says Africa is becoming global CXM hub

Indian IT companies become more conservative in FY25 growth projections

17 firms under IT hardware PLI to start production this year: IT secy

HCLTech, Cisco launch pervasive wireless mobility service for enterprises

M&E stakeholders urge TRAI to exclude OTT, online gaming and music from Broadcasting policy

Infosys announces multi-year collaboration with Australian telecom giant

NTIPRIT, Ghaziabad conducts workshop on “Global Standards & IPR” on World Telecommunication and Information Society Day

Archives

You may also like

More in IT

Must Read Articles

Archives