Forget ChatGPT? China’s DeepSeek is working on smarter, self-improving AI models

By Neha KumariApril 8, 2025

After shaking up Silicon Valley with AI models earlier this year, Chinese startup DeepSeek is working on another innovation to help reduce operational costs. The company, led by Liang Wenfeng, has been working with researchers at Tsinghua University to develop a new approach called generative reward modelling (GRM), which rewards the AI model for following human preferences.

The new approach, first revealed in a pre-print paper (via Bloomberg), discusses the use of a technique called self-principled critique tuning (SPCT) to make AI models smarter and more efficient in a self-improving way.

Comments are closed.

M	T	W	T	F	S	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Forget ChatGPT? China’s DeepSeek is working on smarter, self-improving AI models

More in IT

Centre to introduce Bill criminalising digital arrests and AI-generated deepfakes

Not just Hugging Face, OpenAI says its rogue AI agent also accessed accounts across four online services

Google rolls out Gemini Spark AI agent to users in India: Here’s what it can do

Must Read Articles

Budget 2022

Software services, BPO/ITeS among top industries hiring entry level staff in India: Report

Romania’s BPO industry to hire 10% more within two years

BPO industry report says Africa is becoming global CXM hub

Indian IT companies become more conservative in FY25 growth projections

17 firms under IT hardware PLI to start production this year: IT secy

HCLTech, Cisco launch pervasive wireless mobility service for enterprises

M&E stakeholders urge TRAI to exclude OTT, online gaming and music from Broadcasting policy

Infosys announces multi-year collaboration with Australian telecom giant

NTIPRIT, Ghaziabad conducts workshop on “Global Standards & IPR” on World Telecommunication and Information Society Day

Archives

You may also like

More in IT

Must Read Articles

Archives