An AI system has reached human level on a test for ‘general intelligence’: here’s what that means
By
CS Mathew
A new artificial intelligence (AI) model has just achieved human-level results on a test designed to measure “general intelligence”. On December 20, OpenAI’s o3 system scored 85 per cent on the ARC-AGI benchmark, well above the previous AI best score of 55 per cent and on par with the average human score. It also scored well on a very difficult mathematics test.