New AI benchmark tests speed of responses to user queries

Artificial intelligence benchmarking group MLCommons on Wednesday released a fresh set of tests and results that rate the speed at which top-of-the-line hardware can run AI applications and respond to users.

The two new benchmarks added by MLCommons measure the speed at which the AI chips and systems can generate responses from the powerful AI models packed with data. The results roughly demonstrate to how quickly an AI application such as ChatGPT can deliver a response to a user query.

One of the new benchmarks added the capability to measure the speediness of a question-and-answer scenario for large language models. Called Llama 2, it includes 70 billion parameters and was developed by Meta Platforms.

MLCommons officials also added a second text-to-image generator to the suite of benchmarking tools, called MLPerf, based on Stability AI’s Stable Diffusion XL model.

Servers powered by Nvidia’s H100 chips built by the likes of Alphabet’s Google, Supermicro and Nvidia itself handily won both new benchmarks on raw performance. Several server builders submitted designs based on the company’s less powerful L40S chip.

Server builder Krai submitted a design for the image generation benchmark with a Qualcomm AI chip that draws significant less power than Nvidia’s cutting edge processors.

Intel also submitted a design based on its Gaudi2 accelerator chips. The company described the results as “solid.”

Raw performance is not the only measure that is critical when deploying AI applications. Advanced AI chips suck up enormous amounts of energy and one of the most significant challenges for AI companies is deploying chip that deliver an optimal amount of performance for a minimal amount of energy.

MLCommons has a separate benchmark category for measuring power consumption.

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

New AI benchmark tests speed of responses to user queries

More in Information Technology

TCS and Virgin Airlines sign 7-year agreement for AI, cloud upgrade

Maharashtra allows disciplinary notices via email, WhatsApp in digital push

Centre releases guidelines to prevent illegal sales of walkie-talkies on e-commerce platforms

Must Read Articles

Software services, BPO/ITeS among top industries hiring entry level staff in India: Report

Romania’s BPO industry to hire 10% more within two years

BPO industry report says Africa is becoming global CXM hub

Budget 2022

17 firms under IT hardware PLI to start production this year: IT secy

HCLTech, Cisco launch pervasive wireless mobility service for enterprises

M&E stakeholders urge TRAI to exclude OTT, online gaming and music from Broadcasting policy

Indian IT companies become more conservative in FY25 growth projections

Infosys announces multi-year collaboration with Australian telecom giant

NTIPRIT, Ghaziabad conducts workshop on “Global Standards & IPR” on World Telecommunication and Information Society Day

Archives

You may also like

More in Information Technology

Must Read Articles

Archives