How Google is using its latest Gemini AI model to train robots “navigate the world

Google, at I/O 2024, showcased multimodal capabilities in the Gemini 1.5 AI model. This means that the large language model can take photos, videos and audio, along with text, as inputs, process that information and generate responses. The company’s AI unit is now leveraging this capability to train robots to navigate their surroundings.

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30	31

How Google is using its latest Gemini AI model to train robots “navigate the world

More in IT

PhonePe expands UPI payments to feature phone users

Nvidia CEO Jensen Huang’s success secret revealed by his biographer: He is powered by fear of failing and that’s why he never slows down

Master product life cycle with efficacy with ISB Online’s Professional Certificate in Product Management

Must Read Articles

Software services, BPO/ITeS among top industries hiring entry level staff in India: Report

Romania’s BPO industry to hire 10% more within two years

BPO industry report says Africa is becoming global CXM hub

Budget 2022

17 firms under IT hardware PLI to start production this year: IT secy

HCLTech, Cisco launch pervasive wireless mobility service for enterprises

M&E stakeholders urge TRAI to exclude OTT, online gaming and music from Broadcasting policy

Indian IT companies become more conservative in FY25 growth projections

Infosys announces multi-year collaboration with Australian telecom giant

NTIPRIT, Ghaziabad conducts workshop on “Global Standards & IPR” on World Telecommunication and Information Society Day

Archives

You may also like

More in IT

Must Read Articles

Archives