Microsoft launches 3 AI models for transcription, image, and speech generation

By Binu MathewApril 4, 2026

Microsoft on Thursday announced three new models from its Microsoft AI (MAI) model family for transcription, image, and speech generation.

This includes MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2, as Microsoft aims to expand its push into multimodal artificial intelligence (AI) capabilities for developers.

Starting today, the models are now available on Microsoft Foundry and the MAI Playground. Formerly Azure AI Studio, Foundry is a unified AI platform to build, customise, and scale generative AI (GenAI) applications and agents. Meanwhile, Playground is its public testing environment where users can experiment with features and provide feedback.

Comments are closed.

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Microsoft launches 3 AI models for transcription, image, and speech generation

More in IT

Uber to gobble up Delivery Hero in latest food delivery deal

PayPal board sees Stripe-Advent’s $53 billion bid as undervaluing company , sources say

TSMC pledges another $100 billion for US chip expansion

Must Read Articles

Budget 2022

Software services, BPO/ITeS among top industries hiring entry level staff in India: Report

Romania’s BPO industry to hire 10% more within two years

BPO industry report says Africa is becoming global CXM hub

Indian IT companies become more conservative in FY25 growth projections

17 firms under IT hardware PLI to start production this year: IT secy

HCLTech, Cisco launch pervasive wireless mobility service for enterprises

M&E stakeholders urge TRAI to exclude OTT, online gaming and music from Broadcasting policy

Infosys announces multi-year collaboration with Australian telecom giant

NTIPRIT, Ghaziabad conducts workshop on “Global Standards & IPR” on World Telecommunication and Information Society Day

Archives

You may also like

More in IT

Must Read Articles

Archives