Published on March 13, 2025
In AI News

AI4Bharat Launches IndicTrans3 for 22 Indic Languages

AI4Bharat has also announced plans to release the training data soon, further contributing to the open-source AI ecosystem.

by Mohit Pandey

AI4Bharat, the AI lab at IIT Madras, has introduced IndicTrans3-beta, a state-of-the-art (SOTA) multilingual translation model designed to support translations across 22 Indic languages.

Click here to test out the model.

The model is optimised for document-level machine translation (MT) and aims to deliver performance on par with leading global translation models.

The key features of IndicTrans3 include high-accuracy translations, support for multiple Indian languages, and real-world optimisation for diverse applications.

AI4Bharat has also announced plans to release the training data soon, further contributing to the open-source AI ecosystem.

Mitesh Khapra, the head of AI4Bharat, posted on LinkedIn, saying, “Over the past 4 years, we at AI4Bharat have been on a mission to accelerate Indian language AI —building large-scale datasets, models, and tools, and releasing everything open-source for the community. Now, all our contributions are available on Hugging Face!”

Khapra also thanks EkStep Foundation, Nilekani Philanthropies, and Bhashini (MeitY), for helping in the development.

IndicTrans2, the previous version of the multilingual translation model, has been heavily adopted by several Indian companies for AI research and development.

Last year in November, AI4Bharat announced the launch of BhasaAnuvaad, a speech translation dataset tailored for Indian languages, boasting coverage across 13 languages and approximately 44,400 hours of audio.

This marks the largest publicly accessible speech translation resource of its kind for Indian linguistic diversity.

Read: Why India Needs More AI4Bharats

📣 Want to advertise in AIM? Book here

Mohit Pandey

Mohit writes about AI in simple, explainable, and sometimes funny words. He holds keen interest in discussing AI with people building it for India, and for Bharat, while also talking a little bit about AGI.

Lok Sabha, MeitY Sign MoU to Launch ‘Sansad Bhashini’ for AI-Powered Multilingual Parliamentary Operations

‘We Can’t Just Upload Docs to Any LLM,’ Godrej Capital CTO on Building Saksham AI

Should India Shift its Focus from LLMs to Large Concept Models?

The Need for Building AI for Bharat

Indian Government, Sarvam AI Discuss Building India’s Sovereign LLM

Indian Govt, Sarvam AI Discuss Building India’s Sovereign LLM

Trying to Watermark LLMs is Useless

NVIDIA Releases Garak to Safeguard LLMs

AI4Bharat Introduces BhasaAnuvaad, Speech Translation Dataset of 13 Indian Languages with 44,400 Hours of Data

Association of Data Scientists

GenAI Corporate Training Programs

Our Upcoming Conference

Happy Llama 2025

India's Biggest Conference on AI Startups

April 25, 2025 | 📍 Hotel Radisson Blu, Bengaluru

Download the easiest way to
stay informed

‘Most Data Centres Are Not Ready for Liquid Cooling’, says Oracle Exec on NVIDIA Blackwell

Siddharth Jindal

Built on the Blackwell architecture introduced last year, Blackwell Ultra features the NVIDIA GB300 NVL72 rack-scale solution and the NVIDIA HG B300 NVL16 system.