Project Indus: Tech Mahindra’s Initiative to Challenge OpenAI

Tech Mahindra head CP Gurnani recently took to Twitter to request speakers of Indic languages to contribute to the project

Indian IT giant Tech Mahindra is working on an indigenous Large Language Model (LLM) that would have the ability to speak in many Indic languages, most notably Hindi.

Called Project Indus, the model will have the ability to speak in 40 different Indic languages, to begin with. More languages that have originated in the country will also be added subsequently. 

Tech Mahindra head CP Gurnani recently took to Twitter to request speakers of these languages to contribute to the project with their expressions, vocabulary, and conversations.

Building an LLM needs a big dataset, and the scarcity of Indic language datasets is a challenge. The approach taken by the IT giant is similar to that of Bhashini, a project launched by Narendra Modi to build datasets on Indic languages. 

Speakers of languages such as Dongri (Jammu & Kashmir), Kinnauri, Kangri, Chambeli, Garhwali, (Himachal), Kumaoni, Jaunsari ( Uttar Pradesh), Bhojpuri, Maithili,  and Magahi ( Bihar), among others can contribute to the project.

Previously, Gurnani, responding to a Sam Altman tweet, confirmed that Tech Mahindra is building an LLM specifically for India.

📣 Want to advertise in AIM? Book here

Picture of Pritam Bordoloi

Pritam Bordoloi

I have a keen interest in creative writing and artificial intelligence. As a journalist, I deep dive into the world of technology and analyse how it’s restructuring business models and reshaping society.
Related Posts
Association of Data Scientists
GenAI Corporate Training Programs
Our Upcoming Conference
India's Biggest Conference on AI Startups
April 25, 2025 | 📍 Hotel Radisson Blu, Bengaluru
Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.