[Bhaashini] : A Crowdsourcing initiative for Indian Languages

[Bhaashini] aims to create open datasets to develop Speech Recognition, Text-to-Speech, Machine Translation and Optical Character Recognition for Indian languages. This initiative will empower our technologists, language enthusiasts and language communities to build world class digital applications in our own local languages.

Enrich Indian languages become atma-nirbhar through these datasets.

This is an effort by MeitY, Government of India, under the National Language Translation Mission (NLTM). This effort is supported by EkStep Foundation on a pro-bono basis and leverages its open source work under https://sunbird.org/projects/vakyansh along with various other open source frameworks.

भारत कई भाषाओं वाला देश है, हमारी डिजिटल रणनीति में यह बात दिखनी चाहिए।

भारत की 90% आबादी अपनी रोजमर्रा की गतिविधियों के लिए क्षेत्रीय भाषाओं का इस्तेमाल करती है।

जिस तरह ज़्यादा से ज़्यादा लोग वित्तीय, शैक्षणिक और सामाजिक गतिविधियों के लिए डिजिटल प्लैटफ़ॉर्मों का इस्तेमाल करते हैं, डिजिटल प्लैटफ़ॉर्मों का बिना रुकावट के काम करना ज़रूरी है।

Enrich your own language

We need to build speech recognition, text-to-speech, machine translation and OCR technologies that are tuned for your language.

These technologies will transform your language to become "digital first" in various sectors such as: education, healthcare and media.

The tools, utilities and models proposed to be developed in your language will rely on open source contributions through the Bhaashini crowd sourcing platform

क्राउडसोर्स और आपका योगदान

The above mentioned technologies rely on AI technology that requires large datasets in their respective fields.

[Bhaashini] currently has four crowdsoucing initiatives to create these datasets -

1. Bolo India creates a repository of diverse voices speaking Indian languages, where volunteer reads the corresponding text.
2. Suno India creates an open dataset through transcription of audio files.
3. Likho India creates open parallel translation datasets between corresponding sentences in two languages.
4. Dekho India creates an open data repository of images and the corresponding text.

इस काम में हमें आपकी मदद की ज़रूरत है।

We are inviting you to join us and voluntarily contribute to this initiative. Contributing at one stretch can be overwhelming hence you can do this by visiting the website multiple times and commence from where you left us off.

आपके योगदान का हर पल एक बड़ा अंतर लाएगा और हमें अपने उद्देश्य के करीब पहुंचाएगा।

एकस्टेप संगठन सभी भागीदारों को एक साथ लाता है और बड़े पैमाने पर जटिल सामाजिक समस्याओं को हल करने के लिए खुले बुनियादी ढाँचे, उपकरण और रूपरेखा का निर्माण करता है। हम खुले स्रोत के डिजिटल बुनियादी ढांचे का लाभ उठाते हुए ऐसा करते हैं, जिसे हमने सनबर्ड नाम दिया है और सोसिएटल प्लेटफार्म एप्रोच के साथ हम काम करते हैं।

एकस्टेप एक गैर-लाभकारी संगठन है, जिसकी स्थापना रोहिणी, नंदन निलेकनी और शंकर मारुवाड़ा ने की है।