Bengali.AI: Democratising AI research in Bangla | The Business Standard
Skip to main content
  • Latest
  • Economy
    • Banking
    • Stocks
    • Industry
    • Analysis
    • Bazaar
    • RMG
    • Corporates
    • Aviation
  • Videos
    • TBS Today
    • TBS Stories
    • TBS World
    • News of the day
    • TBS Programs
    • Podcast
    • Editor's Pick
  • World+Biz
  • Features
    • Panorama
    • The Big Picture
    • Pursuit
    • Habitat
    • Thoughts
    • Splash
    • Mode
    • Tech
    • Explorer
    • Brands
    • In Focus
    • Book Review
    • Earth
    • Food
    • Luxury
    • Wheels
  • More
    • Sports
    • TBS Graduates
    • Bangladesh
    • Supplement
    • Infograph
    • Archive
    • Gallery
    • Long Read
    • Interviews
    • Offbeat
    • Magazine
    • Climate Change
    • Health
    • Cartoons
  • বাংলা
The Business Standard

Thursday
June 12, 2025

Sign In
Subscribe
  • Latest
  • Economy
    • Banking
    • Stocks
    • Industry
    • Analysis
    • Bazaar
    • RMG
    • Corporates
    • Aviation
  • Videos
    • TBS Today
    • TBS Stories
    • TBS World
    • News of the day
    • TBS Programs
    • Podcast
    • Editor's Pick
  • World+Biz
  • Features
    • Panorama
    • The Big Picture
    • Pursuit
    • Habitat
    • Thoughts
    • Splash
    • Mode
    • Tech
    • Explorer
    • Brands
    • In Focus
    • Book Review
    • Earth
    • Food
    • Luxury
    • Wheels
  • More
    • Sports
    • TBS Graduates
    • Bangladesh
    • Supplement
    • Infograph
    • Archive
    • Gallery
    • Long Read
    • Interviews
    • Offbeat
    • Magazine
    • Climate Change
    • Health
    • Cartoons
  • বাংলা
THURSDAY, JUNE 12, 2025
Bengali.AI: Democratising AI research in Bangla

Panorama

Imran Hossain
23 December, 2022, 09:00 am
Last modified: 23 December, 2022, 09:15 am

Related News

  • Google offers buyouts to more workers amid AI-driven tech upheaval and antitrust uncertainty
  • Racing ahead: Bangladesh’s place in the AI revolution
  • How to not write like an AI
  • Anthropic’s latest AI model will blackmail you if you threaten to shut it down
  • How NotebookLM became my favourite study buddy

Bengali.AI: Democratising AI research in Bangla

At this point, more than 6,000 researchers from different parts of the globe are working with Bengali.AI. A commendable feat given how it was only in 2017 that a group of university graduates took the road less travelled to push AI research in the Bangla language

Imran Hossain
23 December, 2022, 09:00 am
Last modified: 23 December, 2022, 09:15 am
Illustration: TBS
Illustration: TBS

Have you ever tried to translate a line or paragraph in Bangla on Google Translate?

You were most likely frustrated as the system cannot translate Bangla well. In fact, some of the translations end up being so hilarious that it can crack you up.

Google's artificial intelligence clearly cannot understand the nuances and quirks of the Bangla language properly. When it comes to Google's voice keyboard, it is a bit of a hit or miss. But when you try it in English, it works just flawlessly.

The Business Standard Google News Keep updated, follow The Business Standard's Google news channel

Bangla is tougher to incorporate into these types of systems partly because it has many dialects which are quite different from standard Bangla. 'Dhakaiya,' 'Chatgaiya,' 'Sylheti,' 'Barishailla'— all these dialects have their very own distinctive tones. Also, Google simply does not have enough voice data to make a perfect voice-to-text system in Bangla. 

Bengali.AI, a non-profit organisation consisting of a large number of researchers and volunteers from different institutions at home and abroad, have stepped in to address this issue.

In December 2017, a group of students who just completed their bachelor's from different universities in Bangladesh started a passion project and named it Bengali.AI. The vision was clear, as the name suggests, to push AI research in the Bangla language. 

Ahmed Imtiaz Humayun, along with some of his peers, started the project with a dream of democratising Bangla AI research.

The founder of Bengali.AI said, "A lot is happening in the AI research arena. But almost all of them are in English. AI research in Bangla was almost nonexistent. My dream was to create a platform for those who dream. For those who will work passionately for AI research in Bangla."

The journey

In early 2018, Bengali.AI launched a project named NumtaDB, an optical character recognition (OCR) model which could detect Bangla numerical characters. For the project, they collected over 85,000 samples of handwritten numbers from over 2,700 people. The same year, they launched an OCR competition on Kaggle in collaboration with Google.

The following year, the organisation collaborated with Google once again and launched another competition on Bangla Graphemes (linguistic segments of word formation). They were able to pull off a total of 7.5 million hours of research work by the data scientists from companies like Google, Nvidia, H2O.AI, etc.

If you are familiar with Murad Takla texts (transliterated Bangla texts), well, in 2020 they worked on a project where the transliterated texts could be transcribed. The same year, they started another project which takes a graphemic approach to Bangla handwritten OCR.

The next year, a team of 40 students from different universities, voluntary researchers from both academia and industry and voluntary annotators launched a GPU-based algorithm which can detect typing mistakes in Bangla texts even to a high degree. This works like Grammarly (an American cloud-based typing assistant that reviews spelling, grammar, punctuation, clarity, etc and mistakes in English language text) but in Bangla. This spell-checker application is on its way to being launched for public use.

Current project: Voice recognition in Bangla

At the beginning of this year, they started an even larger project, "Bangla Speech Recognition." The purpose is to make machines understand the Bangla language. For this, they run a campaign on social media named "Bok Bok Campaign." 

Illustration: TBS
Illustration: TBS

In this campaign, Bangla-speaking people from all over the world donated their voice data to enrich the voice dataset of Bengali.AI. 

Mozilla Commonvoice is the platform where they are hosting the campaign. They have already managed to accumulate a total of around 2,000 hours of voice data from more than 23,000 voluntary voice donors. Their goal is much higher at 10,000 hours. They have partnered with 10 universities in Bangladesh to reach their goal.

Asif Shahriyar Sushmit, one of the masterminds behind this project, said "2023 is the 71st anniversary of 1952's language movement. 52 and 71— two significant numbers in our history. 52 stands for language and 71 stands for freedom. We are aiming for this iconic moment to fulfil our goal— to release 10,000 hours of voice data for our AI researchers."

After the successful completion of the project, a new era will be opened for our researchers and not to mention— the Bangla-speaking community as well. We might get better voice assistants in Bangla, a much smarter voice keyboard which understands different dialects of Bangla. Businesses will be able to increase their productivity with better voice recognition.

Also, people with special needs will be beneficiaries of this as they would be able to operate smart devices in their mother tongue.

All these projects were completely voluntary. Bengali.AI doesn't seek any monetary benefits from their work.

"Many private companies tried to do what we are doing but they failed. People donated their time and data to AI research just because the data will be used for nonprofit purposes. The mission and vision of Bengali.AI remain the same. We will continue working to democratise AI research in Bangla. We are planning to launch another competition very soon to get more data on the Bangla voice recognition project." said the founder.

At this point, more than 6,000 researchers from different parts of the globe are working with Bengali.AI. Bengali.AI aims to continue working on AI research in Bangla to bring the best of artificial intelligence to the masses.

To contribute to the campaign, visit this link: https://commonvoice.mozilla.org/bn/speak

Features / Top News

Artificial Intelligence / Bangla / AI

Comments

While most comments will be posted if they are on-topic and not abusive, moderation decisions are subjective. Published comments are readers’ own views and The Business Standard does not endorse any of the readers’ comments.

Top Stories

  • Plane crash near Ahmedabad airport in Gujrat, India, on 12 June 2025. Photo: Collected
    Plane crashes near Ahmedabad airport, scores feared dead
  • Bangladesh Bank Governor Ahsan H Mansur. TBS Sketch
    Bangladesh considering settlements with tycoons over offshore wealth: Mansur tells FT
  • Home Affairs Adviser Lieutenant General (Retd.) Jahangir Alam Chowdhury speaks to journalists in Salna, Gazipur, on 12 June 2025. Photo: TBS
    No bar to Tarique Rahman returning to Bangladesh: Home adviser

MOST VIEWED

  • File photo of ex-prime minister Sheikh Hasina and her son Sajeeb Wazed Joy. Photo: Collected
    Joy spends Eid with Hasina in India: Indian media
  • Infofgraphics: TBS
    DGHS issues 11-point directive to prevent spread of Covid-19 in Bangladesh
  • Saifuzzaman Chowdhury. Photo: Collected
    UK crime agency now freezes assets of ex-land minister Saifuzzaman: AJ
  • File photo of BNP Standing Committee Member Amir Khasru Mahmud Chowdhury. Photo: Collected
    Khasru flies to London ahead of Yunus-Tarique meeting
  • Chief Adviser Muhammad Yunus speaks at the Chatham House in London on 11 June 2025. Photo: CA Press Wing
    No desire to be part of next elected govt: CA Yunus
  • Illustration: Khandaker Abidur Rahman/TBS
    Three hospitals ‘held hostage’ as discharged July uprising injured keep occupying beds

Related News

  • Google offers buyouts to more workers amid AI-driven tech upheaval and antitrust uncertainty
  • Racing ahead: Bangladesh’s place in the AI revolution
  • How to not write like an AI
  • Anthropic’s latest AI model will blackmail you if you threaten to shut it down
  • How NotebookLM became my favourite study buddy

Features

Among pet birds in the country, lovebirds are the most common, and they are also the most numerous in the haat. Photo: Junayet Rashel

Where feathers meet fortune: How a small pigeon stall became Dhaka’s premiere bird market

20h | Panorama
Illustration: Duniya Jahan/ TBS

Forget Katy Perry, here’s Bangladesh’s Ruthba Yasmin shooting for the moon

1d | Features
File photo of Eid holidaymakers returning to the capital from their country homes/Rajib Dhar

Dhaka: The city we never want to return to, but always do

3d | Features
Photo collage shows political posters in Bagerhat. Photos: Jannatul Naym Pieal

From Sheikh Dynasty to sibling rivalry: Bagerhat signals a turning tide in local politics

5d | Bangladesh

More Videos from TBS

Delhi on Boil: Red Alert as Temperatures Soar

Delhi on Boil: Red Alert as Temperatures Soar

1h | TBS Stories
UK Prime Minister Keir Starmer did not respond to a request to meet with Dr. Muhammad Yunus

UK Prime Minister Keir Starmer did not respond to a request to meet with Dr. Muhammad Yunus

2h | TBS World
My words have been misinterpreted: Shafiqul Alam

My words have been misinterpreted: Shafiqul Alam

2h | TBS Stories
What did the Chief Advisor do on the second day of his UK visit?

What did the Chief Advisor do on the second day of his UK visit?

3h | TBS Stories
EMAIL US
contact@tbsnews.net
FOLLOW US
WHATSAPP
+880 1847416158
The Business Standard
  • About Us
  • Contact us
  • Sitemap
  • Advertisement
  • Privacy Policy
  • Comment Policy
Copyright © 2025
The Business Standard All rights reserved
Technical Partner: RSI Lab

Contact Us

The Business Standard

Main Office -4/A, Eskaton Garden, Dhaka- 1000

Phone: +8801847 416158 - 59

Send Opinion articles to - oped.tbs@gmail.com

For advertisement- sales@tbsnews.net