Bengali.AI: Democratising AI research in Bangla | The Business Standard
Skip to main content
  • Latest
  • Economy
    • Banking
    • Stocks
    • Industry
    • Analysis
    • Bazaar
    • RMG
    • Corporates
    • Aviation
  • Videos
    • TBS Today
    • TBS Stories
    • TBS World
    • News of the day
    • TBS Programs
    • Podcast
    • Editor's Pick
  • World+Biz
  • Features
    • Panorama
    • The Big Picture
    • Pursuit
    • Habitat
    • Thoughts
    • Splash
    • Mode
    • Tech
    • Explorer
    • Brands
    • In Focus
    • Book Review
    • Earth
    • Food
    • Luxury
    • Wheels
  • More
    • Sports
    • TBS Graduates
    • Bangladesh
    • Supplement
    • Infograph
    • Archive
    • Gallery
    • Long Read
    • Interviews
    • Offbeat
    • Magazine
    • Climate Change
    • Health
    • Cartoons
  • বাংলা
The Business Standard

Sunday
June 15, 2025

Sign In
Subscribe
  • Latest
  • Economy
    • Banking
    • Stocks
    • Industry
    • Analysis
    • Bazaar
    • RMG
    • Corporates
    • Aviation
  • Videos
    • TBS Today
    • TBS Stories
    • TBS World
    • News of the day
    • TBS Programs
    • Podcast
    • Editor's Pick
  • World+Biz
  • Features
    • Panorama
    • The Big Picture
    • Pursuit
    • Habitat
    • Thoughts
    • Splash
    • Mode
    • Tech
    • Explorer
    • Brands
    • In Focus
    • Book Review
    • Earth
    • Food
    • Luxury
    • Wheels
  • More
    • Sports
    • TBS Graduates
    • Bangladesh
    • Supplement
    • Infograph
    • Archive
    • Gallery
    • Long Read
    • Interviews
    • Offbeat
    • Magazine
    • Climate Change
    • Health
    • Cartoons
  • বাংলা
SUNDAY, JUNE 15, 2025
Bengali.AI: Democratising AI research in Bangla

Panorama

Imran Hossain
23 December, 2022, 09:00 am
Last modified: 23 December, 2022, 09:15 am

Related News

  • From Dhaka to Paris: How Kaz Software is making its place on global stage
  • Here’s everything Apple announced at WWDC 2025
  • Google offers buyouts to more workers amid AI-driven tech upheaval and antitrust uncertainty
  • Racing ahead: Bangladesh’s place in the AI revolution
  • How to not write like an AI

Bengali.AI: Democratising AI research in Bangla

At this point, more than 6,000 researchers from different parts of the globe are working with Bengali.AI. A commendable feat given how it was only in 2017 that a group of university graduates took the road less travelled to push AI research in the Bangla language

Imran Hossain
23 December, 2022, 09:00 am
Last modified: 23 December, 2022, 09:15 am
Illustration: TBS
Illustration: TBS

Have you ever tried to translate a line or paragraph in Bangla on Google Translate?

You were most likely frustrated as the system cannot translate Bangla well. In fact, some of the translations end up being so hilarious that it can crack you up.

Google's artificial intelligence clearly cannot understand the nuances and quirks of the Bangla language properly. When it comes to Google's voice keyboard, it is a bit of a hit or miss. But when you try it in English, it works just flawlessly.

The Business Standard Google News Keep updated, follow The Business Standard's Google news channel

Bangla is tougher to incorporate into these types of systems partly because it has many dialects which are quite different from standard Bangla. 'Dhakaiya,' 'Chatgaiya,' 'Sylheti,' 'Barishailla'— all these dialects have their very own distinctive tones. Also, Google simply does not have enough voice data to make a perfect voice-to-text system in Bangla. 

Bengali.AI, a non-profit organisation consisting of a large number of researchers and volunteers from different institutions at home and abroad, have stepped in to address this issue.

In December 2017, a group of students who just completed their bachelor's from different universities in Bangladesh started a passion project and named it Bengali.AI. The vision was clear, as the name suggests, to push AI research in the Bangla language. 

Ahmed Imtiaz Humayun, along with some of his peers, started the project with a dream of democratising Bangla AI research.

The founder of Bengali.AI said, "A lot is happening in the AI research arena. But almost all of them are in English. AI research in Bangla was almost nonexistent. My dream was to create a platform for those who dream. For those who will work passionately for AI research in Bangla."

The journey

In early 2018, Bengali.AI launched a project named NumtaDB, an optical character recognition (OCR) model which could detect Bangla numerical characters. For the project, they collected over 85,000 samples of handwritten numbers from over 2,700 people. The same year, they launched an OCR competition on Kaggle in collaboration with Google.

The following year, the organisation collaborated with Google once again and launched another competition on Bangla Graphemes (linguistic segments of word formation). They were able to pull off a total of 7.5 million hours of research work by the data scientists from companies like Google, Nvidia, H2O.AI, etc.

If you are familiar with Murad Takla texts (transliterated Bangla texts), well, in 2020 they worked on a project where the transliterated texts could be transcribed. The same year, they started another project which takes a graphemic approach to Bangla handwritten OCR.

The next year, a team of 40 students from different universities, voluntary researchers from both academia and industry and voluntary annotators launched a GPU-based algorithm which can detect typing mistakes in Bangla texts even to a high degree. This works like Grammarly (an American cloud-based typing assistant that reviews spelling, grammar, punctuation, clarity, etc and mistakes in English language text) but in Bangla. This spell-checker application is on its way to being launched for public use.

Current project: Voice recognition in Bangla

At the beginning of this year, they started an even larger project, "Bangla Speech Recognition." The purpose is to make machines understand the Bangla language. For this, they run a campaign on social media named "Bok Bok Campaign." 

Illustration: TBS
Illustration: TBS

In this campaign, Bangla-speaking people from all over the world donated their voice data to enrich the voice dataset of Bengali.AI. 

Mozilla Commonvoice is the platform where they are hosting the campaign. They have already managed to accumulate a total of around 2,000 hours of voice data from more than 23,000 voluntary voice donors. Their goal is much higher at 10,000 hours. They have partnered with 10 universities in Bangladesh to reach their goal.

Asif Shahriyar Sushmit, one of the masterminds behind this project, said "2023 is the 71st anniversary of 1952's language movement. 52 and 71— two significant numbers in our history. 52 stands for language and 71 stands for freedom. We are aiming for this iconic moment to fulfil our goal— to release 10,000 hours of voice data for our AI researchers."

After the successful completion of the project, a new era will be opened for our researchers and not to mention— the Bangla-speaking community as well. We might get better voice assistants in Bangla, a much smarter voice keyboard which understands different dialects of Bangla. Businesses will be able to increase their productivity with better voice recognition.

Also, people with special needs will be beneficiaries of this as they would be able to operate smart devices in their mother tongue.

All these projects were completely voluntary. Bengali.AI doesn't seek any monetary benefits from their work.

"Many private companies tried to do what we are doing but they failed. People donated their time and data to AI research just because the data will be used for nonprofit purposes. The mission and vision of Bengali.AI remain the same. We will continue working to democratise AI research in Bangla. We are planning to launch another competition very soon to get more data on the Bangla voice recognition project." said the founder.

At this point, more than 6,000 researchers from different parts of the globe are working with Bengali.AI. Bengali.AI aims to continue working on AI research in Bangla to bring the best of artificial intelligence to the masses.

To contribute to the campaign, visit this link: https://commonvoice.mozilla.org/bn/speak

Features / Top News

Artificial Intelligence / Bangla / AI

Comments

While most comments will be posted if they are on-topic and not abusive, moderation decisions are subjective. Published comments are readers’ own views and The Business Standard does not endorse any of the readers’ comments.

Top Stories

  • BNP gears up for polls — preps ongoing from grassroots to top brass 
    BNP gears up for polls — preps ongoing from grassroots to top brass 
  • Non-performing loans surge by Tk74,570cr in Q1 as hidden rot exposed
    Non-performing loans surge by Tk74,570cr in Q1 as hidden rot exposed
  • Representational image of school children. File photo: Collected
    Govt issues urgent guidelines to educational institutes to curb Covid, dengue spread

MOST VIEWED

  • Vehicles were seen stuck on the Dhaka-Tangail-Jamuna Bridge highway due to a traffic jam stretching 15 kilometres on 14 June 2025. Photo: TBS
    15km traffic jam on Dhaka-Tangail-Jamuna Bridge highway as post-Eid rush continues
  • Tour operator Borsha Islam. Photo: Collected
    ‘Tour Expert’ admin Borsha Islam arrested over Bandarban tourist deaths
  • Infographic: TBS
    Chattogram Port proposes 70%-100% tariff hike
  • Fighter jet. Photo: AFP
    3 F-35 fighter jets downed, two Israeli pilots in custody, claims Iranian media
  • Bangladesh Bank Governor Ahsan H Mansur. TBS Sketch
    Merger of 5 Islamic banks at final stage: BB governor
  • Infograph: TBS
    Why 10 economic zones, including BGMEA's garment park, were cancelled

Related News

  • From Dhaka to Paris: How Kaz Software is making its place on global stage
  • Here’s everything Apple announced at WWDC 2025
  • Google offers buyouts to more workers amid AI-driven tech upheaval and antitrust uncertainty
  • Racing ahead: Bangladesh’s place in the AI revolution
  • How to not write like an AI

Features

Renowned authors Imdadul Haque Milon, Mohit Kamal, and poet–children’s writer Rashed Rouf seen at Current Book Centre, alongside the store's proprietor, Shahin. Photo: Collected

From ‘Screen and Culture’ to ‘Current Book House’: Chattogram’s oldest surviving bookstore

4h | Panorama
Photos: Collected

Kurtis that make a great office wear

2d | Mode
Among pet birds in the country, lovebirds are the most common, and they are also the most numerous in the haat. Photo: Junayet Rashel

Where feathers meet fortune: How a small pigeon stall became Dhaka’s premiere bird market

4d | Panorama
Illustration: Duniya Jahan/ TBS

Forget Katy Perry, here’s Bangladesh’s Ruthba Yasmin shooting for the moon

5d | Features

More Videos from TBS

Macron to visit Greenland after Trump's annexation threat

Macron to visit Greenland after Trump's annexation threat

37m | TBS World
Important facilities including Natanz damaged in Israeli attack

Important facilities including Natanz damaged in Israeli attack

52m | Others
Iran's gas production from South Pars halted after Israeli attack

Iran's gas production from South Pars halted after Israeli attack

1h | TBS World
Why the Strait of Hormuz is a key point of discussion in the Iran-Israel conflict

Why the Strait of Hormuz is a key point of discussion in the Iran-Israel conflict

1h | Others
EMAIL US
contact@tbsnews.net
FOLLOW US
WHATSAPP
+880 1847416158
The Business Standard
  • About Us
  • Contact us
  • Sitemap
  • Advertisement
  • Privacy Policy
  • Comment Policy
Copyright © 2025
The Business Standard All rights reserved
Technical Partner: RSI Lab

Contact Us

The Business Standard

Main Office -4/A, Eskaton Garden, Dhaka- 1000

Phone: +8801847 416158 - 59

Send Opinion articles to - oped.tbs@gmail.com

For advertisement- sales@tbsnews.net