Bangla Speech Recognition

🐍 Python
🔥 PyTorch
🗣️ Speech Recognition
📚 Seq2Seq Model
Author

Md Hasan

Published

September 23, 2020

This project implements a Sequence-to-Sequence (Seq2Seq) model for Bangla speech recognition using PyTorch. The project focuses on sequence-to-sequence tasks with utilities for data loading, preprocessing, training, and validation. This project aims to build an efficient speech recognition system for the Bangla language.

Key Features:

  • Seq2Seq Model: A sequence-to-sequence deep learning model is used to transcribe Bangla speech to text.
  • PyTorch Implementation: The entire model is built using the PyTorch framework.
  • Data Loading & Preprocessing: The project includes utilities for loading, preprocessing, and preparing the speech dataset.
  • Training and Validation: Scripts are provided for training and validating the model.

Technologies and Dependencies:

  • Python 3.7
  • PyTorch
  • NumPy
  • Librosa: For audio processing.

You can view the full project and source code on GitHub: Bangla Speech Recognition

Back to top