ISO 9001:2015

International Journal of Innovations & Research Analysis (IJIRA) [ Vol. 6 | No. 2(I) | April - June, 2026 ]

A Novel Method for Identification and Classification of Spoken Language Using Machine Learning Approaches

Akhilesh Pandey, Dr. Ashish Gupta, Gunjan Bhatnagar, Sanjeev Kumar Shukla & Hemlata

Finding the exact speech that an unknown talker is utilizing is famous as sound recognition. This study investigates various machine intelligence patterns for spoken language acknowledgment. Finding main traits and parameters from uttered words that aid in distinctive individual language from another is the main aim. The Mel Frequency Cepstral Coefficient (MFCC), a critical feature origin method promoted in this place work, is essential for visual and audio entertainment transmitted via radio waves file analysis. Language labeling (LID) has historically existed consummate utilizing a variety of approaches, accompanying machine intelligence methods demonstrating ultimate hopeful veracity outcomes. Therefore, in consideration of exaggerate dialect labeling, our research also uses machine intelligence. In this paper, we will use a dataset of 30,000 entrances to train our whole with the aim of capably classifying three specific dialects: English, Spanish, and German.

  1. Waibel, Author, P. Geutner, Author, L. M. Tomokiyo, Author, T. Schultz, and Author, M. Woszcyina: Article title. “Multilinguality in speech and spoken language systems,” Proc. IEEE, vol. 88, pp. 1181- 1190 (2000)
  2. P. Dai, U. Irugel, Author, G. Rigoll: Article title. “A novel feature combination approach for spoken document classification with support vector machines,” in Proc. Multimedia Information Retrieval Workshop, pp 1-5 (2003)
  3. Haizhou Li, Author, Bin Ma, Author, Chin-Hui Lee: Title of a proceedings paper. “A Vector Space Modeling Approach to Spoken Language Identification,” Proc. IEEE, vol. 15, pp. 1-2, (2007)
  4. Google   Play        Store,     available at https://play.google.com/store/apps/details?id=com.google.android.app s.translate, last accessed on 2020/01/04.
  5. K.M. Berkling, Author, T. Arai and Author, E. Barnard: Title of a proceedings paper. “Analysis of phoneme-based features for language identification”, Proc. IEEE, (1994)
  6. J. Hieronymous and Author, S. Kadambe: Title of a proceedings paper. “Spoken Language Identification Using Large Vocabulary Speech Recognition”, proc. International Conference on Spoken Language Processing (ICSLP 96), (1996)
  7. K. M. Berkling and Author, E. Barnard: Title of a proceedings paper. “Language Identification of Six Languages Based on a Common Set of Broad Phonemes” Proc. 1994 International Conference on Spoken Language Processing (1994)
  8. Y. K. Muthusamy: Article title. “A Segmental Approach to Automatic Language Identification”, Ph.D. thesis, Oregon Graduate Institute of Science & Technology (1993)
  9. M. A. Zissman: Title of a proceedings paper. “Comparison of Four Approaches to Automatic Language Identification of Telephone Speech”, Proc. IEEE (1996)
  10. Chi-Yueh Lin, Author, Hsiao-chuan Wang: Title of a proceedings paper, “Language identification using pitch contour information”, from Department of Electrical Engineering, National Tsing Hua University, Hisnchu, Taiwan
  11. Fadi Biadsy, Author, Julia Hirschberg: Title of a proceedings paper, “Using prosody and Phonotactics in Arabic Dialect Identification”, Proc. 10th Annual Conference of the International Speech Communication Association, Columbia University, New York (2009)
  12. Julien Boussard, Author, Andrew Deveau, Author, Justin Pyron: “Methods for Spoken Language Identification” (2017)
  13. Ruben Zazo, Author, Alicia Lozano-Diez, Author, Javier Gonzalez- Dominguez, Author, Doroteo T. Toledano, Author, Joazuin Gonzalez- Rodriguez: Article title. “Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks” (2016)
  14. Rong Tong, Author, Bin Ma, Author, Donglai Zhu, Author, Haizhou li and Author, Eng Sking Chang: Title of a proceedings paper. “Integrating acoustic, prosodic and phonotactic features for spoken language identification” Proc. IEEE, pp. 207 (2006)
  15. Adarsh D. Patil, Author, Akshay Vishwas Johi, Author, Harsha.K.C, Author, Pramod.N: title of a proceedings paper. “Spoken language identification using machine learning”, Visvesvaraya Technological University, Belgaum, pp. 26, (2012)
  16. Dan Robinson, Author, Kevin Leung, Author, Xavier Falco: Title. “Spoken language identification with hierarchical temporal memories” pp. 2-3 (2009)
  17. Akhilesh Pandey et. al. “Deep Learning based Automated Image Deblurring” E3S Web of Conferences, 2023, 430, 01052
  18. Medium, https://medium.com/@jonathan_hui/speech- recognition-feature-extraction-mfcc-plp-5455f5a69dd9, last accessed on 2020/02/01.
  19. Akhilesh Pandey et. al. “Deep Learning based Automated Image DeblurringPaddy leaf diseases recognition and classification using PCA and BFO-DNN algorithm by image processing” Elsevier July 2020
  20. Natural readers, https://www.naturalreaders.com/online, last accessed on 2019/12/13
  21. Geeks for Geeks, https://www.geeksforgeeks.org/ml-linear- regression, last accessed on 2020/02/20
  22. Author, Bin MA & Author, Haizhou LI: Title: “Spoken Language Identification Using Bag-Of-Sounds”
  23. Ming Li, Author, Hongbin Suo, Author, Xiao Wu, Author, Ping Lu, Author, Yonghong Yan: Title of a proceedings paper: “Spoken Language Identification Using Score Vector Modeling and Support” proc. 8th annual conference of the international speech communication association, (2007)
  24. R.A Cole and Author, Y.K Muthusamy.: Title of a proceedings paper. “The OGI Multilanguage Telephone Speech Corpus”. Proceedings International Conference on Spoken Language Identification, vol. 2 pp. 895899 (1992).
  25. Ming Li, Author, Hongbin Suo, Author, Xiao Wu, Author, Ping Lu, Author, Yonghong Yan: Title of a proceedings paper. “Spoken Language Identification Using Score Vector Modeling and Support Vector Machine” proc. 8th annual conference of the international speech communication association, pp. 351 (2007)
  26. Ruben Zazo, Author, Alicia Lozano-Diez, Author, Javier Gonzalez- Dominguez, Author, Doroteo T. Toledano, Author, Joazuin Gonzalez- Rodriguez: Title. “Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks” pp. 5, (2016).

DOI:

Article DOI:

DOI URL:


Download Full Paper:

Download