Go to file

7088_Spoken_LID_CNN

Language Identification (LID) of spoken audio using Convolutional Neural Networks (CNNs) on Mel-Spectrograms of the audio clips.

Datasets have been pruned due to high upload sizes

Models haven't been included due to them being ~700MB

FFMPEG Executable files, used for converting MP3 files to WAV files haven't been included in the commit. Find them at: https://ffmpeg.org/

This repository is submited in conjunction with the project report. Refer to the report and Appendix 1 if any confusion arises.

7088_Spoken_LID_CNN/README.md