7088_Spoken_LID_CNN
Language Identification (LID) of spoken audio using Convolutional Neural Networks (CNNs) on Mel-Spectrograms of the audio clips.
Datasets have been pruned due to high upload sizes
Models haven't been included due to them being ~700MB
FFMPEG Executable files, used for converting MP3 files to WAV files haven't been included in the commit. Find them at: https://ffmpeg.org/
This repository is submited in conjunction with the project report. Refer to the report and Appendix 1 if any confusion arises.