International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395-0056
Volume: 06 Issue: 08 | Aug 2019
p-ISSN: 2395-0072
www.irjet.net
Audio Genre Classification using Neural Networks Gandharva Deshpande1, Sushain Bhat2 1,2 Student,
Dept. of Computer Engineering, St. Francis Institute of Technology, Mumbai, Maharashtra, India ---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract – Audio Classification is a fundamental problem in the field of Audio processing. One of the modern applications of such classification is the Audio Genre Classification. With the music industry soaring high as a form of entertainment, the classification of the genres of music became an important aspect. As a solution to this problem of classification, we built a Neural network model to predict the genre of a music file. The model essentially uses a Neural network architecture to classify the music file into 10 different genres based on various parameters of the audio clip. The model is trained using audio files and then further tested using a set of audio files. Key Words: Neural Networks, Audio Dataset, Genre Classification, Audio files, Classifier model
2. Hardware and Software Requirements Requirements are the minimal configurations of a device and software required for the model to work properly and efficiently. 2.1 Hardware requirements Graphics Processing Unit (GPU). Intel Core i3 processor or above 2.2 Software requirements Windows 7 or above / Linux.
1. INTRODUCTION
Python 2.7 or above.
Audio Genre Classification is a task that essentially requires human intelligence to some level. Determining a genre of music simply by listening to it may require some level of intelligence and experience on the human level too. To ease things down for the music enthusiasts, and help them in the process of determining the genre of a music piece, the model classifies the audio file into the most suitable genre. The objective of the model is predicting the genre of the music file using neural networks with considerable accuracy and respectable speed. This concept paves way for the rising machine learning applications in the field of music industry.
Jupyter Notebook.
1.1 Dataset Description The Dataset for the model is an audio file dataset consisting 1000 audio files of 10 different genres. The file format of the audio files was .mp3. A total of 800 audio files were used for training purpose while the rest 200 files were utilized for testing. The dataset comprised of varied audio files however only a duration of 30 seconds is considered for feature extraction purposes.
3. METHODOLOGY The classifier model aims at providing respectable accuracy and speed in successful classification and prediction of genre based on the input audio file. Initially, training data set is provided to the model. The model learns to classify genres and predict the genre for the input file with the help of training. A Neural network model is used for the same. The node structure is developed based on the various features that help determine genre of a song or a music file. The essential features (features that correlate the maximum with the genre of a song) are utilized to rightly classify the audio track. The workflow of the system is given below:
1.2 Model Selection When it comes to dealing with audio files or image files neural network models are the Go-to models in machine learning applications. With an aim of achieving higher accuracy a 5 layered neural network model was developed to attain efficiency. The model consists of the following: 4 layers with RELU activation function, Output layer with SoftMax activation function Fig -1: Block Diagram
Š 2019, IRJET
|
Impact Factor value: 7.34
|
ISO 9001:2008 Certified Journal
|
Page 1290