TY - GEN
T1 - A Study on Broadcast Networks for Music Genre Classification
AU - Heakl, Ahmed
AU - Abdelgawad, Abdelrahman
AU - Parque, Victor
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Due to the increased demand for music streaming/recommender services and the recent developments of music information retrieval frameworks, Music Genre Classification (MGC) has attracted the community's attention. However, convolutional-based approaches are known to lack the ability to efficiently encode and localize temporal features. In this paper, we study the broadcast-based neural networks aiming to improve the localization and generalizability under a small set of parameters (about 180k) and investigate twelve variants of broadcast networks discussing the effect of block configuration, pooling method, activation function, normalization mechanism, label smoothing, channel interdependency, LSTM block inclusion, and variants of inception schemes. Our computational experiments using relevant datasets such as GTZAN, Extended Ballroom, HOMBURG, and Free Music Archive (FMA) show the state-of-the-art classification accuracies in MGC. Our approach offers insights and the potential to enable compact and generalizable broadcast networks for music classification.
AB - Due to the increased demand for music streaming/recommender services and the recent developments of music information retrieval frameworks, Music Genre Classification (MGC) has attracted the community's attention. However, convolutional-based approaches are known to lack the ability to efficiently encode and localize temporal features. In this paper, we study the broadcast-based neural networks aiming to improve the localization and generalizability under a small set of parameters (about 180k) and investigate twelve variants of broadcast networks discussing the effect of block configuration, pooling method, activation function, normalization mechanism, label smoothing, channel interdependency, LSTM block inclusion, and variants of inception schemes. Our computational experiments using relevant datasets such as GTZAN, Extended Ballroom, HOMBURG, and Free Music Archive (FMA) show the state-of-the-art classification accuracies in MGC. Our approach offers insights and the potential to enable compact and generalizable broadcast networks for music classification.
KW - broadcast networks
KW - convolutional neural networks
KW - music classification
KW - music genre classification
UR - http://www.scopus.com/inward/record.url?scp=85140720439&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140720439&partnerID=8YFLogxK
U2 - 10.1109/IJCNN55064.2022.9892651
DO - 10.1109/IJCNN55064.2022.9892651
M3 - Conference contribution
AN - SCOPUS:85140720439
T3 - Proceedings of the International Joint Conference on Neural Networks
BT - 2022 International Joint Conference on Neural Networks, IJCNN 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 International Joint Conference on Neural Networks, IJCNN 2022
Y2 - 18 July 2022 through 23 July 2022
ER -