It may be argued that music genre classification (MGC) is one of the most important tasks in music information retrieval; however, it still suffers from being a high-dimensional, highly variable, and noisy audio signal. Most traditional deep learning models require large computational setups and do not fare well in the instances of overfitting and local optima. The paper proposes a new hybridization: SqueezeNet optimized through PIGMM (Promoted Ideal Gas Molecular Motion) for enhanced MGC performance. PIGMM, which is a metaheuristic algorithm with roots in molecular dynamics and is improved by chaos theory and opposition-based learning, was used to optimize the parameters of SqueezeNet for improved convergence and generalization. The model that works on audio spectrograms demonstrates 96% accuracy in feature extraction. Under ten-fold cross-validation on the GTZAN and Extended Ballroom datasets, the method achieves classification accuracies of 91.1% and 93.4%, respectively, both of which outperform state-of-the-art models. The results show the highest precision values of 93.5% and 95.8% as well as recall values of 96.5% and 97.7%, thus confirming the strength and effectiveness of this model. The work presents a lightweight and noise-resilient solution for scalable music classification.
Deep learning model using squeezenet and promoted ideal gas molecular motion for music genre classification from audio spectrograms.
阅读:4
作者:Xue, Mengjin
| 期刊: | Scientific Reports | 影响因子: | 3.900 |
| 时间: | 2025 | 起止号: | 2025 Sep 1; 15(1):32170 |
| doi: | 10.1038/s41598-025-16499-z | ||
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
