A frequency analysis of filterbank initialisation and noise augmentation for LEAF

对LEAF滤波器组初始化和噪声增强进行频率分析

阅读:1

Abstract

Differentiable frontends, such as the LEArnable Frontend (LEAF), have drawn increasing interest from the computer audition (CA) community combining the rigour of traditional signal processing techniques with the flexibility and potential of end-to-end deep learning approaches. Concretely, they promise the ability to automatically learn task-specific features, resulting in both higher performance and better interpretability of CA applications. With the adaptability of LEAF’s parameters being questioned in recent literature, we further dig into the reasons why LEAF does not adjust its parameters. We thus perform a detailed analysis investigating the effects of filterbank initialisation for LEAF in a wide, previously unmatched range of computer audition tasks, namely speech recognition, speech emotion recognition, acoustic scene classification, and bird activity detection. In line with literature, we report that performance stays constantly high irrespective of filterbank initialisation, so long as it covers the entire frequency spectrum, in which case adaptation is minimal. Crucially, however, a filterbank initialised with all frequency bands equally does change its centre frequencies and bandwidths, yet remains with a lower performance. This effect is seemingly independent of how information is spread across frequencies, as we confirm in an additional set of experiments with controlled frequency distributions. This points towards the critical role of initialisation and the inductive bias of LEAF and manifests concerns about the adaptability and interpretability of LEAF across many settings. The code for our experiments is publicly available under https://github.com/millinma/LEAFFrequencyAnalysis.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。