Describe Where You Are: Improving Noise-Robustness for Speech Emotion Recognition with Text Description of the Environment

描述你所在的位置：利用环境文本描述提高语音情感识别的抗噪能力

阅读：1

作者：Yi,Li,Wilson,John P,Mason,Tyler B,Habre,Rima,Wang,Shirlene,Dunton,Genevieve F,Leem,Seong-Gyun,Fulford,Daniel,Onnela,Jukka-Pekka,Gard,David,Busso,Carlos

期刊：		影响因子：
时间：	2026	起止号：	2026 Jan-Mar;17(1):656-69
doi：	10.1016/j.healthplace.2019.102226

Abstract

Speech emotion recognition (SER) systems often struggle in real-world environments, where ambient noise severely degrades their performance. This paper explores a novel approach that exploits prior knowledge of testing environments to maximize SER performance under noisy conditions. To address this task, we propose a text-guided, environment-aware training where an SER model is trained with contaminated speech samples and their paired noise description. We use a pre-trained text encoder to extract the text-based environment embedding and then fuse it to a transformer-based SER model during training and inference. We demonstrate the effectiveness of our approach through our experiment with the MSP-Podcast corpus and real-world additive noise samples collected from the Freesound and DEMAND repository. Our experiment indicates that the text-based environment descriptions processed by a large language model (LLM) produce representations that improve the noise-robustness of the SER system. With a contrastive learning (CL)-based representation, our proposed method can be improved by jointly fine-tuning the text encoder with the emotion recognition model. Under the −5dB signal-to-noise ratio (SNR) level, fine-tuning the text encoder improves our CL-based representation method by 76.4% (arousal), 100.0% (dominance), and 27.7% (valence).

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

凋亡

线粒体

转录调控

巨噬细胞

自噬

传染病

氧化应激

血管生成

磷酸化

肠道菌群

囊泡

3D/类器官

中性粒细胞

单细胞

外泌体

药物研究

DNA甲基化

细胞衰老

miRNA

铁死亡

缺氧低氧

乙酰化

组蛋白修饰

泛素化

炎性小体

代谢重编程

树突状细胞

焦亡

肿瘤微环境

lncRNA

m6A/m5C/m7G

空间多组学

细胞基因治疗

内质网应激

相分离

治疗耐药

Treg

免疫代谢

上皮间质转化

染色质重塑

脂质过氧化

蛋白质稳态

铁代谢

脂代谢

cGAS-STING

肠脑轴

细胞极性

碱基编辑

氨基酸代谢

乳酸化

蛋白降解

翻译调控

circRNA

低氧缺氧

piRNA

肿瘤异质性

NK 细胞

MDSC

氧化脂质

溶酶体功能

NETosis

RNA 编辑

细胞干性

琥珀酰化

CAR-NK

冷应激

器官芯片

Tfh

巴豆酰化

表观遗传记忆

空间代谢组

器官纤维化

线粒体未折叠蛋白反应

铜死亡

自噬流

程序性坏死

肠肝轴

MAIT 细胞

丙酰化