Automated Analysis of Relative Fundamental Frequency in Continuous Speech: Development and Comparison of Three Processing Pipelines

连续语音中相对基频的自动分析:三种处理流程的开发与比较

阅读:2

Abstract

OBJECTIVES: Relative fundamental frequency (RFF) estimates laryngeal tension during speech, providing insights into vocal effort. Current methods to derive RFF from continuous speech require manual processing, hindering large-scale studies with ecologically valid speech productions. This research aimed to develop and evaluate three fully automated pipelines for RFF analysis from continuous speech, addressing this limitation. METHODS: Three pipelines were compared: two modifications of an existing semiautomated approach [automated relative fundamental frequency (aRFF)-AP] and one novel pipeline replicating manual analysis. The pipelines were tested on speech samples containing vowel-consonant-vowel (VCV) utterances from 82 female participants with and without vocal fatigue complaints in the absence of phonotraumatic vocal fold changes. The pipelines automatically segmented VCVs and measured RFF. Manual measurements of a subset provided reliability and validity benchmarks. RESULTS: All pipelines demonstrated good reliability (r ≥ 0.84) and validity when compared with manual analysis. They required minimal manual correction (<4%) for fricative identification. Notably, the novel aRFF-B pipeline rejected the fewest samples (10%-25%) while maintaining reliability and was able to leverage parallel computing. CONCLUSIONS: Three automated pipelines, especially aRFF-B, enabled time-efficient RFF analysis of large continuous speech data sets without manual intervention. This advancement can facilitate large-scale studies using RFF applied to continuous speech, potentially expanding its application in voice research and clinical practice.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。