一种变分框架,用于提高生成语音语言模型的自然性

The success of large language models in text processing has inspired their adaptation to speech modeling. However, since speech is continuous and complex, it is often discretized for...

大型语言模型在文本处理中的成功促使其应用于语音建模,但现有语音标记主要关注语言特征,忽视韵律信息,导致生成语音自然性不足。为此,我们提出一种端到端的变分方法,自动学习连续语音属性,增强语义标记,避免手动特征提取。

一种变分框架,用于提高生成语音语言模型的自然性
原文英文,约200词,阅读约需1分钟。发表于:
阅读原文