BARK - 文本转音频模型
原文英文,约900词,阅读约需4分钟。发表于: 。Introduction to Bark Bark is a state-of-the-art text-to-audio model that is famous for its ability to generate highly realistic, multilingual speech, as well as other audio types including music,...
BarkBark是一个基于变换器架构的文本转音频模型,能够生成多语言的真实语音和非语言音效,如笑声和背景音乐。它支持自动语言识别,适用于多种应用场景。Suno提供预训练模型,促进研究与商业使用。