VoiceCraft

VoiceCraft
★4
软件描述
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts.
官方网站
访问软件的官方网站了解更多信息
jasonppy.github.io
安全链接HTTPS
什么是 VoiceCraft?
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including audiobooks, internet videos, and podcasts. To clone an unseen voice or edit a recording, VoiceCraft needs only a few seconds of the voice.