- Text To Speech
- MARS5 TTS
MARS5 TTS - Open-source, insanely prosodic text-to-speech model
Introdução
MARS5 an opensource TTS model to replicate performances (from 2-3s of audio reference) in 140+ languages, even for extremely tough prosodic scenarios like sports commentary, movies, anime & more. Join our Discord https://discord.gg/4GVdQ28cZC today!
MARS5 TTS's Visão geral
MARS5 is a novel English speech model (TTS) developed by CAMB.AI. It follows a two-stage AR-NAR pipeline with a unique NAR component. With just 5 seconds of audio and a snippet of text, MARS5 can generate speech for diverse scenarios like sports commentary and anime. The model can be steered with punctuation and capitalization to guide the prosody of the generated output. Speaker identity can be specified using an audio reference file. MARS5 supports both shallow and deep cloning, with the latter requiring the prompt transcript. The model can be easily loaded using `torch.hub` and inference can be performed by providing the reference audio and transcript. The default settings provide good results, but the inference settings can be tuned for specific use cases. The checkpoints for MARS5, along with the necessary hardware requirements, are provided on the GitHub repo. Contributions to improve the model are welcome.
MARS5 TTS's Características
Two-stage AR-NAR pipeline
Guided prosody using punctuation and capitalization
Speaker identity specification
Shallow and deep cloning
Easy model loading using `torch.hub`
Inference using reference audio and transcript
Open-source with alternative licensing options
MARS5 TTS's PERGUNTAS E RESPOSTAS
MARS5 TTS's Preços
MARS5 is open-source and available under GNU AGPL 3.0 license. For alternative licensing options, please contact [email protected]
MARS5 TTS's Analítica
Descrição geral do sítio Web
Principais indicadores de desempenho para github.com
Taxa de rejeição
38.34%
Páginas / Visita
6.50
Total de visitas
437,914,238
Tempo no local
7m 18s
Classificação global
#78
Classificação do país
#111
Regiões de topo
Distribuição do tráfego por país
- 1.United States15.94%
- 2.China15.11%
- 3.India9.28%
- 4.Japan3.94%
Total de visitantes
Estatísticas mensais de visitantes dos últimos 3 meses
Fontes de tráfego
Distribuição das fontes de tráfego