MARS5 TTS

GitHub - Camb-ai/MARS5-TTS: Modelo de voz MARS5 (TTS) de CAMB.AI

Introducción

MARS5: Un modelo de voz novedoso para una prosodia increíble. Genera voz con solo 5 segundos de audio y un fragmento de texto, incluso para escenarios complejos como comentarios deportivos y anime. ¡Prueba nuestra demo!


Añadido el:

14 jun 2024

Visitantes mensuales:

SimilarWeb Icon
437.9M

Programa de afiliados:

No

GitHub - Camb-ai/MARS5-TTS: Modelo de voz MARS5 (TTS) de CAMB.AI

MARS5 TTS's Visión general

MARS5 is a novel English speech model (TTS) developed by CAMB.AI. It follows a two-stage AR-NAR pipeline with a unique NAR component. With just 5 seconds of audio and a snippet of text, MARS5 can generate speech for diverse scenarios like sports commentary and anime. The model can be steered with punctuation and capitalization to guide the prosody of the generated output. Speaker identity can be specified using an audio reference file. MARS5 supports both shallow and deep cloning, with the latter requiring the prompt transcript. The model can be easily loaded using `torch.hub` and inference can be performed by providing the reference audio and transcript. The default settings provide good results, but the inference settings can be tuned for specific use cases. The checkpoints for MARS5, along with the necessary hardware requirements, are provided on the GitHub repo. Contributions to improve the model are welcome.


MARS5 TTS's Características

  • Two-stage AR-NAR pipeline

  • Guided prosody using punctuation and capitalization

  • Speaker identity specification

  • Shallow and deep cloning

  • Easy model loading using `torch.hub`

  • Inference using reference audio and transcript

  • Open-source with alternative licensing options


MARS5 TTS's PREGUNTAS Y RESPUESTAS


MARS5 TTS's Precios

MARS5 is open-source and available under GNU AGPL 3.0 license. For alternative licensing options, please contact [email protected]

MARS5 TTS's Analítica

Resumen del sitio web

Indicadores clave de rendimiento para github.com

Tasa de rebote

38.34%

Páginas / Visita

6.50

Total de visitas

437,914,238

Tiempo in situ

7m 18s

Clasificación mundial

#78

Rango del país

#111

Regiones principales

Distribución del tráfico por países

  • 1.
    United States15.94%
  • 2.
    China15.11%
  • 3.
    India9.28%
  • 4.
    Japan3.94%

Visitantes totales

Estadísticas mensuales de visitas de los últimos 3 meses

Tendencias a la baja by 5.3% este mes
April - June 2024

Fuentes de tráfico

Distribución de las fuentes de tráfico

Social:
6.7%
Paid Referrals:
0.0%
Mail:
0.9%
Referrals:
11.0%
Search:
30.1%
Direct:
51.3%
Fuente dominante: Direct
51.3% del tráfico total

MARS5 TTS's Alternativas