MARS5 TTS

GitHub - Camb-ai/MARS5-TTS:来自CAMB.AI的MARS5语音模型 (TTS)

导言

MARS5是CAMB.AI推出的新型语音模型,支持多种场景的文本到语音转换,展现出极佳的韵律效果。


更新日期:

2024年6月14日

每月访客数:

SimilarWeb Icon
437.9M

联盟计划:

No

MARS5 TTS's 概述

MARS5 is a novel English speech model (TTS) developed by CAMB.AI. It follows a two-stage AR-NAR pipeline with a unique NAR component. With just 5 seconds of audio and a snippet of text, MARS5 can generate speech for diverse scenarios like sports commentary and anime. The model can be steered with punctuation and capitalization to guide the prosody of the generated output. Speaker identity can be specified using an audio reference file. MARS5 supports both shallow and deep cloning, with the latter requiring the prompt transcript. The model can be easily loaded using `torch.hub` and inference can be performed by providing the reference audio and transcript. The default settings provide good results, but the inference settings can be tuned for specific use cases. The checkpoints for MARS5, along with the necessary hardware requirements, are provided on the GitHub repo. Contributions to improve the model are welcome.


MARS5 TTS's 特点

  • Two-stage AR-NAR pipeline

  • Guided prosody using punctuation and capitalization

  • Speaker identity specification

  • Shallow and deep cloning

  • Easy model loading using `torch.hub`

  • Inference using reference audio and transcript

  • Open-source with alternative licensing options


MARS5 TTS's 问答


MARS5 TTS's 定价

MARS5 is open-source and available under GNU AGPL 3.0 license. For alternative licensing options, please contact [email protected]

MARS5 TTS's 分析

网站概述

关键性能指标 github.com

跳出率

38.34%

页面/访问

6.50

总访问量

437,914,238

现场时间

7m 18s

全球排名

#78

国家排名

#111

顶级地区

按国家分列的交通流量分布情况

  • 1.
    United States15.94%
  • 2.
    China15.11%
  • 3.
    India9.28%
  • 4.
    Japan3.94%

游客总数

过去 3 个月的每月访客统计

趋势向下 by 5.3% 本月
April - June 2024

流量来源

流量来源分布

Social:
6.7%
Paid Referrals:
0.0%
Mail:
0.9%
Referrals:
11.0%
Search:
30.1%
Direct:
51.3%
主要来源: Direct
51.3% 占总流量的百分比

MARS5 TTS's 替代品