# MetaVoice-1B - 1.2B 파라미터 Text-To-Speech 모델

> Clean Markdown view of GeekNews topic #13291. Use the original source for factual precision when an external source URL is present.

## Metadata

- GeekNews HTML: [https://news.hada.io/topic?id=13291](https://news.hada.io/topic?id=13291)
- GeekNews Markdown: [https://news.hada.io/topic/13291.md](https://news.hada.io/topic/13291.md)
- Type: news
- Author: [xguru](https://news.hada.io/@xguru)
- Published: 2024-02-10T10:16:01+09:00
- Updated: 2024-02-10T10:16:01+09:00
- Original source: [github.com/metavoiceio](https://github.com/metavoiceio/metavoice-src)
- Points: 12
- Comments: 0

## Topic Body

- 10만 시간의 음성으로 학습된 12억개 파라미터의 TTS(텍스트-음성-변환) 모델   
- 감정적인 말하기 리듬과 어조(영어)   
- 미세 조정을 통한 보이스 클로닝 지원(인도 스피커의 경우 1분 정도의 음성 데이터 만으로 성공했음)  
- 미국/영국 음성에 대해서는 30초의 레퍼런스 오디오 만으로 Zero-Shot 클로닝 가능   
- 긴 음성 합성 지원   
- 아파치 2.0 라이센스로 제한없이 사용 가능

## Comments


_No public comments on this page._