# wav2vec-U : Supervision 필요 없는 고성능 음성 인식

> Clean Markdown view of GeekNews topic #4319. Use the original source for factual precision when an external source URL is present.

## Metadata

- GeekNews HTML: [https://news.hada.io/topic?id=4319](https://news.hada.io/topic?id=4319)
- GeekNews Markdown: [https://news.hada.io/topic/4319.md](https://news.hada.io/topic/4319.md)
- Type: news
- Author: [xguru](https://news.hada.io/@xguru)
- Published: 2021-05-24T09:20:05+09:00
- Updated: 2021-05-24T09:20:05+09:00
- Original source: [ai.facebook.com](https://ai.facebook.com/blog/wav2vec-unsupervised-speech-recognition-without-supervision/)
- Points: 4
- Comments: 0

## Topic Body

- 페이스북 AI팀이 만든 음성인식 프레임워크

- 전사(transcribed) 음성 데이터 없이 다양한 언어 인식을 지원

ㅤ→ 1000시간 정도 분량의 음성으로 훈련된 지도학습 모델과 비슷한 성능

ㅤ→ 전사 음성 데이터가 많지 않은 스와힐리어/타타르 언어등으로 테스트

- 레이블링 되지 않은 오디오의 구조를 학습하는 방식

ㅤ→ 음성 녹음을 각각의 사운드에 느슨하게 대응하는 음성 단위로 분할

ㅤ→ cat 은 “/K/”, “/AE/” “/T/“ 세개의 소리가 포함

ㅤ→ generator 와 discriminator 로 구성된 GAN 으로 훈련

- 코드와 논문 공개

## Comments


_No public comments on this page._