ONE-PEACE: 무제한 멀티 모달리티를 위한 일반 표현 모델

xguru · 2023-05-24T10:47:01+09:00

비젼, 오디오, 언어 모달리티를 모두 아우르는 General Represenation Model 사전학습된 모델 없이도 통합된 작업들에 훌륭한 결과를 냄 강력한 Emergent Zero-shot Retrieval로 훈련 데이터에서 페어링 되지 않은 모달리티를 얼라인 가능 Audio-to-Image, Audtio+Text-to-Image, Audio+Image-to-Image

(github.com/OFA-Sys)

11P by xguru 2023-05-24 | ★ favorite | 댓글 1개

비젼, 오디오, 언어 모달리티를 모두 아우르는 General Represenation Model
사전학습된 모델 없이도 통합된 작업들에 훌륭한 결과를 냄
강력한 Emergent Zero-shot Retrieval로 훈련 데이터에서 페어링 되지 않은 모달리티를 얼라인 가능
Audio-to-Image, Audtio+Text-to-Image, Audio+Image-to-Image

dbs0829 2023-05-24 [-]

보니 많은 태스트에서 sota를 갈아치웠네요

답변달기

ONE-PEACE: 무제한 멀티 모달리티를 위한 일반 표현 모델

함께 보면 좋은 글 β

댓글과 토론