# LLaVA: Visual Instruction Tuning

> Clean Markdown view of GeekNews topic #9017. Use the original source for factual precision when an external source URL is present.

## Metadata

- GeekNews HTML: [https://news.hada.io/topic?id=9017](https://news.hada.io/topic?id=9017)
- GeekNews Markdown: [https://news.hada.io/topic/9017.md](https://news.hada.io/topic/9017.md)
- Type: news
- Author: [xguru](https://news.hada.io/@xguru)
- Published: 2023-04-22T10:32:01+09:00
- Updated: 2023-04-22T10:32:01+09:00
- Original source: [llava-vl.github.io](https://llava-vl.github.io/)
- Points: 6
- Comments: 0

## Topic Body

- "LLaVA : Large Language and Vision Assistant"  
- 범용적인 시각 및 언어 이해를 위해 비전 인코더와 Vicuna를 결합한 대규모 멀티 모달 모델  
- 멀티모달 GPT-4 수준의 능력 및 과학 질문/답변에 있어서 SOTA 정확도를 추구   
- 논문과 코드, 데모 공개

## Comments


_No public comments on this page._