# ColossalChat - ChatGPT를 복제하기 위한 RLHF 파이프라인 오픈소스 솔루션

> Clean Markdown view of GeekNews topic #8845. Use the original source for factual precision when an external source URL is present.

## Metadata

- GeekNews HTML: [https://news.hada.io/topic?id=8845](https://news.hada.io/topic?id=8845)
- GeekNews Markdown: [https://news.hada.io/topic/8845.md](https://news.hada.io/topic/8845.md)
- Type: news
- Author: [xguru](https://news.hada.io/@xguru)
- Published: 2023-03-31T11:32:02+09:00
- Updated: 2023-03-31T11:32:02+09:00
- Original source: [medium.com/@yangyou_berkeley](https://medium.com/@yangyou_berkeley/colossalchat-an-open-source-solution-for-cloning-chatgpt-with-a-complete-rlhf-pipeline-5edf08fb538b)
- Points: 10
- Comments: 0

## Topic Body

- LLaMA 모델을 기반  
  - Supervised 데이터 수집   
  - Supervised 파인 튜닝   
  - Reward 모델 학습   
  - Reinforcement Learning 파인 튜닝   
- 포함하는 콘텐츠   
  - 온라인에서 실행하는 인터랙티브 데모  
  - 7B/13B 모델을 포함하는 완전한 RLHF 훈련코드 오픈소스   
  - 중국어/영어로 구성된 104k bilingual 데이터셋   
  - 7B모델의 4-bit 양자화. 4GB GPU 메모리만 필요  
  - 모델 가중치 포함. 싱글 서버에서 간단히 재생산 가능   
  - 대형 모델/데이터셋/최적화 등도 계속 추가 에정

## Comments


_No public comments on this page._