Light

lzysaltedfish / chatfish-chatbot Goto Github PK

View Code? Open in Web Editor NEW

2.0 1.0 0.0 2.7 MB

ChatFish, an open-source chatbot trained by fine-tuning Bloom on open-source conversation datasets.

License: Apache License 2.0

Shell 3.49% Python 96.51%

chatfish-chatbot's Introduction

ChatFish-Chatbot

ChatFish, an open-source Chinese chatbot trained by fine-tuning Bloom on blendered conversation datasets. This Repo builds Web UI for ChatFish powered by Gradio.

Quick Start

git clone https://github.com/LZYSaltedFish/ChatFish-Chatbot.git
cd ChatFish-Chatbot
pip install -r requirements.txt

cd inference
sh chat.sh

Training Detail

Trained with DeepSpeed-Chat, on 8 16G-V100 GPUs. Full finetuned with ZeRO stage 2, no LoRA.

Model

Bloom-1b1: base model of ChatFish
chatfish-1b1-sft: finetuned chatbot model

Data

Data used for training are extracted from the following open-source dataset.

Dataset	Size	Avg turns	Used
Guanaco	200K	2.7	66K
Vicuna-ShareGPT	6K	5.9	3.5K
GPT4-LLM	49K	1	33K
MOSS-002-SFT	590K	2.9	211K
InstructWild	51K	1	45K

Instances are simply filtered by rules to meet the requirements of:

length of response no shorter than 5 tokens.
total length of query and response no shorter than 128 tokens.
each query has one and only one response.
chinese data.
split multiturn conversation into multiple instances, with history context at the beginning of the query.

Hyperparams

name	value
batch_size	1
max_seq_len	1024
lr	9.65e-6
epoch	15
lr_scheduler	cosine
warm_up	1000

Cases

Writing

Common Sense

Chatting

Role Playing

Limitations

lack of methematical and complex reasoning ability.
lack of truthfulness, prone to hallucinations.
lack of hramlessness.
chinese only.
lack of coding ability, the codes generated usually contains errors.

chatfish-chatbot's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.