taprosoft / llm_finetuning Goto Github PK
View Code? Open in Web Editor NEWConvenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes)
License: Apache License 2.0
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes)
License: Apache License 2.0
Could you provide a sample template for training a character chat model (like "USER: Hello\nASSISTANT: Hi") except back and forth?
It'd be easy to modify the existing template samples for a single USER input followed by one ASSISTANT output but how can the sharegpt:chat conversation format be implemented (like axolotl supports) where there is an array of chat?
{"conversations": [
{"from": "human", "value": "hi"},
{"from": "gpt", "value": "hi"},
{"from": "human", "value": "how are you"},
{"from": "gpt", "value": "good"}
]}
petf@git+https://github.com/huggingface/peft.git
Should be
peft@git+https://github.com/huggingface/peft.git
requirements.txt has git+https://github.com/huggingface/peft.git
cuda_quant_requirements.txt has peft==0.4.0
when I try and install both of these, I see the following
Collecting git+https://github.com/huggingface/peft.git (from -r requirements.txt (line 1))
Cloning https://github.com/huggingface/peft.git to /tmp/pip-req-build-n5akfjww
Running command git clone --filter=blob:none --quiet https://github.com/huggingface/peft.git /tmp/pip-req-build-n5akfjww
Resolved https://github.com/huggingface/peft.git to commit ad8f7cb59ee7ca4b9ca1c9048711038ac36b31b8
Installing build dependencies ... done
Getting requirements to build wheel ... done
Preparing metadata (pyproject.toml) ... done
ERROR: Cannot install peft 0.11.2.dev0 (from git+https://github.com/huggingface/peft.git) and peft==0.4.0 because these package versions have conflicting dependencies.
The conflict is caused by:
The user requested peft 0.11.2.dev0 (from git+https://github.com/huggingface/peft.git)
The user requested peft==0.4.0
To fix this you could try to:
1. loosen the range of package versions you've specified
2. remove package versions to allow pip attempt to solve the dependency conflict
ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts
Iโm trying to finetune my gptq pretrained model via your framework, but I found that gpt+lora seems to make the model worse.
During the initialization phase of training, I met the warning:
The safetensors archive passed at gptq_model-4bit-128g.safetensors does not contain metadata. Make sure to save your model with the save_pretrained
method. Defaulting to 'pt' metadata.
hope you can give me some advice, thanks!
I formatted my data as shareGPT but it keeps throwing up this error.
Failed to read file '/workspace/llm_finetuning/datasets/dataset.json' with error <class 'pyarrow.lib.ArrowInvalid'>: Could not convert 'USER' with type str: tried to convert to int64
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/datasets/builder.py", line 1879, in _prepare_split_single
for _, table in generator:
File "/usr/local/lib/python3.10/dist-packages/datasets/packaged_modules/json/json.py", line 156, in _generate_tables
raise ValueError(f"Not able to read records in the JSON file at {file}.") from None
ValueError: Not able to read records in the JSON file at /workspace/llm_finetuning/datasets/dataset.json.
Is it possible to do raw-text training (not instruction style - just feeding full text of literature)?
If so could you suggest a template to do this?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.