Giter Club home page Giter Club logo

huanhuan-chat's Introduction

My Visitor Count Visitor.

About Me

  • 🌱 I'm a graduate student, studying in Henan Polytechnic University (HPU), and leaving soon.

  • 💬 My research direction is global discrete grid and method of generation Voronoi diagram

  • ⭐ Datawhale members, InternLM community IOPMC SIG RAG Manager

github contribution grid snake animation

Interest 👨🏽‍💻

  • NLP : Large Language Model, Prompt Engineering and more things about NLP.

  • Language : Python, Pytorch and Markdown.

  • Theory : Voronoi diagram algorithm.

Open Source Experience 👯

As a manager

  • self-llm《开源大模型食用指南》 更适合中国宝宝的大模型部署指南,Github 4.3k star,并多次登顶 Github Trending!

  • huanhuan-chat : A chat robot based on ChatGLM2 likes Zhenhuan。

  • AMchat : AM (Advanced Mathematics) chat 是一个集成了数学知识和高等数学习题及其解答的大语言模型。

  • d2l-ai-solutions-manual : The answers to the exercises after class Dive into Deep Learning. 动手学深度学习 习题解答项目~

  • tiny-universe : 《大模型白盒子构建指南》致力于从零手搓大模型相关任务,如:RAG、Agent、Eval等等。

As a member :

Competition

GitHub Stats 📫

huanhuan-chat's People

Contributors

bald0wang avatar coderdeepstudy avatar gkdgkd avatar kmno4-zx avatar logan-zou avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

huanhuan-chat's Issues

多轮对话数据

你好,我有多轮对话的数据集,请问该怎么处理呀,谢谢

无法使用peftmodel

实在不好意思,我没有发现讨论区,只能占用您issue区的资源了,请见谅!
我尝试在本地部署huanhuanchat,我成功调用了chatglm,但是无法使用微调后的模型,我的代码如下:

from peft import PeftModel
from transformers import AutoTokenizer, AutoModel

model_path = "THUDM/chatglm2-6b"
model = AutoModel.from_pretrained(model_path, trust_remote_code=True).half().cuda()
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
# 到这里都没有问题


#   给你的模型加上嬛嬛LoRA! 
model = PeftModel.from_pretrained(model, "output/sft").half()
model.eval()

报错信息如下:

HTTPError: 401 Client Error: Unauthorized for url: https://huggingface.co/output/sft/resolve/main/adapter_config.json

The above exception was the direct cause of the following exception:

RepositoryNotFoundError                   Traceback (most recent call last)
    176 try:
--> 177     config_file = hf_hub_download(
    178         model_id,
    179         CONFIG_NAME,
    180         **hf_hub_download_kwargs,
    181     )
    182 except Exception:

File [~/anaconda3/envs/deeplearning/lib/python3.8/site-packages/huggingface_hub/utils/_validators.py:120](https://vscode-remote+ssh-002dremote-002b7b22686f73744e616d65223a225461726765744d616368696e65227d.vscode-resource.vscode-cdn.net/home/mguo/dl/llm/~/anaconda3/envs/deeplearning/lib/python3.8/site-packages/huggingface_hub/utils/_validators.py:120), in validate_hf_hub_args.._inner_fn(*args, **kwargs)
    118     kwargs = smoothly_deprecate_use_auth_token(fn_name=fn.__name__, has_token=has_token, kwargs=kwargs)
--> 120 return fn(*args, **kwargs)

File [~/anaconda3/envs/deeplearning/lib/python3.8/site-packages/huggingface_hub/file_download.py:1195](https://vscode-remote+ssh-002dremote-002b7b22686f73744e616d65223a225461726765744d616368696e65227d.vscode-resource.vscode-cdn.net/home/mguo/dl/llm/~/anaconda3/envs/deeplearning/lib/python3.8/site-packages/huggingface_hub/file_download.py:1195), in hf_hub_download(repo_id, filename, subfolder, repo_type, revision, library_name, library_version, cache_dir, local_dir, local_dir_use_symlinks, user_agent, force_download, force_filename, proxies, etag_timeout, resume_download, token, local_files_only, legacy_cache_layout)
...
--> 183         raise ValueError(f"Can't find '{CONFIG_NAME}' at '{model_id}'")
    185 loaded_attributes = cls.from_json_file(config_file)
    186 return loaded_attributes["peft_type"]

ValueError: Can't find 'adapter_config.json' at 'output/sft'

请问这个如何解决呢?

微调后没起作用

hi,我用默认的lora微调脚本微调2400step后,使用gui部署。向他提问“你是谁”,回答是清华大学的聊天机器人。而使用仓库预训练的lora模型则回答是嬛嬛。这看上去是训练没起到作用?

pip install -r requirement.txt 安装需要进行如下的修改

直接下载回来安装的 requirement.txt 会存在多个包依赖的报错;
修改以下几个包的 版本信息可以顺利安装:

diff requirements.txt requirements.txt.bak
59,61c59,61
< mkl-fft
< mkl-random
< mkl-service

mkl-fft==1.3.6
mkl-random==1.2.2
mkl-service==2.4.0
69c69
< numpy==1.24.4


numpy==1.25.2
128,129c128,129
< torchaudio
< torchvision


torchaudio==0.12.1
torchvision==0.13.1

关于huanhuan-chat微调报错的问题

因为刚入门大模型有很多东西不是很懂,我想咨询一下关于微调时候报错NotImplementedError: Loading a dataset cached in a LocalFileSystem is not supported.的问题。您有时间解答一下吗?谢谢!

训练参数没有改,错误如下:
从../dataset/train/lora/huanhuan.json加载数据集失败

NotImplementedError( "Loading a dataset cached in a LocalFileSystem is not supported. ")

搭建一个小程序/网站

哈喽!huanhuan-chat创始人你好!我github用的不多所以不太懂怎么私信,但你感兴趣把这个包装出一个全栈的项目嘛?目前来说让这个模型在本地跑起来还是挺需要对计算机的了解的,而大部分看甄嬛传的估计也不懂。围绕他做一个网站或者小程序可以让他有更大的传播度。如果你感兴趣的话欢迎你联系我!

几个问题

hi,很棒的开源项目,我最近对此也比较感兴趣,有几个问题想请教一下:

  1. 我看训练数据中对于“你是谁”和“你是”两个问题进行了重采样,是强行让模型学到这部分知识吗?我在想可以通过提示词的方式去控制人设吗?
  2. 微调后人物语气生成的场景受问题影响大吗,比如某些问题生成会差一些之类的吗?
  3. sft之后会有灾难性遗忘吗?

多卡微调时报错

使用多卡微调时,提示数据不在同一张显卡上,报错:
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:7 and cuda:0! (when checking argument for argument target in method wrapper_CUDA_nll_loss_forward)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.