Giter Club home page Giter Club logo

nlp-projects-nhv's Introduction

NLP ADVANCED - NHV

NLP A-Z COURSE

Roadmap

Video Link

COURSE

Topic Video Link
NLP A-Z Course Part - 1 Video Link
NLP A-Z Course Part - 2 Video Link

VIDEOS IN ORDER

Serial Number Topic Code Link Video Link
1 Transformers From Scratch Code Link Video Link
2 BERT For Text Classification Code Link Video Link
3 BERT NER Code Link Video Link
4 T5 - All NLP Tasks Code Link Video Link
5 Llama2 Finetuning Code Link Video Link
6 LoRA Paper Explanation and Implementation Code Link Video Link
7 DPO Paper Explanation and Implementation Code Link Video Link
8 Mistral Architecture Explanation Code Link Video Link
9 Mistral Finetuning Code Link Video Link
10 Mistral DPO Finetuning Code Link Video Link
11 LLM Evaluation using Mistral Code Link Video Link
12 Mistral RAG Code Link Video Link
13 LLM Finetuning Crash Course Code Link Video Link
14 LLM For Information Extraction Code Link Video Link
15 Gemma Architecture Explained with Finetuning Code Link Video Link
16 My Best LLM using Model Merging Code Link Video Link
17 Mixture of Experts from Scratch Code Link Video Link
18 LoRA Merging Code Link Video Link
19 Deploy and Serve LLM using Ollama WebUI No Code Video Link
20 Whatsapp Chatbot using Twilio and Open Source LLMs Code Link Video Link
21 Edubot - Llama RAG Application Code Link Video Link
22 AI Girlfriend - Benefits of Prompting Code Link Video Link
23 YTBuddy - Chat with Videos Code Link Video Link
24 Cricbot - Chat with CSV Code Link Video Link
25 Codepal - Chat with Git Repo Code Link Video Link
26 Building your own Copilot in VSCode Realtime Code Link Video Link
27 Realtime Research Agent with Deployment Code Link Video Link
28 AI Database Administrator - Chat with Database Code Link Video Link
29 Building my own AI Startup Code Link Video Link
30 1 Bit LLM Pretraining - Era of 1 Bit LLM Code Link Video Link

nlp-projects-nhv's People

Contributors

vasanthengineer4949 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

nlp-projects-nhv's Issues

Device error DPO part2 notebook

When running the notebooks out of the box, I am getting an device error when trying to train the DPO.
Somehow something is on the cpu while it expects a CUDA device. I am running on the T4 in google colab... do you know what it could be and how to solve? Thanks!

Dataloader error while running DPO part 2 notebook.

Hi,

I keep getting the following dataloader error when I run the DPO code:
Traceback (most recent call last): File "/home/ml/users/---/research/learning_from_preferences/rlhf_starter_code/rlhf/dpo_falcon.py", line 114, in <module> dpo_trainer.train() File "/home/ml/users/---/anaconda3/envs/trl/lib/python3.9/site-packages/transformers/trainer.py", line 1885, in train return inner_training_loop( File "/home/ml/users/---/anaconda3/envs/trl/lib/python3.9/site-packages/transformers/trainer.py", line 2178, in _inner_training_loop for step, inputs in enumerate(epoch_iterator): File "/home/ml/users/----/anaconda3/envs/trl/lib/python3.9/site-packages/accelerate/data_loader.py", line 454, in __iter__ current_batch = next(dataloader_iter) File "/home/ml/users/----/anaconda3/envs/trl/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 631, in __next__ data = self._next_data() File "/home/ml/users/----/anaconda3/envs/trl/lib/python3.9/site-packages/torch/utils/data/dataloader.py", line 675, in _next_data data = self._dataset_fetcher.fetch(index) # may raise StopIteration File "/home/ml/users/----/anaconda3/envs/trl/lib/python3.9/site-packages/torch/utils/data/_utils/fetch.py", line 54, in fetch return self.collate_fn(data) File "/home/ml/users/----/anaconda3/envs/trl/lib/python3.9/site-packages/trl/trainer/utils.py", line 338, in __call__ to_pad = [torch.LongTensor(ex[k]) for ex in features] File "/home/ml/users/----/anaconda3/envs/trl/lib/python3.9/site-packages/trl/trainer/utils.py", line 338, in <listcomp> to_pad = [torch.LongTensor(ex[k]) for ex in features] TypeError: an integer is required (got type NoneType) 0%| | 0/50 [00:00<?, ?it/s]

Passing Dataset

Hi all,

I'm currently working on fine tuning Mistral using my own dataset. But I'm wondering of how to upload the data. I used dataset = load_dataset('json', data_files='/path to the dataset/.jsonl', split='train')

But this error is showing when I run AutoModelForCausalLM.from_pretrained:
ValueError: You need to pass dataset in order to quantize your model

It looks like it can't see it.

Will appreciate any help.

phi-1.5 finetuning

Hi,
First of all thanks for your contributions and sharing.
I'm having trouble to get inference stopped. Actually, after my fine-tuning, when I'm doing an inference. the model keep generating tokens until it reaches "max_tokens". What should I do (I'm newbie ๐Ÿ˜ƒ) to get rid of this and stop at the end of the answer.

Any insight ?

thanks for your feedback.

ImportError: libcudart.so.12: cannot open shared object file: No such file or directory

Finetuning Zephyr:

You passed quantization_config to from_pretrained but the model you're loading already has a quantization_config attribute and has already quantized weights. However, loading attributes (e.g. use_exllama, exllama_config, use_cuda_fp16, max_input_length) will be overwritten with the one you passed to from_pretrained. The rest will be ignored.
WARNING:auto_gptq.nn_modules.qlinear.qlinear_cuda:CUDA extension not installed.
WARNING:auto_gptq.nn_modules.qlinear.qlinear_cuda_old:CUDA extension not installed.
ERROR:auto_gptq.nn_modules.qlinear.qlinear_exllama:exllama_kernels not installed.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.