Course-ChatBot using Llama-2 7B model

This chatbot is meant to answer all your questions related to BT203, BT204, BT204, trained using all the textbooks of the respective courses.

Architecture

Extracted dats Books PDF
Divided into text Chunks (750 in Llama 2 and 10,000 in Gemini pro)
Using Embeedings
Vector storage (PINECONE for Llama 2 and Chroma for Gemini) then semantic and similarity search (Can use Cosine, Eucledian or any but in my opinion cosine should be used)
Final refined Results using Llama 2 7B model (Y'll can use any model of your choice of any number of parameters) (6. Deployment using Streamlit or Flask)

Update 25/12/2023

Implemented the same thing using Google-Gemini-Pro-API which you can find in Course-ChatBot-using-Gemini-Pro.ipynb file. But Before everything let me drop some notes

For embedding model in this case I implemented a google embeddings model embedding-001 unlike last time as we used a hugging face embedding model
For Vector Database I used Chroma this time but few things to note again, Chroma is efficient for many use cases, it might not match Pinecone’s performance in certain high-throughput real-time scenarios as Pinecone excels at similarity search. ChromaDB is an open-source database that you need to set up and manage yourself. This can be a significant hurdle for users who don't have experience with database administration or who require a plug-and-play solution. Pinecone, on the other hand, is a managed service that takes care of all the infrastructure and maintenance, making it much easier to get started. However, if you're comfortable with managing your own infrastructure, appreciate the flexibility of open-source software, and have budget constraints, ChromaDB could be a viable option.
I just implemented it in jupyter notebook instead of deploying it. But if you are cloning it then the architecture is same for both hence you can just and paste the same part of the code over the files created and edit it.

NOTE!!!

This model is Trained To run over Local CPU hence it might take some time (<2 min definetely) to get response.
Also as I mentioned you can use any model of any number of parameter but do keep in mind that Larger size means smarter, but slower.
I set the temperature value to 0.8. Now what's Temperature? Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic to the point, 0.75 is a good starting value hence I used 0.8.
Also you can change the number of tokens, a word is generally 2-3 tokens.
In the Line 23 of app.py change the index_name based upon your index_name.
I used some randomly AI generated HTML CSS code So I dont take credit for this (Also dont judge me for this :)

Now after all these How to Run?

STEPS:

Clone the repository

Run This Over your terminal : git clone https://github.com/suvraadeep/Course-ChatBot.git

STEP 01- Create an environment after cloning and run the same over your terminal

Change the python version based upon the version over your PC

conda create -n chatbot python=3.10.9 -y

conda activate chatbot

STEP 02- install the Libraries

We will download all the Libs required altogether

pip install -r requirements.txt

Note: The book I used is stored in `Books` folder

Step 03- Create a `.env` file in the root directory and add all your credentials as follows:

We will use Pinecone database for this project but again you can use ChromaDB or any database of your choice You need to generate it yourself

PINECONE_API_KEY = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxx"
PINECONE_API_ENV = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxx"

Step 04- Download the quantized model from the link provided below.

## Download the Llama 2 Model:

The model I used is llama-2-7b-chat.ggmlv3.q8_0.bin
## From the following link:
https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGML/tree/main

## For other parameter models you can use
## 13B Quantized model
https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML

## You can also use up the model originally from mets site using
https://ai.meta.com/llama/

STEP 05- Run `helper.py`, `prompt.py`, `setup.py`, `store_index.py`

# Finally run the following file 
app.py

Step 06- Running it Locally

# Finally paste it over your browser
localhost:8069

cheers ;)

suvraadeep / course-chatbot-using-llama-2-7b-model-and-gemini-pro Goto Github PK