Giter Club home page Giter Club logo

pdfbot's Introduction

PDFBot ๐Ÿ“š

PDFBot is a self-learning project that utilizes the GPT (Generative Pre-trained Transformer) API to answer questions about uploaded PDF documents. With PDFBot, you can quickly extract information and insights from PDF files by asking questions, making it a valuable tool for research, document analysis, and more.

How it Works

PDFBot is built using Python and leverages several libraries and technologies to achieve its functionality:

  • Streamlit: The user interface is created using Streamlit, which provides a simple and interactive way to upload PDF files and ask questions.

  • PyPDF2: PyPDF2 is used to extract text from the uploaded PDF files, making their content accessible for analysis.

  • Langchain: Langchain is a library used for text processing and question-answering tasks. It handles text splitting, embeddings, and vector storage.

  • OpenAI GPT API: The OpenAI GPT API is the heart of PDFBot, powering the question-answering capabilities. It processes the user's questions and generates meaningful responses based on the content of the uploaded PDF.

Usage

To use PDFBot:

  1. Upload a PDF File: Click the "Upload a PDF file" button to select and upload a PDF document.

  2. Ask Questions: Once the PDF is uploaded, you can enter your questions in the "Ask pdfBOT ๐Ÿง " input field.

  3. Get Answers: PDFBot will process your question and provide you with relevant answers based on the content of the PDF.

Getting Started

To run PDFBot locally or make changes to the code, follow these steps:

  1. Clone the Repository:
    git clone https://github.com/yourusername/PDFBot.git
    

Contributing

PDFBot is an open-source project, and contributions are welcome! If you have ideas for improvements, bug fixes, or new features, please create a pull request or submit an issue on the GitHub repository.

License

This project is licensed under the MIT License.

pdfbot's People

Contributors

malshancs avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.