Giter Club home page Giter Club logo

ask-my-pdf's Introduction

Ask my PDF

Thank you for your interest in my application. Please be aware that this is only a Proof of Concept system and may contain bugs or unfinished features. If you like this app you can โค๏ธ follow me on Twitter for news and updates.

Ask my PDF - Question answering system built on top of GPT3

๐ŸŽฒ The primary use case for this app is to assist users in answering questions about board game rules based on the instruction manual. While the app can be used for other tasks, helping users with board game rules is particularly meaningful to me since I'm an avid fan of board games myself. Additionally, this use case is relatively harmless, even in cases where the model may experience hallucinations.

๐ŸŒ The app can be accessed on the Streamlit Community Cloud at https://ask-my-pdf.streamlit.app/. ๐Ÿ”‘ However, to use the app, you will need your own OpenAI's API key.

๐Ÿ“„ The app implements the following academic papers:

Installation

  1. Clone the repo:

    git clone https://github.com/mobarski/ask-my-pdf

  2. Install dependencies:

    pip install -r ask-my-pdf/requirements.txt

  3. Run the app:

    cd ask-my-pdf/src

    run.sh or run.bat

High-level documentation

RALM + HyDE

RALM + HyDE

RALM + HyDE + context

RALM + HyDE + context

Environment variables used for configuration

General configuration:
  • STORAGE_SALT - cryptograpic salt used when deriving user/folder name and encryption key from API key, hexadecimal notation, 2-16 characters

  • STORAGE_MODE - index storage mode: S3, LOCAL, DICT (default)

  • STATS_MODE - usage stats storage mode: REDIS, DICT (default)

  • FEEDBACK_MODE - user feedback storage mode: REDIS, NONE (default)

  • CACHE_MODE - embeddings cache mode: S3, DISK, NONE (default)

Local filesystem configuration (storage / cache):
  • STORAGE_PATH - directory path for index storage

  • CACHE_PATH - directory path for embeddings cache

S3 configuration (storage / cache):
  • S3_REGION - region code

  • S3_BUCKET - bucket name (storage)

  • S3_SECRET - secret key

  • S3_KEY - access key

  • S3_URL - URL

  • S3_PREFIX - object name prefix

  • S3_CACHE_BUCKET - bucket name (cache)

  • S3_CACHE_PREFIX - object name prefix (cache)

Redis configuration (for persistent usage statistics / user feedback):
  • REDIS_URL - Redis DB URL (redis[s]://:password@host:port/[db])
Community version related options:
  • OPENAI_KEY - API key used for the default user
  • COMMUNITY_DAILY_USD - default user's daily budget
  • COMMUNITY_USER - default user's code

ask-my-pdf's People

Contributors

carelfdewaal avatar krishddd avatar mobarski avatar qixiaobo avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.