Giter Club home page Giter Club logo

gpt-online-footprint's Introduction

๐Ÿ”Ž GPT Online Footprint

OSINT Analysts are sometimes required to conduct an assessment of their clients online exposure to determine if there is any risk to their privacy or reputation. The widespread utilisation of LLMs has introduced a new dimension for analysts to explore, as these models fuel automated content generation and contribute to the proliferation of low-quality journalism. GPT Online Footprint employs LLMs to assimilate and interpret open-source information about a target, facilitating the assessment of potential privacy and reputational risks posed by AI.

Inspired by AutoGPT and Research-GPT

Why GPT Digital Footprint?

  • With the rising use of LLMs for content generation among social media users, journalists, and trolls, it becomes important for public figures to understand what data has been used to train these models.
  • Being aware of how LLMs interpret any new information about an individual can help them prepare for future agression and safeguard their interests.
  • Awareness of LLM biases can help inform defensive PR/Comms strategies

Architecture

The main idea is to run "planner" and "execution" agents, where the planner generates Google queries to best identify problematic articles, and the execution agents process the retrieved information. Finally, the planner filters and aggregates all related information and creates a report. The agents currently leverage gpt3.5-turbo-16k with an option to include gpt-4 in the future.

More specifcally:

  • Generate a set of search queries which aim to identify problematic content for the target individual. Some LLM biases will be revealed at this stage
  • For each query, trigger a crawler agent that scrapes online resources for information relevant to the given task.
  • For each scraped resources, summarize based on relevant information and keep track of its sources.
  • Finally, filter and aggregate all summarized sources and generate a final report.

Features

  • ๐ŸŒ Aggregates over 60 web sources per research to form objective and factual conclusions
  • ๐Ÿ–ฅ๏ธ Includes an easy-to-use web interface (HTML/CSS/JS)
  • ๐Ÿ” Scrapes web sources with javascript support
  • ๐Ÿ“‚ Keeps track and context of visited and used web sources
  • ๐Ÿ“„ Export research reports to PDF and more...

Quickstart

Step 0 - Install Conda and create a Python 3.11 environment:

conda create -n py311 python=3.11

Step 1 - Activate the environment

conda activate py311

Step 2 - Clone the repo

$ git clone https://github.com/dfaram7/gpt-online-footprint.git
$ cd gpt-online-footprint

Step 3 - Install dependencies

$ pip install -r requirements.txt

Step 4 - Install WeasyPrint

$ conda install -c conda-forge weasyprint

Step 5 - Create .env file with your OpenAI Key or simply export it

$ export OPENAI_API_KEY={Your API Key here}

Step 6 - Run the agent with FastAPI

$ uvicorn main:app --reload

Step 7 - Go to http://localhost:8000 on any browser

gpt-online-footprint's People

Contributors

assafelovic avatar rotemweiss57 avatar dependabot[bot] avatar dfaram7 avatar elishakay avatar gregdrizz avatar jortegac avatar eltociear avatar saunakghosh10 avatar shon avatar

Stargazers

Jesse Boudreau avatar  avatar  avatar tudouya avatar Jordan Hayes avatar  avatar

Watchers

 avatar

Forkers

oscarmh twolives

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.