Giter Club home page Giter Club logo

eleutherai-gpt-neo-125m-news_generator's Introduction

News Generation with GPT-Neo 125M

This project demonstrates the fine-tuning and deployment of the EleutherAI GPT-Neo 125M model for generating news articles. The dataset used for training is the "All the News" dataset from Kaggle, covering the period from 2015 to 2020. The project includes a Flask web application that allows users to generate fake news without needing to train the model themselves.

Overview

  • Model: EleutherAI GPT-Neo 125M
  • Dataset: All the News dataset from Kaggle
  • Framework: Hugging Face Transformers
  • Web Framework: Flask
  • Purpose: Generate fake news articles from 2015 to 2020
  • Computational Requirements: Not very intensive, suitable for an introduction to Hugging Face

Features

  1. Model Fine-Tuning: Detailed methodology and code for fine-tuning the GPT-Neo 125M model.
  2. Web Application: A simple Flask web app for generating news articles without training.
  3. Dataset: Focused on news articles from 2015 to 2020.

Methodology

The methodology for fine-tuning the GPT-Neo 125M model on the "All the News" dataset is documented in detail in the included Jupyter notebooks and Python scripts. This includes:

  1. Data Preprocessing: Cleaning and preparing the dataset for training.
  2. Model Training: Fine-tuning the GPT-Neo 125M model using Hugging Face Transformers.
  3. Evaluation: Assessing the model's performance and generating sample outputs.

Getting Started

Prerequisites

  • Python 3.7+
  • Pip
  • Virtual environment (recommended)

Installation

  1. Install Requirements

    pip install -r requirements.txt
    

eleutherai-gpt-neo-125m-news_generator's People

Contributors

daniyalahm avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.