Light

praveensonu / llm-unlearning Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 19 KB

llm-unlearning's Introduction

LLM-Unlearning

This repository contains a collection of papers on LLM-Unlearning.

Survey

An introduction to Unlearning: Ken Liu blog
Rethinking Machine Unlearning for Large Language Models: paper

Dataset

TOFU - A Task of Fictitious Unlearning for LLMs: paper
WMDP Benchmark: paper

Unlearning methods

Relabeling-based fine-tuning:

Who’s harry potter? approximate unlearning in LLMs: paper
Knowledge sanitization of large language models: paper
TOFU - A Task of Fictitious Unlearning for LLMs: paper

In-context Learning:

Can we edit the factual knowledge in LLMs: paper
In-context unlearning- Language models as few shot unlearners: paper

Gradient Ascent-based:

Gradient Ascent vs Gradient Descent: blog
Knowledge unlearning for mitigating privacy risks in language models Knowledge unlearning for mitigating privacy risks in language models: paper
Large language model unlearning: paper

Model Editing techniques:

Can we edit the factual knowledge in LLMs: paper
Detecting and editing privacy neurons in pre-trained language models: paper
Can sensitive information be deleted from LLMs? objectives for defending against extraction attacks: paper
Privacy adhering machine un-learning in NLP: paper
Unlearning bias in language models by partitioning gradients: paper

Reward-reinforced model-based:

Controllable text generation with reinforced unlearning: paper

KL-divergence-based methods:

A general machine unlearning framework based on knowledge gap alignment: paper
Unlearn what you want to forget: Efficient unlearning for LLMs: paper

Vector-based PEFT via LoRA:

Editing models with task arithmetic: paper
Composing parameter-efficient modules with arithmetic operations: paper

Unlearning in other domains:

Computer vision:

llm-unlearning's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.