Giter Club home page Giter Club logo

Hi there 👋

I am Pengyu Cheng, a researcher in NLP and ML. Here are some facts about me:

  • I am currently at Tencent AI Lab, primarily working on LLM training, AI agents, and dialogue systems.
  • I have been experienced in research and projects about controllable generation, interpretability, and fairness of NLP.
  • I am also interested in probabilistic and information-theoretic machine learning methods.
  • I received my Ph.D. degree from Duke University in 2021, advised by Dr. Lawrence Carin.
  • I graduated from the Department of Mathematical Sciences at Tsinghua University in 2017, advised by Dr. Jiwen Lu.

Pengyu Cheng's Projects

apo icon apo

Code for ACL2024 paper - Adversarial Preference Optimization (APO).

awesome-llm-reasoning icon awesome-llm-reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

awesome-llm-rl icon awesome-llm-rl

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

awesome-llm-robotics icon awesome-llm-robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

awesome-rlhf icon awesome-rlhf

A curated list of reinforcement learning with human feedback resources (continually updated)

binarysentemb icon binarysentemb

Code for ACL 2019 oral paper - Learning Compressed Sentence Representations for On-Device Text Processing.

club icon club

Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

detgp icon detgp

Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.

dsp icon dsp

Domain-specific preference (DSP) data and customized RM fine-tuning.

emacs-init icon emacs-init

My emacs init file for python coding in deep learning

linear95.github.io icon linear95.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

megatron-lm icon megatron-lm

Ongoing research training transformer models at scale

rlm icon rlm

Code for the paper - Replacing Language Model for Style Transfer

spag icon spag

Self-playing Adversarial Language Game Enhances LLM Reasoning

tc-estimation icon tc-estimation

Code for AISTATS 2023 paper - Estimating Total Correlation with Mutual Information Estimators

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.