Name: THUNLP-MT
Type: Organization
Bio: Machine Translation Group, Natural Language Processing Lab at Tsinghua University (THUNLP). Please refer to https://github.com/thunlp for more NLP resources.
Location: Tsinghua University, Beijing, China
Blog: https://thumtblog.github.io/
THUNLP-MT's Projects
A Bilingual Lexicon Inducer From Non-Parallel Data
Continual Knowledge Distillation for Neural Machine Translation
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
Codebase for ACL 2023 conference long paper Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models.
A Dataset for Direct Quotation Extraction and Attribution in News Articles.
Improving the Transformer translation model with document-level context
This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions (EMNLP 2023 Findings)
Learning to Copy for Automatic Post-Editing (EMNLP 2019)
Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021
This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.
Official code repo for our work "Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement".
A list machine translation datasets maintained by Tsinghua Natural Language Processing Group
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
A list of machine translation open-source toolkits maintained by Tsinghua Natural Language Processing Group
Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks
Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022
Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization
This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).
This repo contains the codes for our paper: Molecule Generation by Principal Subgraph Mining and Assembling.
This repo contains the codes for our work “Restricted orthogonal gradient projection for continual learning”.
Self-Supervised Quality Estimation for Machine Translation
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models