Yuqiang Xie's Projects
a simple seq2seq model
Data and software for building the ACL Anthology.
An open-source NLP research library, built on PyTorch.
Analysis of Story-Cloze-Test (SCT) task and its training set (ROCStories). History, developments and future works.
ASER (activities, states, events, and their relations), a large-scale eventuality knowledge graph extracted from more than 11-billion-token unstructured textual data.
A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
A curated list of awesome embedding models tutorials, projects and communities.
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
This repo contains a PyTorch implementation of a pretrained BERT model for multi-label text classification.
Code for paper Fine-tune BERT for Extractive Summarization
Tool for visualizing attention in BERT and OpenAI GPT-2
Commonsense Ability Tests
The ChID Dataset for paper ChID: A Large-scale Chinese IDiom Dataset for Cloze Test
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
Codes and Date for CLSEG in ICASSP 2022. https://ieeexplore.ieee.org/document/9747435/
The Third Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2019)
Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"
The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology.org/2022.coling-1.15.pdf In COLING 2022.
Code for EMNLP 2018 paper "Commonsense for Generative Multi-Hop Question Answering Tasks"
code and data for EMNLP-19 paper "Counterfactual Story Reasoning and Generation" https://arxiv.org/abs/1909.04076
《动手学深度学习》,英文版即伯克利深度学习(STAT 157,2019春)教材。面向中文读者、能运行、可讨论。
The Natural Language Decathlon: A Multitask Challenge for NLP
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,近30万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
Python package built to ease deep learning on graph, on top of existing DL frameworks.
A paper list about diffusion models for natural language processing.