Name: Helaine
Type: User
Company: BIT(Beijing Institute of Technology)
Bio: A student in Beijing Institute of Technology.
Major in Data Mining, Nature Language Processing and Social Computing.
Location: Beijing, Haidian, Zhongguancun South Street, 5
Helaine's Projects
Bidirectional LSTM-CRF for Sequence Labeling. Easy-to-use and state-of-the-art performance.
第一次参加大数据比赛
2018年春季研一数据挖掘课程作业项目
Contents for data mining course
Key-phrase extraction for research publications using graph-representation of texts and centrality measures
刷题笔记~持续更新
百万英雄答题助手(汉王/百度OCR, 百度搜索/机器自动决策, Android / IOS手机均支持)
Dataset and code for the task of modeling the role and function of resource citation in scientific literatures.
pyltp: the python extension for LTP
A TensorFlow implementation of Recurrent Neural Networks for Sequence Classification and Sequence Labeling
Dataset and code for "A Context-based Framework for Modeling the Role and Function of On-line Resource Citations in Scientific Literature" (EMNLP 2019)
Named Entity Recognition (LSTM + CRF) - Tensorflow
Python wrapper for Stanford CoreNLP.
自然语言处理相关实验(基于sougou数据集),包含文本特征提取(TF-IDF),文本分类,文本聚类,word2vec训练词向量及同义词词林中文词语相似度计算、文档自动摘要,信息抽取,情感分析与观点挖掘等。
A tool to do entity set expansion on Twitter corpus. From a set of inital seeds the program can return more semantic similar entities.