Giter Club home page Giter Club logo

chinese-annotator's Introduction

Chinese-Annotator

Join the chat at https://gitter.im/Chinese-Annotator/Lobby Build Status License

Annotator for Chinese Text Corpus

Many NLP tasks require lots of labelling data. Current annotators are mostly for English. We want to develop a Chinese Annotator based on existing open source technologies.

欢迎一起加入讨论。

Get Start on OSX

  • 安装 python3.6
  • 安装 virtualenv
  • 新建 python 虚拟环境, 激活虚拟环境
  • 安装 mongodb
virtualenv --python $path_of_python_3.6 $target_virtual_env_path
source $target_virtual_env_path/bin/activate
  • pip 安装依赖,推荐配置豆瓣或者阿里源
cd $repository_directory
pip install -r requirement.txt
  • pip 安装当前项目 python 包
cd $repository_directory
pip install -e .
  • 开启 mongodb
mongod
  • 导入样例数据
cd $repository_directory
bash scripts/init_db.sh
  • 启动 python api 服务
cd $repository_directory
bash scripts/run_webui.sh
  • 安装 nodejs, 推荐 lts 版本
  • 安装 yarn
  • 安装 npm 依赖
npm install -g yarn
cd $repository_directory/web
yarn
  • 启动前端开发服务
yarn start

Project Alignment

.
├── config                  # System config files
├── docs                    # Documentations
├── tests                   # Test cases
│   └── data                # Raw data for tests
├── chi_annotator           # Main backend project folder
│   ├── algo_factory        # Algorithm Factory module containing general algorithms
│       ├── preprocess      # Preprocess codes
│       ├── online          # Online Algorithms for Active Learning (svm for now)
│       └── offline         # Offline Algorithms for higher Accuracy (DL models)
│   ├── task_center         # Task Center module (main entrance and logic control)
│   ├── webui               # WebUI module
│       ├── apis
│       └── static
│   ├── data                # Database module
│   └── user_instance       # User Instance module holding config files for specific tasks
│       └── examples        # User Instance examples
|           ├── classify    # Text Classification
|           ├── ner         # Named Entity Recognition
|           ├── pos_tagger  # POS Tagger
|           └── re          # Relation Extraction
├── web                     # Main frontend project folder
└── ...

构想:中文文本标注工具

自然语言处理的大部分任务是监督学习问题。序列标注问题如中文分词、命名实体识别,分类问题如关系识别、情感分析、意图分析等,均需要标注数据进行模型训练。深度学习大行其道的今天,基于深度学习的 NLP 模型更是数据饥渴。

最前沿的 NLP 技术往往首先针对英文语料。英文 NLP 的生态很好,针对不同有意思的问题都有不少大规模语料公开供大家研究,如斯坦福的 SQuAD 阅读理解语料。中文方面开源语料就少得多,各种英文 NLP 上的犀利模型和前沿技术都因为中文语料的匮乏很难迁移过来。另一方面,对于一些垂直领域,如医疗、金融、法律、公安等等,专有名词和特有需求甚多,很难将比较 general 的比如在 wikipedia dump 上面训练的模型直接拿过来用。

传统人工标注数据的过程往往是繁琐和低效率的。刚标了一个“联想”是公司名,又来一个“联想集团”,再标一次又来一个“联想集团有限公司”,如此的例子令标注过程含有大量的重复劳动。另一方面也没有一个易上手的标注 UI,标注工作者往往需要直接按预先定好的格式直接在写字板之类的软件中修改原始数据,格式错误率也较高。

能不能构建一个中文文本的标注工具,可以达到以下两个特点:

  1. 标注过程背后含有智能算法,将人工重复劳动降到最低;

  2. 标注界面显而易见地友好,让标注操作尽可能简便和符合直觉。

答案是可以的。事实上很多标注工具已经做到了这一点,最先进的如 Explosion.ai 的 Prodigy;然而开发了著名的 NLP 开源包 Spacy 的 explosion.ai 选择了将 Prodigy 闭源,而 Spacy 支持中文也仍然遥遥无期。我们希望构建一个开源的中文文本标注工具,而本文很多的技术灵感正是来自Prodigy 文档

主动学习的智能标注算法

流程:

  1. 用户标一个 label

  2. 主动学习的后台算法分为 online 和 offline 部分。online 部分即时更新模型,可使用诸如 SVM、bag of words 等尽可能快的传统方法;offline 部分当标注数据积累到一定数量时更新模型,可使用准确度较高的深度学习模型。

  3. 模型更新后,对尽可能多的 example 做预测,将确信度排序,取确信度最低的一个 example 作为待标注例子。重复 1 的过程。

可以想象如果模型训练得好的话,这个过程将直接忽略掉确信度最大的那些例子,而把所有重点放在分类边界上的那些确信度小的例子。这样可以尽算法所能减少用户端的人工工作量。

online 与 offline 模型互相协作,与用户手动标注的过程一起不断迭代;在最终标注任务完成之后,offline 模型可以重新在所有标注数据上重新训练,以达到最好的模型效果。

显而易见的友好标注前端

用户标注的界面应该尽可能符合直觉,让用户完全聚焦在当前的标注任务上。

Prodigy 给了一个非常好的demo,每一次的标注只需要用户解决一个 case 的问题。以文本分类为例,对于算法给出的分类结果,只需要点击“正确”提供正样本,“错误”提供负样本,“略过”将不相关的信息滤除,“Redo”让用户撤回操作,四个功能键以最简模式让用户进行标注操作。

真正应用中,应该还要加入一个用户自己加入标注的交互方式,比如用户可以高亮一个词然后选择是“公司”,或者链接两个实体选择他们的关系等等。

以上是个人觉得的一个智能中文文本标注工具的最大亮点。算法本身还有很多细节需要思考,比如 online 机器学习算法与 offline 深度学习算法的协作、中文 NLP 的特征提取与模型构建、正则规则的引入、word embedding 的训练和使用等等。系统本身还要考虑后台存储(SQLite?)和数据导入导出,前端框架选型和开发,前后端交互(django? flask? RestAPI?)等等的问题。下面是 Prodigy 的简单架构图。

我们希望专注于中文文本标注的功能。前期我们想实现三种中文 NLP 任务的标注工具:中文命名实体识别中文关系识别中文文本分类。未来如果有更多如中文图片问答、中文图片描述之类的任务,我们可以再研究加入图片标注这一块。

希望这个工具的开发会是以中文社区的开源协作方式,为整个中文 NLP 的开源生态做出一点贡献。

chinese-annotator's People

Contributors

burkun avatar crownpku avatar dependabot[bot] avatar habout632 avatar jialei123 avatar minimonkey avatar minminzhong avatar ppn029012 avatar sjy avatar xxg1413 avatar zqhzy avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

chinese-annotator's Issues

[feature wanted]多人交叉验证的标注质量控制

每个task可以选择多人交叉验证。
不同worker标注出现出入的话,需要一个owner最后确认。

另外足够多数据训练之后,可以考虑将算法置信度也引入帮助判断和怀疑标注质量。

Get error after command "bash scripts/run_webui.sh"

    copying sklearn/__check_build/setup.py -> build/lib.linux-x86_64-3.7/sklearn/__check_build
    copying sklearn/__check_build/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/__check_build
    creating build/lib.linux-x86_64-3.7/sklearn/_build_utils
    copying sklearn/_build_utils/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/_build_utils
    creating build/lib.linux-x86_64-3.7/sklearn/covariance
    copying sklearn/covariance/empirical_covariance_.py -> build/lib.linux-x86_64-3.7/sklearn/covariance
    copying sklearn/covariance/robust_covariance.py -> build/lib.linux-x86_64-3.7/sklearn/covariance
    copying sklearn/covariance/graph_lasso_.py -> build/lib.linux-x86_64-3.7/sklearn/covariance
    copying sklearn/covariance/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/covariance
    copying sklearn/covariance/shrunk_covariance_.py -> build/lib.linux-x86_64-3.7/sklearn/covariance
    copying sklearn/covariance/outlier_detection.py -> build/lib.linux-x86_64-3.7/sklearn/covariance
    creating build/lib.linux-x86_64-3.7/sklearn/covariance/tests
    copying sklearn/covariance/tests/test_graph_lasso.py -> build/lib.linux-x86_64-3.7/sklearn/covariance/tests
    copying sklearn/covariance/tests/test_covariance.py -> build/lib.linux-x86_64-3.7/sklearn/covariance/tests
    copying sklearn/covariance/tests/test_robust_covariance.py -> build/lib.linux-x86_64-3.7/sklearn/covariance/tests
    copying sklearn/covariance/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/covariance/tests
    creating build/lib.linux-x86_64-3.7/sklearn/cross_decomposition
    copying sklearn/cross_decomposition/cca_.py -> build/lib.linux-x86_64-3.7/sklearn/cross_decomposition
    copying sklearn/cross_decomposition/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/cross_decomposition
    copying sklearn/cross_decomposition/pls_.py -> build/lib.linux-x86_64-3.7/sklearn/cross_decomposition
    creating build/lib.linux-x86_64-3.7/sklearn/cross_decomposition/tests
    copying sklearn/cross_decomposition/tests/test_pls.py -> build/lib.linux-x86_64-3.7/sklearn/cross_decomposition/tests
    copying sklearn/cross_decomposition/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/cross_decomposition/tests
    creating build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/variance_threshold.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/from_model.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/rfe.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/univariate_selection.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/mutual_info_.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/base.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    copying sklearn/feature_selection/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection
    creating build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_feature_select.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_mutual_info.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_rfe.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_chi2.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_variance_threshold.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_base.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    copying sklearn/feature_selection/tests/test_from_model.py -> build/lib.linux-x86_64-3.7/sklearn/feature_selection/tests
    creating build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/kernels.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/correlation_models.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/gaussian_process.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/gpc.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/regression_models.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    copying sklearn/gaussian_process/gpr.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process
    creating build/lib.linux-x86_64-3.7/sklearn/gaussian_process/tests
    copying sklearn/gaussian_process/tests/test_kernels.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process/tests
    copying sklearn/gaussian_process/tests/test_gaussian_process.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process/tests
    copying sklearn/gaussian_process/tests/test_gpc.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process/tests
    copying sklearn/gaussian_process/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process/tests
    copying sklearn/gaussian_process/tests/test_gpr.py -> build/lib.linux-x86_64-3.7/sklearn/gaussian_process/tests
    creating build/lib.linux-x86_64-3.7/sklearn/mixture
    copying sklearn/mixture/gaussian_mixture.py -> build/lib.linux-x86_64-3.7/sklearn/mixture
    copying sklearn/mixture/gmm.py -> build/lib.linux-x86_64-3.7/sklearn/mixture
    copying sklearn/mixture/bayesian_mixture.py -> build/lib.linux-x86_64-3.7/sklearn/mixture
    copying sklearn/mixture/base.py -> build/lib.linux-x86_64-3.7/sklearn/mixture
    copying sklearn/mixture/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/mixture
    copying sklearn/mixture/dpgmm.py -> build/lib.linux-x86_64-3.7/sklearn/mixture
    creating build/lib.linux-x86_64-3.7/sklearn/mixture/tests
    copying sklearn/mixture/tests/test_gmm.py -> build/lib.linux-x86_64-3.7/sklearn/mixture/tests
    copying sklearn/mixture/tests/test_bayesian_mixture.py -> build/lib.linux-x86_64-3.7/sklearn/mixture/tests
    copying sklearn/mixture/tests/test_gaussian_mixture.py -> build/lib.linux-x86_64-3.7/sklearn/mixture/tests
    copying sklearn/mixture/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/mixture/tests
    copying sklearn/mixture/tests/test_dpgmm.py -> build/lib.linux-x86_64-3.7/sklearn/mixture/tests
    creating build/lib.linux-x86_64-3.7/sklearn/model_selection
    copying sklearn/model_selection/_search.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection
    copying sklearn/model_selection/_validation.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection
    copying sklearn/model_selection/_split.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection
    copying sklearn/model_selection/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection
    creating build/lib.linux-x86_64-3.7/sklearn/model_selection/tests
    copying sklearn/model_selection/tests/test_split.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection/tests
    copying sklearn/model_selection/tests/test_validation.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection/tests
    copying sklearn/model_selection/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection/tests
    copying sklearn/model_selection/tests/test_search.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection/tests
    copying sklearn/model_selection/tests/common.py -> build/lib.linux-x86_64-3.7/sklearn/model_selection/tests
    creating build/lib.linux-x86_64-3.7/sklearn/neural_network
    copying sklearn/neural_network/rbm.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network
    copying sklearn/neural_network/_stochastic_optimizers.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network
    copying sklearn/neural_network/_base.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network
    copying sklearn/neural_network/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network
    copying sklearn/neural_network/multilayer_perceptron.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network
    creating build/lib.linux-x86_64-3.7/sklearn/neural_network/tests
    copying sklearn/neural_network/tests/test_mlp.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network/tests
    copying sklearn/neural_network/tests/test_stochastic_optimizers.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network/tests
    copying sklearn/neural_network/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network/tests
    copying sklearn/neural_network/tests/test_rbm.py -> build/lib.linux-x86_64-3.7/sklearn/neural_network/tests
    creating build/lib.linux-x86_64-3.7/sklearn/preprocessing
    copying sklearn/preprocessing/_function_transformer.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing
    copying sklearn/preprocessing/imputation.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing
    copying sklearn/preprocessing/label.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing
    copying sklearn/preprocessing/data.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing
    copying sklearn/preprocessing/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing
    creating build/lib.linux-x86_64-3.7/sklearn/preprocessing/tests
    copying sklearn/preprocessing/tests/test_function_transformer.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing/tests
    copying sklearn/preprocessing/tests/test_imputation.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing/tests
    copying sklearn/preprocessing/tests/test_label.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing/tests
    copying sklearn/preprocessing/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing/tests
    copying sklearn/preprocessing/tests/test_data.py -> build/lib.linux-x86_64-3.7/sklearn/preprocessing/tests
    creating build/lib.linux-x86_64-3.7/sklearn/semi_supervised
    copying sklearn/semi_supervised/label_propagation.py -> build/lib.linux-x86_64-3.7/sklearn/semi_supervised
    copying sklearn/semi_supervised/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/semi_supervised
    creating build/lib.linux-x86_64-3.7/sklearn/semi_supervised/tests
    copying sklearn/semi_supervised/tests/test_label_propagation.py -> build/lib.linux-x86_64-3.7/sklearn/semi_supervised/tests
    copying sklearn/semi_supervised/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/semi_supervised/tests
    creating build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/affinity_propagation_.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/setup.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/bicluster.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/spectral.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/mean_shift_.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/k_means_.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/hierarchical.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/_feature_agglomeration.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/birch.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    copying sklearn/cluster/dbscan_.py -> build/lib.linux-x86_64-3.7/sklearn/cluster
    creating build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_bicluster.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_spectral.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_k_means.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_affinity_propagation.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_birch.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_hierarchical.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_dbscan.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/common.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    copying sklearn/cluster/tests/test_mean_shift.py -> build/lib.linux-x86_64-3.7/sklearn/cluster/tests
    creating build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/mlcomp.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/twenty_newsgroups.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/setup.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/samples_generator.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/california_housing.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/kddcup99.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/base.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/svmlight_format.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/lfw.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/covtype.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/species_distributions.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/mldata.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/rcv1.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    copying sklearn/datasets/olivetti_faces.py -> build/lib.linux-x86_64-3.7/sklearn/datasets
    creating build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_kddcup99.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_20news.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_lfw.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_covtype.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_rcv1.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_base.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_svmlight_format.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_samples_generator.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    copying sklearn/datasets/tests/test_mldata.py -> build/lib.linux-x86_64-3.7/sklearn/datasets/tests
    creating build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/sparse_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/online_lda.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/dict_learning.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/setup.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/fastica_.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/truncated_svd.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/incremental_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/base.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/kernel_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/factor_analysis.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    copying sklearn/decomposition/nmf.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition
    creating build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_fastica.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_online_lda.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_factor_analysis.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_incremental_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_truncated_svd.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_sparse_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_nmf.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_dict_learning.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    copying sklearn/decomposition/tests/test_kernel_pca.py -> build/lib.linux-x86_64-3.7/sklearn/decomposition/tests
    creating build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/voting_classifier.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/weight_boosting.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/partial_dependence.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/setup.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/bagging.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/iforest.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/base.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/gradient_boosting.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    copying sklearn/ensemble/forest.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble
    creating build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_forest.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_voting_classifier.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_gradient_boosting_loss_functions.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_iforest.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_bagging.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_base.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_partial_dependence.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_gradient_boosting.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    copying sklearn/ensemble/tests/test_weight_boosting.py -> build/lib.linux-x86_64-3.7/sklearn/ensemble/tests
    creating build/lib.linux-x86_64-3.7/sklearn/externals
    copying sklearn/externals/odict.py -> build/lib.linux-x86_64-3.7/sklearn/externals
    copying sklearn/externals/setup.py -> build/lib.linux-x86_64-3.7/sklearn/externals
    copying sklearn/externals/test_externals_setup.py -> build/lib.linux-x86_64-3.7/sklearn/externals
    copying sklearn/externals/funcsigs.py -> build/lib.linux-x86_64-3.7/sklearn/externals
    copying sklearn/externals/six.py -> build/lib.linux-x86_64-3.7/sklearn/externals
    copying sklearn/externals/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/externals
    creating build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/format_stack.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/numpy_pickle_utils.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/numpy_pickle.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/testing.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/parallel.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/_memory_helpers.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/func_inspect.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/logger.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/pool.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/_compat.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/_parallel_backends.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/disk.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/numpy_pickle_compat.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/_multiprocessing_helpers.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/hashing.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/my_exceptions.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    copying sklearn/externals/joblib/memory.py -> build/lib.linux-x86_64-3.7/sklearn/externals/joblib
    creating build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/dict_vectorizer.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/text.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/stop_words.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/setup.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/image.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    copying sklearn/feature_extraction/hashing.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction
    creating build/lib.linux-x86_64-3.7/sklearn/feature_extraction/tests
    copying sklearn/feature_extraction/tests/test_text.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction/tests
    copying sklearn/feature_extraction/tests/test_dict_vectorizer.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction/tests
    copying sklearn/feature_extraction/tests/test_feature_hasher.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction/tests
    copying sklearn/feature_extraction/tests/test_image.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction/tests
    copying sklearn/feature_extraction/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/feature_extraction/tests
    creating build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/spectral_embedding_.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/setup.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/isomap.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/mds.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/locally_linear.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    copying sklearn/manifold/t_sne.py -> build/lib.linux-x86_64-3.7/sklearn/manifold
    creating build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    copying sklearn/manifold/tests/test_isomap.py -> build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    copying sklearn/manifold/tests/test_t_sne.py -> build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    copying sklearn/manifold/tests/test_spectral_embedding.py -> build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    copying sklearn/manifold/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    copying sklearn/manifold/tests/test_mds.py -> build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    copying sklearn/manifold/tests/test_locally_linear.py -> build/lib.linux-x86_64-3.7/sklearn/manifold/tests
    creating build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/ranking.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/regression.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/scorer.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/setup.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/classification.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/pairwise.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/base.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    copying sklearn/metrics/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/metrics
    creating build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/test_pairwise.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/test_score_objects.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/test_regression.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/test_ranking.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/test_classification.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    copying sklearn/metrics/tests/test_common.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/tests
    creating build/lib.linux-x86_64-3.7/sklearn/metrics/cluster
    copying sklearn/metrics/cluster/unsupervised.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster
    copying sklearn/metrics/cluster/setup.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster
    copying sklearn/metrics/cluster/bicluster.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster
    copying sklearn/metrics/cluster/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster
    copying sklearn/metrics/cluster/supervised.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster
    creating build/lib.linux-x86_64-3.7/sklearn/metrics/cluster/tests
    copying sklearn/metrics/cluster/tests/test_bicluster.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster/tests
    copying sklearn/metrics/cluster/tests/test_supervised.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster/tests
    copying sklearn/metrics/cluster/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster/tests
    copying sklearn/metrics/cluster/tests/test_unsupervised.py -> build/lib.linux-x86_64-3.7/sklearn/metrics/cluster/tests
    creating build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/regression.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/unsupervised.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/setup.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/graph.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/approximate.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/kde.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/classification.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/base.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    copying sklearn/neighbors/nearest_centroid.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors
    creating build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_kde.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_dist_metrics.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_ball_tree.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_neighbors.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_nearest_centroid.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_approximate.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    copying sklearn/neighbors/tests/test_kd_tree.py -> build/lib.linux-x86_64-3.7/sklearn/neighbors/tests
    creating build/lib.linux-x86_64-3.7/sklearn/tree
    copying sklearn/tree/export.py -> build/lib.linux-x86_64-3.7/sklearn/tree
    copying sklearn/tree/setup.py -> build/lib.linux-x86_64-3.7/sklearn/tree
    copying sklearn/tree/tree.py -> build/lib.linux-x86_64-3.7/sklearn/tree
    copying sklearn/tree/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/tree
    creating build/lib.linux-x86_64-3.7/sklearn/tree/tests
    copying sklearn/tree/tests/test_export.py -> build/lib.linux-x86_64-3.7/sklearn/tree/tests
    copying sklearn/tree/tests/test_tree.py -> build/lib.linux-x86_64-3.7/sklearn/tree/tests
    copying sklearn/tree/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/tree/tests
    creating build/lib.linux-x86_64-3.7/sklearn/svm
    copying sklearn/svm/setup.py -> build/lib.linux-x86_64-3.7/sklearn/svm
    copying sklearn/svm/base.py -> build/lib.linux-x86_64-3.7/sklearn/svm
    copying sklearn/svm/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/svm
    copying sklearn/svm/classes.py -> build/lib.linux-x86_64-3.7/sklearn/svm
    copying sklearn/svm/bounds.py -> build/lib.linux-x86_64-3.7/sklearn/svm
    creating build/lib.linux-x86_64-3.7/sklearn/svm/tests
    copying sklearn/svm/tests/test_svm.py -> build/lib.linux-x86_64-3.7/sklearn/svm/tests
    copying sklearn/svm/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/svm/tests
    copying sklearn/svm/tests/test_bounds.py -> build/lib.linux-x86_64-3.7/sklearn/svm/tests
    copying sklearn/svm/tests/test_sparse.py -> build/lib.linux-x86_64-3.7/sklearn/svm/tests
    creating build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/ridge.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/least_angle.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/theil_sen.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/omp.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/logistic.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/perceptron.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/setup.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/stochastic_gradient.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/randomized_l1.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/bayes.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/passive_aggressive.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/base.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/coordinate_descent.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/sag.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/huber.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    copying sklearn/linear_model/ransac.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model
    creating build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_bayes.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_theil_sen.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_huber.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_least_angle.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_omp.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_coordinate_descent.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_sparse_coordinate_descent.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_randomized_l1.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_sgd.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_logistic.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_base.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_sag.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_perceptron.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_ransac.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_passive_aggressive.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    copying sklearn/linear_model/tests/test_ridge.py -> build/lib.linux-x86_64-3.7/sklearn/linear_model/tests
    creating build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/arpack.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/bench.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/metaestimators.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/class_weight.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/optimize.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/setup.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/testing.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/mocking.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/random.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/graph.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/stats.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/estimator_checks.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/linear_assignment_.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/multiclass.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/fixes.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/deprecation.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/extmath.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/sparsefuncs.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/_scipy_sparse_lsqr_backport.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    copying sklearn/utils/validation.py -> build/lib.linux-x86_64-3.7/sklearn/utils
    creating build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools
    copying sklearn/utils/sparsetools/setup.py -> build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools
    copying sklearn/utils/sparsetools/_graph_validation.py -> build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools
    copying sklearn/utils/sparsetools/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools
    creating build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools/tests
    copying sklearn/utils/sparsetools/tests/test_traversal.py -> build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools/tests
    copying sklearn/utils/sparsetools/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/utils/sparsetools/tests
    creating build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_fast_dict.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_seq_dataset.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_murmurhash.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_sparsefuncs.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_validation.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_stats.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_fixes.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_graph.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_utils.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_metaestimators.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_class_weight.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_estimator_checks.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_extmath.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_linear_assignment.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_random.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_testing.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_bench.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_multiclass.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_optimize.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    copying sklearn/utils/tests/test_shortest_path.py -> build/lib.linux-x86_64-3.7/sklearn/utils/tests
    creating build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_multioutput.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_kernel_approximation.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_pipeline.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_isotonic.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_naive_bayes.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_metaestimators.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_dummy.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_learning_curve.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_init.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_check_build.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_grid_search.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_cross_validation.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_discriminant_analysis.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_random_projection.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_base.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_calibration.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/__init__.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_multiclass.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_kernel_ridge.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    copying sklearn/tests/test_common.py -> build/lib.linux-x86_64-3.7/sklearn/tests
    running build_clib
    customize UnixCCompiler
    customize UnixCCompiler using build_clib
    building 'libsvm-skl' library
    compiling C++ sources
    C compiler: g++ -pthread -B /root/anaconda3/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -fPIC
    
    creating build/temp.linux-x86_64-3.7
    creating build/temp.linux-x86_64-3.7/sklearn
    creating build/temp.linux-x86_64-3.7/sklearn/svm
    creating build/temp.linux-x86_64-3.7/sklearn/svm/src
    creating build/temp.linux-x86_64-3.7/sklearn/svm/src/libsvm
    compile options: '-I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -c'
    g++: sklearn/svm/src/libsvm/libsvm_template.cpp
    ar: adding 1 object files to build/temp.linux-x86_64-3.7/liblibsvm-skl.a
    running build_ext
    customize UnixCCompiler
    customize UnixCCompiler using build_ext
    customize UnixCCompiler
    customize UnixCCompiler using build_ext
    building 'sklearn.__check_build._check_build' extension
    compiling C sources
    C compiler: gcc -pthread -B /root/anaconda3/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -fPIC
    
    creating build/temp.linux-x86_64-3.7/sklearn/__check_build
    compile options: '-I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/include/python3.7m -c'
    gcc: sklearn/__check_build/_check_build.c
    gcc -pthread -shared -B /root/anaconda3/compiler_compat -L/root/anaconda3/lib -Wl,-rpath=/root/anaconda3/lib -Wl,--no-as-needed -Wl,--sysroot=/ build/temp.linux-x86_64-3.7/sklearn/__check_build/_check_build.o -Lbuild/temp.linux-x86_64-3.7 -o build/lib.linux-x86_64-3.7/sklearn/__check_build/_check_build.cpython-37m-x86_64-linux-gnu.so
    building 'sklearn.cluster._dbscan_inner' extension
    compiling C++ sources
    C compiler: g++ -pthread -B /root/anaconda3/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -fPIC
    
    creating build/temp.linux-x86_64-3.7/sklearn/cluster
    compile options: '-I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/include/python3.7m -c'
    g++: sklearn/cluster/_dbscan_inner.cpp
    In file included from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/ndarraytypes.h:1824:0,
                     from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/ndarrayobject.h:12,
                     from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/arrayobject.h:4,
                     from sklearn/cluster/_dbscan_inner.cpp:306:
    /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/npy_1_7_deprecated_api.h:17:2: warning: #warning "Using deprecated NumPy API, disable it with " "#define NPY_NO_DEPRECATED_API NPY_1_7_API_VERSION" [-Wcpp]
     #warning "Using deprecated NumPy API, disable it with " \
      ^~~~~~~
    In file included from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/ndarrayobject.h:21:0,
                     from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/arrayobject.h:4,
                     from sklearn/cluster/_dbscan_inner.cpp:306:
    /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/__multiarray_api.h:1463:1: warning: ‘int _import_array()’ defined but not used [-Wunused-function]
     _import_array(void)
     ^~~~~~~~~~~~~
    g++ -pthread -shared -B /root/anaconda3/compiler_compat -L/root/anaconda3/lib -Wl,-rpath=/root/anaconda3/lib -Wl,--no-as-needed -Wl,--sysroot=/ build/temp.linux-x86_64-3.7/sklearn/cluster/_dbscan_inner.o -Lbuild/temp.linux-x86_64-3.7 -o build/lib.linux-x86_64-3.7/sklearn/cluster/_dbscan_inner.cpython-37m-x86_64-linux-gnu.so
    building 'sklearn.cluster._hierarchical' extension
    compiling C++ sources
    C compiler: g++ -pthread -B /root/anaconda3/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -fPIC
    
    compile options: '-I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/include/python3.7m -c'
    g++: sklearn/cluster/_hierarchical.cpp
    In file included from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/ndarraytypes.h:1824:0,
                     from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/ndarrayobject.h:12,
                     from /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/arrayobject.h:4,
                     from sklearn/cluster/_hierarchical.cpp:302:
    /root/anaconda3/lib/python3.7/site-packages/numpy/core/include/numpy/npy_1_7_deprecated_api.h:17:2: warning: #warning "Using deprecated NumPy API, disable it with " "#define NPY_NO_DEPRECATED_API NPY_1_7_API_VERSION" [-Wcpp]
     #warning "Using deprecated NumPy API, disable it with " \
      ^~~~~~~
    sklearn/cluster/_hierarchical.cpp: In function ‘void __Pyx__ExceptionSave(PyThreadState*, PyObject**, PyObject**, PyObject**)’:
    sklearn/cluster/_hierarchical.cpp:22118:21: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         *type = tstate->exc_type;
                         ^~~~~~~~
                         curexc_type
    sklearn/cluster/_hierarchical.cpp:22119:22: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         *value = tstate->exc_value;
                          ^~~~~~~~~
                          curexc_value
    sklearn/cluster/_hierarchical.cpp:22120:19: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         *tb = tstate->exc_traceback;
                       ^~~~~~~~~~~~~
                       curexc_traceback
    sklearn/cluster/_hierarchical.cpp: In function ‘void __Pyx__ExceptionReset(PyThreadState*, PyObject*, PyObject*, PyObject*)’:
    sklearn/cluster/_hierarchical.cpp:22127:24: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         tmp_type = tstate->exc_type;
                            ^~~~~~~~
                            curexc_type
    sklearn/cluster/_hierarchical.cpp:22128:25: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         tmp_value = tstate->exc_value;
                             ^~~~~~~~~
                             curexc_value
    sklearn/cluster/_hierarchical.cpp:22129:22: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         tmp_tb = tstate->exc_traceback;
                          ^~~~~~~~~~~~~
                          curexc_traceback
    sklearn/cluster/_hierarchical.cpp:22130:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         tstate->exc_type = type;
                 ^~~~~~~~
                 curexc_type
    sklearn/cluster/_hierarchical.cpp:22131:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         tstate->exc_value = value;
                 ^~~~~~~~~
                 curexc_value
    sklearn/cluster/_hierarchical.cpp:22132:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         tstate->exc_traceback = tb;
                 ^~~~~~~~~~~~~
                 curexc_traceback
    sklearn/cluster/_hierarchical.cpp: In function ‘int __Pyx__GetException(PyThreadState*, PyObject**, PyObject**, PyObject**)’:
    sklearn/cluster/_hierarchical.cpp:22187:24: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         tmp_type = tstate->exc_type;
                            ^~~~~~~~
                            curexc_type
    sklearn/cluster/_hierarchical.cpp:22188:25: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         tmp_value = tstate->exc_value;
                             ^~~~~~~~~
                             curexc_value
    sklearn/cluster/_hierarchical.cpp:22189:22: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         tmp_tb = tstate->exc_traceback;
                          ^~~~~~~~~~~~~
                          curexc_traceback
    sklearn/cluster/_hierarchical.cpp:22190:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         tstate->exc_type = local_type;
                 ^~~~~~~~
                 curexc_type
    sklearn/cluster/_hierarchical.cpp:22191:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         tstate->exc_value = local_value;
                 ^~~~~~~~~
                 curexc_value
    sklearn/cluster/_hierarchical.cpp:22192:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         tstate->exc_traceback = local_tb;
                 ^~~~~~~~~~~~~
                 curexc_traceback
    sklearn/cluster/_hierarchical.cpp: In function ‘void __Pyx__ExceptionSwap(PyThreadState*, PyObject**, PyObject**, PyObject**)’:
    sklearn/cluster/_hierarchical.cpp:22214:24: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         tmp_type = tstate->exc_type;
                            ^~~~~~~~
                            curexc_type
    sklearn/cluster/_hierarchical.cpp:22215:25: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         tmp_value = tstate->exc_value;
                             ^~~~~~~~~
                             curexc_value
    sklearn/cluster/_hierarchical.cpp:22216:22: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         tmp_tb = tstate->exc_traceback;
                          ^~~~~~~~~~~~~
                          curexc_traceback
    sklearn/cluster/_hierarchical.cpp:22217:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_type’; did you mean ‘curexc_type’?
         tstate->exc_type = *type;
                 ^~~~~~~~
                 curexc_type
    sklearn/cluster/_hierarchical.cpp:22218:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_value’; did you mean ‘curexc_value’?
         tstate->exc_value = *value;
                 ^~~~~~~~~
                 curexc_value
    sklearn/cluster/_hierarchical.cpp:22219:13: error: ‘PyThreadState {aka struct _ts}’ has no member named ‘exc_traceback’; did you mean ‘curexc_traceback’?
         tstate->exc_traceback = *tb;
                 ^~~~~~~~~~~~~
                 curexc_traceback
    error: Command "g++ -pthread -B /root/anaconda3/compiler_compat -Wl,--sysroot=/ -Wsign-compare -DNDEBUG -g -fwrapv -O3 -Wall -fPIC -I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/lib/python3.7/site-packages/numpy/core/include -I/root/anaconda3/include/python3.7m -c sklearn/cluster/_hierarchical.cpp -o build/temp.linux-x86_64-3.7/sklearn/cluster/_hierarchical.o -MMD -MF build/temp.linux-x86_64-3.7/sklearn/cluster/_hierarchical.o.d" failed with exit status 1
    
    ----------------------------------------
  Rolling back uninstall of scikit-learn
  Moving to /root/anaconda3/lib/python3.7/site-packages/scikit_learn-0.20.3.dist-info/
   from /root/anaconda3/lib/python3.7/site-packages/~cikit_learn-0.20.3.dist-info
  Moving to /root/anaconda3/lib/python3.7/site-packages/sklearn/
   from /root/anaconda3/lib/python3.7/site-packages/~klearn
Command "/root/anaconda3/bin/python -u -c "import setuptools, tokenize;__file__='/tmp/pip-install-yfjsrsv_/scikit-learn/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record /tmp/pip-record-3fk13hc4/install-record.txt --single-version-externally-managed --compile" failed with error code 1 in /tmp/pip-install-yfjsrsv_/scikit-learn/
Watching for file changes with StatReloader
Exception in thread django-main-thread:
Traceback (most recent call last):
  File "/root/anaconda3/lib/python3.7/threading.py", line 917, in _bootstrap_inner
    self.run()
  File "/root/anaconda3/lib/python3.7/threading.py", line 865, in run
    self._target(*self._args, **self._kwargs)
  File "/root/anaconda3/lib/python3.7/site-packages/django/utils/autoreload.py", line 53, in wrapper
    fn(*args, **kwargs)
  File "/root/anaconda3/lib/python3.7/site-packages/django/core/management/commands/runserver.py", line 109, in inner_run
    autoreload.raise_last_exception()
  File "/root/anaconda3/lib/python3.7/site-packages/django/utils/autoreload.py", line 76, in raise_last_exception
    raise _exception[1]
  File "/root/anaconda3/lib/python3.7/site-packages/django/core/management/__init__.py", line 357, in execute
    autoreload.check_errors(django.setup)()
  File "/root/anaconda3/lib/python3.7/site-packages/django/utils/autoreload.py", line 53, in wrapper
    fn(*args, **kwargs)
  File "/root/anaconda3/lib/python3.7/site-packages/django/__init__.py", line 24, in setup
    apps.populate(settings.INSTALLED_APPS)
  File "/root/anaconda3/lib/python3.7/site-packages/django/apps/registry.py", line 91, in populate
    app_config = AppConfig.create(entry)
  File "/root/anaconda3/lib/python3.7/site-packages/django/apps/config.py", line 90, in create
    module = import_module(entry)
  File "/root/anaconda3/lib/python3.7/importlib/__init__.py", line 127, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 1006, in _gcd_import
  File "<frozen importlib._bootstrap>", line 983, in _find_and_load
  File "<frozen importlib._bootstrap>", line 965, in _find_and_load_unlocked
ModuleNotFoundError: No module named 'rest_framework'

[20171128] 当前进展和关于接下来工作的思考

webui + database

前端界面和数据库都还在开发中,重点是要写好API和相应文档以便未来后端算法模块的接入,下一步希望能尽快出一个能看到界面的demo~

task_center + user_instance

当前已经搭建好一个最简单的offline training的pipeline。

下一步既然我们已经有spam email的数据和label,在前端完成之前,需要模拟用户 “拿到confidence最低的一批数据” -> "续标数据(其实就是从数据里把该部分的label拿出来)" -> "重新训练给出confidence ranking"这样一个过程,完成一个模拟的online training & inference pipeline的test case。未来就可以方便接入前端与数据库的部分。

另外一个,就是要注意把具体任务(如spam email classification)的所有配置文件(.config),文本数据(.sqlite/mongodb),模型数据(tensorflow/sklearn/jieba词库)乃至状态数据全都实例化单独放在同一个user_instance下面的位置。我们的目标是,用户换一台电脑装好我们的软件,把user_instance中相应任务的数据包拷贝过去,就能在尽可能简单地配置完成后接着进行之前的工作。

algo_factory

当前已经完成了符合pipline框架格式的基于component和message的 char_tokenizer, sentence_embedding_extractorsklean_classifier模块,可以接起来实现一个offline training的过程。

下一步,即是要实现与用户标注数据交互的一个过程,即新标注数据进来的re-train(暂时实现是所有已标注数据的全量训练,即伪active learning)以及未标注数据inference之后的confidence ranking功能,返回确信度最低的几条数据。每个功能要写unit test。

另外,要实验这样的SVM全量训练在数据多了之后,是否能给到用户active learning级别的反馈速度。如果不够快的话,就要考虑加入online batch learning来代替每次全量数据集的训练,实现真正的active learning过程;这一块挑战多多。

暂时想到这么多,欢迎大家讨论呀!

如果想白嫖的,就不要浪费时间安装了

我之前先用docker安装,不行的。

后来又原生安装,安装完了也用不起来

然后我看issue,作者本身都已经指出这就是一个半成品,还需要很多开发的。

所以看到这个帖子的你如果想白嫖,就算了,不用浪费时间了。

如果想帮助作者开发,可以继续搞,并且我希望能把搞好的东西给我白嫖。

还是感谢作者的付出,只是可以把半成品这个事情写在前面,帮助广大白嫖程序员节省时间。

[algo_factory] Spam Email Classification Pipeline and Module Prototype

算法这块,现在有email spam数据,有classification的目标了

要实现:

  1. 前端导入数据写入database里面。然后一个一个给用户标注spam or not这个功能,并把标注结果通过api传给算法(这部分需要task_center调度)
  2. 算法这里先做online的传统机器学习部分,要做数据预处理(chi_annotator/algo_factory/preprocess/),feature提取(chi_annotator/algo_factory/feature_extraction/)以及SVM的分类运算(chi_annotator/algo_factory/online/)
    整个pipeline要模块化和通用化写成不同的py文件,这样config里面定义好pipeline的模块,就可以调用不同的部分跑流程了。
  3. 算法要实现置信度的运算和ranking,然后反馈给前端下一个需要标注的结果,这部分也要有一个调度和接口.

一个从rasa_nlu拿来的SVM例子放去了chi_annotator/algo_factory/online,供参考。

大家协助一起把它细化,然后尝试认领其中某一块开始实现。具体代码实现之前,我们要把函数定义输入输出商议好。

如果是新函数新代码,可以直接push;如果是修改别人的代码,建议用pull request让原来写代码的owner做review和merge.

how to get started?

After install deps

pip install -r requirements.txt

What to do next? how to run the service?

how to use the tool

when i intall all the packages the project needed and run yarn serve the web come out but when i click the "导入"button it raise network error

image

安装文档有些问题,建议更新下

经过一番折腾终于能run起来了,安装文档我觉得需要完善下:

  1. mongod命令,需要用户提前安装好mongo,并配置到PATH,这个需要提示下用户。
  2. 启动web服务的命令是yarn serve,不是yarn start

环境已搭建好,但是这个软件怎么使用呢?

项目作者您好,很高兴有您这样的先驱者为大家解决标注刚需的问题。现在对于您项目使用上有些问题想请教。
我已经搭建好环境,使用的是mac OS系统。

  1. 启动yarn serve,进入JS页面后。操作面板上只有两个选项:(1)导出annotation_data.josn (2)导入(email_classify_chi_nolablel.txt)。除此之外我找不到其他导入文档的按钮。同时也找不到选择文本分类还是命名实体识别的按钮。我现在想对NER的文本进行标注,请问js页面上该如何操作呢?
  2. 我使用了email_classify的标注,在给导入的文档打完标签后,点击导出.josn文件,会报错:key error : "txt"。当我退出项目,重新yarn serve进入页面后,导入、导出功能都出现error。请问这是怎么回事?我该如何选择我导入、导出文件?导出后文件所在路径如何选择?

期待您的回复~~非常感谢

while i run the end of readMe named yarn start, errors "错误: 找不到或无法加载主类 start"

Chinese Annotator version (e.g. 0.1):

Used backend / pipeline (jieba, sklearn, ...):

Operating system (windows, osx, ...):

Issue:

Proposed New Feature:

Content of configuration file (if used & relevant):

the end of readMe named yarn start , there are errors:
yarn start
错误: 找不到或无法加载主类 start

And view the log under web folder, tail log information is
20 error code ELIFECYCLE 21 error errno 1 22 error [email protected] serve: vue-cli-service serve 22 error Exit status 1 23 error Failed at the [email protected] serve script. 23 error This is probably not a problem with npm. There is likely additional logging output above. 24 verbose exit [ 1, true ] ~

view yarn , it appears:

Usage: yarn [--config confdir] [COMMAND | CLASSNAME]
  CLASSNAME                             run the class named CLASSNAME
 or
  where COMMAND is one of:
  resourcemanager -format-state-store   deletes the RMStateStore
  resourcemanager                       run the ResourceManager
  nodemanager                           run a nodemanager on each slave
  timelineserver                        run the timeline server
  rmadmin                               admin tools
  sharedcachemanager                    run the SharedCacheManager daemon
  scmadmin                              SharedCacheManager admin tools
  version                               print the version
  jar <jar>                             run a jar file
  application                           prints application(s)
                                        report/kill application
  applicationattempt                    prints applicationattempt(s)
                                        report
  container                             prints container(s) report
  node                                  prints node report(s)
  queue                                 prints queue information
  logs                                  dump container logs
  classpath                             prints the class path needed to
                                        get the Hadoop jar and the
                                        required libraries
  cluster                               prints cluster information
  daemonlog                             get/set the log level for each
                                        daemon

there is no start.

if i run npm run serve, i will got
`

vue-cli-service serve

INFO Starting development server...
10% building 2/2 modules 0 activeevents.js:174
throw er; // Unhandled 'error' event
^

Error: getaddrinfo ENOTFOUND x86_64-apple-darwin13.4.0
at GetAddrInfoReqWrap.onlookup [as oncomplete] (dns.js:56:26)
Emitted 'error' event at:
at GetAddrInfoReqWrap.doListen [as callback] (net.js:1457:12)
at GetAddrInfoReqWrap.onlookup [as oncomplete] (dns.js:56:17)
npm ERR! code ELIFECYCLE
npm ERR! errno 1
npm ERR! [email protected] serve: vue-cli-service serve
npm ERR! Exit status 1
npm ERR!
npm ERR! Failed at the [email protected] serve script.
npm ERR! This is probably not a problem with npm. There is likely additional logging output above.
`

what happend?
thx

请教个启动的问题

image
按照readme的指示,全部安装完了,bash scripts/run_webui.sh后,开了一个新的终端,把nodejs、npm、yarn都安装完了,但是报如下错误
image
image
当使用npm run serve时,报如下错误
image

前段页面所需要的用户的操作

前期我们想实现三种中文NLP任务的标注工具:中文命名实体识别,中文关系识别,中文文本分类。
针对这三种任务前段页面分别需要用户进行什么样子的交互呢

安装太不友好了,可以提供个docker版的吗

搞了一天硬是装不起来,依赖的东西太多太多.
1.python virtualenv装scikit-learn怎么装就是失败,尝试了无数回,改了requirements.txt 版本号终于装好了。
2.github下载代码库,40M下了一个小时下不下来。无奈翻墙下载,秒速下完。
3.机器上没有node环境,装个node,这个包缺那个包缺,一个个装好
4.yarn start 一直报错找不到start参数,搜到到堆栈溢出,终于找到解决方案,魔改参数
5.解决上个问题,然后build,src报错,完全没思路了

这安装依赖环境太多了,一般人都用不起啊

请问这个项目可以运行起来吗?

Chinese Annotator version (e.g. 0.1):

Used backend / pipeline (jieba, sklearn, ...):

Operating system (windows, osx, ...):

Issue:

Proposed New Feature:

Content of configuration file (if used & relevant):

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.