Giter Club home page Giter Club logo

comment-analysis's Introduction

大一立项——结合用户评价分析的网店假货预警

任何一个评论都可以分为 5 类:与商品质量无关、是水军、情感正向、情感中性、情感负向。于是,我们只需将一个评论分为这 5 类之一,这就是一个文本分类问题。

总流程:

总流程

时间节点

开题时间:2019年12月27日

中期答辩时间:2020年9月12日

结题时间:2020年11月1日

参考资料

[1] 李嘉明.面向跨境电商的高风险商品识别系统

[2] 李文江.基于深度学习的商品评价数据分析系统: https://www.doc88.com/p-2159917854320.html

[3] Python官方文档.csv文件的读写: https://docs.python.org/3/library/csv.html

[4] 简书.Python csv模块的使用: https://www.jianshu.com/p/d2e3d12bcc4d

[5] jieba分词系统官方文档: https://github.com/fxsjy/jieba

[6] 文本分类性能比较: https://mp.weixin.qq.com/s/_xILvfEMx3URcB-5C8vfTw

[7] Armand Joulin, Edouard Grave, Piotr Bojanowski, Tomas Mikolov. Bag of Tricks for Efficient Text Classification: https://arxiv.org/pdf/1607.01759.pdf

[8] FastText官方文档: https://fasttext.cc/docs/en/unsupervised-tutorial.html

[9] fastText原理和文本分类实战,看这一篇就够了: https://blog.csdn.net/feilong_csdn/article/details/88655927

[10] fastText python官方文档: https://pypi.org/project/fasttext/#text-classification-model

[11] 基于FastText进行文本分类: https://zhuanlan.zhihu.com/p/66739066

[12] 使用python+机器学习方法进行情感分析(详细步骤): http://www.360doc.com/content/16/0725/19/15165994_578332920.shtml

[13] 知乎专栏.炼丹笔记一:样本不平衡问题: https://zhuanlan.zhihu.com/p/56882616

[14] 解决样本不平衡问题的奇技淫巧 汇总: https://blog.csdn.net/songhk0209/article/details/71484469

[15] 知乎回答.欠采样(undersampling)和过采样(oversampling)会对模型带来怎样的影响?: https://www.zhihu.com/question/269698662/answer/352279936

[16] Python 使用 NLTK 实现简单情感分析--Twitter(推特)分析: https://www.cnblogs.com/yidada/p/11868870.html

[17] nltk 数据下载: https://github.com/nltk/nltk_data

[18] NLTK最详细功能介绍: https://www.cnblogs.com/chen8023miss/p/11458571.html

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.