Giter Club home page Giter Club logo

semantic's Introduction

Semantic

Abstract

With the development of e-commerce, more and more people browse, collect, purchase, and publish their reviews through the e-commerce websites. The ocean of product reviews can be used to mine user command to guide producter design his product and marketing plan. This paper chooses AliExpress as the research object, and excavates Aliexpress's review data through the construction and reasoning of the knowledge graph.

In order to build knowledge graph of the AliExpress’s review, first we use the crawler to collect data according to AliExpress's second category, and store the text data in a relational database; secondly, we use the CoreNLP provided by Stanford University to name the text data for entity recognition and extract their entity relationship, and form a triple structure data; then we use the data set of Probase provided by Microsoft Asia Research Institute(MSRA) to identify the entity relationship type of the triplet data; and finally we build the reasoning semantic layer by a three-group and triple relationship type.

When obtained the AliExpress review knowledge graph, it is necessary to predict link and complete the the knowledge graph. Because the triplet data is discrete and network-based, it is difficult to data mining, so it needs to be converted into a computable, continuous, distributed triad vector. Firstly, we use the TransE model to train the triple data and obtain the distributed triples vector. Then we use the TransE model to calculate the Euclidean distance between entities in the knowledge map to predict the entity relationship, and then determine the possible entity relationships, and then complete the knowledge graph.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.