Giter Club home page Giter Club logo

gaiic2022-product-title-entity-recognition-baseline's Introduction

GAIIC2022商品标题实体识别Baseline,使用GlobalPointer实现

简介

比赛名称:GAIIC2022商品标题实体识别

测评任务:本赛题要求选手使用模型抽取出商品标题文本中的实体。与传统的实体抽取不同,京东商品标题文本的实体密度高、实体粒度细,赛题具有特色性。

任务类型:命名实体

测评链接:https://gaiic.caai.cn/ai2022

注意事项:本Baseline开源经大赛组委会同意 ,建议大家开源Baseline前先联系主办方

思路

使用苏神提出的GlobalPointer设计,它利用全局归一化的思路来进行命名实体识别(NER),可以无差别地识别嵌套实体和非嵌套实体。

环境

pip install ark-nlp
pip install pandas

ark-nlp开源地址:https://github.com/xiangking/ark-nlp

目录结构

│
├── data                                    # 数据文件夹
│   ├── source_datasets                                                   
│
└── code                                    # 代码

使用说明

下载数据并解压到data/source_datasets中,运行code文件夹中的baseline.ipynb

效果

  • 线上测试:0.80151539

gaiic2022-product-title-entity-recognition-baseline's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.