ner's Introduction

NER

NER for medieval encyclopedia for the SourcEncyMe project at IRHT (Paris).

The model has been trained on the basis of the Lewis & Short dictionnary of which only proper names have been maintained (file "data/ls_indexData_capital.json") and the encyclopedia of Marcus d'Orvieto. The proper nouns of the dictionnary have been declined thanks to a script of Dr. WJB Mattingly (https://github.com/wjbmattingly/latin_ner_lesson) in order to recognize the declined forms of the proper nouns in the texts.

The dictionnary of the recognized entities througout the encyclopedia of Marcus d'Orvieto is the file "train/training_set.json". Each entity and their position in the corpus are listed. The only entities with their pattern have been listed in the file "temp/results.txt".

The test is applied on another text. A list of the entities without their position and only once referenced is the file "temp/results_one_ref.txt". Each entity and their position in the text are listed in the file "test.json".

Recommend Projects

etienneferrandi / ner Goto Github PK

ner's Introduction

NER

ner's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent