wikidoc's Introduction

WikiDocs - A dataset of Wikipedia as a consumable format of Wikipedia pages for NLP Projects.

Abstract:- Wikipedia is one of the major free online encyclopedia and is created and edited by volunteers around the world and hosted by the Wikimedia Foundation. The corpora provided by Wikipedia is of significant use for the general research population in various domains of natural lanaguage processing. Chatbots are no stranger to data required for better modelling. Here we make available a Wikipedia corpus, which can be used for strong Natural language processing projects.The documents are in general available as definition and brief elaboration of the topic in question.

Other Details:- Link to the paper :- https://drive.google.com/file/d/1zlY8KGN0Fu00_WvWtul34C1wD44gLlti/view?usp=sharing

Link to the Presentation:- https://drive.google.com/file/d/1rgUOQHLdzYI7nqr9x4es_hgcDED5RpA9/view?usp=sharing

Link to the Dataset:- https://drive.google.com/file/d/15OaFfMYA_pvaiSSxWyLAljt-_0aWWig2/view?usp=sharing

Feel free to send email to the author for any queries and suggestions at ([email protected])

In Proceedings of Competition Track of Workshop W-19 Reasoning and Learning for Human-Machine Dialogues (DEEP-DIAL20) of the Thirtieth Conference on Association for the Advancement of Artificial Intelligence at AAAI Conference, 2020. Feel free to reach out for feedback, queries and suggestions to authors (Email:[email protected]).

Recipient of travel grant for the workshop from AIJ (Artificial Intelligence Journal). Special thanks to the committee members of DEEPDIAL'20, AAAI2020 (Dr. Srivastava and Team)

Acknowledgement:- Dr. Biplav Srivastava for continuous encouragement.

Recommend Projects

madhavanpallan / wikidoc Goto Github PK

wikidoc's Introduction

WikiDocs - A dataset of Wikipedia as a consumable format of Wikipedia pages for NLP Projects.

wikidoc's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent