Check out the report in this link for more information about this project.
WikiArticleXmlHandler
: This file handles the parsing for the articles.WikiDataXmlHandler
: This file handles the parsing of the wiki data.Utils
: Some utilities function, such as a function to download the wikidata, articles, dictionaries, etc.data_play
: lot of dummy code that visualize the data that was extracted.main
: Driver code to run the parser and the NLP.playground
: Some dummy code to debug and do the base code.ProcessArticle
: It'll parse and process a single article.