- Simple question-answering system using CrunchBase and LinkedIn dataset.
- Wrote a web crawler to scrape and parse the text data for the NLP algorithm.
- Converted the extracted data into RDF format and expanded the vocabulary using Google Word2Vector.
- Filtered the best results using elastic search ranking algorithm.
- Technologies used: Python, Pandas, Natural Language Toolkit, Stanford CoreNLP and Standford NER parser, BeautifulSoup, RDF (Sparql) and ElasticSearch, Word2Vector (GoogleNews)
rajatsharma07 / crunchbase-semantics Goto Github PK
View Code? Open in Web Editor NEWText Mining : Simple question-answering system using CrunchBase & LinkedIn datasets