Built a content extraction and a local search engine using Apache Tika for Employment dataset from DARPA XDATA.
Built this project by cleansing and transforming the data and developing an algorithm for ranking the job postings. Developed a crawler using Tika to run across the employment dataset to show relevant job postings.