budirs86 / airflow-pdf2embeddings Goto Github PK
View Code? Open in Web Editor NEWThis project forked from moj-analytical-services/airflow-pdf2embeddings
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
License: MIT License