The project aims to develop a machine learning-based solution for automated resume detection and extraction. With the increasing volume of job applications, it becomes challenging for recruiters to manually review and extract information from each resume. This project seeks to streamline and expedite the resume screening process by leveraging machine learning algorithms.
The system will utilize state-of-the-art natural language processing (NLP) techniques and machine learning models to analyze and extract relevant information from resumes using SpaCy, Pandas, NumPy, Sciki-Learn etc.
pip install -r requirements.txt
=> Just locate your Command Prompt or Powershell to Project Folder
=> Run the following command to activate the flask API server
python app.py
=> Here we have two endpoints "/resumeparser" and "/resumedetection".
=> "/resumeparser" is directly extracting information from PDF file without ensuring that the given PDF file is RESUME or not.
=> "/resumedetection" is checking first that the given PDF file is RESUME or not and after that if it is then it will start to extract information from it else it will simply display that the given PDF file is not RESUME.
=> To use the service of API, you have give the following inputs to PostMan...
-
Create a new request with POST method and http://127.0.0.1:5000/resumedetection
-
In authorization tab, give username and password credentials which are defined in .env file of the project.
-
In Body tab move to form-data tab, enter key as "filePath" and select type of input as File and select file from values column.
-
Also give the information of your desired information like name, email, phone or skills in array withe key value "extractFields" and select type of input as Text and give that array into Values column as ["name", "email", "phone"] if you want to have only name , email and phone from the uploaded PDF file.
-
If you want to have all information from PDF file then no need to follow step 4.