This project contains a set of Python scripts that automate the process of searching for LinkedIn profiles based on the names provided in an input CSV file, extracting specific information from the profiles, and storing the results in an output CSV file.
The project consists of the following files and directories:
linkedin-profile-crawler/
โโโ constants.py # Contains credentials and constants
โโโ crawler.py # Main script for crawling LinkedIn profiles
โโโ input_data.csv # CSV file containing names to search for
โโโ notfound_data.csv # CSV file for profiles that couldn't be found
โโโ output_data.csv # CSV file containing the crawled data
โโโ README.md # This README file
โโโ requirements.txt # List of Python package dependencies
โโโ utils/
โโโ department_separator.py # Utility script for department separation
โโโ extracted_content.json # JSON file with extracted content
โโโ __init__.py # Makes utils a Python package
โโโ students_separator.py # Utility script for student separation
โโโ teacher_separator.py # Utility script for teacher separation
-
Create a LinkedIn account if you do not already have one.
-
Clone this repository to your local machine.
-
Create a
constants.py
file with your LinkedIn credentials and other constants.Example
constants.py
:email = '[email protected]' password = 'your_password'
-
Install the required Python packages using the
requirements.txt
file:pip install -r requirements.txt
The most important packages are
selenium
andbeautifulsoup4
.
- Ensure you have a
input_data.csv
file in the root directory with the names you wish to search on LinkedIn. - Run the
crawler.py
script:python crawler.py
- After the script finishes, check the
output_data.csv
for successfully found profiles andnotfound_data.csv
for profiles that could not be found on LinkedIn.
- ๐: Indicates a directory or file
- ๐ ๏ธ: Represents the setup process
- ๐: Denotes the action of running the script
If you encounter any problems or have questions, feel free to contact me by email at [email protected] or reach out on Telegram at navid_falah
.