Giter Club home page Giter Club logo

ScrapedIn

Generate potential email addresses from LinkedIn

Description

In 2017 scrapedin was created. The tool has been kept internal because there is a fear that linkedin will pursue legal action for publishing a scrapper. To overcome this fear, scrapedin is being published without any credit to the developers. It should be noted that this tool is to be used by professionals performing testing for organizations that they are authorized to work with. Malicious/illegal use of this tool is not the intention. Use at your own risk, as scrapedin does violate linkedins TOS. If this tool is useful to you- give thanks by spreading the word.

ScrapedIn allows you to gather the names of employees on LinkedIn and generate emails in the format of your choosing at runtime (-s allows users to specify format strings). This is useful for penetration tests that include phishing. This scrapper relies on selenium and the gecko webdriver. When you run the scrapper, you will be able to watch the scrapping happening live in a web browser. The reason scrapedin was not made headless is because it makes debugging issues exceptionally easy.

A CSV file with the users name, email, title, location and profie URL is output after a scrape is completed. Scrapedin also provides users with a way to generate email addresses from a specific geopgrahical region, which can help you be more targeted during a red team engagement.

scrapedin running

Example Usage:

# Company Scraping
./scrapedin.py -c "Company Name" -o filename.csv -u [email protected] -s {first:.1}{last}@{domain}.com

# Georegion Scraping - Useful for scraping users from a company within a specific region
./scrapedin.py -c "Giant Global Conglomorate" -o filename.csv -u [email protected] -s
{first}.{last}@{domain}.com -g "New York City"

# Industry Scraping - Useful for multiple companies with same name but different industries
./scrapedin.py -c "Smith Inc." -o filename.csv -u [email protected] -i manufacturing

# Georegion/Industry lookup syntax - View available/valid regions/industries to scrape for
./scrapedin.py -l -g "Somewhere" -i "manufacturing"

Requirements:

  • Selenium
  • Firefox-ESR
  • Gecko webdriver
  • Python 3.3+

Install Dependencies:

Pipenv method (recommended)

Install specific versions of selenium & geckodriver via pipenv for a safer, more pleasant scraping experience. This method will allow you to execute scrapedin via pipenv shell or pipenv run python scrapedin.py and will respect your globally installed external libraries.

Simply install via

git clone https://github.com/scrapedin/scrapedin
cd scrapedin
pipenv install
pipenv run python scrapedin.py 

OR if you prefer not to use a pipenv

pip install .

Shortcut script

For added convenience, simply add the following wrapper script to your /usr/local/bin/ or $HOME/.local/bin/ path.

cat <<EOF >> $HOME/.local/bin/scrapedin
#!/bin/bash
exec pipenv run python $(pwd)/scrapedin.py "\$@"

EOF
chmod +x $HOME/.local/bin/scrapedin
Scrapedin will use the local geckodriver executable path in webdriver/<arch>/geckodriver which is v0.24.0. Manual installation is no longer required.

Manually install geckodriver (optional)

Gecko Webdriver:

Find the current release of the Gecko Webdriver here (https://github.com/mozilla/geckodriver/releases). Once you have the correct web address, follow these instructions:

wget https://github.com/mozilla/geckodriver/releases/download/v0.14.0/geckodriver-YOUR-VERSION.tar.gz
tar -xvzf geckodriver-YOUR-VERSION.tar.gz
sudo cp geckodriver /usr/bin/geckodriver
rm gecko*

Firefox (Debian/Kali):

sudo apt-get install firefox-esr

scrapedin's Projects

scrapedin icon scrapedin

Generate potential email addresses from LinkedIn

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.