Python Script to Scrape Pastebin with Regex. This is by far NOT a 'finished project' and plan to improve this over time. My goal is to make PastaBean as flexible as I can and simple to run with minimal requirements to capture data.
Created script to learn Python and capture data on the popular site https://Pastebin.com.
https://github.com/Tu5k4rr/PastaBean-Samples
- Scrape Pastebin, 100 queries per 60 seconds.
- Write matches to text file in same directory.
- Logging -
pasta.log
- Pastebin PRO account to use the API to scrape and whitelist your Internet IP (https://pastebin.com/doc_scraping_api).
sudo apt-get install python python-pip
pip install requests
- Run on VPS
- Login via SSH
- Run script as background process(Python2.7):
python PastaBean.py &
- Run script as background process(Python3.5.2):
python3 PastaBean.py &
: Tested and works. - To release background process to
init
useexit
to logout of ssh session - Log back into VPS via SSH verify
Pastabean.py
is still running - Happy Hunting :)
- Docker file to create image
- Improve current RegeX
- Add more Regex!!!
Decreased status output to one line.Generate log file for each alarm to replace e-mail alerts
- Twitter @Tu5k4rr
- E-mail: [email protected]