malwarize / webpalm Goto Github PK
View Code? Open in Web Editor NEW🕸️ Crawl in the web network
Home Page: https://malwarize.live
License: GNU General Public License v3.0
🕸️ Crawl in the web network
Home Page: https://malwarize.live
License: GNU General Public License v3.0
After Installing webpalm using both cloning and direct go installation and try to use the tool using " webpalm -u https://google.com -l1 --live" it just doesn't run. Even tried running on a Docker Container still didn't work.
command:
go run main.go --regexes "emails=([a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+\.[a-zA-Z0-9-.]+),passwords=\bpassword\b\s*.{0,10}$"
output:
Error: invalid argument "emails=([a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+\\.[a-zA-Z0-9-.]+),passwords=\\bpassword\\b\\s*.{0,10}$" for "--regexes" flag: 10}$ must be formatted as key=value
Upon further investigation, it seems to be related to the mismatch in functionality of the GetStringToString
function at line 178. If we go to its implementation, we can see that in the parsing process, this function calls the stringToStringConv function, which treats the string as a CSV, resulting in an unexpected structure of the input, rendering ss
as follows:
ss = []string{
"emails=([a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+\.[a-zA-Z0-9-.]+)",
"passwords=\bpassword\b\s*.{0",
"10}$"
}
Continuing to parse each element as key=value
, the last element is not valid, thus causing the error
On macOS Ventura 13.4 (22F66), I'm trying to run:
webpalm -u https://www.nammalonline.co.il/ -l2 -i nammalonline.co.il
if I try to stop the run by pressing ^C, it does not respond
trying to run
webpalm -u https://www.nammalonline.co.il/ -l3 -i nammalonline.co.il
After a few seconds I'm getting an error message saying
runtime: failed to create new OS thread
This could be due to the number of pages it is trying to simultaneously crawl, or due to new threads getting created faster than the existing ones are completing their work. The number of threads on top
jumped from 12K to 18K in a few seconds, and then webpalm (which was eating up 1800% CPU) crashed.
I think some control on the number of simultaneous threads is required.
I hope my peers and family members do not run -l2 against my domain, or I'll have some explaining to do...lol.
./webpalm -u https://all-around-audit-solutions.com -l2 --live
├── [https://www.bingoporno.com/](200)
running webpalm -u https://www.nammalonline.co.il/wp-content/uploads/ -l3 -i nammalonline.co.il
returns:
┌[https://www.nammalonline.co.il/wp-content/uploads/]
│Level: 3
│Live Mode: false
│Export to: nothing
│Regexes:
│ nothing
│Crawl Only :
│ nammalonline.co.il
│Excluded Status: nothing
└
└── [https://www.nammalonline.co.il/wp-content/uploads/](0)
even though the uploads directory is open for browsing and contains multiple URLs to crawl.
we must add cycle detection algorithm to avoid infinite incursion
Text file created maxes out after 650 rows.
I have checked with "wc -l" during operations.
Please remove the cap of rows
while running the following command as mentioned in the documentation (readme file):
webpalm -u https://google.com -l1 --regexes comments="\<\!--.*?-->",emails="([a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+.[a-zA-Z0-9-.]+)"
I'm getting the following error:
Error: invalid argument "comments=\\<\\!--.*?-->,emails=([a-zA-Z0-9_.+-]+@[a-zA-Z0-9-]+.[a-zA-Z0-9-.]+)" for "--regexes"
From Kali Linux, I'm in the webpalm folder and the webpalm command is highlighted in green and appears executable. I'm in a Google Cloud Shell. Should I try it in a VM instead?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.