Comments (4)
That's great and thanks for responding so quickly.
Yeah with two versions it should be good for folks who want to use Python2.7.
Let me know if you need help with testing.
from ioc-finder.
Hey, @fhightower are there any plans to do this?. Was looking into how to speed up the parsing portion.
from ioc-finder.
Yes, I'll start working on this today and hope to have a solid, working version in a couple of weeks.
Implementation details
As far as implementation details are concerned (this is more for my own thoughts than for anyone else), I'm planning on using asyncio. This is pretty easy (I've done similar things before); the only downside is that we will lose python2.7 support (which doesn't have much time remaining anyway: https://pythonclock.org/).
This change will be a major version (v2.x.x
). Version 1.x of ioc-finder will be compatible with python2.x+ . Version 2.x of ioc-finder will be comptabile with python3.4+ .
from ioc-finder.
I just shipped version 2.0.0 to pypi which uses https://docs.python.org/3.7/library/multiprocessing.html to run many of the parsing operations concurrently. Let me know if you have any questions!
Implementation Details (or: Lessons Learned)
I had originally planned on using asyncio. I converted the library to use asyncio and it actually made it slower. After doing some research, I found this article which was very helpful: https://realpython.com/python-concurrency/. Basically, a program can run slowly because of I/O latency or because of CPU usage. asyncio is good for solutions involving I/O latency and multiprocessing is good when CPU usage is the limiting factor.
For this library (ioc-finder), the limitation is the CPU usage, so I went with multiprocessing.
from ioc-finder.
Related Issues (20)
- Consider removing "filler words" from text HOT 1
- Improve documentation around "original_text" vs. "text" variables
- Maybe an ioc type should be added. HOT 1
- Some file paths match to wrong paths. HOT 1
- Review docs
- url parser error HOT 2
- Move large test into separate tests
- parser ssdeeps error HOT 2
- Incorrect parsing of ipv4 cidrs to urls HOT 1
- Improve URL boundary regarding commas (Old name: Wrong URL Pattern being Parsed) HOT 4
- Update lint step to fail properly
- Add documentation about how we handle natural, human language conventions vs. RFC
- Speed parsing by running concurrent processes HOT 11
- Incorrect extract domains from AV signatures HOT 1
- Identify ip or version HOT 1
- v7.3.0 does not work properly in interactive docs HOT 2
- URL path and query params getting to lower case HOT 1
- Add logging
- Feature Request: Give option to return found start and end index of IoC HOT 2
- Drop formal support for python 3.7 and 3.8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ioc-finder.