Giter Club home page Giter Club logo

icewater's Introduction

Project Icewater

This project provides open-source YARA rules for the detection of malware and malicious files. the anti-virus industry prefers names for a threat. This is my attempt to publish signatures as numbers. Since I find the naming of threats to be confusing and misleading I am attempting to locate threats in a phase-space so that their relationships can be measured, visualized and scientificly described.

Each yara signature in this archive is organized by a prefix and a 64 bit integer. The prefix is an index into file size and file type while the suffix is a 64 bit coordinate in a multi dimintional hyper space. Within a prefix, edit distance may be used to understand how two clusters relate to eachother.

The Starting Problem

The basis of this research and this contribution to internet security is the idea of the Starting Problem which derives itself from Turing complete machines halting problem documented by Allen Turing in 1936. The staring problem I am defining thus: Knowing if a program should be allowed to run without running the program. My solution is to run about 4% of programs and by running them infer if the other 96% should be allowed to run.

Icewater is the project that clusters and sorts things on the interent. Icewater writes these rules in the hope that they are a compact form of transmitting knowlege reguarding programs that should have their evil-bit set :)

How these rules get written

Icewater clusters malicious objects on the internet and when it has enough information about these objects it will publish a yara rule that can be used to detect the threat. Since I am generally annoyed with the state of internet security I am publishing many of the rules Icewater writes.

Each rule leverages the hash module of the yara tools. I provide an offset into a file and the amount of data that you should hash and the hash algorthim. I choose md5 because it is fast and most folks dislike it because of the possability of collision. If you think I should choose a different hashing algorthim please explain over beers.

QA

Each rule is tested against the cluster that it is written from and against part of our larger data set. The rule should fire only for its cluster or a neighbor cluster that both have the same "family." Rules that pass these qualifications are published in this archive. Rules that fail these tests are used to inform the process and debug the systems that generate the rules. If you find a rule that is missing its target please let me know, contact deatiols are bleow.

Is Icewater a form of Artifical Intelligence?

Yes, if you are a VC -- Icewater is based off a kind of mathmatics that is used to describe the physical world, much like the math that we use for training AI. Icewater uses the same algorthims all Eukaryote (any cell that has a nucleas) use to organize their DNA. If you don't think binaries either in PE or COFF fomat are like DNA... Well, they are. You are a robot -- get used to it.

Remember Icewater writes the rules, I just write the part that writes Icewater, but I didn't write the algorthim -- nature did.

Goals

My goal for this project is to place a large quantity of yara rules into the network security community that it measureably effects global cyber security. Please let me know when you think I'm getting close to my goal.

License

Pay close attention to the RIL (Rick's Internet License) is is simular to the BSD with a 3rd clause that requires that if you use these rules and know me in physical space, you may need to acknolage that you use these rules. I do enforce the license at public and private events.

Contact

webpage: http://icewater.io
blog: http://cyberwarhead.com
Twitter: @wessorh

icewater's People

Contributors

wessorh avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.