Giter Club home page Giter Club logo

opendamp's Introduction

OpenDAMP

Operations of OpenDAMP (open Digital Advertising Measurement Platform):

  1. Categorization of websites resources (Advertising, Analytics, Social, 3rd party content, Rest) by using external sources (for now I use Disconnect plugins blacklist I have also the one of Adblocker Plus and Ghostery) and a list of ours that I created after manual inspection.
  2. Web Beacon detection by checking the pixel size of the fetched resources. (it replays each HTTP request - it's not always easy to detect if the req regards image (jsp case) and replay only GIFs or PNGs)
  3. Categorization of Advertisers based on the products they provide (by using g2crowd.com list), separation of Data Management Platforms, DSPs, Ad platforms etc.
  4. Calculation of statistics for each advertiser (number of requests, Total bytes delivered, number of users served, popularity in the dataset etc.)
  5. User Agent analysis to separate mobile Vs desktop related traffic and identification of device and OS (Android, Iphone, Windows phone).
  6. Calculation of statistics: (i) based on the full trace (ii) based on the different Categories e.g. number of Requests, percentage of traffic, average Latency, Total Bytes downloaded, and (iii) based on the different file types retrieved (iv) based on the traffic of different users.
  7. Extraction of user's IPs and calculation of her overall geographic movement (by using offline geoIP database of maxmind)
  8. In case of RTB related traffic, it first filters out possible duplicates from browser retransmissions and by using a list of keywords it extracts bidder (i.e. DSP), bid price, charge price (separation of encrypted and unencrypted ones), publisher, time of day, geolocation of the user, Cookie Synchronizations up to that moment, ad slot size, carrier, ad exchange platform and (rarely) the associated SSP.
  9. Estimation of each publishers type of content.
  10. Estimation of the user's interests by extracting the publishers she has visited along with the type of content they distribute.
  11. Cookie syncronization (CS) event detection by looking for (i) redirections (303 and 200 HTTP statuses) and (ii) cookie/users IDs passed from one host to another through the user's device.
  12. Creation of CS graph by using as metrics (i) the number of CS transactions each cookie ID had participated and the number of cookie IDs loaded in each CS transaction (multiplier of privacy leakage)
  13. Creation of user timelines, creation of time windows of specific width, and calculation of statistics for the traffic of each time window.
  14. Automated production of plots and distributions using the results of the above operations.

opendamp's People

Contributors

panpap avatar

Stargazers

 avatar  avatar Motin avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.