Giter Club home page Giter Club logo

charlotte's Introduction

Charlotte

A java web-spider.

This program takes a given seed URL (in this case rockhopper.us) and conducts either a breadth-first search or depth-first search on the structure of pages linked from the seed site. In this way the structure of the internet in the vicinity of a particular webpage can be understood.

####Database Structure The results gathered by Charlotte are stored in a MySQL database (configurable in the code, but curently scrubbed of login credentials for my own database). In my implementation I used a WAMP stack, although many other common database solutions could be easily implemented into this code. The structure of the internet is stored in graph form using two tables. The first keys an automatically incrementing integer ID to each unique URL encountered, and the second stores tuples of ID numbers which represent an edge in the graph. Together, these two tables can be used to reconstruct the structure of the internet as parsed by Charlotte.

####Application In this video, I describe how I used Charlotte's data in my Web project. Internet Visualization

####Acknowledgments Charlotte's DFS code was learned from ryanlr's excellent tutorial. JSoup was also invaluable in implementing this project. I brainstormed Charlotte's BFS method at the recent PennApps XII in conjunction with Charles Nickerson. Equipment used and code implemented are my own.

charlotte's People

Contributors

timtinkers avatar

Watchers

James Cloos avatar eric o avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.