466labs's Introduction

466Labs

Various Knowledge Discovery and Data Mining algorithms implemented (primarily) in Java.

Information Retrevial: A querying program designed to run on a collection of documents stored in a JSON Array, then use the Vector Space Model to organize these documents and retrieve them based on given queries. Uses stemming and stopword removal to better categorize documents. Then determines keyword weights inside of the documents to calculate Cosine Similarity or Okapi Score (depending what the user has chosen) based on a given query.

PageRank: A basic implementation of Sergey Brin and Larry Page's PageRank algorithm developed at Stanford, described in their paper "The Anatomy of a Large-Scale Hypertextual Web Search Engine". http://infolab.stanford.edu/~backrub/google.html

Our algorithm is designed to run either directed or undirected graphs, represented by the two Node classes. One is a generic Node for general csv parsing, while the other is specialized, designed to work with SNAP datasets from the Stanford Network Analysis Project. http://snap.stanford.edu/

Recommend Projects

gsawers1 / 466labs Goto Github PK

466labs's Introduction

466Labs

466labs's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent