This repo contains assignments work of Search Engine Architecture class. The goal is to build a scalable web search engine.
Assignment 1 is to build a load balancer usign tornado. Assignment 2 is Indexing and Retrieval. Assignment 3 is to build MapReduce framework. Assignment 4 is to build a distributed Indexer based on assignment2 and assignment 3. Assignment 5 is to develop a minimal distributed optimizer. The optimizer is used to train a Word2Vec model in parallel using stochastic gradient descent (SGD).