ssomani7 Goto Github PK
Name: Saurabh Arunkumar Somani
Type: User
Company: Santa Clara University
Bio: Master's in CSE from Santa Clara University, Santa Clara, California, USA
Location: Latur, Maharashtra, India
Name: Saurabh Arunkumar Somani
Type: User
Company: Santa Clara University
Bio: Master's in CSE from Santa Clara University, Santa Clara, California, USA
Location: Latur, Maharashtra, India
exercise for algorithm
A collection of algorithms and data structures
Implemented Spark machine learning Pipeline on AWS EMR for Collaborative Filtering to recommend users which online educational course they should take based on their viewing history. Target audience found using K-Means clustering over 2 billion data rows. • Using Kafka & Spark Structured Streaming simulated the above models as real time events with a window size of 2 minutes. • House price prediction for California residents based on Kaggle’s 2014/15 dataset using Linear Regression. Narrowed down the customers who were likely to purchase using Logistic Regression & Decision Tree Classifier along with Random Forests to choose the best performing model.
A list of awesome beginners-friendly projects.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Code Examples For Back To Back SWE Lessons
Analysis of SQL Leetcode and classic interview questions. Common pitfalls, anti-patterns and handy tricks are discussed. Sample databases are provided.
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
End-to-End Machine Learning Project based on the Stalib repository. This dataset is based on data from the 1990 California census. The machine learning models built in this project helps in determining whether to invest or not in the given area. The final prediction is made for a district's median housing price. This data has metrics such as the population, median income, median housing price, and so on for each block group in California. Block groups are the smallest geographical unit for which the US Census Bureau publishes sample data (a block group typically has a population of 600 to 3,000 people). </br> System uses multiple features to make a prediction (it will use the district’s population, the median income, etc.). This classifies for a **multivariate regression problem**. Hence the performance measure chosen for this is **_RMSE_** (Root Mean Square Error).
Online Certification
Java implementation of Checkers
Java implementation of Checkers with MouseListener
Answers to various puzzles and coding interview questions solved in Java for practice
XML schema and Instance
Contains all of the queries used within the Complete Guide to Elasticsearch course.
This is a Python script which will help you to know when a slot is available on Cowin website.
Cheat Sheets
repository to record exercises of data structures and algorithms
Using Pandas, NumPy, Matplotlib, Seaborn, Plotly and Cufflinks, the answers for financial crisis of 2007/08 & the most crime affected US regions are visualized with detailed description leading to those events.
DeepDash, is a deep learning dashboard which builds predictive stats from image data using 2 models: K Nearest Neighbors and VGG19. These stats are then displayed on the front end for user to analyse and develop/choose a strategy to solve the business problem he/her is working on. This first release of DeepDash is intended for people who have access to image datasets (public or private).
A Deep Learning Dashboard
Mini project to implement DGIM Algorithm for estimating number of ones in continuous bit stream. Objective of this project is to estimate number of ones in past K data with a tolerance of 33%. The major challenge with continuous stream of bit is that storing the stream in main memory is not possible because of the continuous flow of bit stream which will start accumulating and eventually exceeds the size of memory. Therefore, using DGIM Algorithm, n number of bits can be stored in log n memory space. Often, it is much more efficient to get an approximate answer to our problem than an exact solution.
Companion repo to a course on Udemy.com
Spring Demo for an E-Commerce Application
Java implementation to find all possible Edit Distances
Apache Kafka and Confluent Platform examples and demos
The GitHub Matrix Screensaver for Mac OSX
A collection of useful .gitignore templates
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.