Giter Club home page Giter Club logo

Hi there! Welcome to my Github Profile! πŸ‘‹

I am Arch and I am interested in all things data and machine learning systems. I am currently working as a Data Scientist and located in San Francisco Bay Area, CA. In my free time, I like to read, hike, travel, and explore different music genres over a cup of tea (In no particular order).

Check out my Portfolio for all of my projects, micro-projects, skills, certificates, and achievements.

My Stats

Arch's GitHub Stats

Arch Desai's Projects

codesnippets icon codesnippets

This repository contains code snippets I use on daily basis.

customer-survival-analysis-and-churn-prediction icon customer-survival-analysis-and-churn-prediction

In this project, I have utilized survival analysis models to see how the likelihood of the customer churn changes over time and to calculate customer LTV. I have also implemented the Random Forest model to predict if a customer is going to churn and deployed a model using the flask web app.

drone-flights-analysis icon drone-flights-analysis

The objective of this project is to perform independent exploratory data analysis & visualization of drone flights data in order to find hidden trends, patterns, and anomalies.

ds-challenges icon ds-challenges

This repository contains codes of online python/ML/AI/Statistics challenges I have solved.

hourly-energy-consumption-prediction icon hourly-energy-consumption-prediction

In this project I used novel models such as XgBoost and Fbprophet on the hourly energy consumption data to accurately predict energy usage in the future. Features are extracted from timestamps to find trends on daily, weekly, monthly, quarterly and yearly basis and Fbprophet model's performance is improved by incorporating public holidays in the analysis.

instacart-market-basket-analysis icon instacart-market-basket-analysis

The objective of this project is to analyze the 3 million grocery orders from more than 200,000 Instacart users and predict which previously purchased item will be in user's next order. Customer segmentation and affinity analysis are done to study customer purchase patterns and for better product marketing and cross-selling.

learnings icon learnings

Resources and notebooks that I used to learn cool stuff

loan-default-prediction icon loan-default-prediction

In this project I applied various classification models such as Logistic Regression, Random Forest and LightGBM to accurately detect and classify consumers who will default the loan. SMOTE technique is used to combat class imbalance and LightGBM is implemented that resulted into the highest accuracy 98.89% and 0.99 F1 Score.

lstm-best-practices icon lstm-best-practices

This repository contains a method to develop a LSTM model for any task in a more efficient way using thumb rules..

machine-predictive-maintenance-pdm icon machine-predictive-maintenance-pdm

In this project I aim to apply predictive maintenance techniques over 100MB of historical data from twenty of the units of a company that failed in the field. My objective is to see if there is a similarity in information of the units who had longest lives or shortest lives and to predict which active units will fail soon.

multivariate-phase-1-analysis icon multivariate-phase-1-analysis

Objective of this project is to identify the in-control data points and eliminate out of control data points to set up distribution parameters for manufacturing process monitoring. I utilized PCA for dimension reduction and Hotelling T2 and m-CUSUM control charts to established mean and variance matrices.

news-articles-recommendation icon news-articles-recommendation

Objective of the project is to build a hybrid-filtering personalized news articles recommendation system which can suggest articles from popular news service providers based on reading history of twitter users who share similar interests (Collaborative filtering) and content similarity of the article and user’s tweets (Content-based filtering).

portfolio icon portfolio

This Portfolio is a compilation of all the Data Science and Data Analysis projects I have done for academic, self-learning and hobby purposes. This portfolio is updated on the regular basis.

predicting-gdp-of-india icon predicting-gdp-of-india

Objective of this project is to perform predictive assesment on the Gross Domestic Product of India through an inferential analysis of various socio-economic factors to find out which predictors contribute most to the GDP. Various models are compared and Stepwise Regression model is implemented which resulted in 5.7% Test MSE.

pyspark icon pyspark

This repository contains all the files I have worked with to learn pyspark

ranking-of-nfl-teams-using-markov-method icon ranking-of-nfl-teams-using-markov-method

In this project I implemented and compared three stationary distribution of Markov-chain based approaches to rank 32 NFL (National Football League) teams from "Best" to "Worst" using the scores of 2007 NFL regular season.

stat-689-assignments icon stat-689-assignments

This repository contains all assignments completed by me as a part of the academic course- Stat 689: Statistical Computation with Python

summary-of-research-papers icon summary-of-research-papers

This repository contains the summary of various research papers I have read. Feel free to fork and collaborate.

tennis-players-ranking icon tennis-players-ranking

Objective of this project is to rank all Tennis Players based on the matches they played in the year of 2018. Statistics of all matches are given including their scores in all the sets of all matches. This project comprises 4 approaches to rank Tennis players and I have tried to make these approaches more robust sequentially.

wind-turbine-power-curve-estimation icon wind-turbine-power-curve-estimation

In this project, I have employed various regression techniques to estimate the Power curve of an on-shore Wind turbine. Nonlinear trees based ensemble regression methods perform best as true power curve is nonlinear. I have implemented and optimized XGBoost using GridSearchCV that yields lowest Test RMSE-6.404.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.