atharva1309 Goto Github PK
Name: Atharva Tipre
Type: User
Bio: UT Dallas graduate student. Business Analytics, Data Analytics, SQL, R, Python, Tableau and ML on my mind.
Location: Dallas, Texas
Name: Atharva Tipre
Type: User
Bio: UT Dallas graduate student. Business Analytics, Data Analytics, SQL, R, Python, Tableau and ML on my mind.
Location: Dallas, Texas
Amazon Shipping is a company which ships a variety of FMCG (Fast Moving Consumer Goods) all over the world. The Shipping Manager doesn't have much visibility and doesn't know how many orders are processed and shipped on a monthly basis. He would like a dashboard of this where he selects a month and can tell how many orders are outstanding a day and where they should be shipped
My Personal Website
This dataset is based on "Bank Marketing" UCI dataset (please check the description at: http://archive.ics.uci.edu/ml/datasets/Bank+Marketing). The data is enriched by the addition of five new social and economic features/attributes (national wide indicators from a ~10M population country), published by the Banco de Portugal and publicly available at: https://www.bportugal.pt/estatisticasweb. For the dataset check the classification file where the link is mentioned to the dataset. Performed data cleaning and imputation of 10% of the data using Numpy & Pandas packages on Jupyter Notebook. Classified whether a client agreed to place deposit or not with a test accuracy of 91%, to improve the marketing campaign. Implemented classification models like KNN, Logistic Regression, Decision Tree and Support Vector Machines
In this project RNN variations are tested on a dataset comprised of not only Bitcoin historical price, but also other influencing factors such as macroeconomic indices, sentiments etc.
This is a part of my college project related to typical machine learning concepts of regression. The data was scraped from several websites in Czech Republic and Germany over a period of more than a year. To access the dataset please click on this - https://www.kaggle.com/mirosval/personal-cars-classifieds. I performed data pre-processing of roughly 3.5 million rows and exploratory data analysis to check the distribution of data in each column. Later, I implemented various regression algorithms to find out how the factors affect the price of the used car and the resale value. Finally, determined the optimal algorithm suited for this data which has the highest R^2 value as deciding parameter.
π Stanford CS229: Machine Learning
This is a repository for Instacart Market Basket Analysis. It consists of data cleaning, exploratory data analysis,data visualization and machine learning algorithms developed for the project. Market Basket Analysis is a modeling technique based upon the theory that if you buy a certain group of items, you are more (or less) likely to buy another group of items. In this analysis, a forecasting model is developed using machine learning algorithms to improve the accurate forecasts of product sales.
NeuralProphet - a Neural Network based Time-Series model
The two python files are specific for the pdf file I have uploaded, the 1st file dskew.py I have converted the image into grayscale and checked the orientation of the image which we get after converting the pdf file into image. The second python file segment.py is used to manipulate the image. I have created a seperate co-ordinate variable for each field I wanted to extract from the image. Then these cropped images are converted into text using Pytesseract and appending each extracted field into a dictionary. At the end, I have dumped this dictionary and its key-value pairs in a json variable.
Prosper is a peer-to-peer lending platform that aims to connect people who need money with those people who have the money to invest. In this data analysis project, I have explored the Prosper dataset and used Tableau to create my visualizations.
This is a dataset of news headlines of publicly held organizations. The headlines are preprocessed and converted into TF-IDF numeric vectors using NLTK. TF-IDF is a statistical measure that evaluates how relevant a word is to a document in a collection of documents. TF-IDF (term frequency-inverse document frequency) was invented for document search and information retrieval. It works by increasing proportionally to the number of times a word appears in a document, but is offset by the number of documents that contain the word. So, words that are common in every document, such as this, what, and if, rank low even though they may appear many times, since they donβt mean much to that document in particular. Implemented bag of words approach for vectorization and predicted the sentiment of a stock market of an organization using a RandomForest classifier with an accuracy of 82% to help customers to invest in ideal stock to gain high ROI
Time Series Analysis Concepts Explained with examples
A declarative, efficient, and flexible JavaScript library for building user interfaces.
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. πππ
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google β€οΈ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.