Giter Club home page Giter Club logo

Atharva Tipre's Projects

amazon-shipping-analytics icon amazon-shipping-analytics

Amazon Shipping is a company which ships a variety of FMCG (Fast Moving Consumer Goods) all over the world. The Shipping Manager doesn't have much visibility and doesn't know how many orders are processed and shipped on a monthly basis. He would like a dashboard of this where he selects a month and can tell how many orders are outstanding a day and where they should be shipped

bank-marketing-analysis icon bank-marketing-analysis

This dataset is based on "Bank Marketing" UCI dataset (please check the description at: http://archive.ics.uci.edu/ml/datasets/Bank+Marketing). The data is enriched by the addition of five new social and economic features/attributes (national wide indicators from a ~10M population country), published by the Banco de Portugal and publicly available at: https://www.bportugal.pt/estatisticasweb. For the dataset check the classification file where the link is mentioned to the dataset. Performed data cleaning and imputation of 10% of the data using Numpy & Pandas packages on Jupyter Notebook. Classified whether a client agreed to place deposit or not with a test accuracy of 91%, to improve the marketing campaign. Implemented classification models like KNN, Logistic Regression, Decision Tree and Support Vector Machines

bitcoin-price-prediction icon bitcoin-price-prediction

In this project RNN variations are tested on a dataset comprised of not only Bitcoin historical price, but also other influencing factors such as macroeconomic indices, sentiments etc.

car-sale-prediction icon car-sale-prediction

This is a part of my college project related to typical machine learning concepts of regression. The data was scraped from several websites in Czech Republic and Germany over a period of more than a year. To access the dataset please click on this - https://www.kaggle.com/mirosval/personal-cars-classifieds. I performed data pre-processing of roughly 3.5 million rows and exploratory data analysis to check the distribution of data in each column. Later, I implemented various regression algorithms to find out how the factors affect the price of the used car and the resale value. Finally, determined the optimal algorithm suited for this data which has the highest R^2 value as deciding parameter.

cs229_ml icon cs229_ml

🍟 Stanford CS229: Machine Learning

instacart-market-basket-analysis icon instacart-market-basket-analysis

This is a repository for Instacart Market Basket Analysis. It consists of data cleaning, exploratory data analysis,data visualization and machine learning algorithms developed for the project. Market Basket Analysis is a modeling technique based upon the theory that if you buy a certain group of items, you are more (or less) likely to buy another group of items. In this analysis, a forecasting model is developed using machine learning algorithms to improve the accurate forecasts of product sales.

ocr_template icon ocr_template

The two python files are specific for the pdf file I have uploaded, the 1st file dskew.py I have converted the image into grayscale and checked the orientation of the image which we get after converting the pdf file into image. The second python file segment.py is used to manipulate the image. I have created a seperate co-ordinate variable for each field I wanted to extract from the image. Then these cropped images are converted into text using Pytesseract and appending each extracted field into a dictionary. At the end, I have dumped this dictionary and its key-value pairs in a json variable.

prosper-loan-data-complete-analysis icon prosper-loan-data-complete-analysis

Prosper is a peer-to-peer lending platform that aims to connect people who need money with those people who have the money to invest. In this data analysis project, I have explored the Prosper dataset and used Tableau to create my visualizations.

stock-market-sentiment-analysis icon stock-market-sentiment-analysis

This is a dataset of news headlines of publicly held organizations. The headlines are preprocessed and converted into TF-IDF numeric vectors using NLTK. TF-IDF is a statistical measure that evaluates how relevant a word is to a document in a collection of documents. TF-IDF (term frequency-inverse document frequency) was invented for document search and information retrieval. It works by increasing proportionally to the number of times a word appears in a document, but is offset by the number of documents that contain the word. So, words that are common in every document, such as this, what, and if, rank low even though they may appear many times, since they don’t mean much to that document in particular. Implemented bag of words approach for vectorization and predicted the sentiment of a stock market of an organization using a RandomForest classifier with an accuracy of 82% to help customers to invest in ideal stock to gain high ROI

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.