Giter Club home page Giter Club logo

Vaddhiparthy's Portfolio

Data Professional

Welcome to my GitHub profile! I am a data professional specializing in natural language processing (NLP) and machine learning (ML) techniques.

I have extensive experience working with various programming languages, including Python, SQL, R, and MATLAB, as well as proficiency in utilizing databases like Amazon Redshift, Snowflake, and MySQL. My knowledge extends to data integration and management techniques such as ETL and AWS Snowflake.

In terms of data visualization, I use tools like Tableau, Microsoft Excel, and PowerBI to present insights.

My GitHub portfolio showcases a diverse array of projects. Sentiment analysis on blog authorship and play store reviews, fine-tuning GPT models, and building sophisticated text classification systems. Additionally, you'll find projects in other domains, such as location selection using k-means clustering, credit risk evaluation using machine learning, and forest fire estimation using machine learning.

I welcome the opportunity to contribute to your projects, collaborate, or discuss new ventures.

My Tableau Profile

My Leetcode Profile

Languages and Tools:

c cplusplus d3js kibana mongodb mssql mysql opencv pandas postgresql python pytorch scikit_learn seaborn tensorflow

vaddhiparthy

 vaddhiparthy

Surya Vaddhiparthy's Projects

blogsentimentanalysis icon blogsentimentanalysis

This Python code analyzes a dataset of blog posts, focusing on the polarity and subjectivity of the text. It cleans the text data, visualizes word frequency using word clouds, and explores the sentiment of the text based on age groups and blog topics. The results show differences in sentiment, subjectivity, and word usage among age groups and blog

covid-data-analysis icon covid-data-analysis

Leveraging advanced data analytics methodologies and time series forecasting with ARIMA modeling, this project delivers a comprehensive analysis of COVID-19 trends and metrics in the United States, providing crucial insights for informed decision-making in pandemic management.

credit_risk icon credit_risk

This project employs alternative data sources and machine learning techniques, specifically the Extreme Gradient Boosting (XGBoost) algorithm, to evaluate credit risk for individuals lacking traditional credit history. By incorporating diverse data points and addressing class imbalance through Synthetic Minority Oversampling Technique (SMOTE).

forestfireestimation icon forestfireestimation

Using advanced machine learning techniques, this project successfully predicted the burned area of forest fires in the northeast region of Portugal based on meteorological and other data. By employing linear and polynomial regression models, the developed solution effectively captures the complex relationships between variables.

ghg icon ghg

This project employed advanced data wrangling techniques in R to analyze and visualize greenhouse gas emissions from countries party to the UNFCCC. Utilizing the dplyr and ggplot2 libraries, the transformed data provided valuable insights into emissions trends, serving as an essential resource for decision-makers and stakeholders.

gpt icon gpt

Fine-tuning GPT-2 models with custom text corpora, utilizing Hugging Face's Transformers library and advanced training techniques for sophisticated text generation applications.

imageclassfication icon imageclassfication

A Python implementation for image classification using a pre-trained ResNet-18 model from torchvision. The input image undergoes a series of transformations, including resizing, center cropping, tensor conversion, and normalization, before being fed into the model. The model then predicts the class label for the input

ims icon ims

Utilizing cutting-edge machine learning techniques and advanced telematic data, a highly accurate predictive model was developed for insurance claim assessment, leading to more informed risk evaluation and optimized decision-making in the insurance industry.

location-selection-using-k-means-clustering icon location-selection-using-k-means-clustering

Utilizing geospatial data and sophisticated machine learning algorithms, specifically K-Means Clustering, the project successfully pinpointed an optimal location for a novel Indian restaurant in New York City's dynamic landscape by evaluating competitive density.

reviewsentiments icon reviewsentiments

Utilizing advanced NLP techniques and SentimentIntensityAnalyzer from the NLTK library, this script analyzes Google Play Store app reviews to extract and visualize user sentiments based on pre-defined topics, such as app interface and load time, offering valuable insights into user experience.

securelog icon securelog

This project successfully integrates AWS Simple Queue Service (SQS) with a local PostgreSQL database, processing and securely storing user login data. The implementation demonstrates a streamlined approach to data ingestion, masking, and storage, resulting in an efficient and secure data pipeline.

stock-data-analysis-tool icon stock-data-analysis-tool

The final code is a Python script that retrieves and analyzes financial metrics and growth data for a list of stock symbols from the Seeking Alpha API, and combines this information into a DataFrame for further analysis and visualization.

synthscorenet icon synthscorenet

project that utilizes Generative Adversarial Networks (GANs) to generate synthetic credit data for individuals with limited or no credit history, and predicts custom credit scores based on this data using machine learning models. This project aims to improve the accuracy and inclusivity of the credit assessment process.

vaddhiparthy icon vaddhiparthy

This repository is for my personalized GitHub profile introduction page, utilizing HTML and CSS to create a visually informative layout.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.