Giter Club home page Giter Club logo

Hi there! 👋 I'm Midhun

I'm a passionate data science enthusiast with a background in engineering and projects and operations management (PMP certified). I'm excited about leveraging data to gain insights, solve complex problems, and drive informed decision-making.

🔭 What I'm Currently Working On

  • Exploring machine learning algorithms and techniques to solve real-world problems.
  • Building data pipelines and conducting exploratory data analysis (EDA) on diverse datasets.
  • Expanding my knowledge in natural language processing (NLP) and deep learning.

⚡ My Skills

  • 📊 Data Science

Data Science Machine Learning Data Analysis Statistical Modeling Predictive Analytics

  • 💻 Programming Languages

Python NumPy Pandas Scikit-learn TensorFlow Keras R SQL Microsoft Excel Jupyter Notebook PySpark

  • 📊 Data Visualization

Data Visualization Matplotlib Seaborn Tableau Lucid Chart Amazon QuickSight

  • 🧰 Data Engineering

AWS Lambda AWS S3 AWS Glue Studio AWS CLI AWS Glue DataBrew AWS Glue DynamicFrame Mage.ai VM Machines SSH

  • 🌐 Big Data Tools

Big Data Tools Apache Spark Hadoop Google Cloud Platform BigQuery Google Cloud Storage AWS Glue Amazon Athena

  • 🚀 Project Management

Project Management PMP Certified Agile Methodologies AIMS Grid Kanban

🚀 My Projects

Here are a few noteworthy projects I've worked on:

  • ETL / Data Science Project on AWS Platform - Youtube Data : AWS | Python | S3 | ETL | AWS Glue | Athena | Lambda | Tableau | Data Acquisition | Data Cleaning & Transformation | Data Model | Data Engineering | Data Analysis | Data Visualization | Data Science | Communication

  • This project focuses on performing YouTube data analytics using the AWS platform. It involves steps such as data processing, conversion, cleaning, and building an analytics pipeline. The raw data is uploaded to S3 buckets, and an IAM role is set up. The AWS Glue Catalog is used to generate a data catalog, and the JSON files are serialized into a tabular format. AWS Lambda functions are created to convert JSON to Apache Parquet. Athena is used for querying and cleaning the data. An ETL job is developed using PySpark scripts, and a trigger is added to the Lambda function for continuous processing. AWS Glue Studio is utilized to build an analytics report layer, and the data is stored in a new bucket in Parquet format. A Tableau dashboard is created by connecting Tableau to AWS Athena. The final result is an interactive dashboard for YouTube data analysis.

  • Unveiling Insights and Trends in NYC Taxi Trips Using Python, GCP, and Tableau : Python | SQL | GCP | ETL | VM | BigQuery | Lucid | Mage | Tableau | Data Acquisition | Data Cleaning & Transformation | Data Model | Data Engineering | Data Analysis | Data Visualization | Data Science | Communication

  • In this project, I utilized Python, Google Cloud Platform (GCP), and Tableau to conduct a comprehensive analysis of NYC taxi trips. By leveraging ETL techniques, including data ingestion, transformation, and loading, I extracted valuable insights from the TLC Trip Record Data. Using Lucid Chart, I created a data model to visualize the relationships between entities. With GCP Storage and Compute Instances, I established a robust infrastructure for efficient data processing. The Mage Data Pipeline Tool played a crucial role in orchestrating the data pipeline and loading the transformed data into BigQuery. Finally, I harnessed Tableau's powerful visualization capabilities to create interactive reports and gain actionable insights from the data.

Take a ride through the world of NYC taxi data and discover hidden patterns with my Uber Data Analytics Project.

  • UAE Residential Properties Analysis: Python | Tableau | Webscraping | Geocoding | Data Acquisition | Data Cleaning & Transformation | Data Analysis | Data Visualization | Data Science | Communication

    • Provided insights into the residential property market through the analysis of rental and sale property data. Collected data by leveraging web scraping techniques, to provide valuable information to individuals seeking to make informed decisions regarding property investment or rental choices. The project utilized visualization tool - Tableau, to present the findings in an interactive and informative manner.
  • Predict Online Shoppers Intention: Python | Correlation | Clustering | Modelling | ML | Data Acquisition | Data Cleaning & Transformation | Data Analysis | Data Visualization | Data Science | Communication

    • To determine if visiting sessions close to special days have an impact on finalized transactions, and also to identify patterns related to browsing period activity and pages visited.
  • Amazon online reviews analysis: Microsoft Excel | Pivot Tables | Dashboard | Pivot Charts | Correlation | Data Acquisition | Data Cleaning & Transformation | Data Analysis | Data Visualization | Data Science | Communication

    • Insight was gained into the product reviews on Amazon, and looked out for any interesting patterns that could have helped in better product placement and strategy for a potential seller

Feel free to explore my GitHub repositories for more projects and code examples.

📚 Education and Certifications

  • Certificate Programme in Data Science (Indian Institute of Management - Kozhikode)
  • Project Management Professional (PMP) Certification (Project Management Institute)
  • Master of Business Administration (ESERP Business School, Barcelona)
  • Diploma in Maritime Studies (Singapore Polytechnic)

📫 How to Reach Me

You can connect with me on LinkedIn to discuss collaborations, job opportunities, or simply chat about data science!

Looking forward to connecting with fellow data enthusiasts and contributing to the exciting world of data science! Let's explore the possibilities together!

Midhun's Projects

sql-server-samples icon sql-server-samples

Azure Data SQL Samples - Official Microsoft GitHub Repository containing code samples for SQL Server, Azure SQL, Azure Synapse, and Azure SQL Edge

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.