I'm a passionate data science enthusiast with a background in engineering and projects and operations management (PMP certified). I'm excited about leveraging data to gain insights, solve complex problems, and drive informed decision-making.
- Exploring machine learning algorithms and techniques to solve real-world problems.
- Building data pipelines and conducting exploratory data analysis (EDA) on diverse datasets.
- Expanding my knowledge in natural language processing (NLP) and deep learning.
Here are a few noteworthy projects I've worked on:
-
ETL / Data Science Project on AWS Platform - Youtube Data : AWS | Python | S3 | ETL | AWS Glue | Athena | Lambda | Tableau | Data Acquisition | Data Cleaning & Transformation | Data Model | Data Engineering | Data Analysis | Data Visualization | Data Science | Communication
-
This project focuses on performing YouTube data analytics using the AWS platform. It involves steps such as data processing, conversion, cleaning, and building an analytics pipeline. The raw data is uploaded to S3 buckets, and an IAM role is set up. The AWS Glue Catalog is used to generate a data catalog, and the JSON files are serialized into a tabular format. AWS Lambda functions are created to convert JSON to Apache Parquet. Athena is used for querying and cleaning the data. An ETL job is developed using PySpark scripts, and a trigger is added to the Lambda function for continuous processing. AWS Glue Studio is utilized to build an analytics report layer, and the data is stored in a new bucket in Parquet format. A Tableau dashboard is created by connecting Tableau to AWS Athena. The final result is an interactive dashboard for YouTube data analysis.
-
Unveiling Insights and Trends in NYC Taxi Trips Using Python, GCP, and Tableau : Python | SQL | GCP | ETL | VM | BigQuery | Lucid | Mage | Tableau | Data Acquisition | Data Cleaning & Transformation | Data Model | Data Engineering | Data Analysis | Data Visualization | Data Science | Communication
-
In this project, I utilized Python, Google Cloud Platform (GCP), and Tableau to conduct a comprehensive analysis of NYC taxi trips. By leveraging ETL techniques, including data ingestion, transformation, and loading, I extracted valuable insights from the TLC Trip Record Data. Using Lucid Chart, I created a data model to visualize the relationships between entities. With GCP Storage and Compute Instances, I established a robust infrastructure for efficient data processing. The Mage Data Pipeline Tool played a crucial role in orchestrating the data pipeline and loading the transformed data into BigQuery. Finally, I harnessed Tableau's powerful visualization capabilities to create interactive reports and gain actionable insights from the data.
Take a ride through the world of NYC taxi data and discover hidden patterns with my Uber Data Analytics Project.
-
UAE Residential Properties Analysis: Python | Tableau | Webscraping | Geocoding | Data Acquisition | Data Cleaning & Transformation | Data Analysis | Data Visualization | Data Science | Communication
- Provided insights into the residential property market through the analysis of rental and sale property data. Collected data by leveraging web scraping techniques, to provide valuable information to individuals seeking to make informed decisions regarding property investment or rental choices. The project utilized visualization tool - Tableau, to present the findings in an interactive and informative manner.
-
Predict Online Shoppers Intention: Python | Correlation | Clustering | Modelling | ML | Data Acquisition | Data Cleaning & Transformation | Data Analysis | Data Visualization | Data Science | Communication
- To determine if visiting sessions close to special days have an impact on finalized transactions, and also to identify patterns related to browsing period activity and pages visited.
-
Amazon online reviews analysis: Microsoft Excel | Pivot Tables | Dashboard | Pivot Charts | Correlation | Data Acquisition | Data Cleaning & Transformation | Data Analysis | Data Visualization | Data Science | Communication
- Insight was gained into the product reviews on Amazon, and looked out for any interesting patterns that could have helped in better product placement and strategy for a potential seller
Feel free to explore my GitHub repositories for more projects and code examples.
- Certificate Programme in Data Science (Indian Institute of Management - Kozhikode)
- Project Management Professional (PMP) Certification (Project Management Institute)
- Master of Business Administration (ESERP Business School, Barcelona)
- Diploma in Maritime Studies (Singapore Polytechnic)
You can connect with me on LinkedIn to discuss collaborations, job opportunities, or simply chat about data science!
Looking forward to connecting with fellow data enthusiasts and contributing to the exciting world of data science! Let's explore the possibilities together!