Giter Club home page Giter Club logo

โšกBenitoโšก

" It's not about data, it's all about tangible business insights ๐Ÿ“ˆ "

๐Ÿ‘จโ€๐Ÿ’ป My Profile

Innovative and dynamic Data Scientist with a proven track record in leveraging AI and Machine/Deep Learning techniques to develop impactful data-driven solutions. Incorporating a robust skill set encompassing:

ย 

  • โœ… Data Visualization/Analytics: Power BI, Looker, Tableau, Matplotlib, Seaborn, Plotly, Streamlit
  • โœ… Data Science: PyTorch, TensorFlow, Scikit-learn, Hugging Face, Transformers, OpenCV, NLTK, SpaCy
  • โœ… Web Scraping: BeautifulSoup, Scrapy, Selenium
  • โœ… Maths and Statistics: Statsmodels, SciPy
  • โœ… Domains: Regression, Classification, NLP, LLM, RAG, Computer Vision, Time Series, Neural Networks, Ensemble Methods, PCA, Clustering, Dimensionality, Reduction, Anomaly Detection
  • โœ… Data Engineering: dbt, Terraform, SQL, PySpark
  • โœ… MLOps: MLflow, Prefect, Mage
  • โœ… APIs: Flask, FastAPI
  • โœ… Cloud Platforms: GCP, AWS, Azure

ย 

Currently working as Teaching Assistant for the Data Analytics and Data Science & AI Bootcamps @ Le Wagon and as AI Course Developer and Technical Editor @ Towards AI and open for further cooperation opportunities! ย 

๐Ÿ‘‰ CONTACT ME! ๐Ÿ‘‰ Fill in this form or reach out on LinkedIn!

ย 

๐Ÿ“„ Projects Portfolio

  • ๐ŸŒฑ Developed as a project leader a Computer Vision MLOps project FoodScore (summary) and its Website during the last 2 weeks of the Data Science Bootcamp of Le Wagon (March 2023)

  • ๐Ÿ”ญ After my graduation, I worked as a volunteer in the following Data Science Projects NLP and GIS project (Website) at Omdena

  • ๐Ÿ’ฐ My last personal Data Science/Engineering/Analysis and ML projects can be found in these repositories (feel free to click โญ if you like them ๐Ÿ˜Ž):

    • MLOps:
    • Data Analysis + Modeling:
      • Cryptocurrencies Analysis: EDA and Modeling project: comparison of ARIMA, XGBoost, LSTM, and Prophet
      • News Classification: EDA, Modeling and Deployment project: comparison of several Neural Networks (CNN, RNN, feedforward) and Multinomial Naive Bayes models and deployment in Streamlit (see app)
      • Breast Cancer Classification: EDA and Modeling project: comparison of Random Forest using Sklearn and Spark, as part of the Advanced Data Science with IBM Specialization
      • Bank Churn Classification: EDA and Modeling project including univariate/bivariate analysis, feature engineering, baseline model selection and voting classifier (LGBMClassifier, XGBoostClassifier, and CatBoostClassifier)
    • Machine Learning & LLM:
      • Birds Classification: Computer Vision project using Pytorch EfficientNet models and deployment in Gradio (see app)
      • Q&A and Summarization: LLM project for audio and text extraction using Whisper and Langchain with app deployment using Streamlit (locally)
      • RAG Llama Index: RAG (Retrieval-Augmented Generation) project for QA retrieval using Llama Index and Deep Lake
      • RAG LangChain Ragas: RAG (Retrieval-Augmented Generation) project for QA retrieval using LangChain and evaluation with RAGAS
    • Data Engineering:
      • Hotel Reviews: Data Engineering project using Prefect, Spark, SQL, dbt, Terraform, Looker, CI/CD and GCP
      • Air Quality Switzerland: Data Engineering project using Mage, dbt, Terraform, Looker, CI/CD and GCP
  • ๐Ÿ’ธ Additionally, you can find my Power BI projects:

  • :basecamp: Last but not least, I also have a Tableau portfolio using groups, sets, blends, joins, table calculations, storylines, parameters, animations, and other advanced functions

ย 

๐Ÿงฎ Tech Stack

Data Science/Engineering/Analysis and ML

Visual Studio Code HTML5 CSS3 Jupyter Notebook MySQL SQLite PostgreSQL Tableau PowerBI Looker Studio Python Pandas NumPy Plotly Matplotlib scikit-learn SciPy TensorFlow PyTorch OpenCV OpenAI FastAPI Flask Docker Anaconda Linux Ubuntu Databricks Google Cloud AWS Azure Grafana Terraform Apache Spark Prefect dbt MLflow GitHub Actions Git Streamlit

Cloud Services

GCP

Cloud Storage BigQuery Cloud Run VM Vertex AI Dataproc Earth Engine Container Registry

Azure

Azure Databricks Data Lake Gen2 Data Factory Container Registry

AWS

S3 EC2 ECR Kinesis Lambda RDS

ย 

๐Ÿงฎ Let's Connect!

benito benito

Benito Martin's Projects

dotfiles icon dotfiles

Default configuration for Le Wagon's students

zenml icon zenml

ZenML ๐Ÿ™: Build portable, production-ready MLOps pipelines. https://zenml.io.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.