Giter Club home page Giter Club logo

Hello Internet! I am Vincent, a Software Engineer based in Switzerland. I have been focusing on Data Engineering and Machine Learning Operations (MLOps) over the past few years: data pipelines, CI/CD frameworks, workflow management, APIs..

So far, I have worked in several domains, namely public health, financial services and consumer electronics. I conducted a wide variety of projects involving tools like containers (e.g. Docker, Kubernetes), cloud providers (AWS, GCP), distributed computing (e.g. Spark), databases (e.g. Snowflake) and Machine Learning (e.g PyTorch, XGBoost, pandas, scikit-learn...). I use Linux shells, Git/GitHub, Python and SQL on a daily basis.

I created this GitHub account to archive projects and code that might be useful to other people, including future me. All the content is under MIT license. Feel free to use anything you may find interesting. You can find more digestible versions of most of these repositories on my blog, blog.vlgdata.io. I am also on LinkedIn.

Thanks for reading!

Links

Vincent Le Goualher's Projects

asymmetric_loss icon asymmetric_loss

Implement a custom asymmetric loss to train and drive a regression model towards underestimation or overestimation

datatrigger icon datatrigger

My blog about data engineering and machine learning operations

nlp_hugging_face icon nlp_hugging_face

Text classification with the transformers library from Hugging Face, by fine-tuning DistilBERT or using summarization + Zero-Shot classification.

scaling icon scaling

How to properly split and scale a dataset using Python, Spark & R modules.

school_projects icon school_projects

Here are some projects I did during my master of science in statistics and data science at ISUP, Paris.

shiny_apps icon shiny_apps

Run the CLT Shiny app : https://datatrigger.shinyapps.io/CLT_Visualization/

subtotals icon subtotals

Adding totals and subtotals rows with pandas / the tidyverse

sum_random_variables icon sum_random_variables

Contour plots to answer this question: if the sum of two random variables is large, are they likely to be both large ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.