Giter Club home page Giter Club logo

Portfolio link: https://sites.google.com/view/sourav9827/home

Linkedin Link: www.linkedin.com/in/sourav-mandal-390064210

mail id: [email protected]

#Data scientist

What is this and how do I use it?

  • This is a curated list of delightful resources for everything you need to develop Machine Learning solutions.
  • Each item in this list will teach you at least one distinct and significant skill or piece of information.
  • There are three content levels:
    1. πŸ₯ Essential reading for all ML engineers
    2. 🐍 Advanced reading for professional ML engineers
    3. πŸ¦„ Expert material for expert ML engineers
  • Descriptions are written to complete the sentence "After reading this article you will have learned ...".

Contents

Communication

Software Engineering

API design

Workflow

Python patterns

Typing

  • 🐍 The Comprehensive Guide to mypy - How to write type annotations in Python (1 hour)
  • 🐍 Pydantic overview - How to write type annotations for complex types instead of a meaningless Dict[str, Any] (1 hour)
  • 🐍 Magic number - Why magic values are an anti-pattern (15 min)
  • 🐍 Enums - How to write Enums in Python instead of type-unsafe magic values (15 min)
  • πŸ¦„ Mypy generics - How to use TypeVars to write generic types such as List[T] (30 min)
  • πŸ¦„ Mypy protocols - How to use Protocols to define interfaces such as Iterable (30 min)

Curated Python packages

Workflow

Code quality

  • πŸ₯ black - Automatically format your code
  • πŸ₯ isort - Automatically sort your import statements
  • 🐍 pre-commit - Automatically run code quality checks on commit
  • 🐍 bandit - Find common security issues
  • 🐍 darglint - Check that your docstrings match your function signature
  • 🐍 flake8 - Check your code for bugs and that your code style is PEP8-compliant
  • 🐍 flake8 extensions - An awesome list of Flake8 extensions
  • 🐍 mypy - Check the type-correctness of your code
  • 🐍 pre-commit hooks - A collection of pre-commit hooks that check file quality
  • 🐍 pydocstyle - Check that your code is documented
  • 🐍 pygrep hooks - A collection of pre-commit hooks that check for common Python code smells
  • 🐍 pytest-recording - Record and play back HTTP requests in your pytest tests
  • 🐍 pyupgrade - Check that your code is written using the latest Python language features
  • 🐍 safety - Check that your dependencies don't have any known security vulnerabilities
  • 🐍 shellcheck - Check the quality of your shell scripts
  • 🐍 coverage.py - Check your code's test coverage
  • πŸ¦„ hypothesis - Write tests that automatically look for edge cases that break your code
  • πŸ¦„ hypothesis-auto - Automate generate Hypothesis tests based on your code's type annotations

Application development

  • 🐍 fastapi - Create RESTful APIs based on type annotations
  • 🐍 typer - Create CLIs based on type annotations
  • 🐍 streamlit - Create web apps with a single Python file

Utilities

  • 🐍 bump2version - Release a new version of your package
  • 🐍 coloredlogs - Increase your logs' readability with colour
  • 🐍 hvplot - Create interactive plots from pandas dataframes
  • 🐍 mkdocs - Create developer documentation for your project
  • 🐍 pdoc - Generate API documentation for your code
  • 🐍 birdseye - Graphically debug your Python code
  • 🐍 scalene - Profile your code's CPU and memory usage by line
  • 🐍 viztracer - Vizualize your code's performance with a flamegraph
  • 🐍 tqdm - Easily add progress bars to long-running jobs

Machine Learning

Practical theory

Explainability

Unsupervised

Classification

Regression

Computer Vision

Natural Language Processing

Time Series Analysis

Recommender Systems

Tensor computation libraries

Pandas

Sci-kit learn

Labelling

DevOps

CI/CD

  • 🐍 invoke - How to implement common tasks you run on your project as a CLI (30 min)
  • 🐍 poe - How to implement common tasks you run on your project as a CLI (30 min)

Environment and dependency management

Docker

Data pipelines

  • 🐍 Great Expectations - How to test and document your data and data pipelines (30 min)

Shell

Terraform

Infrastructure

(RADIX-AI)(https://github.com/radix-ai/awesome-machine-learning-engineer)

Sourav Mandal's Projects

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.