dustinpartain's Projects
A system for quickly generating training data with weak supervision
A machine learning approach to investment portfolio composition. The program analyzes the fundamentals of the listed companies on the S&P1500 in order to emit monthly buy signals.
Public facing repository for Data 100, Spring 2021.
Apache Spark - A unified analytics engine for large-scale data processing
Introduction to Data Science
Python module to generate stochastic reduced order models (SROMs)
Learning embeddings for classification, retrieval and ranking.
š The UI component explorer. Develop, document, & test React, Vue, Angular, Web Components, Ember, Svelte & more!
Streamlit ā The fastest way to build data apps in Python
Style guides for Google-originated open-source projects
OpenStack Storage (Swift). Mirror of code maintained at opendev.org.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
šÆ Curated interview preparation materials for busy engineers
An Open Source Machine Learning Framework for Everyone
The textbook Computational and Inferential Thinking: The Foundations of Data Science
High-performance TensorFlow library for quantitative finance.
Code, Slides, & Materials for our Tensorflow Workshop Series for Spring Quarter 2017
Master the command line, in one page
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
Host repository for The Turing Way: a how to guide for reproducible data science
Magnificent app which corrects your previous console command.
The JavaScript Way book
Tidy Data in Python Jupyter Notebook
š¤ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Trick Simulation Environment. Trick provides a common set of simulation capabilities and utilities to build simulations automatically.
Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rather than invoking the Python interpreter, Tuplex generates optimized LLVM bytecode for the given pipeline and input data set.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
:books: The definitive guide to TypeScript and possibly the best TypeScript book :book:. Free and Open Source š¹