wandb / edu Goto Github PK

View Code? Open in Web Editor NEW

479.0 479.0 236.0 24.01 MB

Educational materials on deep learning by Weights & Biases

Home Page: http://wandb.ai

License: GNU General Public License v2.0

Python 9.56% Jupyter Notebook 90.29% HTML 0.05% JavaScript 0.05% Shell 0.02% Dockerfile 0.03%

edu's Issues

Add README for lightning section

LightingDataModule for MNIST is auto-encoder specific

Need to refactor it into a base class and two dataloaders for classification and for image tasks

Remove this extra space

edu/lightning/utils.py

Line 262 in e2e3347

Fix typos in autoencoder-mnist.ipynb

Choosing an output activate
- Review the logged outputs of your neural network in the Weights & Biases interface and look for issues caused by this choice of activation.
- For example, here they are always between 0 and 1, to match the data they are compared against, while while activation values have no such restriction.
Change Hyperparameters
- Better results can be obtained by tweaking the hyperparameters.
Challenge: Regularization and Learned Weights
- With the default settings, the leanred filters don't look like that at all.

https://github.com/wandb/edu/blob/4a438532a4cd53a025c348ecabed93bcc5dee646/lightning/autoencoder/autoencoder-mnist.ipynb

Add headers for Math4ML notebooks

Based on new art collateral

Add more autograded exercises to calculus notebook

Ideas

Little-o exercises. They'd be multiple choice, basically. And/or: provide a function that is little-o of x^2? I can check this with sympy.
Linearity of the gradient approximation. Write a function to compute gradient approx, given the function and its gradient. Compare to finite differences?

add wandb login to the setup section of all NBs

this way, folks are always logged in before they get to the meat of the code

Test the CNN implementation for robustness to hyperparameter changes

in particular, how much can i change the sizes of the convs before the adaptive pooling layer complains? how much can i change the target size of the adaptive pooling layer? how deep can i go before MNIST images are too small?

improve artifacts integration with profile

better naming and saving
add config metadata

rethink the logging code from the LoggedLitModule

This was written when I had half as much experience with Lighting as I have now, and before the most recent integration.

I should rethink it, with emphasis on the following:

moving material into callbacks
cutting down on the magic parameter inference
improve robustness (see #25)
reorganize and make hierarchical (see #20)
writing some tests, including e2e tests that write runs to wandb and pull metrics

run Colab-specific installs only on colab

right now, the installs are run regardless of environment, but they should only be run in Colab.

just need to move the !pip install commands into the appropriate if branch and then apply %%capture to the cell

Consider creating a nn utilities file

On the one hand, it will reduce code duplication across Colabs, but on the other, it really needs to be done well if it's going to be used everywhere -- have to make sure it's e.g. DDP compatible, the logging could be done better (more callbacks?), and want to use PL best practices as much as possible.

Use .modules in parameter pruning in performance/pruning_cnn

See projects/constrained_emotion_classifier.ipynb for an example. This is better than the name-sensitive style being used elsewhere.

use torchmetrics

beginning to be deprecated

add "run in Colab" badges to Math4ML notebooks

waiting on the branches to be correct

image_size handling in FilterLogCallback

Doesn't need to be provided for convolutional networks.

Should fix and possibly update the docs example.

Organize Math4ML into folders

Adding more content and at the same time de-emphasizing existing content, but we don't want to delete that content, so let's make a nested structure with folders and subfolders

-- 00_{topic}/
  |  exercises.ipynb
  --  extras/
    |  {subtopic}.ipynb
-- 01_{topic}/

Separate out "extras" nbs for calculus

Retain only the components that are easily autograde-able or could be converted to autograding.

Add W&B branding to notebooks

crib the HTML from the notebooks in the examples repo.

Print model during training so that it gets logged to wandb

Should be as easy as calling print(model) inside the right hook, which might be on_train_start, with the LoggedLitModule

Better model saving for PyTorch

Caught on a dilemma with saving PyTorch models for viewing in Netron

OT1H, just saving as a .pt file results in unreliable performance by Netron, who can't really be expected to handle all the possible choices in both major libraries, and so prefers ONNX and lets them handle the conversion, but
OTOH, ONNX can't handle e.g. the AdapativeAvgPool2d layer, important for CNNs that are easy to play with. Seems like a pretty fundamental limitation w.r.t adaptive layers.

In particular, for the MLP and CNN in PyTorch, I want to emphasize reusability and enable easy extension, and so I'm using ModuleLists and custom Modules. Neither is really gonna play nicely with Netron.

For now, I'm sticking with .pt files, which save but aren't visualized well (the ModuleLists aren't expanded, and that's where all the action is!).

Make extras compatible with Colab

Would like to drop reliance on the W&B Hub, and Colab seems like the least-bad choice.

Options:

Local. No lol. Installation issues are a dealbreaker.
Binder. Ephemerality will be frustrating for students, and is a dealbreaker for classes where the HW is done between sessions.
Colab. Avoids ephemerality. The limitation to the matplotlib inline backend for interactive charts is frustrating but not a showstopper.
Gradient. Requires a login, possibly expensive, unclear if better, GPUs not needed for this class.

Switch from `lambda xs: xs` as default activation in `LitMLP`

torch.nn.Identity serves the same purpose, but is less jank.

See cnn.ipynb.

Size mismatch between rand_pmf and range(5) ?

edu/math-for-ml/03_probability/utils/tests/q03.py

Line 23 in 135f4f1

>>> neg_exps = [-1 * np.exp(surprise(rand_pmf, ii)) for ii in range(5)]

Deduplicate projects

Architecture search was put under cnn/ when it fits much better under projects/. For now it's duplicated, but it should only be in projects/.

There are currently two versions of the FER project, but there should only be one. Once the more robust NN utils are in place (#20, #25, #26) can reduce.

add browser compatibility warning to profile notebook

The profiling tool may only be compatible with Chrome.

✅ Chrome
❌ Safari
❔ Firefox
❌ Edge (not fully supported by Colab)

Add top-level README

improve the robustness of the utils

Networks that are being pruned with torch.nn.prune break certain assumptions in e.g. the parameter counting, as do I believe the quantized networks.

These should be resolved (incorporating fixes from the relevant notebooks, when possible) so that the utils are more robust.

add in the nice header images from the video series

linearalgebra
calculus
probability

separate the CLT section into another nb

it's interesting, but not autogradable.

handle graphviz dependency

this needs to be installed locally, but is not a pip package

Make a better test for the array_is_pmf question

Right now, doesn't appropriately test whether there are values above 1 or below 0 because the examples don't add up to 1, which is what most folks test for.

np.array([-1, 2]) and/or np.array([-0.5, 1.1, 0.2, 0.2]) would do it.

fix italics differences in "Note:" between colab and binder

_Note_: renders differently between platforms because Markdown is an incomplete spec, resulting in some very wonky formatting in places; use _Note:_ instead.

Add README for keras section

based off of README in ml-class

fix mixed 2-space and 4-space indentation

Colab defaults to 2-space but much of the code is in 4-space -- and other Jupyter instances don't like 2-space indentation

In calculus exercises: is_little_o, identity, constant

Move SVD material

The SVD material is interesting, but hard to make concrete and compelling with the constraints we have (unless I come up with a slick "LA-as-programming" explanation of kernels and maybe also eigenvalues, which is tougher).

I should move it into a separate notebook.

Remove the doubled "output" in this line

Better DataLoader practices

should make the dataloaders more configurable for the AbstractMNISTDataModule -- can probably fix pin_memory to True, but should allow configuration of num_workers (with a default of 2 or nproc, depending on how far we want to go)

standardize # Headers across notebooks

Should be ## Setup Code, # Section X, etc.

Add more "Linear Algebra as Programming" Exercises

This is the core idea of the lecture slides, but there aren't enough exercises for it. They require a certain amount of creativity, but here's a few possibilities:

Shapes and types. Debug a shape issue? Write code that checks a shape, by analogy with checking a type?
Batch dimensions/broadcasting and for loops. Write a for loop that does the same thing as a broadcasted multiply. Make it a "batch" application.
Convolutions and zips. Harder: write a zip that's equivalent to a convolution.
Parallel composition. Use concat/stack operations to implement applying two "functions" to the same input. Something else with the Kronecker product?
repeat. Use matrix multiplication (an outer product?) to copy the input k times.