Back to portal

Hyperparameters & Tuning

Turn the knobs that separate a 75% model from a 95% model

0% · 0 of 4 steps completed · ~90 min · Learning rate, batch size, and architecture experiments
1
What Are Hyperparameters?
Parameters vs hyperparameters
2
Learning Rate Sensitivity
The most important knob to turn
3
Batch Size Effects
Gradient noise vs training stability
4
Architecture Search
Systematic search over width and depth