Comments (1)
While the original YOLOv1 paper used SGD with momentum and weight decay, it's worth noting that the choice of optimizer can be a hyperparameter and may not be set in stone.
Adam is an adaptive optimizer that can converge faster than SGD with momentum in some cases. Adam adjusts the learning rate for each parameter based on the gradient variance and the historical gradient, which helps in cases where the gradients for different parameters vary significantly.
In contrast, SGD with momentum adjusts the learning rate based on the moving average of the gradient, which can be less effective when the gradient variance is high. Therefore, Adam can be a good choice for neural networks that have many parameters and complex architectures like YOLOv1.
Additionally, while the original YOLOv1 paper used SGD with momentum, subsequent research has shown that Adam can outperform SGD in some cases, especially for deep learning models with complex architectures. Therefore, the choice of optimizer can depend on the specific problem and the architecture of the neural network.
from machine-learning-collection.
Related Issues (20)
- Why aren't you transposing the input in multi-head attention? HOT 1
- YOLO ground truth width and length are not relative to image size but to S
- Question in self-attention from 'transformer from scratch'
- Can you add a cff so your work can be cited? HOT 1
- Error in train.py HOT 1
- add header=None to pd.read_csv
- -
- Tensor tutorial 3: Neural Networks with Sequential and Functional API Issue
- Pretrained weight for semantic segmentation
- Image Captioning gives following error: TypeError: relu(): argument 'input' (position 1) must be Tensor, not InceptionOutputs HOT 1
- Re. Height and Width of image, mask or masks should be equal. You can disable shapes check by setting a parameter is_check_shapes=False of Compose class
- Weights ESRGAN
- YOLO v1 loss
- SelfAttention bug on Scores * V HOT 1
- Pytorch/GANs /CycleGAN/generator_model.py | Test function has a minor issue.
- Issue with YOLOv3 Anchors on Scale HOT 2
- ConvBlock for Discriminator
- type error: Trainer.__init__() got an unexpected keyword argument 'auto_lr_find'
- why is z_dim=64 in simple GAN code
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from machine-learning-collection.