optimizer.zero_grad() before loss.backward()?

Hi, Excellent tutorials! But I have a question. Form tutorial 13 and on you change the place where the `zero_grad` method is called and I do not get why?
Before 13 was:
```python
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()
optimizer.zero_grad()
```
After 13:
```python
loss = criterion(outputs, labels)
optimizer.zero_grad() # Here is the change
loss.backward()
optimizer.step()
```

Now I am wondering if you set to zero the gradients, then, how the optimizer could update the parameters without any information about the gradient?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimizer.zero_grad() before loss.backward()? #19

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

optimizer.zero_grad() before loss.backward()? #19

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions