gradient not backpropgating to mu

Question

my mu cannot have any valid gradient somehow, here is the code:

import torch
torch.manual_seed(0)
mu = torch.zeros(1, requires_grad=True)
sigma = 1.0
eps = torch.randn(1)
sampled = mu + sigma * eps
logp = -((sampled - mu)**2) / 2 - 0.5 * torch.log(torch.tensor(2 * torch.pi))
loss = -logp.sum()
loss.backward()
print("eps:", eps.item())
print("mu.grad:", mu.grad.item())  # should be -eps.item()

I consistently get zero grad, is this normal?

mu + sigma * eps - mu will cancel mu

Oscar
– Oscar

2025-04-14 20:00:57 +00:00
Commented Apr 14 at 20:00 — Oscar
– Oscar, Commented Apr 14 at 20:00

tbrugere · Accepted Answer · 2025-04-18 12:44:51Z

0

As explained by @Oscar in comments, the gradient is propagating fine, but your logp does not actually depend on mu (so nor does your loss).

If the gradient was not propagating to mu, mu.grad wouldn't be zero, it would be None.

this is because

sampled = mu + sigma * eps
logp = -((sampled - mu)**2) / 2 - 0.5 * torch.log(torch.tensor(2 * torch.pi))

so in practice, inlining sampled into the definition of logp

logp = -((mu + sigma * eps - mu)**2) / 2 - 0.5 * torch.log(torch.tensor(2 * torch.pi))

ie, after simplification

logp = -((sigma * eps)**2) / 2 - 0.5 * torch.log(torch.tensor(2 * torch.pi))

so logp really doesn't depend on mu, and the gradient related to mu is 0.

answered Apr 18 at 12:44

tbrugere

1,84418 silver badges27 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

gradient not backpropgating to mu

1 Answer 1

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Related