Fund: [BUG] Why is the backpropagation calculation so slow when I use the mamba network?

[BUG] Why is the backpropagation calculation so slow when I use the mamba network?

4 months ago

👍

When I used the mamba network, I defined a loss to test backpropagation and found that the calculation was very slow. Setting the len length to 1024 requires a long waiting time. code show as below：

`import torch
import torch.nn as nn
from zeta.nn import MambaBlock

block = MambaBlock(dim=512, depth=1)
x = torch.randn(1, 1024, 512)
target = torch.randn(1, 1024, 512)
loss_fn = nn.MSELoss()

y = block(x)
loss = loss_fn(y, target)
loss.backward()
print("Output shape:", y.shape)
print("Loss value:", loss.item())
`

Kye Gomez/zeta

[BUG] Why is the backpropagation calculation so slow when I use the mamba network?

How does funding with Polar work?

Backer

Contributor

Maintainer

Kye Gomez/zeta

[BUG] Why is the backpropagation calculation so slow when I use the mamba network?

How does funding with Polar work?

Backer

Why does "Fund on completion" require GitHub login?

When is the invoice due for "Fund on completion"?

What happens if the issue is never completed?

Do I get any extra benefits by funding?

Do I get progress updates?

Contributor

Do I get a reward?

Is rewards guaranteed?

Maintainer

How can I get funding like this for my open source initiatives?