Fund: [BUG] layer norm called multiple times with same parameters

[BUG] layer norm called multiple times with same parameters

1 month ago

👍

In the module: MambaTransformer/mamba_transformer, you execute the following in class MambaTransformerblock:

       # Layernorm
        self.norm = nn.LayerNorm(dim)

    def forward(self, x: Tensor) -> Tensor:
        for mamba, attn, ffn in zip(
            self.mamba_blocks,
            self.transformer_blocks,
            self.ffn_blocks,
        ):
            x = self.norm(x)
            x = mamba(x) + x
            x = self.norm(x)
            x = attn(x) + x
            x = self.norm(x)
            x = ffn(x) + x

        return x

Since the layerNorm has trainable parameter, you appear to be calling three layer norms in the forward function with tied parameters. Is that what you really want?

Kye Gomez/MambaTransformer

[BUG] layer norm called multiple times with same parameters

How does funding with Polar work?

Backer

Contributor

Maintainer

Kye Gomez/MambaTransformer

[BUG] layer norm called multiple times with same parameters

How does funding with Polar work?

Backer

Why does "Fund on completion" require GitHub login?

When is the invoice due for "Fund on completion"?

What happens if the issue is never completed?

Do I get any extra benefits by funding?

Do I get progress updates?

Contributor

Do I get a reward?

Is rewards guaranteed?

Maintainer

How can I get funding like this for my open source initiatives?