Fund: [BUG]

[BUG]

Was thinking to try this Zamba impl. Read code and a loop looks odd. Is this some odd global sharing scheme with this fractal shared weight part? Do not understand it, like spamming the same weight with same input and drop all except last output? Or typo? If typo need:

    out = x
    for layer in self.layers:
        out = layer(out)


def forward(self, x) -> Tensor:
    # Embed tokens
    x = self.embed(x)

    if self.post_embed_norm is not False:
        x = self.norm(x)

    for layer in self.layers:
        out = layer(x)

    # return OutputHead(self.dim, 1, self.vocab_size)(x)
    if self.output_head_on is not False:
        out = OutputHead(self.dim, 1, self.vocab_size)(x)
    else:
        return out

Kye Gomez/Zamba

[BUG]

How does funding with Polar work?

Backer

Contributor

Maintainer

Kye Gomez/Zamba

[BUG]

How does funding with Polar work?

Backer

Why does "Fund on completion" require GitHub login?

When is the invoice due for "Fund on completion"?

What happens if the issue is never completed?

Do I get any extra benefits by funding?

Do I get progress updates?

Contributor

Do I get a reward?

Is rewards guaranteed?

Maintainer

How can I get funding like this for my open source initiatives?