mamba paper Options
lastly, we provide an example of a complete language model: a deep sequence model spine (with repeating Mamba blocks) + language product head. Even though the recipe for forward go must be outlined within this function, 1 must simply call the Module If passed along, the product takes advantage of the prior condition in the many blocks (which is a