The recent arrival of Mamba has created considerable buzz within the deep learning community . This novel architecture, unlike conventional Transformers, promises a potential path to improved efficiency and diminished processing requirements. Departing from the quadratic scaling inherent in attention mechanisms, Mamba leverages a structured approac