- October 28, 2024
Distilling State Space Models from Transformers
- September 18, 2024
Mamba(2) and Transformer Hybrids: An Overview
- August 29, 2024
Hydra a Double Headed Mamba
- August 8, 2024
From Mamba to Mamba-2
- July 12, 2024
Butterflies, Monarchs, Hyenas, and Lightning Fast BERT
- June 12, 2024
BinT5 and HexT5 or T5 and Binary Reverse Engineering
- June 1, 2024
CodeT5 and CodeT5+
- April 29, 2024
Longer Context for T5
- March 6, 2024
T5 the Old New Thing
- February 6, 2023
Paper overview: Hungry Hungry Hippos: Towards Language Modeling with State Space Models