- April 1, 2025
RL Bite: Monotonic Policy Improvement and Deriving Proximal Policy Optimization (PPO)
- March 8, 2025
RL Bite: Policy Gradient and Reinforce
- March 3, 2025
RL Bite: Learning the Q Function
- February 23, 2025
TLDR; Graph Contrastive Learning: Representation Scattering
- February 18, 2025
RL Bite: Computing the Value Function
- February 16, 2025
TLDR; HC-GAE The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning
- February 12, 2025
RL Bite: Bellmans Equations and Value Functions
- February 10, 2025
TLDR; Duplex: Dual GAT for Complex Embeddings of Directed Graphs
- February 5, 2025
RL Bite: Exploitation vs Exploration
- December 16, 2024
Graph Neural Networks meet Large Language Models
- December 2, 2024
Hymba, a new breed of SSM-Attention Hybrids
- November 25, 2024
Transform any LLMs to a powerful Encoder
- November 20, 2024
2025 Year of Zig
- October 28, 2024
Distilling State Space Models from Transformers
- October 14, 2024
Illusion of State in SSMs like Mamba
- September 18, 2024
Mamba(2) and Transformer Hybrids: An Overview
- August 29, 2024
Hydra a Double Headed Mamba
- August 8, 2024
From Mamba to Mamba-2
- July 12, 2024
Butterflies, Monarchs, Hyenas, and Lightning Fast BERT
- June 12, 2024
BinT5 and HexT5 or T5 and Binary Reverse Engineering