Posts

April 10, 2025 RL Bite: Monte Carlo Search Tree
April 1, 2025 RL Bite: Monotonic Policy Improvement and Deriving Proximal Policy Optimization (PPO)
March 8, 2025 RL Bite: Policy Gradient and Reinforce
March 3, 2025 RL Bite: Learning the Q Function
February 23, 2025 TLDR; Graph Contrastive Learning: Representation Scattering
February 18, 2025 RL Bite: Computing the Value Function
February 16, 2025 TLDR; HC-GAE The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning
February 12, 2025 RL Bite: Bellmans Equations and Value Functions
February 10, 2025 TLDR; Duplex: Dual GAT for Complex Embeddings of Directed Graphs
February 5, 2025 RL Bite: Exploitation vs Exploration
December 16, 2024 Graph Neural Networks meet Large Language Models
December 2, 2024 Hymba, a new breed of SSM-Attention Hybrids
November 25, 2024 Transform any LLMs to a powerful Encoder
November 20, 2024 2025 Year of Zig
October 28, 2024 Distilling State Space Models from Transformers
October 14, 2024 Illusion of State in SSMs like Mamba
September 18, 2024 Mamba(2) and Transformer Hybrids: An Overview
August 29, 2024 Hydra a Double Headed Mamba
August 8, 2024 From Mamba to Mamba-2
July 12, 2024 Butterflies, Monarchs, Hyenas, and Lightning Fast BERT