Chengshuo Dai

Chengshuo Dai

Self-driven AI Engineer.
Yale University.

Personal Website 🔗

👋 Welcome to my Blog

Hi, this is Chengshuo. I'm documenting my learning notes in this blog. Feel free to email me if there is any mistakes in the notes. 😉

Jan 09, 2026

Breaking the Memory Bottleneck: PagedAttention and RadixAttention

Inference OptimizationLLM Fundamentals

Jan 06, 2026

Optimizing Transformer Attention: From Multi-Head to Grouped-Query and FlashAttention

LLM FundamentalsInference Optimization

Dec 31, 2025

LLM Inference Optimization: Post-Training Quantization Techniques

Inference Optimization

Dec 28, 2025

揭秘KV Cache：为什么大模型推理这么吃显存？

Inference Optimization