Chengshuo Dai

Chengshuo Dai

Self-driven AI Engineer.
Yale University.

Personal Website 🔗

👋 Welcome to my Blog

Hi, this is Chengshuo. I'm documenting my learning notes in this blog. Feel free to email me if there is any mistakes in the notes. 😉

Jan 09, 2026

Breaking the Memory Bottleneck: PagedAttention and RadixAttention

Inference OptimizationLLM Fundamentals
Jan 06, 2026

Optimizing Transformer Attention: From Multi-Head to Grouped-Query and FlashAttention

LLM FundamentalsInference Optimization
Dec 31, 2025

LLM Inference Optimization: Post-Training Quantization Techniques

Inference Optimization
Dec 28, 2025

揭秘KV Cache:为什么大模型推理这么吃显存?

Inference Optimization

Categories

  • All (48)
  • AI Safety (1)
  • AI Search (3)
  • Agent (6)
  • Agentic RAG (1)
  • Alignment (6)
  • DNS (1)
  • Embedding & Vector Database (3)
  • Fine-Tuning (2)
  • GraphRAG (1)
  • Inference Optimization (4)
  • LLM Evaluation (3)
  • LLM Fundamentals (11)
  • LLM Inference (1)
  • Model Architecture (4)
  • Performance Optimization (3)
  • Personal Reflextion (1)
  • Post-Training (1)
  • Prompt Engineering (6)
  • RAG (4)
  • Scaling Laws (1)
  • Search (1)
  • Supervised Fine-tuning (5)
  • System Design (3)
  • Vibe Coding (1)
  • Web Deployment (1)