Xiaotian Han

Max Han

  • home
  • teaching
  • group
  • blog

  • coding
  • 2025-01-22 » Optimizers: math, implementations and efficiency
    2025-01-21 » LLM Tech Report Notes (updated on 01/22/2025)
    2024-12-12 » Cross-entropy loss and its optimization [WIP]
    2024-10-20 » Attention and its gradient
    2024-10-19 » Softmax and its triton implementation

  • paper
  • 2024-12-30 » Reproduce the inference time scaling exp
    2024-11-20 » Graph Convolution ≈ Mixup

  • LLM
  • 2025-03-07 » [Research Preview] Speculative Thinking: Large Models Mentoring Small Models for Efficient Reasoning
    2025-01-24 » [Research Preview] Thinking Preference Optimization



Copyright © 2025 Xiaotian Han; last updated on 01/22/2025