Hope these will be as helpful to you today as they will be for future me.
view by tags:   
coding (5)    
paper (2)    
LLM (1)    
[Research Preview] Thinking Preference Optimization
01/24/2025
[ LLM ]
[ LLM ]
enhance model reasoning by using long/short CoT as preferred/rejected examples in DPO
Optimizers: math, implementations and efficiency
01/22/2025
[ coding ]
[ coding ]
From math to optimized code: implementing optimizers with PyTorch comparisons
LLM Tech Report Notes (updated on 01/22/2025)
01/21/2025
[ coding ]
[ coding ]
reading LLM tech reports.
Reproduce the inference time scaling exp
12/30/2024
[ paper ]
[ paper ]
dive into the minimal experiment to show the inference time scaling.
Cross-entropy loss and its optimization [WIP]
12/12/2024
[ coding ]
[ coding ]
dive into cross-entropy loss and its optimization.
Graph Convolution ≈ Mixup
11/20/2024
[ paper ]
[ paper ]
one of my most liked papers.
Attention and its gradient
10/20/2024
[ coding ]
[ coding ]
dive into attention and its gradient.
Softmax and its triton implementation
10/19/2024
[ coding ]
[ coding ]
implementing softmax using triton