1:12:18Diffusion Transformers (DiT) Explained Replacing U-Nets with TransformersColby豆布斯12 views·1 week ago
8:58Reinforcement Learning 105 RLHF & Reinforcement Fine-Tuning ExplainedColby豆布斯19 views·2 weeks ago
35:22Reinforcement Learning Explained (RL 101 Intuition, MDP, Policy, Value)Colby豆布斯54 views·1 month ago