Latest Videos
What are prompt word engineering, context engineering, and Harness engineering
程序员老王
13.4K views·3 weeks ago
Deploy large models locally! Run DeepSeek-R1 with the Transformers library
程序员老王
4.2K views·2 months ago
Training principles of large models Gradient descent starting with a straight line
程序员老王
8.6K views·5 months ago
Anaconda, Miniconda, conda-forge, Miniforge & Mamba Explained in 15 Minutes!
程序员老王
24.5K views·9 months ago
Understand Python Project Structure and Packaging in 15 Minutes with build + hatchling
程序员老王
18.7K views·9 months ago
From pip to uv A Complete Guide to the Modern Python Project Management Workflow
程序员老王
103.5K views·10 months ago


![注意力残差是什么? [白话读论文]](https://i.ytimg.com/vi/pGYrWsNQ8A0/hqdefault.jpg?sqp=-oaymwEcCNACELwBSFXyq4qpAw4IARUAAIhCGAFwAcABBg==&rs=AOn4CLBvD0XLwN2zUhaPvg0_e0ZsyPJ5TA)






![Training a Handwritten Digit Recognition Model with 30 Lines of Code [PyTorch in Action]](https://i.ytimg.com/vi/qL6ca-mIeMI/hqdefault.jpg?sqp=-oaymwEcCNACELwBSFXyq4qpAw4IARUAAIhCGAFwAcABBg==&rs=AOn4CLC6phFA9OeY2FbUHU62KBc-WvFkbA)






![AI思维链是幻象吗?[白话读论文]](https://i.ytimg.com/vi/ZLDfTwHm56A/hqdefault.jpg?sqp=-oaymwEcCNACELwBSFXyq4qpAw4IARUAAIhCGAFwAcABBg==&rs=AOn4CLChjZBH6TDqQgdxMxw-O5G-vRwB-Q)












