Exponential search collapses to a polynomial sweep. Modern AI's secret.
Bellman's equation: V(s) = max_a [r + γ V(s')]. Optimal solutions are made of optimal sub-solutions. From this one observation comes dynamic programming, every shortest-path algorithm, AlphaGo, and most of modern reinforcement learning.
— Eigen Box · λ —
Math · Physics · Engineering
▸ Subscribe: https://youtube.com/@EigenBox
♫ Music: "Name The Time And Place" by Telecasted (YouTube Audio Library)
#eigenbox #math #science #engineering #physics
Download
0 formats
No download links available.
The Recursion That Powers AlphaGo (Bellman) | NatokHD