Dimitri Bertsekas

7.68K subscribers

84 videos

View on YouTube

Latest Videos

1:03:43

Reinforcement Learning, Model Predictive Control, and the Newton Step for Solving Bellman's Equation

Dimitri Bertsekas

8.5K views·11 months ago

1:25:33

Lecture 12, 2025; Training of cost functions, approximation in policy space, policy gradient methods

Dimitri Bertsekas

1.4K views·1 year ago

1:08:20

Abstract Dynamic Programming, Reinforcement Learning, Newton's Method, and Gradient Optimization

Dimitri Bertsekas

3.2K views·1 year ago

1:15:27

Lecture 11, 2025; Adversarial Problems, Minimax Rollout, Use of MPC Methods, Computer Chess

Dimitri Bertsekas

281 views·1 year ago

1:32:12

Lecture 10, 2025; Aggregation Methods for Off-Line Training, Applications to POMDP and Cybersecurity

Dimitri Bertsekas

484 views·1 year ago

1:39:51

Lecture 9, 2025; Rollout and Its Variants for Stochastic and Adaptive Control, Application to Wordle

Dimitri Bertsekas

350 views·1 year ago

1:25:35

Lecture 8, 2025; GPT, HMM, and Markov chains Rollout variants for most likely sequence generation

Dimitri Bertsekas

1.2K views·1 year ago

48:52

New Directions in RL TD(lambda), aggregation, seminorm projections, free-form sampling (from 2014)

Dimitri Bertsekas

679 views·1 year ago

2:00:51

Lecture 7, 2025, Case studies Multi-robot warehouse, data association

Dimitri Bertsekas

474 views·1 year ago

1:24:41

Lecture 6, 2025, Multistep Approximation in Value Space, Constrained Rollout, Multiagent Rollout

Dimitri Bertsekas

757 views·1 year ago

1:58:08

Lecture 5, 2025, Deterministic Rollout and Animations

Dimitri Bertsekas

673 views·1 year ago

1:50:32

Lecture 4, 2025, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

Dimitri Bertsekas

982 views·1 year ago

1:25:24

Lecture 3, 2025, LQ Problems, Approximation in Value Space, VI, and PI, Newton's Method, Examples

Dimitri Bertsekas

1.2K views·1 year ago

36:17

Computer chess with model predictive control and reinforcement learning

Dimitri Bertsekas

1.7K views·1 year ago

2:06:50

Lecture 2, 2025, Stochastic finite and infinite horizon DP, approximation in value and policy space

Dimitri Bertsekas

2.3K views·1 year ago

2:04:16

Lecture 1, 2025, Course overview RL and DP, AlphaZero, deterministic DP, examples, applications

Dimitri Bertsekas

7.9K views·1 year ago

54:31

Plenary lecture at IFAC Nonlinear MPC, 2024; Model Predictive Control and Reinforcement Learning

Dimitri Bertsekas

5.1K views·1 year ago

1:59:20

Lecture 1, 2024, course overview RL and DP, AlphaZero, discrete and continuous applications

Dimitri Bertsekas

5.2K views·2 years ago

1:21:08

Lecture 13 2024 Approximate LP. Approximation in policy space, policy gradient methods. Epilogue

Dimitri Bertsekas

623 views·2 years ago

1:29:29

Lecture 12 2024; Off-line training with neural nets for approximate VI and PI. Aggregation

Dimitri Bertsekas

405 views·2 years ago

1:38:04

Lecture 11, 2024 On-line training, neural networks, and other approximation architectures

Dimitri Bertsekas

585 views·2 years ago

1:43:23

Lecture 10, 2024; GPT, HMM, and Markov chains Rollout variants for most likely sequence generation

Dimitri Bertsekas

2.1K views·2 years ago

1:10:09

Lecture 9, 2024, Bayesian optimization and adaptive control with a POMDP approach. Wordle case study

Dimitri Bertsekas

3.0K views·2 years ago

16:42

Acceptance remarks by Dimitri Bertsekas for 2014 Khachiyan Prize of the INFORMS Optimization Society

Dimitri Bertsekas

481 views·2 years ago

1:32:39

Lecture 8, 2024, Rollout for stochastic DP. Value space approx for infinite state and control spaces

Dimitri Bertsekas

529 views·2 years ago

2:07:39

Lecture 7, 2024, Case studies Multi-robot warehouse, multiagent routing, data association

Dimitri Bertsekas

642 views·2 years ago

1:27:03

Lecture 6, 2024, Multistep Approximation in Value Space, Constrained Rollout, Multiagent Rollout

Dimitri Bertsekas

587 views·2 years ago

1:30:28

Lecture 5, 2024, Deterministic Rollout, cost improvement, sequential improvement, multiagent rollout

Dimitri Bertsekas

722 views·2 years ago

1:46:26

Lecture 4, 2024, POMDP, Systems with Changing Parameters, Adaptive Control, Model Predictive Control

Dimitri Bertsekas

797 views·2 years ago

48:15

Polyhedral Approximation Algorithms for Convex Optimization, NIPS 2008

Dimitri Bertsekas

508 views·2 years ago

Load More Videos