Mainak's PMRF Tutorials

612 subscribers

81 videos

View on YouTube

This channel contains tutorials sessions in Mathematics, Artificial Intelligence and Science.

Latest Videos

1:46:51

Session 4 Diffusion Denoising Probabilistic Model DDPM

Mainak's PMRF Tutorials

32 views·18 hours ago

1:53:12

Session 3 Beta VAE, Info VAE, Implementing-VAE

Mainak's PMRF Tutorials

122 views·3 days ago

1:44:50

Session 2 Variational Inference, Autoencoders, Variational Autoencoders

Mainak's PMRF Tutorials

68 views·5 days ago

2:05:43

Session1 GenAI, Latent Variable, EM, GMMs

Mainak's PMRF Tutorials

159 views·5 days ago

1:48:51

Session 21 Actor Critic based Policy Gradient, Safe RL, Planning, DYNA, Curriculum Learning

Mainak's PMRF Tutorials

268 views·11 months ago

1:50:03

Session 20 Deep Neural Networks, MLP, Backpropagation, Policy Gradient, REINFORCE

Mainak's PMRF Tutorials

99 views·11 months ago

1:54:17

Session 19 Asynchronous Q learning, Classification in ML, MLE, Logistic and Softmax Regression

Mainak's PMRF Tutorials

266 views·11 months ago

1:57:08

Session 18 Synchronous Q-learning, Model-free, based, tabular, with Linear Fn. Approx., Convergence

Mainak's PMRF Tutorials

54 views·11 months ago

1:39:10

Session 17 Off-Policy Evaluation of TD0 with linear function Approximation, Emphatic TD0

Mainak's PMRF Tutorials

42 views·11 months ago

1:42:49

Session 16 γ contraction, Banach's Fixed Point Theorem, How far is it far from the intended optimal

Mainak's PMRF Tutorials

56 views·11 months ago

1:52:56

Session 15 TD(0) convergence proof (contd), Point of Convergence of TD(0) (linear function approx.)

Mainak's PMRF Tutorials

58 views·11 months ago

1:54:39

Session 14 TD0 with linear function approximation, Glimpse at Stochastic Approximation Algorithm(1)

Mainak's PMRF Tutorials

87 views·1 year ago

1:45:21

Session 13 Function Approximation in RL, Policy Evaluation, SGD Monte Carlo, TD(0) Implementation

Mainak's PMRF Tutorials

121 views·1 year ago

1:49:15

Session 12 On Policy vs Off Policy Algorithms, Importance Sampling, Model-free Q learning, SARSA

Mainak's PMRF Tutorials

131 views·1 year ago

1:44:33

Session 11 Model Free Methods, Monte Carlo, Temporal Difference Algorithm, TD(λ) Algorithm

Mainak's PMRF Tutorials

111 views·1 year ago

1:51:52

Session 10 Stochastic Shortest Path, Bellman Operators, Proof of convergence of Policy Evaluation

Mainak's PMRF Tutorials

134 views·1 year ago

1:55:36

Session 9 Policy Iteration & Q learning code, Finite Horizon MDPs, Dynamic Program, Theory and Exmp

Mainak's PMRF Tutorials

141 views·1 year ago

1:48:24

Session 8 Bellman Equation, Optimal Policy, Iterative Policy Evaluation, Policy & Value Iteration

Mainak's PMRF Tutorials

169 views·1 year ago

1:51:33

Session 7 MDPs, Action, Value, Reward functions, Bellman Equations 1, Examples

Mainak's PMRF Tutorials

183 views·1 year ago

1:53:14

Session 6 Random Processes, Markov Chains and Stationary Distribution

Mainak's PMRF Tutorials

165 views·1 year ago

1:50:48

Session 5 ODE Interpretation in Bandits, UCB, Gradient-Based Algorithms, UCB in Python

Mainak's PMRF Tutorials

164 views·1 year ago

1:42:57

Session 4 Introduction to Reinforcement Learning, Multi-armed Bandits Algorithm and Implementation

Mainak's PMRF Tutorials

387 views·1 year ago

1:54:27

Session 3 Recap on Joint Distributions, Conditional Distributions, and Conditional Expectations

Mainak's PMRF Tutorials

146 views·1 year ago

1:48:30

Session 2 Recap - Continuous Distributions, Transformation of random variables

Mainak's PMRF Tutorials

215 views·1 year ago

1:56:11

Session 1 Recap on Random Variables, Exemplar Discrete Distributions, Expectations

Mainak's PMRF Tutorials

480 views·1 year ago

2:20:38

Session 24 Mixture Models, Expectation Maximization, GMMs, K-means is a specialized GMM

Mainak's PMRF Tutorials

444 views·1 year ago

2:07:14

Session 23 Dimensionality Reduction - Principal Component Analysis, Linear Discriminant Analysis

Mainak's PMRF Tutorials

373 views·1 year ago

2:02:29

Session 22 Unsupervised Learning, Clustering algorithms, K-means, K-medoids, and Hierarchical

Mainak's PMRF Tutorials

268 views·1 year ago

1:41:05

Session 21 Backpropagation, Dropout, Bias-variance tradeoff, Prevent overfitting or underfitting

Mainak's PMRF Tutorials

375 views·2 years ago

1:59:31

Session 20 Perceptron, Perceptron Learning Algorithm, Convergence Proof, MLPs, Forward Propagation

Mainak's PMRF Tutorials

257 views·2 years ago

Load More Videos