Reinforcement Learning LLM - Search Videos

Distributed RL training for LLM explained part 1

MSNDeep Learning with Yacine

Distributed RL training for LLM explained part 1

An introduction to distributed reinforcement learning for large language models covering core concepts, training setup, and why scaling matters. #AI #MachineLearning #LLM

Deep Reinforcement Learning

Lecture 14 | Deep Reinforcement Learning

Lecture 14 | Deep Reinforcement Learning

YouTubeStanford University School of

386.1K viewsAug 11, 2017

Grokking Deep Reinforcement Learning - Miguel Morales

Grokking Deep Reinforcement Learning - Miguel Morales

Understanding Reinforcement Learning Environment and Rewards

Understanding Reinforcement Learning Environment and Rewards

47.1K viewsApr 1, 2019

Top videos

Reinforcement Learning for LLM Reasoning. RL / RLHF / RLAIF.

Reinforcement Learning for LLM Reasoning. RL / RLHF / RLAIF.

YouTubeByte Goose AI.

185 views6 months ago

Proximal Policy Optimization (PPO) - How to train Large Language Models

Proximal Policy Optimization (PPO) - How to train Large Language Models

YouTubeLuis Serrano Academy

83.3K viewsJan 24, 2024

Reinforcement Learning (RL) for LLMs

Reinforcement Learning (RL) for LLMs

YouTubeNatasha Jaques

13.9K viewsMar 12, 2025

Reinforcement Learning Tutorial

What Is Reinforcement Learning | Types of Reinforcement Learning

What Is Reinforcement Learning | Types of Reinforcement Learning

simplilearn.com

Reinforcement Learning Tutorial | Reinforcement Learning Example Using Python | Edureka

Reinforcement Learning Tutorial | Reinforcement Learning Example Using Python | Edureka

YouTubeedureka!

133.7K viewsJan 10, 2019

Python Reinforcement Learning Tutorial for Beginners in 25 Minutes

Python Reinforcement Learning Tutorial for Beginners in 25 Minutes

YouTubeNicholas Renotte

68.1K viewsMar 10, 2021

Reinforcement Learning for LLM Reasoning. RL / RLHF / RLAIF.

Reinforcement Learning for LLM Reasoning. RL / RLHF / RLAIF.

185 views6 months ago

YouTubeByte Goose AI.

Proximal Policy Optimization (PPO) - How to train Large Language Models

Proximal Policy Optimization (PPO) - How to train Large Language Models

83.3K viewsJan 24, 2024

YouTubeLuis Serrano Academy

Reinforcement Learning (RL) for LLMs

Reinforcement Learning (RL) for LLMs

13.9K viewsMar 12, 2025

YouTubeNatasha Jaques

Reinforcement Learning with Human Feedback (RLHF) - How to train and fine-tune Transformer Models

Reinforcement Learning with Human Feedback (RLHF) - How to train and fine-tune Transformer Models

34.8K viewsFeb 12, 2024

YouTubeLuis Serrano Academy

[UCLA RL-LLM] Chapter 3.2: Reinforcement learning with verifiable rewards (RLVR)

[UCLA RL-LLM] Chapter 3.2: Reinforcement learning with verifiable rewards (RLVR)

3.8K views10 months ago

YouTubeErnest Ryu

Aligning Enterprise LLMs: A Practical Guide to Reward Design and Reinforcement Learning

Aligning Enterprise LLMs: A Practical Guide to Reward Design and Reinforcement Learning

YouTubeAIM Media House

LLMs explained (Part 6): Smarter AI through Reinforcement Learning

LLMs explained (Part 6): Smarter AI through Reinforcement Learning

[UCLA RL-LLM] Chapter 0: Course outline and prologue

13K views10 months ago

YouTubeErnest Ryu

A new path for LLM fine-tuning — without gradients or Reinforcement Learning

New Course: Reinforcement Fine-Tuning LLMs with GRPO! Learn to use reinforcement learning to improve your LLM performance in this short course, built in collaboration with Predibase, and taught by Travis Addair, its Co-Founder and CTO, and Arnav Garg, its Senior Engineer and Machine Learning Lead. Reasoning models have been one of the most important developments in LLMs. Reinforcement Fine-Tuning (RFT) uses rewards to encourage LLMs to find solutions to multi-step reasoning tasks such as solving

38.8K viewsMay 21, 2025

FacebookAndrew Ng

New course with Google Cloud: Reinforcement Learning from Human Feedback (RLHF)

9.9K viewsDec 13, 2023

YouTubeDeepLearningAI

Reinforcement Learning in the Era of LLMs

1.8K viewsMar 13, 2024

YouTubeArize AI

Reinforcement learning (for llms) for beginners

5K views3 weeks ago

YouTubeIndividualKex

Reinforcement Learning for LLMs in 2025

15.6K viewsFeb 10, 2025

YouTubeTrelis Research

Master LLM Training with Reinforcement Learning

13 views1 month ago

YouTubeGithub Signals

I Trained an LLM to Think Deeper (Here's How)

12.6K viewsFeb 24, 2025

YouTubeAdam Lucek

Get Started with Reinforcement Learning on Azure Machine Learning

Microsoftmarkdefalco

Free Course: Training & Finetuning LLMs

97K viewsOct 5, 2023

YouTubeWeights & Biases

Deep Dive into LLMs like ChatGPT

6.4M viewsFeb 5, 2025

YouTubeAndrej Karpathy

Why Reinforcement Learning Unlocks Reasoning in LLMs (Aha Moments Explained)

2.5K views4 months ago

YouTubeAI Papers Academy

ERL: Improving LLM Training via Self-Reflection

44 views3 months ago

YouTubeAI Research Roundup

Deep Reinforcement Learning

deepmind.google

Learning to Reason with LLMs

15.8K viewsSep 26, 2024

YouTubeSimons Institute for the Theory of Computing

Reinforcement Learning in Finance: Resources and Expert Advice from Paul Bilokon

Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning

33.4K viewsJun 21, 2024

YouTubeLuis Serrano Academy

What are RLVR environments for LLMs? | Policy, rollouts & rubrics explained

MSNDeep Learning with Yacine

GRPO 2.0? DAPO LLM Reinforcement Learning Explained

6.2K viewsMar 25, 2025

YouTubeAI Papers Academy

Large Language Models explained briefly

5.9M viewsNov 20, 2024

YouTube3Blue1Brown

LLM: Pretraining, Instruction fine-tuning and RLHF

6.4K viewsJul 31, 2023

YouTubeYanAITalk

See more