All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
substack.com
Direct Preference Optimization (DPO) explained
A Simpler Way to Fine-Tune Language Models than with RLHF
100 views
Dec 27, 2024
Direct Preference Optimization Tutorial
論文紹介:Direct Preference Optimization: Your Language Model is Secretly a Reward Model
speakerdeck.com
Aug 19, 2024
7:52
21. Direct Preference Optimization (DPO) (Rafailov et al., 2023)
YouTube
LOADING_
1 views
3 months ago
2:45
Direct Preference Optimization (DPO) Explained: AI Alignment
YouTube
VLR Software Training
7 views
2 months ago
Top videos
12:16
Direct Preference Optimization (DPO) explained + OpenAI Fine-tuning example
YouTube
Simeon Emanuilov
786 views
Dec 26, 2024
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning
YouTube
Serrano.Academy
30K views
Jun 21, 2024
Mastering LLM Alignment & Preference Optimization Llama3 LLM
git.ir
Jun 27, 2024
Direct Preference Optimization Applications
14:16
Diffusion Model Alignment Using Direct Preference Optimization
bilibili
dalaska的欢愉
43 views
1 month ago
1:05
DeepLearning.AI on Instagram: "Our course recommendation of the day is “Post-training of LLMs, ” where you’ll learn how to customize pre-trained language models using Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Online Reinforcement Learning (RL). You'll learn when to use each method, how to curate training data, and implement them in code to shape model behavior effectively. Enroll at the link in bio or comment "LLM" to receive the link in your inbox."
Instagram
deeplearningai
8.1K views
4 months ago
Direct Preference Optimization is one of the most significant advances in AI over the last six months. It provides a simpler and more efficient way to align a model's preferences. You can try out in packages like TRL. Direct Preference Optimization (DPO) - A Simplified Explanation: https://medium.com/@joaolages/direct-preference-optimization-dpo-622fc1f18707 Direct Preference Optimization: Your Language Model is Secretly a Reward Model - https://arxiv.org/pdf/2305.18290.pdf DPO Trainer: https://
TikTok
rajistics
4.8K views
Jan 26, 2024
12:16
Direct Preference Optimization (DPO) explained + OpenAI Fine-tu
…
786 views
Dec 26, 2024
YouTube
Simeon Emanuilov
21:15
Direct Preference Optimization (DPO) - How to fine-tune LLMs dir
…
30K views
Jun 21, 2024
YouTube
Serrano.Academy
Mastering LLM Alignment & Preference Optimization Llama3 L
…
Jun 27, 2024
git.ir
Direct Nash Optimization: Teaching language models to self-improve
…
Sep 3, 2024
Microsoft
8:55
Direct Preference Optimization: Your Language Model is Secretly
…
39.1K views
Dec 22, 2023
YouTube
AI Coffee Break with Letitia
Federated Fine-Tuning of Large Language Models: Kahneman-Tve
…
9 months ago
acm.org
16:57
Direct Preference Optimization (DPO) | Paper Explained
1.4K views
2 months ago
YouTube
Outlier
12:30
How does DPO improve the LLM's performance? | Simple Explanation
207 views
Jan 29, 2025
YouTube
MLWorks
論文紹介:Direct Preference Optimization: Your Language Mod
…
Aug 19, 2024
speakerdeck.com
48:46
Direct Preference Optimization (DPO) explained: Bradley-Terry m
…
33.7K views
Apr 14, 2024
YouTube
Umar Jamil
7:52
21. Direct Preference Optimization (DPO) (Rafailov et al., 2023)
1 views
3 months ago
YouTube
LOADING_
37:16
Hands-on 10: Large Language Model Alignment with Direct Prefe
…
3.7K views
7 months ago
YouTube
BrainOmega
36:25
Direct Preference Optimization (DPO): Your Language Model is S
…
19.1K views
Aug 10, 2023
YouTube
Gabriel Mongaras
1:27:21
RLHF, PPO and DPO for Large language models
3.6K views
Feb 18, 2024
YouTube
Arvind N
33:26
ORPO: Monolithic Preference Optimization without Reference M
…
25.4K views
May 1, 2024
YouTube
Yannic Kilcher
41:28
LLMs | Alignment of Language Models: Contrastive Learning | Le
…
1.6K views
Sep 26, 2024
YouTube
LCS2
58:07
Aligning LLMs with Direct Preference Optimization
34K views
Feb 8, 2024
YouTube
DeepLearningAI
7:55
[Paper Review] Direct preference optimization(DPO) : Your languag
…
8 views
5 months ago
YouTube
LOADING_
8:05
DPO : L'Alternative RLHF qui Révolutionne l'Alignement IA
26 views
2 months ago
YouTube
Deep Learner, One Step at a Time
1:06:31
UMass CS685 S24 (Advanced NLP) #12: Direct preference optimizatio
…
3K views
Mar 13, 2024
YouTube
Mohit Iyyer
53:03
DPO - Part1 - Direct Preference Optimization Paper Explanation |
…
2K views
Aug 12, 2023
YouTube
Neural Hacks with Vasanth
【勉強メモ】直接優先最適化 (DPO): 言語モデルは密かに報酬モデルで
…
Aug 11, 2023
note(ノート)
だいち
Direct Preference Optimization is one of the most significant advanc
…
4.8K views
Jan 26, 2024
TikTok
rajistics
31:18
VPX研讨会 21 | Direct Preference Optimization 论文讲解
352 views
May 26, 2024
bilibili
VPX_Lab
31:05
Direct Preference Optimization Your Language Model is Secretly a Rew
…
953 views
Jun 20, 2023
bilibili
mardinff
59:40
Direct Preference Optimization (DPO) in 1 hour
2.1K views
5 months ago
YouTube
Zachary Huang
42:49
Direct Preference Optimization (DPO)
7.3K views
Nov 13, 2023
YouTube
Trelis Research
40:55
Fast Fine Tuning and DPO Training of LLMs using Unsloth
5.8K views
Mar 25, 2024
YouTube
AI Anytime
2:45
Direct Preference Optimization (DPO) Explained: AI Alignment
7 views
2 months ago
YouTube
VLR Software Training
See more videos
More like this
Feedback