AI Wiki

Direct Preference Optimization (DPO)

Last reviewed

Jun 10, 2026

Sources

14 citations

Review status

Source-backed

Revision

v6 · 3,592 words

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation.

Suggest edit