AI Wiki

Reinforcement Learning from Human Feedback (RLHF)

Last reviewed

Apr 26, 2026

Sources

31 citations

Review status

Source-backed

Revision

v5 ยท 8,021 words

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation.

Suggest edit