Back to Reinforcement Learning from Human Feedback
Learner Reviews & Feedback for Reinforcement Learning from Human Feedback by DeepLearning.AI
31 ratings
About the Course
Large language models (LLMs) are trained on human-generated text, but additional methods are needed to align an LLM with human values and preferences.
Reinforcement Learning from Human Feedback (RLHF) is currently the main method for aligning LLMs with human values and preferences. RLHF is also used for further tuning a base LLM to align with values and preferences that are specific to your use case.
In this course, you will gain a conceptual understanding of the RLHF training process, and then practice applying RLHF to tune an LLM. You will:
1. Explore the two datasets that are used in RLHF training: the 鈥減reference鈥 and 鈥減rompt鈥 datasets.
2. Use the open source Google Cloud Pipeline Components Library, to fine-tune the Llama 2 model with RLHF.
3. Assess the tuned LLM against the original base model by comparing loss curves and using the 鈥淪ide-by-Side (SxS)鈥 method.
Top reviews
Filter by:
1 - 6 of 6 Reviews for Reinforcement Learning from Human Feedback
By Ahmad A
鈥Jun 19, 2025
better to be expanded a bit, but overall, it is super course
By Neil L
鈥Aug 18, 2025
Very nice overview about how RLHF works.
By sajjad s
鈥May 14, 2025
great
By Fady A S
鈥Dec 12, 2024
The content is amazing, the instructor is great and the flow is well structured. I did learn a lot, however, I wish the notebooks were structured so that I can write some of the code on my own as opposed to everything being ready and already verified working.
By Manideep R E
鈥Jan 12, 2025
Overall worth a shot. Not in depth but good overview
By Alessandro V
鈥Aug 28, 2024
andrew is always a guarantor