Last week, we released an interview with Nathan Lambert (Research Scientist at HuggingFace) on RLHF and LLM evaluations. This has become one of our most watched interviews already, and it’s clearly a topic that many people are interested in. I recommend watching the whole interview, but I also wanted to share some takeaways from the conversation.
Perfect timing. What if RLCF truly captured human prefrences?