logo
xLog
HomeAboutgithub stars
FeaturedShortsLatestHottestxLogWeb3AIJournalFictionCodingPodcast
RLHF
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover
cover

From RL to RLHF

This article is mainly based on Umar Jamil's course ^{[1]} for learning and recording. Our goal is to align the behavior of LLMs with our…
深度学习17 min
Nagi-ovoNagi-ovo
·11 days ago
An open-source creative community written on the blockchain.
Current Block Height
9
7
,
0
3
8
,
0
6
3

Suggested creators for you

    Show more
© xLog