RLHF: Difference between revisions

From ACT Wiki

Jump to navigation Jump to search

Latest revision as of 15:02, 21 April 2023

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.

See also

Retrieved from ‘https://wiki-dev.treasurers.org/w/index.php?title=RLHF&oldid=39726’

Navigation menu