RLHF: Difference between revisions
From ACT Wiki
Jump to navigationJump to search
imported>Doug Williamson (Create page. Sources: Linked pages.) |
imported>Doug Williamson (Add link.) |
||
Line 12: | Line 12: | ||
* [[Machine learning]] | * [[Machine learning]] | ||
* [[Natural language]] | * [[Natural language]] | ||
* [[Natural language processing]] | * [[Natural language processing]] (NLP) | ||
* [[Reinforcement Learning from Human Feedback]] | * [[Reinforcement Learning from Human Feedback]] | ||
* [[Software]] | |||
[[Category:The_business_context]] | [[Category:The_business_context]] |
Latest revision as of 15:02, 21 April 2023
Information technology - software - natural language processing - artificial intelligence - chatbots - training.
Reinforcement Learning from Human Feedback, a training process for machine learning.