A fine-tuning technique used in ChatGPT that generates multiple responses which are then ranked to select the best one.

Authority Links

ChatGPT Glossary

Related Terms