Quantifiable standards like perplexity, accuracy, F1 score, or BLEU score, used to gauge the effectiveness of a language model.

Authority Links

A ChatGPT Glossary

Related Terms