#evaluation #multi-metric #llm
FLASK defines four primary abilities which are divided into 12 fine-grained skills to evaluate the performance of language models comprehensively.
#evaluation #multi-metric #llm
FLASK defines four primary abilities which are divided into 12 fine-grained skills to evaluate the performance of language models comprehensively.