domain,metric,value,samples code,pass@1,0.73,164 math,exact_match,0.42,500 translation,bleu,28.5,500 data_to_text,rouge_l,0.65,500