标签: tensorflow machine-learning deep-learning stanford-nlp reinforcement-learning
SQuAD Challenge根据F1和EM分数对结果进行排名。关于F1分数的信息很多(精确度和召回率的函数)。但是,EM得分是多少?
答案 0 :(得分:1)
Exact match. This metric measures the percentage of predictions that match any one of the ground truth answers exactly.
根据here。