From 3563e59abe6366981ae5eb7c3333efcd64225d04 Mon Sep 17 00:00:00 2001 From: Naman Goyal Date: Fri, 9 Aug 2019 09:43:16 -0700 Subject: [PATCH] added superglue dev set results to readme Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/815 Differential Revision: D16733633 fbshipit-source-id: 0a5029e41b6dbb9fb28e9703ad057d939d489d90 --- examples/roberta/README.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/examples/roberta/README.md b/examples/roberta/README.md index 537c55f3fa..5f3be7941d 100644 --- a/examples/roberta/README.md +++ b/examples/roberta/README.md @@ -24,6 +24,13 @@ Model | MNLI | QNLI | QQP | RTE | SST-2 | MRPC | CoLA | STS-B `roberta.large` | 90.2 | 94.7 | 92.2 | 86.6 | 96.4 | 90.9 | 68.0 | 92.4 `roberta.large.mnli` | 90.2 | - | - | - | - | - | - | - + +##### Results on SuperGLUE tasks (dev set, single model, single-task finetuning) + +Model | BoolQ | CB | COPA | MultiRC | RTE | WiC | WSC +---|---|---|---|---|---|---|--- +`roberta.large` | 86.9 | 98.2 | 94.0 | 85.7 | 89.5 | 75.6 | 91.3 + ##### Results on SQuAD (dev set) Model | SQuAD 1.1 EM/F1 | SQuAD 2.0 EM/F1