From 1d3dd0f26ac70e09b2ee0e662f6b007cf5309bfe Mon Sep 17 00:00:00 2001 From: blackbeelabs Date: Thu, 6 Jun 2024 16:33:27 +0800 Subject: [PATCH] add benchmarks --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index 34a7d85..fd50d58 100644 --- a/README.md +++ b/README.md @@ -405,6 +405,11 @@ Recommended reading order: ### Benchmarks +**Tier 1** +- ✨ [Know What You Don’t Know: Unanswerable Questions for SQuAD](https://arxiv.org/pdf/1806.03822) +- ✨ [SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems](https://arxiv.org/pdf/1905.00537) +- ✨ [GLUE: A multi-task benchmark and analysis platform for natural language understanding](https://arxiv.org/pdf/1804.07461) + **Tier 2** - ✨ [GPQA: A Graduate-Level Google-Proof Q&A Benchmark](http://arxiv.org/abs/2311.12022)