@songying
2018-10-20T15:29:19.000000Z
字数 650
阅读 1448
阅读理解数据集
deep-learning
Cloze-style
- CNN/Daily Mail: Teaching machines to read and comprehend
- Children’s Books test (CBT) : Reading children’s books with explicit memory representations.
- Who did what: Who did what: A large-scale person-centered cloze dataset
- LAMBADA: The lambada dataset: Word prediction requiring a broad discourse context.
- Cloth: Large-scale Cloze Test Dataset Created by Teachers
- Squad: 100,000+ questions for machine comprehension of text.
- Triviaqa: A large scale distantly supervised challenge dataset for reading comprehension
- Newsqa: A machine comprehension dataset
- Ms marco: A human generated machine reading comprehension dataset.
Others
- Race: Large-scale reading comprehension dataset from examinations.