Skip to content

Latest commit

 

History

History
 
 

cmu_wiki_qa

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
dataset_info license task_categories language tags pretty_name size_categories
features splits download_size dataset_size
name dtype
INSTRUCTION
string
name dtype
RESPONSE
string
name dtype
SOURCE
string
name dtype
METADATA
string
name num_bytes num_examples
train
410246
1610
105516
410246
mit
question-answering
summarization
en
Carnegie Mellon University
University of Pittsburgh
Wikipedia
Q&A
Question-Answer Dataset
1K<n<10K

Dataset Card for "cmu_wiki_qa"

A filtered / cleaned version of the http://www.cs.cmu.edu/~ark/QA-data/ Q&A dataset, which provides manually-generated factoid questions from Wikipedia articles.

Acknowledgments

These data were collected by Noah Smith, Michael Heilman, Rebecca Hwa, Shay Cohen, Kevin Gimpel, and many students at Carnegie Mellon University and the University of Pittsburgh between 2008 and 2010.

Their research project was supported by NSF IIS-0713265 (to Smith), an NSF Graduate Research Fellowship (to Heilman), NSF IIS-0712810 and IIS-0745914 (to Hwa), and Institute of Education Sciences, U.S. Department of Education R305B040063 (to Carnegie Mellon).

More Information needed