Language Data

L27 - Yahoo Answers Factoids Queries, version 1.0 (3.5MB)

The dataset includes English queries that were input to a search engine in 2012-2014, and identified as a "factoid" queries, i.e., referring to a short fact (filtered by the answer being no longer than 3 words). These queries were identified based on questions in English on Yahoo Answers that have a short best answer and a link to English Wikipedia. The dataset includes the query, its corresponding question title, the best answer, a number indicating the occurrence frequency of the query, the link(s) to English Wikipedia, and the URL of the Yahoo Answers page.