Language Data

L26 - Yahoo! Answers consisting of questions asked in French, version 1.0 (3.8Gb) (Hosted on AWS)

Yahoo! Answers is a website where people post questions and answers, all of which are public to any web user willing to browse or download them. The data we have collected is a subset of the Yahoo! Answers corpus from 2006 to 2015 consisting of 1.7 million questions posed in French, and their corresponding answers. We only include questions which have been resolved, that is, questions which have received one or more answers. The dataset may serve as a testbed for multilingual question answering system as well as research into user behavior on community question answer sites in other