PLOS comment data
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This data set consists of all comments (and associated metadata) left on PLOS articles since the launch of the very first PLOS journal (PLOS Biology) in 2003. The data set provded by PLOS included article ID, comment ID, the title of the comment, the comment itself, and the date and time the comment was created. PLOS required that the ‘display name’ (the username created by the individual making the comment) associated with comments be anonymised, and each comment author’s ‘display name’ was therefore converted by them to a unique numeric ID. These data were supplemented in a number of ways. In some cases, additional data (e.g. article title and publication date) were collected using a web scraping tool, while in others, information could be extracted or inferred from the original PLOS data (e.g. publishing journal, and whether the comment was original or a reply to an existing comment).Full details of the data set and how it was derived can be found in the accompanying "Guide to data".