The University of Sheffield
Table A4.docx (1.02 MB)
Download file

MatSciBERT software results when applied to a magnetic paper corpus

Download (1.02 MB)
posted on 2023-01-10, 08:32 authored by Nicola MorleyNicola Morley


This document contains the analyse of the results when MatSciBERT software was applied to the magnetic paper corpus, as part of the "Feasibility study to assess Natural Language Processing for Functional Magnetic Materials" project funded by ROyce Materials 4.0. As MatSciBERT gives a label to every word with the abstract, this means that the output for the data is not as simple as the other two data sets. Rather the raw data output is 100’s of lines long. Thus the raw data output for the first corpus is found here: here

The analysed data is given in the document, along with the highlighted expected results and a comment on how well the software worked.



  • There is no personal data or any that requires ethical approval


  • The data complies with the institution and funders' policies on access and sharing

Sharing and access restrictions

  • The data can be shared openly

Data description

  • The file formats are open or commonly used

Methodology, headings and units

  • Headings and units are explained in the files

Usage metrics

    Department of Materials Science and Engineering