Multimedia Tools and Applications - Experiments results for paper "Lightly supervised alignment of subtitles on multigenre broadcasts"

dataset

posted on 2016-10-05, 12:06 authored by Oscar Saz TorralbaOscar Saz Torralba, Thomas HainThomas Hain, Salil Deena, Mortaza Doulaty BashkandMortaza Doulaty Bashkand, Bilal KhaliqBilal Khaliq, Wai Man NgWai Man Ng, Rosanna MilnerRosanna Milner, Madina HasanMadina Hasan, Julia Olcoz Martinez

The files in the dataset correspond to results that have been generated for the Multimedia Tools and Applications (Springer ISSN: 1380-7501 / 1573-7721) article: "Lightly supervised alignment of subtitles on multigenre broadcasts".

The files in the zip file are of three types:
- .ctm, which correspond to the output of the automatic speech recognition system or lightly supervised alignment system.
- .rttm, which correspond to the output of the speech segmentation system.
- .sys, which correspond to scoring of the speech segmentation, automatic speech recognition or lightly supervised alignment system.

The following is a description about the naming convention of the files:

TableX-LineY-[ser|wer|f1]: This is the output and scoring results corresponding to Line Y of Table X in the article in terms of SER, WER or F1 score.

All three file types are standard outputs that are recognised by the speech technology community and can be opened using any text editor.

Funding

EPSRC Programme Grant EP/I031022/1 (Natural Speech Technology)

History

Ethics

There is no personal data or any that requires ethical approval

Policy

The data complies with the institution and funders' policies on access and sharing

Sharing and access restrictions

The data can be shared openly

Data description

The file formats are open or commonly used

Methodology, headings and units

Headings and units are explained in the files

Usage metrics

Keywords

Multimedia Data Lightly Supervised Alignment Natural Language Processing Artificial Intelligence and Image Processing Signal Processing

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM