Data/Code

Schubert Winterreise Dataset (SWD)

Dataset on Zenodo

The Schubert Winterreise Dataset (SWD) is a multimodal dataset comprising various representations and annotations of Franz Schubert's 24-song cycle Winterreise [1]. The primary material (raw data) consists of textual representations of the songs' lyrics, music scores in image, symbolic, and MIDI format, as well as audio recordings of nine performances. The secondary material (annotations) comprises information of musical measure positions in sheet music images and audio recordings as well as analyses of chords, local keys, global keys, and structural parts.

Dagstuhl ChoirSet: A Multitrack Dataset for MIR Research on Choral Singing

Web-based Interface

Dagstuhl ChoirSet (DCS) is a multitrack dataset of a cappella choral music designed to support MIR research on choral singing [2]. The dataset includes recordings of an amateur vocal ensemble performing two choir pieces in full choir and quartet settings. The audio data was recorded during an MIR seminar at Schloss Dagstuhl using different close-up microphones to capture the individual singers' voices.

Cross-Era Dataset

Accompanying Website

The Cross-Era Dataset was designed for studying the analysis and classification of Western classical music recordings. It is compiled from commercial audio recordings, totalling 2000 tracks [3]. We provide annotations including composer- and piece-specific information as well as global key labels for the 1200 pieces of the Baroque, Classical, and Romantic periods. Furthermore, chroma-based audio features and automatically computed chord labels are available.

Cross-Composer Dataset

Accompanying Website

The Cross-Composer Dataset was designed for studying the composer identification task for Western classical music recordings [4]. It is compiled from commercial audio recordings, totalling 1100 tracks. We provide annotations including composer- and piece-specific information as well as album information. Furthermore, chroma-based audio features and automatically computed chord labels are available.


Accompanying Papers

  1. Christof Weiß, Frank Zalkow, Vlora Arifi-Müller, Harald Grohganz, Hendrik Vincent Koops, Anja Volk, and Meinard Müller
    Schubert Winterreise Dataset: A Multimodal Scenario for Music Analysis
    ACM Journal on Computing and Cultural Heritage (JOCCH), 2020, in press. Details
    @article{WeissZAGKVM20_WinterreiseDataset_ACM-JOCCH,
    author  = {Christof Wei{\ss} and Frank Zalkow and Vlora Arifi-M{\"u}ller and Harald Grohganz and Hendrik Vincent Koops and Anja Volk and Meinard M{\"u}ller},
    title   = {{S}chubert {W}interreise Dataset: A Multimodal Scenario for Music Analysis},
    journal = {{ACM} Journal on Computing and Cultural Heritage ({JOCCH})},
    volume  = {},
    number  = {},
    pages   = {},
    year    = {2020, in press},
    url-details = {https://doi.org/10.5281/zenodo.4122060}
    }
  2. Sebastian Rosenzweig, Helena Cuesta, Christof Weiß, Frank Scherbaum, Emilia Gómez and Meinard Müller
    Dagstuhl ChoirSet: A Multitrack Dataset for MIR Research on Choral Singing
    Transactions of the International Society for Music Information Retrieval (TISMIR), 3(1): 98–110, 2020. PDF Details DOI
    @article{RosenzweigCWSGM20_DCS_TISMIR,
    title     = {Dagstuhl ChoirSet: A Multitrack Dataset for MIR Research on Choral Singing},
    author    = {Sebastian Rosenzweig, Helena Cuesta, Christof Wei{\ss}, Frank Scherbaum, Emilia G{\'o}mez and Meinard M{\"u}ller},
    journal   = {Transactions of the International Society for Music Information Retrieval ({TISMIR})},
    volume    = {3},
    number    = {1},
    year      = {2020},
    pages     = {98--110},
    doi       = {10.5334/tismir.48},
    url       = {http://doi.org/10.5334/tismir.48},
    url-pdf   = {https://transactions.ismir.net/articles/10.5334/tismir.48/galley/50/download/},
    url-details = {https://doi.org/10.5281/zenodo.3956666},
    publisher = {Ubiquity Press}
    }
  3. Christof Weiß, Matthias Mauch, Simon Dixon, and Meinard Müller
    Investigating Style Evolution of Western Classical Music: A Computational Approach
    Musicae Scientiae, 23(4): 486–507, 2019. PDF Details DOI
    @article{WeissMDM19_StyleEvolution_MusicaeScientiae,
    author  = {Christof Wei{\ss} and Matthias Mauch and Simon Dixon and Meinard M{\"u}ller},
    title   = {Investigating Style Evolution of {W}estern Classical Music: A Computational Approach},
    journal = {Musicae Scientiae},
    volume  = {23},
    number  = {4},
    pages   = {486--507},
    year    = {2019},
    doi     = {10.1177/1029864918757595},
    url-pdf = {https://doi.org/10.1177/1029864918757595},
    url-details = {https://www.audiolabs-erlangen.de/resources/MIR/cross-era/}
    }
  4. Christof Weiß
    Computational Methods for Tonality-Based Style Analysis of Classical Music Audio Recordings
    PhD Thesis, Ilmenau University of Technology, 2017. PDF
    @PhdThesis{Weiss17_StyleAnalysis_PhD,
    author  = {Christof Wei{\ss}},
    title   = {Computational Methods for Tonality-Based Style Analysis of Classical Music Audio Recordings},
    school  = {Ilmenau University of Technology},
    address = {Ilmenau, Germany},
    year    = {2017},
    url     = {http://www.db-thueringen.de/receive/dbt_mods_00032890},
    url-pdf = {http://www.db-thueringen.de/servlets/MCRFileNodeServlet/dbt_derivate_00039054/ilm1-2017000293.pdf}
    }