This is the accompanying website for the following paper:
@article{KrauseMW21_TowardsLeitmotifDetection_TISMIR, title = {Towards Leitmotif Activity Detection in Opera Recordings}, author = {Michael Krause and Meinard M{\"u}ller and Christof Wei{\ss}}, journal = {Transactions of the International Society for Music Information Retrieval ({TISMIR})}, volume = {4}, number = {1}, year = {2021}, pages = {127--140}, doi = {10.5334/tismir.116}, url = {https://transactions.ismir.net/articles/10.5334/tismir.116/}, url-pdf = {2021_KrauseMW_LeitmotifDetection_TISMIR_ePrint.pdf}, url-details = {https://www.audiolabs-erlangen.de/resources/MIR/2021-TISMIR-TowardsLeitmotifDetection} }
This paper approaches the automatic detection of musical patterns in audio recordings with a particular focus on leitmotifs, which are specific types of patterns associated with certain characters, places, items, or feelings occurring in an opera or movie soundtrack. The detection of such leitmotifs is particularly challenging since their appearance can change substantially over the course of a musical work. In our case study, we consider a self-contained yet comprehensive scenario comprising 16 recorded performances of Richard Wagner’s four-opera cycle Der Ring des Nibelungen, which is a prime example for the use of leitmotifs. Within this scenario, we introduce and formalize the novel task of leitmotif activity detection. Based on a dataset of 200 hours of audio with over 50000 annotated leitmotif instances, we explore the benefits and limitations of deep-learning techniques for detecting leitmotifs. To this end, we adapt two common deep-learning strategies based on recurrent and convolutional neural networks, respectively. To investigate the robustness of the trained systems, we test their sensitivity to different modifications of the input. We find that our deep-learning systems work well in general but capture confounding factors, such as pitch distributions in leitmotif regions, instead of characteristic musical properties, such as rhythm and melody. Thus, our in-depth analysis demonstrates some challenges that may arise from applying deep-learning approaches for detecting complex musical patterns in audio recordings.
The occurrence and instance positions of the twenty leitmotifs considered in this paper are made publicly available as a dataset for further research.
This dataset constitutes a strict superset of the data found on this page.
In this dataset, we use the following naming scheme to refer to different parts of the Ring:
Name | Description |
---|---|
A | Das Rheingold |
B-1 | Die Walküre, Act 1 |
B-2 | Die Walküre, Act 2 |
B-3 | Die Walküre, Act 2 |
C-1 | Siegfried, Act 1 |
C-2 | Siegfried, Act 2 |
C-3 | Siegfried, Act 3 |
D-0 | Götterdämmerung, Vorspiel |
D-1 | Götterdämmerung, Act 1 |
D-2 | Götterdämmerung, Act 2 |
D-3 | Götterdämmerung, Act 3 |
Occurrence positions are found in the .csv-files in the "Occurrences" subfolder of the zip-archive. For example, "Occurrences/B-2.csv" contains all motif occurrences in Die Walküre, Act 2. In these files, each line corresponds to a motif occurrence. Start and end positions are given in measures. For example, the line
Ring;778.5;780.25
in "Occurrences/B-2.csv" signifies that there is an occurrence of the Ring motif starting at 778.5 (a half measure after 778) and ending at 780.25 (a quarter measure after measure 780). Measure numbers correspond to the piano score from Richard Kleinmichel, available at IMSLP.
Instance positions are found in the .csv-files inside the 16 directories of the "Instance" subfolder of the zip-archive. For example, "Instances/P-Ba/B-2.csv" contains all motif instances in the Daniel Barenboim performance of Die Walküre, Act 2. In these files, each line corresponds to a motif instance. Start and end positions are given in seconds (for this, all CD tracks for a particular performance of an act have been cut and concatenated to form one continuous audio file for that act). For example, the line
Ring;2130.6;2137.2
in "Instances/P-Ba/B-2.csv" signifies that there is an instance of the Ring motif starting at second 2130.6 and ending at 2137.2.
A table of the performances used in this study, including ID, conductors, years of recording and length, can be found in the paper. The following table provides helpful information for identifying the exact CD releases:
ID in Paper | Label | Year of release | Conductor, Orchestra, Choir |
---|---|---|---|
P-Ba | WC | 2009 | Daniel Barenboim, Chor und Orchester der Bayreuther Festspiele |
P-Ha | EMI | 2008 | Bernard Haitink, Symphonieorchester und Chor des Bayrischen Rundfunks |
P-Ka | DG | 1998 | Herbert von Karajan, Berliner Philharmoniker, Chor der Deutschen Oper Berlin |
P-Sa | EMI | 2012 | Wolfgang Sawallisch, Bayrisches Staatsorchester, Chor der Bayrischen Staatsoper |
P-So | DECCA | 2012 | Georg Solti, Wiener Staatsopernchor, Wiener Philharmoniker |
P-We | OEHMS | 2013 | Sebastian Weigle, Frankfurter Opern- und Museumsorchester, Chor und Herren des Extrachores der Oper Frankfurt |
P-Bo | PHILIPS | 2006 | Pierre Boulez, Chor und Orchester der Bayreuther Festspiele |
P-Bö | DECCA | 2008 | Karl Böhm, Chor und Orchester der Bayreuther Festspiele |
P-Fu | EMI | 2011 | Wilhelm Furtwängler, Orchestra Sinfonica della Radio Italiana, Coro della Radio Italiana |
P-Ja | SONY | 2012 | Marek Janowski, Staatskapelle Dresden, Männer des Staatsopernchores Leipzig, Staatsopernchor Dresden |
P-Ke | ZYX | 2012 | Joseph Keilberth, Chor und Orchester der Bayreuther Festspiele; Wilhelm Furtwängler, Wiener Philharmoniker |
P-Kr | ORFEO | 2010 | Clemens Krauss, Chor und Orchester der Bayreuther Festspiele |
P-Le | DG | 2012 | James Levine, The Metropolitan Opera Orchestra, The Metropolitan Opera Chorus |
P-Ne | MEMBRAN | 1995 | Günther Neuhold, Badische Staatskapelle, Badischer Staatsopernchor |
P-Sw | PROFIL | 2013 | Hans Swarowsky, Grosses Symphonieorchester mit Mitgliedern der Tschechischen Philharmonie und des Orchesters des Nationaltheaters Prag, Chor der Wiener Volksoper |
P-Th | DG | 2013 | Christian Thielemann, Chor und Zusatzchor der Wiener Staatsoper, Orchester der Wiener Staatsoper, Bühnenorchester der Wiener Staatsoper |
Thus, the performance by Wolfgang Sawallisch (P-Sa) was released by EMI in 2012.
We thank Julia Zalkow and Vlora Arifi-Müller for their assistance in preparing the data. This work was supported by the German Research Foundation (DFG MU 2686/7-2). The International Audio Laboratories Erlangen are a joint institution of the Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Fraunhofer Institut für Integrierte Schaltungen (IIS).
@incollection{Wagner66_MusicDrama_BOOK, author = {Richard Wagner}, title = {On the Application of Music to the Drama}, booktitle = {Prose Works}, year = {1966}, translator = {William A. Ellis}, publisher = {Broude Brothers, New York}, pages = {175--191} }
@book{Bribitzer15_Leitmotif_BOOK, author = {Matthew Bribitzer-Stull}, title = {Understanding the Leitmotif}, publisher = {Cambridge University Press}, year = {2015} }
@inproceedings{ZalkowWM17_WagnerHarmony_ISMIR, author = {Frank Zalkow and Christof Wei{\ss} and Meinard M{\"u}ller}, title = {Exploring Tonal-Dramatic Relationships in {R}ichard {W}agner's {R}ing Cycle}, booktitle = {Proceedings of the International Conference on Music Information Retrieval ({ISMIR})}, pages = {642--648}, address = {Suzhou, China}, year = {2017}, url-pdf = {http://ismir2017.smcnus.org/wp-content/uploads/2017/10/132_Paper.pdf} }
@inproceedings{ZalkowWPAM17_MeasureTransfer_AES, author = {Frank Zalkow and Christof Wei{\ss} and Thomas Pr{\"a}tzlich and Vlora Arifi-M{\"u}ller and Meinard M{\"u}ller}, title = {A Multi-Version Approach for Transferring Measure Annotations Between Music Recordings}, booktitle = {Proceedings of the {AES} International Conference on Semantic Audio}, pages = {148--155}, address = {Erlangen, Germany}, year = {2017} }
@book{Burghold13_Leitmotive_Schott, title = {{D}er {R}ing des {N}ibelungen. {V}ollständiger {T}ext mit {N}otentafeln der {L}eitmotive}, author = {Richard Wagner}, editor = {Julius Burghold}, address = {Mainz}, publisher = {Schott Music}, note = {Reprint of the original edition from 1913 (Ed. Julius Burghold)}, year = {2013} }