The SM Toolbox has been developed by Meinard Müller, Nanzhu Jiang, Peter Grosche, and Harald G. Grohganz. It contains MATLAB implementations for computing and enhancing similarity matrices in various ways. Furthermore, the toolbox includes a number of additional tools for parsing, navigation, and visualization synchronized with audio playback. Also, it contains code for a recently proposed audio thumbnailing procedure that demonstrates the applicability and importance of enhancement concepts. The MATLAB implementations provided on this website are published under the terms of the General Public License (GPL). A general overview of the SM Toolbox is given in [1].
If you publish results obtained using these implementations, please cite [1]. For technical details, applications, or data please cite [2], [3], [4], [5], [6], [7].
The MATLAB implementations provided on this website are published under the terms of the General Public License (GPL), version 2 or later. If you publish results obtained using these implementations, please cite the references below.
Download SM Toolbox (Version 1.0. Last update: 2013-07-01): [zip]
features_to_SM.m
Computing similarity matrices based on different enhancement strategies such as tempo invariance and transposition invariance.threshSM.m
Application of different thresholding techniques.visualizeSM.m
Visualiation of similarity matrix.visualizeTransIndex.m
Visualization of transposition index matrix.makePlotPlayable.m
Synchronized playback of audio file along with a plotted figure.SSM_to_scapePlotFitness.m
Computation of fitness scape plot from a self-similarity matrix.scapePlotFitness_to_thumbnail.m
Computation of thumbnail segment from fitness scape plot.thumbnailSSM_to_pathFamily.m
Computation of induced segment family from thumbnail.visualizeScapePlot.m
Visualization of fitness scape plot.visualizePathFamilySSM.m
Visualization of self-similarity matrix and path family.visualizeSegFamily.m
Visualization of segment family.The following demo files are provided. These demo files allow you to try out the code and give you a first overview of the toolbox. The necessary audio files to run the demos are also provided by the toolbox.
demoSMtoolbox.m
Demo showing various enhancement functionalities as described in [1].demoSMtoolbox_thumbnailing.m
Demo for thumbnailing application as described in [1].demoSMtoolbox_thumbnailing_otherSettings.m
Demo for thumbnailing application with other settings for various music recordings.MATLAB-Chroma-Toolbox_2.0
of the zip-file provided above. The feature extraction step may replaced using feature extraction functions supplied by other toolboxes.The concept of similarity matrices (SMs) has been widely used for a multitude of music analysis and retrieval tasks including audio structure analysis or version identification. For such tasks, the improvement of structural properties of the similarity matrix at an early state of the processing pipeline has turned out to be of crucial importance. The SM toolbox contains MATLAB implementations for computing and enhancing similarity matrices in various ways.
MATLAB function: features_to_SM.m
Controlling parameter: paramSM.smoothLenSM
Controlling parameters: paramSM.tempoRelMin
, paramSM.tempoRelMax
, paramSM.tempoNum
Controlling parameter: paramSM.forwardBackward
Controlling parameter: paramSM.circShift
MATLAB functions and controlling parameters: threshSM.m
, paramThres.threshTechnique
, paramThres.threshValue
, paramThres.applyBinarize
, paramThres.applyScale
, paramThres.penalty
As an illustrating application, our toolbox also contains the MATLAB code for a recently proposed audio thumbnailing procedure. For this task, the goal is to find the the most representative and repetitive segment of a given audio recording. Based on a suitable self-similarity matrix, the procedure in [4] computes for each audio segment a fitness value that expresses how well the given segment explains other related segments (also called induced segments) in the audio recording. These relations are expressed by a so-called path family over the given segment. The thumbnail is then defined as the fitness-maximizing segment. Furthermore, a triangular scape plot representation is computed, which shows the fitness of all segments and yields a compact high-level view on the structural properties of the entire audio recording.
MATLAB functions: SSM_to_scapePlotFitness.m
, visualizeScapePlot.m
MATLAB function: scapePlotFitness_to_thumbnail.m
MATLAB functions: thumbnailSSM_to_pathFamily.m
, visualizePathFamilySSM.m
MATLAB function: parseAnnotationFile.m
MATLAB function: makePlotPlayable.m
@inproceedings{MuellerJG14_SMToolbox_AES, author = {Meinard M{\"u}ller and Nanzhu Jiang and Harald G. Grohganz}, title = {SM Toolbox: MATLAB Implementations for Computing and Enhancing Similarity Matrices}, booktitle = {Proceedings of 53rd Audio Engineering Society ({AES})}, address = {London, UK}, year = {2014}, }
@inproceedings{MuellerK06_EnhancingSimilarityMatrices_ICASSP, author = {Meinard M{\"u}ller and Frank Kurth}, title = {Enhancing Similarity Matrices for Music Audio Analysis}, booktitle = {Proceedings of the International Conference on Acoustics, Speech and Signal Processing ({ICASSP})}, address = {Toulouse, France}, year = {2006}, pages = {437--440}, }
@inproceedings{MuellerC07_Transposition_ISMIR, author = {Meinard M{\"u}ller and Michael Clausen}, title = {Transposition-Invariant Self-Similarity Matrices}, booktitle = {Proceedings of the International Conference on Music Information Retrieval ({ISMIR})}, address = {Vienna, Austria}, year = {2007}, pages = {47--50}, }
@article{MuellerJG13_StructureAnaylsis_IEEE-TASLP, author = {Meinard M{\"u}ller and Nanzhu Jiang and Peter Grosche}, title = {A Robust Fitness Measure for Capturing Repetitions in Music Recordings With Applications to Audio Thumbnailing}, journal = {IEEE Transactions on Audio, Speech {\&} Language Processing}, volume = {21}, number = {3}, year = {2013}, pages = {531-543}, }
@book{Mueller07_InformationRetrieval_SPRINGER, author = {Meinard M{\"u}ller}, title = {Information Retrieval for Music and Motion}, type = {Monograph}, year = {2007}, isbn = {3540740473}, publisher = {Springer Verlag} }
@inproceedings{MuellerKC05_ChromaFeatures_ISMIR, author = {Meinard M{\"u}ller and Frank Kurth and Michael Clausen}, title = {Audio Matching via Chroma-Based Statistical Features}, booktitle = {Proceedings of the 12th International Conference on Music Information Retrieval ({ISMIR})}, year = {2011}, pages = {}, }
@inproceedings{MuellerKBA11_SMD_ISMIR, author = {Meinard M{\"u}ller and Verena Konz and Wolfgang Bogler and Vlora Arifi-M{\"u}ller}, title = {Saarland Music Data ({SMD})}, booktitle = {Proceedings of the International Society for Music Information Retrieval Conference ({ISMIR}): Late Breaking session}, year = {2011}, }