Separation of monophonic music signal based on user-guided onset information

Jeongsoo Park, Kyogu Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present a novel informed source separation (ISS) algorithm from mono-phonic sound mixtures based on user-guided onset information. Conventional user-guided ISS methods have studied various cases to provide target sound information. We consider the case where the information is given via a user-guided audio signal. Conventional algorithm requires both spectral and temporal information of the target source to be separated, often provided by means of singing/humming. However, it might be difficult for an unskilled user to provide exact pitch information of the target source, which can result in severe performance degradation of the spectrogram decomposition algorithms. On the other hand, it is relatively easier for novice users to give onset information by finger- or foot-tapping, for example. In this paper, we propose a novel informed source separation algorithm where only temporal information from the user is given by means of note onsets of the target source. To this end, we utilize non-negative matrix factorization (NMF) comprised of two steps. In the first step of NMF, we aim to estimate a spectral basis of the target source with the use of sparsity constraint. In the second step, we estimate the corresponding temporal basis of the target source. Finally, we reconstruct the estimated target sound based on the results of the two-step NMF. Experiments show that the proposed algorithm can successfully separate the target source using just the onset information from the user when there exists no significant overlap in onsets between the target and other sources.

Original languageEnglish
Title of host publication21st International Congress on Sound and Vibration 2014, ICSV 2014
PublisherInternational Institute of Acoustics and Vibrations
Pages2483-2490
Number of pages8
ISBN (Electronic)9781634392389
StatePublished - 2014
Externally publishedYes
Event21st International Congress on Sound and Vibration 2014, ICSV 2014 - Beijing, China
Duration: 13 Jul 201417 Jul 2014

Publication series

Name21st International Congress on Sound and Vibration 2014, ICSV 2014
Volume3

Conference

Conference21st International Congress on Sound and Vibration 2014, ICSV 2014
Country/TerritoryChina
CityBeijing
Period13/07/1417/07/14

Fingerprint

Dive into the research topics of 'Separation of monophonic music signal based on user-guided onset information'. Together they form a unique fingerprint.

Cite this