Balazs, P., Tauböck, G., Rajbamshi, S., & Holighaus, N. (2022). Audio Inpainting. In Proceedings of 24th International Congress on Acoustics - ICA 2022 (pp. 258–261). http://hdl.handle.net/20.500.12708/176529
Proceedings of 24th International Congress on Acoustics - ICA 2022
-
Volume:
A16: Numerical, Computational and Theoretical Acoustics
-
Date (published):
Oct-2022
-
Event name:
24th International Congress on Acoustics - ICA 2022
en
Event date:
24-Oct-2022 - 28-Oct-2022
-
Event place:
Austria
-
Number of Pages:
4
-
Keywords:
Audio Inpainting; Time-Frequency; Gabor Dictionary; Compressive Sensing; Basis Optimization; Convex Optimization; Alternating Direction Method of Multipliers (ADMM)
en
Abstract:
The objective of audio inpainting is to conceal missing or heavily distorted segments in an audio signal. This is ideally done by reconstructing the original signal or, at least, by inferring a meaningful surrogate signal. We present here a novel approach applying sparse modeling in the time-frequency (TF) domain which was recently published.
In particular, we devise a dictionary learning technique which learns the dictionary from reliable parts around the gap with the goal to obtain a signal representation with improved TF sparsity. The presented method is based on a basis optimization technique that deforms a given Gabor frame such that the sparsity of the analysis coefficients of the resulting frame is maximized (with respect to a certain sparsity measure). Furthermore, we modify the SParse Audio INpainter (SPAIN) such that it is able to exploit the improved TF sparsity and—in turn—benefits from dictionary learning. As an alternative, we combine dictionary learning with weighted ℓ1-minimization adapted for audio inpainting to compensate for the loss of energy within the gap after restoration. Our experiments demonstrate that the developed methods achieve significant gains in terms of signal-to-distortion
ratio (SDR) and objective difference grade (ODG) compared with several state-of-the-art audio inpainting techniques.
en
Project title:
Unendlich-dimensionale Signalverarbeitungs-Methoden für akustische Anwendungen: MA16-053 (WWTF Wiener Wissenschafts-, Forschu und Technologiefonds)