8e135d4361b24d8b.tex
1: \begin{abstract}
2: This paper describes a versatile method 
3:  that accelerates multichannel source separation methods 
4:  based on full-rank spatial modeling.
5: A popular approach to multichannel source separation
6:  is to integrate a spatial model with a source model
7:  for estimating the spatial covariance matrices (SCMs) 
8:  and power spectral densities (PSDs) of each sound source
9:  in the time-frequency domain.
10: One of the most successful examples of this approach
11:  is multichannel nonnegative matrix factorization (MNMF) 
12:  based on a full-rank spatial model and a low-rank source model.
13: MNMF, however, is computationally expensive and often works poorly 
14:  due to the difficulty of estimating the unconstrained full-rank SCMs.
15: Instead of restricting the SCMs to rank-1 matrices
16:  with the severe loss of the spatial modeling ability
17:  as in independent low-rank matrix analysis (ILRMA),
18:  we restrict the SCMs of each frequency bin 
19:  to jointly-diagonalizable but still full-rank matrices.
20: For such a fast version of MNMF,
21:  we propose a computationally-efficient 
22:  and convergence-guaranteed algorithm
23:  that is similar in form to that of ILRMA.
24: Similarly,
25:  we propose a fast version of a state-of-the-art speech enhancement method
26:  based on a deep speech model and a low-rank noise model.
27: Experimental results showed that 
28:  the fast versions of MNMF and the deep speech enhancement method
29:  were several times faster and performed even better 
30:  than the original versions of those methods, respectively.
31: \end{abstract}
32: