1: \begin{abstract}
2: This paper describes a versatile method
3: that accelerates multichannel source separation methods
4: based on full-rank spatial modeling.
5: A popular approach to multichannel source separation
6: is to integrate a spatial model with a source model
7: for estimating the spatial covariance matrices (SCMs)
8: and power spectral densities (PSDs) of each sound source
9: in the time-frequency domain.
10: One of the most successful examples of this approach
11: is multichannel nonnegative matrix factorization (MNMF)
12: based on a full-rank spatial model and a low-rank source model.
13: MNMF, however, is computationally expensive and often works poorly
14: due to the difficulty of estimating the unconstrained full-rank SCMs.
15: Instead of restricting the SCMs to rank-1 matrices
16: with the severe loss of the spatial modeling ability
17: as in independent low-rank matrix analysis (ILRMA),
18: we restrict the SCMs of each frequency bin
19: to jointly-diagonalizable but still full-rank matrices.
20: For such a fast version of MNMF,
21: we propose a computationally-efficient
22: and convergence-guaranteed algorithm
23: that is similar in form to that of ILRMA.
24: Similarly,
25: we propose a fast version of a state-of-the-art speech enhancement method
26: based on a deep speech model and a low-rank noise model.
27: Experimental results showed that
28: the fast versions of MNMF and the deep speech enhancement method
29: were several times faster and performed even better
30: than the original versions of those methods, respectively.
31: \end{abstract}
32: