abstract:8e135d4361b24d8b.tex

1: \begin{abstract}

2: This paper describes a versatile method

3:  that accelerates multichannel source separation methods

4:  based on full-rank spatial modeling.

5: A popular approach to multichannel source separation

6:  is to integrate a spatial model with a source model

7:  for estimating the spatial covariance matrices (SCMs)

8:  and power spectral densities (PSDs) of each sound source

9:  in the time-frequency domain.

10: One of the most successful examples of this approach

11:  is multichannel nonnegative matrix factorization (MNMF)

12:  based on a full-rank spatial model and a low-rank source model.

13: MNMF, however, is computationally expensive and often works poorly

14:  due to the difficulty of estimating the unconstrained full-rank SCMs.

15: Instead of restricting the SCMs to rank-1 matrices

16:  with the severe loss of the spatial modeling ability

17:  as in independent low-rank matrix analysis (ILRMA),

18:  we restrict the SCMs of each frequency bin

19:  to jointly-diagonalizable but still full-rank matrices.

20: For such a fast version of MNMF,

21:  we propose a computationally-efficient

22:  and convergence-guaranteed algorithm

23:  that is similar in form to that of ILRMA.

24: Similarly,

25:  we propose a fast version of a state-of-the-art speech enhancement method

26:  based on a deep speech model and a low-rank noise model.

27: Experimental results showed that

28:  the fast versions of MNMF and the deep speech enhancement method

29:  were several times faster and performed even better

30:  than the original versions of those methods, respectively.

31: \end{abstract}

32: