abstract:161993b78f4a1d49.tex

1: \begin{abstract}

2: We aim at segmenting small organs (e.g., the pancreas) from abdominal CT scans.

3: As the target often occupies a relatively small region in the input image,

4: deep neural networks can be easily confused by the complex and variable background.

5: To alleviate this, researchers proposed a coarse-to-fine approach~\cite{Zhou_2017_Fixed},

6: which used prediction from the first (coarse) stage to indicate a smaller input region for the second (fine) stage.

7: Despite its effectiveness, this algorithm dealt with two stages individually,

8: which lacked optimizing a global energy function, and limited its ability to incorporate multi-stage visual cues.

9: Missing contextual information led to unsatisfying convergence in iterations,

10: and that the fine stage sometimes produced even lower segmentation accuracy than the coarse stage.

11:

12: This paper presents a {\bf Recurrent Saliency Transformation Network}.

13: The key innovation is a saliency transformation module,

14: which repeatedly converts the segmentation probability map from the previous iteration as spatial weights

15: and applies these weights to the current iteration.

16: This brings us two-fold benefits.

17: In training, it allows joint optimization over the deep networks dealing with different input scales.

18: In testing, it propagates multi-stage visual information throughout iterations to improve segmentation accuracy.

19: Experiments in the NIH pancreas segmentation dataset demonstrate the state-of-the-art accuracy,

20: which outperforms the previous best by an average of over $2\%$.

21: Much higher accuracies are also reported on several small organs in a larger dataset collected by ourselves.

22: In addition, our approach enjoys better convergence properties, making it more efficient and reliable in practice.

23: \end{abstract}

24: