ae7745f38025c3b9.tex
1: \begin{abstract}
2: 
3: In this work, we analyze the capabilities and practical limitations of \acp{NN} for sequence-based signal processing which can be seen as an omnipresent property in almost any modern communication systems.
4: In particular, we train multiple state-of-the-art \ac{RNN} structures to \emph{learn how to decode} convolutional codes allowing a clear benchmarking with the corresponding \ac{ML} Viterbi decoder.
5: We examine the decoding performance for various kinds of \ac{NN} architectures, beginning with \emph{classical} types like feedforward layers and \ac{GRU}-layers, up to more recently introduced architectures such as \acp{TCN} and \acp{DNC} with external memory.
6: As a key limitation, it turns out that the training complexity increases exponentially with the length of the encoding memory $\nu$ and, thus, practically limits the achievable \ac{BER} performance. 
7: To overcome this limitation, we introduce a new training-method by gradually increasing the number of \emph{ones} within the training sequences, i.e., we constrain the amount of possible training sequences in the beginning until first convergence.
8: By consecutively adding more and more possible sequences to the training set, we finally achieve training success in cases that did not converge before via \emph{naive} training.
9: Further, we show that our network can learn to jointly detect and decode a \ac{QPSK} modulated code with sub-optimal (anti-Gray) labeling in \emph{one-shot} at a performance that would require iterations between demapper and decoder in classic detection schemes.
10: 
11: \end{abstract}
12: