5b29018265152195.tex
1: \begin{abstract}
2: Online approximation of the optimal station keeping strategy for a
3: fully actuated six degrees-of-freedom marine craft subject to an irrotational
4: ocean current is considered. An approximate solution to the optimal
5: control problem is obtained using an adaptive dynamic programming
6: technique. The hydrodynamic drift dynamics of the dynamic model are
7: assumed to be unknown; therefore, a concurrent learning-based system
8: identifier is developed to identify the unknown model parameters.
9: The identified model is used to implement an adaptive model-based
10: reinforcement learning technique to estimate the unknown value function.
11: The developed policy guarantees uniformly ultimately bounded convergence
12: of the vehicle to the desired station and uniformly ultimately bounded
13: convergence of the approximated policies to the optimal polices without
14: the requirement of persistence of excitation. The developed strategy
15: is validated using an autonomous underwater vehicle, where the three
16: degrees-of-freedom in the horizontal plane are regulated. The experiments
17: are conducted in a second-magnitude spring located in central Florida.
18: \end{abstract}
19: