abstract:5b29018265152195.tex

1: \begin{abstract}

2: Online approximation of the optimal station keeping strategy for a

3: fully actuated six degrees-of-freedom marine craft subject to an irrotational

4: ocean current is considered. An approximate solution to the optimal

5: control problem is obtained using an adaptive dynamic programming

6: technique. The hydrodynamic drift dynamics of the dynamic model are

7: assumed to be unknown; therefore, a concurrent learning-based system

8: identifier is developed to identify the unknown model parameters.

9: The identified model is used to implement an adaptive model-based

10: reinforcement learning technique to estimate the unknown value function.

11: The developed policy guarantees uniformly ultimately bounded convergence

12: of the vehicle to the desired station and uniformly ultimately bounded

13: convergence of the approximated policies to the optimal polices without

14: the requirement of persistence of excitation. The developed strategy

15: is validated using an autonomous underwater vehicle, where the three

16: degrees-of-freedom in the horizontal plane are regulated. The experiments

17: are conducted in a second-magnitude spring located in central Florida.

18: \end{abstract}

19: