245fb118cf66ede5.tex
1: \begin{abstract}
2: 	This paper develops a dual control framework for exploration and exploitation (DCEE) to solve a self-optimisation problem in unknown and uncertain environment. In general, there is a fundamental conflict between tracking an unknown optimal operational condition and parameter identification. Different from existing adaptive control methods, the proposed DCEE does not need to introduce additional perturbation signals, since it naturally embraces an exploration effect to actively probe the uncertain environment to reduce belief uncertainty. An ensemble based multi-estimator approach is developed to learn the environmental parameters and in the meanwhile quantify the estimation uncertainty in real time. The control action is devised with dual effects, which not only minimises the tracking error between the current state and the believed unknown optimal operational condition but also reduces belief uncertainty by actively exploring the environment. Formal properties of the proposed DCEE framework like convergence are established. A numerical example is used to validate the effectiveness of the proposed DCEE. Simulation results for maximum power point tracking are provided to further demonstrate the potential of this new framework in real world applications. 
3: 	
4: \end{abstract}