论文标题
使用模式的控制:D-学习方法
Control with Patterns: A D-learning Method
论文作者
论文摘要
基于学习的控制策略在机器人技术和控制领域的各种任务中广泛使用。但是,很难获得具有非线性动力学系统的基于学习的控制器的正式(Lyapunov)稳定性。我们提出了一种新颖的控制方法,即使用模式(CWP)控制,以解决与非线性动态系统相对应的数据集的稳定性问题。对于此类数据集,我们引入了一个新的定义,即数据集中的指数吸引力,以描述正在考虑的非线性动态系统。基于数据集和参数化的Lyapunov功能,将指数吸引力的问题转化为模式分类问题的问题。此外,提出了D-L-Learning作为执行CWP的方法,而无需了解系统动力学。最后,通过模拟和实际飞行实验证明了基于D-学习的CWP的有效性。在这些实验中,使用实时图像作为反馈来稳定多次驱动器的位置,可以将其视为基于图像的视觉伺服(IBV)问题。
Learning-based control policies are widely used in various tasks in the field of robotics and control. However, formal (Lyapunov) stability guarantees for learning-based controllers with nonlinear dynamical systems are difficult to obtain. We propose a novel control approach, namely Control with Patterns (CWP), to address the stability issue over data sets corresponding to nonlinear dynamical systems. For such data sets, we introduce a new definition, namely exponential attraction on data sets, to describe the nonlinear dynamical systems under consideration. The problem of exponential attraction on data sets is transformed into a problem of pattern classification one based on the data sets and parameterized Lyapunov functions. Furthermore, D-learning is proposed as a method to perform CWP without knowledge of the system dynamics. Finally, the effectiveness of CWP based on D-learning is demonstrated through simulations and real flight experiments. In these experiments, the position of the multicopter is stabilized using real-time images as feedback, which can be considered as an Image-Based Visual Servoing (IBVS) problem.