论文标题
学习神经状态空间模型:我们需要一个状态估计器吗?
Learning neural state-space models: do we need a state estimator?
论文作者
论文摘要
近年来,已经引入了几种针对神经状态空间模型的系统识别算法。大多数提出的方法旨在通过对从较长训练数据集提取的简短子序列进行优化来降低学习问题的计算复杂性。然后在Minibatch中同时处理不同的序列,利用现代的并行硬件进行深度学习。在这些方法中产生的问题是需要为每个子序列分配一个初始状态,这是运行模拟并因此评估拟合损失所必需的。在本文中,我们为基于广泛的实验和对两个公认的系统识别基准进行的大量实验和分析提供了校准神经状态空间训练算法的见解。特别关注的是最初状态估计的选择和作用。我们证明,实际上需要先进的初始状态估计技术才能在某些类别的动态系统上实现高性能,而对于渐近稳定的基本程序(例如零或随机初始化),已经产生了竞争性能。
In recent years, several algorithms for system identification with neural state-space models have been introduced. Most of the proposed approaches are aimed at reducing the computational complexity of the learning problem, by splitting the optimization over short sub-sequences extracted from a longer training dataset. Different sequences are then processed simultaneously within a minibatch, taking advantage of modern parallel hardware for deep learning. An issue arising in these methods is the need to assign an initial state for each of the sub-sequences, which is required to run simulations and thus to evaluate the fitting loss. In this paper, we provide insights for calibration of neural state-space training algorithms based on extensive experimentation and analyses performed on two recognized system identification benchmarks. Particular focus is given to the choice and the role of the initial state estimation. We demonstrate that advanced initial state estimation techniques are really required to achieve high performance on certain classes of dynamical systems, while for asymptotically stable ones basic procedures such as zero or random initialization already yield competitive performance.