论文标题

QM7-X:跨越小有机分子化学空间的量子力学特性的综合数据集

QM7-X: A comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules

论文作者

Hoja, Johannes, Sandonas, Leonardo Medrano, Ernst, Brian G., Vazquez-Mayagoitia, Alvaro, DiStasio Jr., Robert A., Tkatchenko, Alexandre

论文摘要

我们介绍了QM7-X,这是一个42个物理化学特性的综合数据集,$ \ $ \ $ \ $ 4.2 m的平衡和非平衡结构的小有机分子,最多七个非氢(C,N,O,O,S,Cl)原子。要跨越化学化合物空间(CCS)的根本重要区域,QM7-X包括(元)稳定平衡结构的详尽抽样,由构造/结构异构体和立体异构体组成总计$ \ $ 4.2 m的分子结构。 QM7-X在紧密收敛的量子力学PBE0+MBD理论水平上计算,包含全局(分子)和局部(原子中的原子)特性,范围从基态量化(例如雾化能量和偶极矩)到响应数量(例如极性张力张力张力张力张力张力张力张力张力和分散量)。通过提供量子力学计算的物理化学特性的系统,广泛且紧密结合的数据集,我们预计QM7-X将在基于机器学习的下一代基于机器学习模型的开发中起关键作用,以探索更大的CCS和具有目标特性的分子设计的更大的CCS和表演。

We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for $\approx$ 4.2 M equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures - comprised of constitutional/structural isomers and stereoisomers, e.g., enantiomers and diastereomers (including cis-/trans- and conformational isomers) - as well as 100 non-equilibrium structural variations thereof to reach a total of $\approx$ 4.2 M molecular structures. Computed at the tightly converged quantum-mechanical PBE0+MBD level of theory, QM7-X contains global (molecular) and local (atom-in-a-molecule) properties ranging from ground state quantities (such as atomization energies and dipole moments) to response quantities (such as polarizability tensors and dispersion coefficients). By providing a systematic, extensive, and tightly-converged dataset of quantum-mechanically computed physicochemical properties, we expect that QM7-X will play a critical role in the development of next-generation machine-learning based models for exploring greater swaths of CCS and performing in silico design of molecules with targeted properties.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源