论文标题
使用基于模型的森林的观察数据的异质治疗效果估计
Heterogeneous Treatment Effect Estimation for Observational Data using Model-based Forests
论文作者
论文摘要
异质治疗效应(HTE)的估计引起了许多学科的浓厚兴趣,最重要的是医学和经济学。到目前为止,当代研究主要集中在连续和二元响应上,其中传统上通过线性模型估算了HTE,该模型即使在某些模型误差下也可以估算恒定或异构效应。生存,计数或顺序结果的更复杂的模型需要更严格的假设,以可靠地估计治疗效果。最重要的是,非碰撞性问题需要对治疗和预后影响进行联合估计。基于模型的森林允许同时估计协变量依赖性治疗和预后作用,但仅用于随机试验。在本文中,我们建议对基于模型的森林进行修改,以解决观察数据中的混杂问题。特别是,我们评估了最初由Robinson(1988,ConaticleTrica)提出的正交策略,其基于模型的森林针对广义线性模型和转换模型中的HTE估计。我们发现,该策略在具有各种结果分布的模拟研究中降低了混杂效应。我们通过评估Riluzole对肌萎缩性侧面硬化症进展的潜在异质作用来证明HTE估计的生存和顺序结局的实际方面。
The estimation of heterogeneous treatment effects (HTEs) has attracted considerable interest in many disciplines, most prominently in medicine and economics. Contemporary research has so far primarily focused on continuous and binary responses where HTEs are traditionally estimated by a linear model, which allows the estimation of constant or heterogeneous effects even under certain model misspecifications. More complex models for survival, count, or ordinal outcomes require stricter assumptions to reliably estimate the treatment effect. Most importantly, the noncollapsibility issue necessitates the joint estimation of treatment and prognostic effects. Model-based forests allow simultaneous estimation of covariate-dependent treatment and prognostic effects, but only for randomized trials. In this paper, we propose modifications to model-based forests to address the confounding issue in observational data. In particular, we evaluate an orthogonalization strategy originally proposed by Robinson (1988, Econometrica) in the context of model-based forests targeting HTE estimation in generalized linear models and transformation models. We found that this strategy reduces confounding effects in a simulated study with various outcome distributions. We demonstrate the practical aspects of HTE estimation for survival and ordinal outcomes by an assessment of the potentially heterogeneous effect of Riluzole on the progress of Amyotrophic Lateral Sclerosis.