论文标题

具有不同样本选择偏见和分散参数的广义heckman模型

A Generalized Heckman Model With Varying Sample Selection Bias and Dispersion Parameters

论文作者

Bastos, Fernando de S., Barreto-Souza, Wagner, Genton, Marc G.

论文摘要

许多建议已成为Heckman选择模型的替代方法,主要是为了解决其正常假设的非舒适性。 2001年医疗支出小组调查数据通常用于说明Heckman模型的这种不舒适性。在本文中,我们通过允许样品选择偏差和分散参数依赖于协变量来提出对Heckman样本选择模型的概括。我们表明,Heckman模型的不稳定性可能是由于假设恒定样品选择偏置参数而不是正态性假设所致。我们提出的方法使我们能够了解哪些协变量对于解释样本选择偏见现象很重要,而不仅仅是仅形成有关其存在的结论。我们探讨了我们提出的广义Heckman模型的最大似然估计器(MLE)的推论方面。更具体地说,我们表明该模型满足某些规律性条件,从而确保MLE的一致性和渐近态性。提供了针对样本选择模型的适当得分残差,并解决了模型是否足够。提出了模拟结果以检查估计器的有限样本行为,并验证不考虑不同样本选择偏见和分散参数的后果。我们表明,分析医疗支出数据的正常假设是合适的,并且使用我们的方法得出的结论与先前文献的发现是一致的。此外,我们确定哪些协变量与解释该重要数据集中样本选择偏差的存在相关。

Many proposals have emerged as alternatives to the Heckman selection model, mainly to address the non-robustness of its normal assumption. The 2001 Medical Expenditure Panel Survey data is often used to illustrate this non-robustness of the Heckman model. In this paper, we propose a generalization of the Heckman sample selection model by allowing the sample selection bias and dispersion parameters to depend on covariates. We show that the non-robustness of the Heckman model may be due to the assumption of the constant sample selection bias parameter rather than the normality assumption. Our proposed methodology allows us to understand which covariates are important to explain the sample selection bias phenomenon rather than to only form conclusions about its presence. We explore the inferential aspects of the maximum likelihood estimators (MLEs) for our proposed generalized Heckman model. More specifically, we show that this model satisfies some regularity conditions such that it ensures consistency and asymptotic normality of the MLEs. Proper score residuals for sample selection models are provided, and model adequacy is addressed. Simulated results are presented to check the finite-sample behavior of the estimators and to verify the consequences of not considering varying sample selection bias and dispersion parameters. We show that the normal assumption for analyzing medical expenditure data is suitable and that the conclusions drawn using our approach are coherent with findings from prior literature. Moreover, we identify which covariates are relevant to explain the presence of sample selection bias in this important dataset.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源