首页|断点回归的两大分析框架:我们究竟该用哪一个?

断点回归的两大分析框架:我们究竟该用哪一个?

扫码查看
作为最重要的准实验因果推断方法之一,断点回归设计有两大分析框架,二者无论在前提假定、带宽选择还是推断方法上均有相当差异。其中,基于连续性的框架(continuity-based framework)假定潜在结果的条件期望连续,在实证研究中广泛应用。局部随机化的框架(local randomization framework)则为后起之秀,该框架假定在断点附近的小窗口,驱动变量可视为随机分配。本文详细介绍了这两大框架的原理与技术细节,包括识别、估计、推断与证伪,并通过蒙特卡罗模拟与经典案例深入比较了二者的差异。文献中一般认为基于连续性的框架所依赖的假定更弱,但本文发现,一方面,该框架隐含假定驱动变量在带宽内为外生变量,在实践中未必满足;另一方面,局部随机化的框架所选窗口一般更窄,故驱动变量的外生性条件更易满足,且适用于离散驱动变量的情形。由于局部随机化框架的有效样本容量一般较小,易受离群值影响,故本文提出使用"留一估计"(leave-one-out estimation)作为稳健性检验。
On the Two Frameworks for Regression Discontinuity Design:Which One Should We Use?
Regression discontinuity(RD)design is one of the most popular quasi-experimental approaches to causal inference in economics.There are two major frameworks for RD design,which differ in their assumptions,choice of bandwidth,and inference.The continuity-based framework assumes that the conditional expectations of potential out-comes are continuous at the cutoff.The local average treatment effect at the cutoff is identified and nonparametric local polynomial regression can be used for estimation and inference.The local randomization framework assumes that the run-ning variable is as good as randomly assigned in a small window around the cutoff.The average treatment effect in this window is identified,and estimation and inference can be implemented using methods from analysis of experiments.An important question is how to choose between the two RD frameworks.The continuity framework appeared ear-lier in literature and is widely used in empirical studies.The local randomization framework is a latecomer,currently used mostly as an alternative approach or a robustness check.However,this paper views the local randomization framework as a promising framework,which is expected to play an increasingly important role in the future for the following reasons.Firstly,although the continuity assumption is weaker than the local randomization assumption,the continuity frame-work implicitly assumes that the running variable is exogenous within the chosen optimal bandwidth while implementing local polynomial regression.However,the optimal bandwidth is usually chosen to minimize mean squared error(MSE),with no regard of guaranteeing the exogeneity of the running variable.Empirical researchers often tacitly view RD as local randomized experiments and take this exogeneity condition for granted.The local randomization framework chooses the bandwidth by a series of covariate balance tests,often resulting in a much smaller window such that the exogeneity can be more easily satisfied.However,a consequence of choosing a small window is that the effective observations within this window may be small,and one may have to rely on an exact finite-sample Fisherian approach for inference.Secondly,the continuity framework assumes that the running variable is continuous with a positive density at the cut-off,which is not applicable with a discrete running variable,unless additional assumptions are introduced.The local ran-domization framework is applicable with a continuous or discrete running variable.Thirdly,the external validity of the continuity framework is weak since it can only identify the local average treat-ment effect at the cutoff.The local randomization framework enjoys a stronger external validity since it can identify the av-erage treatment effect in a window around the cutoff,although the window is usually narrow.In summary,the continuity framework and the local randomization framework have respective advantages and disad-vantages,but the latter is expected to become more important in practice.Furthermore,this paper compares their relative performance via Monte Carlo simulations.The results of simulations show that if the running variable is exogenous in the wider MSE-optimal bandwidth,then the estimators of the continuity framework are more efficient.However,if the run-ning variable is only exogenous in the narrower window chosen under the local randomization framework,the estimators of the continuity framework are inconsistent,while the estimator of the local randomization framework remains consis-tent.Additionally,this paper demonstrates the detailed implementation of the two RD frameworks in Stata using the clas-sic example of U.S.Senate elections(Cattaneo et al.,2015).A major disadvantage of the local randomization framework is that it often chooses a very narrow window.This typi-cally results in a significant loss of observations such that the statistical inference has to rely on the exact finite-sample Fisherian approach.Since the effective sample size under the local randomization framework may be small and subject to the influence of outliers,this paper proposes leave-one-out estimation as a robustness check.

RDContinuity-Based FrameworkLocal Randomization Framework

陈强、齐霁、颜冠鹏

展开 >

山东大学经济学院,250100

山东财经大学经济学院,250002

断点回归 连续性框架 局部随机化框架

2024

经济学动态
中国社会科学院经济研究所

经济学动态

CSTPCDCSSCICHSSCD北大核心
影响因子:1.125
ISSN:1002-8390
年,卷(期):2024.(11)