首页|Communication-efficient surrogate quantile regression for non-randomly distributed system
Communication-efficient surrogate quantile regression for non-randomly distributed system
扫码查看
点击上方二维码区域,可以放大扫码查看
原文链接
NSTL
Elsevier
Distributed system has been widely used to solve massive data analysis tasks. This article targets on quantile regression on distributed system with non-randomly distributed mas-sive data, and proposes a new communication-efficient surrogate quantile regression. Specifically, based on a small size random pilot sample collected from different worker machines, we approximate the global quantile regression as a surrogate one on the master machine, which relates to the local datasets only through their gradient vectors, and can overcome the non-randomly distributed nature. Then the resulting estimator can be obtained on the master, and the communication cost is greatly reduced, since the pilot sample and local gradients can be transferred conveniently. In theory, without any restric-tive assumption about randomness, the established asymptotical results show that the proposed method works beautifully just as the data were stored on one single machine. Synthetic data and real world data evaluations are also used to illustrate the proposed method.(c) 2021 Elsevier Inc. All rights reserved.