Roadside collection of training data for cropland mapping is viable when environmental and management gradients are surveyed

Waldner, Francois Bellemans, Nicolas Hochman, Zvi Newby, Terence Veron, Santiago R. Bartalev, Sergey Lavreniuk, Mykola Kussul, Nataliia Le Maire, Guerric Simoes, Margareth Skakun, Sergii de Abelleyra, Diego Defourny, Pierre

Roadside collection of training data for cropland mapping is viable when environmental and management gradients are surveyed

Waldner, Francois 1Bellemans, Nicolas 2Hochman, Zvi 1Newby, Terence 3Veron, Santiago R. 4Bartalev, Sergey 5Lavreniuk, Mykola 6Kussul, Nataliia 6Le Maire, Guerric 7Simoes, Margareth 8Skakun, Sergii 9de Abelleyra, Diego 4Defourny, Pierre2
扫码查看

作者信息

  • 1. CSIRO Agr & Food, Queensland Biosci Precinct, 306 Carmody Rd, St Lucia, Qld 4067, Australia
  • 2. Catholic Univ Louvain, Earth & Life Inst, Environm, Louvain La Neuve, Belgium
  • 3. Agr Res Council, Private Bag X79, ZA-0001 Pretoria, South Africa
  • 4. INTA, Inst Clima & Agua, Buenos Aires, DF, Argentina
  • 5. Russian Acad Sci IKI, Space Res Inst, Moscow, Russia
  • 6. NAS & SSA SRI, Space Res Inst, Kiev, Ukraine
  • 7. CIRAD, UMR Eco & Sols, Campinas, SP, Brazil
  • 8. Rio de Janeiro State Univ UERJ, Rua Sao Francisco Xavier 524, BR-20550900 Rio De Janeiro, RJ, Brazil
  • 9. Univ Maryland, Dept Geog Sci, College Pk, MD 20742 USA
  • 折叠

Abstract

Cropland maps derived from satellite imagery have become a common source of information to estimate food production, support land use policies, and measure the environmental impacts of agriculture. Cropland classification models are typically calibrated with data collected from roadside surveys which enable the sampling of large areas at a relatively low cost. However, there is a risk of providing biased data as environmental and management gradients may not be fully captured from road networks, thereby violating the assumption of representativeness of calibration data. Despite being widely adopted, the potential biases of roadside sampling have so far not been thoroughly addressed. In this study, we looked for evidence of these biases by comparing three sampling strategies: Random sampling, Roadside sampling, and Transect sampling - a spatially constrained variant of Roadside sampling. In these three strategies, non-cropland data are randomly distributed as they can be photo-interpreted. Based on reference maps at 30 m in four study sites, we followed a Monte Carlo approach to generate multiple realizations of each sampling strategy for ten sample sizes. The effect of the sampling strategy was then assessed in terms of representativeness of the data set collected and accuracy of the resulting maps. Results showed that data sets obtained from Roadside sampling were significantly less representative than those obtained from Random sampling but the resulting maps were only marginally less accurate (2% difference). Transect sampling captured systematically less variability than Random or Roadside sampling which led to differences in accuracy as large as 15%. The effect of sample size on accuracy varied across sites but generally leveled off after reaching 3000 pixels. Augmenting the size of Transect samples improved the classification accuracy but not sufficiently to match the performance of the other sampling strategies. Finally, we found that Random and Roadside training sets with similar representativeness yield comparable accuracy. Therefore, we conclude that roadside sampling can be a viable source of training data for cropland mapping if the range of environmental and management gradients is surveyed. This underlines the importance of survey planning to identify those routes that capture most variability.

Key words

Agriculture/Sampling/Classification/Representativeness/Accuracy/Sample size

引用本文复制引用

出版年

2019
International journal of applied earth observation and geoinformation

International journal of applied earth observation and geoinformation

SCI
ISSN:0303-2434
被引量9
参考文献量69
段落导航相关论文