分布式存储与计算方法在水利地理空间大数据中的应用

Application of Distributed Storage and Computing Method in Water Conservancy Geospatial Big Data

袁泽文 ¹周国成 ²周胜洁 ²邸国辉²

扫码查看

作者信息

1. 武汉大学测绘学院,湖北武汉 430079
2. 湖北省水利水电规划勘测设计院,湖北武汉 430070
折叠

摘要

为高效存储管理水利地理空间大数据,本文提出一种基于Hadoop和Spark的分布式存储与计算架构,同时合理运用Spark的内存与缓存机制,设计实现了针对空间大数据的分布式存储与计算方法.利用某水电站库区DEM数据进行库容计算分析,验证了本文所提出的分布式存储和计算方法具有更高的计算效率,并具有良好的可扩展性.

Abstract

This article presents a Hadoop&Spark-based distributed storage and computing framework for saving and managing water conservancy geospatial big data efficiently. By reasonably using Spark's memory and cache mechanism, the distributed storage and computing method is designed and achieved. It calculates and analyzes the reservoir capacity by using DEM data of a hydroelectric sta-tion reservoir and the results show that the proposed distributed computing method can improve computing efficiency of large geospatial data effectively. Meanwhile, this distributed computing method exhibits good adaptability.

关键词

Hadoop/Spark/空间大数据/分布式计算/库容计算

Key words

Hadoop/Spark/geospatial big data/distributed computing/reservoir capacity calculation

引用本文复制引用

基金项目

湖北省水利重点科研项目(HBSLKY202112)

出版年

2024

测绘与空间地理信息

黑龙江省测绘学会

测绘与空间地理信息

影响因子：0.788

ISSN：1672-5867

参考文献量7

段落导航