Title: A Versatile Data-Intensive Computing Platform for Information Retrieval from Big Geospatial Data
Authors: SOILLE PIERREBURGER ARMINDE MARCHI DAVIDEKEMPENEERS PIETERRODRIGUEZ ASERETTO ROQUE DARIOSYRRIS VASILEIOSVASILEV VESELIN
Citation: FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE vol. 81 p. 30-40
Publisher: ELSEVIER SCIENCE BV
Publication Year: 2018
JRC N°: JRC105787
ISSN: 0167-739X
URI: https://www.sciencedirect.com/science/article/pii/S0167739X1730078X
http://publications.jrc.ec.europa.eu/repository/handle/JRC105787
DOI: 10.1016/j.future.2017.11.007
Type: Articles in periodicals and books
Abstract: The increasing amount of free and open geospatial data of interest to major societal questions calls for the development of innovative data-intensive computing platforms for the efficient and effective extraction of information from these data. This paper proposes a versatile petabyte-scale platform based on commodity hardware and equipped with open-source software for the operating system, the distributed file system, and the task scheduler for batch processing as well as the containerization of user specific applications. Interactive visualization and processing based on deferred processing are also proposed. The versatility of the proposed platform is illustrated with a series of applications together with their performance metrics.
JRC Directorate:Joint Research Centre Corporate Activities

Files in This Item:
There are no files associated with this item.


Items in repository are protected by copyright, with all rights reserved, unless otherwise indicated.