On the Design and Implementation of Easy Access to External Spatiotemporal Datasets in NFDI
DOI:
https://doi.org/10.52825/cordi.v1i.360Keywords:
Spatiotemporal data access, Workflow platform, Geo EngineAbstract
Across many scientific domains, the ability to process large amounts of heterogeneous spatiotemporal data from various sources is crucial for solving challenging research questions. For example, in NFDI4Biodiversity, researchers need to combine observation data with satellite images to correlate the loss of biodiversity with climate change variables. In general, large data sets are not available on the system (called consumer) where the processing is performed, but first have to be retrieved from one or multiple external systems (called providers) that offer a corresponding service. Moreover, a consumer is often unaware of the datasets the providers offer. Ideally, a provider follows FAIR principles and thus supports mechanisms to greatly simplify the data exchange. However, in practice, there are multiple providers with valuable datasets that are not as FAIR as desired or lack spatiotemporal-specific support for data exchange. Instead of improving each potential provider at the source, we propose an intermediary spatiotemporal data exchange layer (SDExL) that helps simplify data exchange so that domain experts easily gain access to valuable data with little technical know-how.
Downloads
References
“NFDI4Biodiversity.” (2023), [Online]. Available: https://www.nfdi4biodiversity.org/ (visited on 04/25/2023).
M. D. Wilkinson, M. Dumontier, I. J. Aalbersberg, et al., “The fair guiding principles for scientific data management and stewardship,” Scientific data, vol. 3, no. 1, pp. 1–9, 2016.
C. Beilschmidt, J. Dr¨onner, M. Mattig, P. Schweitzer, and B. Seeger, “Geo engine: Workflow-backed geo data portals,” in BTW 2023, Bonn: Gesellschaft f ¨ ur Informatik e.V.,
, pp. 837–849, ISBN: 978-3-88579-725-8.
“FAIR Data Spaces.” (2023), [Online]. Available: https://www.nfdi.de/fair- dataspaces/ (visited on 04/25/2023).
“Web Coverage Service.” (2023), [Online]. Available: https://www.ogc.org/standard/wcs/ (visited on 04/25/2023).
“SpatioTemporal Asset Catalogs.” (2023), [Online]. Available: https://stacspec.org/ (visited on 04/25/2023).
“Aruna Object Storage.” (2023), [Online]. Available: https://www.uni-giessen.de/de/fbz/fb08/Inst/bioinformatik/software/aruna (visited on 04/25/2023).
“Global Biodiversity Information Facility.” (2023), [Online]. Available: https://www.gbif.org/ (visited on 04/25/2023).
“PostgreSQL.” (2023), [Online]. Available: https://www.postgresql.org/ (visited on 04/25/2023).
“PostGIS.” (2023), [Online]. Available: https://postgis.net/ (visited on 04/25/2023).
“GDAL PostgreSQL Driver.” (2023), [Online]. Available: https://gdal.org/drivers/vector/pg.html (visited on 04/25/2023).
“Open Geospatial Consortium.” (2023), [Online]. Available: https://www.ogc.org/ (visited on 04/25/2023).
“Data and Information Access Services.” (2023), [Online]. Available: https://www.copernicus.eu/en/access-data/dias (visited on 04/25/2023).
“The Gaia-X Ecosystem - A Sovereign Data Infrastructure for Europe.” (2023), [Online]. Available: https://www.bmwk.de/Redaktion/EN/Dossier/gaia- x.html (visited on
/25/2023).
“NFDI - Nationale Forschungsdateninfrastruktur.” (2023), [Online]. Available: https://www.nfdi.de/ (visited on 04/25/2023).
Downloads
Published
How to Cite
Conference Proceedings Volume
Section
License
Copyright (c) 2023 Christian Beilschmidt, Dominik Brandenstein, Johannes Drönner, Nikolaus Glombiewski, Michael Mattig, Bernhard Seeger
This work is licensed under a Creative Commons Attribution 4.0 International License.
Accepted 2023-06-29
Published 2023-09-07
Funding data
-
Deutsche Forschungsgemeinschaft
Grant numbers 442032008 -
Bundesministerium für Bildung und Forschung
Grant numbers FAIRDS10 -
Bundesministerium für Wirtschaft und Technologie
Grant numbers O3EUPHE069;50EE2303B