Navigating data lakes for Earth and marine science: FAIR Data management and Service Interoperability in practice
This workshop is envisioned as an addition to the Blue-Cloud Training Academy, which aims to stimulate the uptake of FAIR data practices in marine science and neighbouring disciplines, through contributions by key actors such as EuroGOOS and IEEE. The workshop also has a strong overlap with the FAIR-IMPACT project regarding development and uptake of FAIR service descriptions (expansion of FAIR software) and methods for assessment.
This workshop is envisioned as an addition to the Blue-Cloud Training Academy, which aims to stimulate the uptake of FAIR data practices in marine science and neighbouring disciplines, through contributions by key actors such as EuroGOOS and IEEE. The workshop also has a strong overlap with the FAIR-IMPACT project regarding development and uptake of FAIR service descriptions (expansion of FAIR software) and methods for assessment.
To manage and provide access to a large amount of data, a federated or distributed Data Lake can be a solution but it presents some technical bottlenecks and sustainability constraints. For the federation of services it is crucial to achieve technical and semantic interoperability between the services for providing added value to users. “Just providing API’s” is not sufficient. As a basis each data access service (being a “plain” dataset access service or a more advanced subsetting service) needs a FAIR service description which includes e.g. the expected input, output, processing capacity, data policy (CC-BY), etc.
Both Blue-Cloud 2026 and FAIR-EASE run into this complex data lake challenge which could be well supported by FAIR service descriptions. Blue-Cloud 2026 is active in the marine domain developing virtual labs, work benches for Essential Ocean Variables (EOVs) and a Virtual Research Environment (VRE) on top of Blue Data Infrastructure services, and, FAIR-EASE develops similar services on top of data access services in the multi-disciplinary domain. Both projects (in coordination with other projects (e.g.,EuroSciencesGateway) contribute to the implementation of the EOSC interoperability framework.
Presentations are available here.
Details
-
DATE:27 September 2023
-
ROOM:Sala Ciudad Úbeda
Organisers
Speakers
Rita Giuffrida
Patricia Cabrera
Peter Thijsse
Maria Luisa Chiusano
Katrina Exter
Massimiliano Assante
Marie Jossé
Short Bios
Rita Giuffrida
Patricia Cabrera
Peter Thijsse
Maria Luisa Chiusano
Katrina Exter
Massimiliano Assante
Marie Jossé
Agenda
11:30 Welcome – _Rita Giuffrida (Trust-IT)
- Interdisciplinary user environments
- 11:35 _Patricia Cabrera, VLIZ (Blue-Cloud 2026)
- 11:50 _Maria Luisa Chiusano, UNINA (FAIR-EASE)
12:05 The Mythical Data Lake - _Katrina Exter, VLIZ (FAIR-EASE)
12:20 Interactive session with the audience – _Rita Giuffrida (Trust-IT)
12:30 Blue-Cloud Virtual Research Environment services for Open
Science practices - Massimiliano Assante, CNR (Blue-Cloud 2026)
12:45 Virtual Research Environment (process data As FAIR As
Possible) - Marie Jossé, CNRS (FAIR-EASE)
13:00 FAIR Data Discovery and Access -Peter Thijsse, MARIS
(Blue-Cloud 2026/FAIR-EASE)
13:15 Q&A and reflections