EUDAT 2020

EUDAT 2020 is the second phase of EUDAT (for European Data) addressing European researchers and practitioners from all fields of science and their collaborative data handling requirements. EUDAT has set up a Collaborative Data Infrastructure (CDI), an e-infrastructure to enable users to find, access and process scientific data in a trusted environment. It provides integrated services developed and supported by EUDAT partners. As the number of services grows and the development and maintenance of the existing suite of integrated services continues, EUDAT partners have adopted a development process we can characterize as follows:

  • A pan-European effort with close collaboration of over 50 research communities from various scientific disciplines
  • A software engineering process involving scientific communities during all stages of design and implementation as they evolve the EUDAT CDI development with their needs and requirements

The network of EUDAT CDI partners currently consists of 36 European high performance computing centers, national data centers and research institutions. Within EUDAT, MPI-M and Deutsches Klimarechenzentrum (DKRZ) along with other partners represent the climate community as ENES, the European Network for Earth System Modelling.

EUDAT 2020 is funded through the European Union's Horizon 2020 DG Connect e-Infrastructure & Open Science Cloud Unit under grant agreement No. 654065, aimed at advancing technological development and innovation across Europe. Many EUDAT partners are playing a fundamental role in the creation of the European Open Science Cloud to be realized by 2020. Information on EUDAT, their service catalogue, pilot projects and much more can be found on the official EUDAT website.

Rationale for our Participation
The rationale for our participation in EUDAT is rooted in our collaboration with DKRZ. As DKRZ's services address the data management needs of our institute and other members of the climate community, it cannot define solutions on its own without engaging with end-users to assess the data management needs of the climate community. EUDAT provides a means of engaging with a subset of its user community, organized as ENES, and serves as a funding mechanism for developments that meet some of the climate community's data handling requirements. Our contribution ensures that our requirements feed into the EUDAT development process so that any future services operated by DKRZ-or any other data center-and developed through EUDAT are tailored to our needs. The development process has many agile aspects and can accommodate moderately changing requirements through most of the project lifetime.

Towards Tighter Integration with E-infrastructures
We are seeking tighter integration with e-infrastructures like the Earth Science Grid Federation (ESGF) e-infrastructure and the European Grid Infrastructure (EGI) Federated Cloud. ESGF maintains a global system of federated data centers that allows access to the largest archive of climate data in the world, including CMIP data. One of the ESGF data nodes is operated by DKRZ in Hamburg. EGI is a network of academic private clouds and virtualized resources built around open standards. Harnessing these resources in a user-friendly way will provide us with precious computing power for post-processing of model data. With the aim of integrating more tightly with these e-infrastructures, we coordinate the core development of the Generic Execution Framework (GEF) which will be integrated with both e-infrastructures in order to showcase post-processing of CMIP5 data from ESGF with the CDOs on the EGI Federated Cloud. The GEF development effort will also raise the profile of the CDOs as they are the primary use case example of our contribution on the pan-European level.

Nature of our Participation
MPI-M is mainly involved in two strands of work within EUDAT2020, both separated into two different work packages (follow the links for more detailed information):

Work Package 5 (WP5): Service Building
Work Package 8 (WP8): Data Life Cycle across Communities

For more details on our work in EUDAT please contact Asela Rajapakse (asela.rajapakse@we dont want or Reinhard Budich (reinhard.budich@we dont want