This topic covers two complementary areas of e-infrastructures very closely related with the objective to make research data discoverable, accessible, assessable, intelligible, useable, and wherever possible interoperable – c.f. G8 principles on research data:
(a) Secure and agile data and distributed computing e-infrastructures: fostering the integration of a secure, permanent, on-demand service-driven, privacy-compliant and sustainable e-infrastructure incorporating distributed databases, computing resources and software.
The European data and computing e-infrastructure landscape remains very fragmented which is an obstacle for research collaboration at European and global levels and introduces additional complexity for achieving sustainable governance. The challenge is to integrate at European level the geographically and disciplinary dispersed resources to achieve economies of scale and efficiency gains in providing the best data and computing capacity and services to the research and education communities. This action is interrelated to INFRADEV-04-2016, “European Open Science Cloud for Research”.
(b) Access and preservation platforms for scientific information: supporting the integration and consolidation of e-infrastructure for reliable and permanent open access to digital scientific records, based on existing initiatives across Europe (institutional and thematic repositories, aggregators, etc.).
The European infrastructures need to respond to the emerging requirements for seamless and reliable access to publications, research data and software. These requirements are complemented by the need for long term preservation and curation of scientific information to fully support data and computing intensive science. The challenge is to support the integration at European level of a robust and sustainable e-infrastructure, based on existing initiatives across Europe (institutional and thematic publishing platforms, aggregators, etc.) and services supporting European Open Access policies. An additional challenge is the building of capacity to link all kinds of digital research objects in order to enable a more transparent evaluation of research and reproducibility of results, enabling trust and facilitating access by innovative business actors.Scope:
Grants awarded under this topic will be complementary between them. The respective options of Article 2, Article 31.6 and Article 41.4 of the Model Grant Agreement will be applied. The main purpose of the collaboration agreements referred to in Article 41.4 of the Model Grant Agreement is to work on potential synergies, overlaps and gaps in the overall service offering. In addition, links should also be established with projects selected under topic INFRADEV-04-2016, to collaborate, exploit potential synergies and ensure complementarity.
Proposals will address part (a) or (b), but not both. At least one proposal for each part will be selected:
(a) Secure and agile data and distributed computing e-infrastructures (proposals should address all points below):
(1) integration of computing, software and storage resources exposing them through a dynamic registry and catalogue of services supporting European research and education communities in their tasks related with data and computing intensive science. This integration should be done by means of open and flexible architectures and include institutional, regional, national and European capabilities, packaging them in the optics of end-user needs
(2) seamless operation of highly scalable and agile data and computing platforms and services dedicated to analytics including hardware and software components, database, compilers, analytics software, supported to easy user entry points for the community of users
(3) reliably address the aspects of privacy, cybersecurity and information assurance supporting multiple compartments with private, public or industrial corpus of data, protected from unauthorized access by secure interfaces
(4) adoption of standards-based common interfaces, open source components enabling access and processing of underlying data collected/stored in different platforms and formats. Empowering users to customise application and services tailoring them to specific requirements, which will differ across disciplines, applications etc
(5) work closely with user communities (from different disciplines) to foster the use of digital infrastructures, promote the values of open science and support their data management plans. Engage and train users (researchers, educators and students) to contribute to the dynamic registry and catalogue of services improving quality of data, software and computing infrastructure that become available for re-use
(6) foster interoperability of pan-European thematic/community-driven e-infrastructures providing cost-effective and interoperable solutions for data management. The data and computing e-infrastructure should be able to interoperate with resources based on different technologies which are operated/owned by public and or private organisations
(7) support the preservation and curation of data and associated software so that the reproducibility and accuracy of the data can be verified
(8) enable seamless transition and e-infrastructure upgrades, exploiting economies of scale and promoting interoperability with similar infrastructures across and beyond Europe and operate user-friendly and comprehensive repositories of software components for research and education
The Commission considers that proposals requesting a contribution from the EU of between EUR 10 and 15 million would allow this specific challenge to be addressed appropriately. Nonetheless, this does not preclude submission and selection of proposals requesting other amounts.
(b) Access and preservation platforms for scientific information (proposals will address all points below):
(1) Deployment and maintenance of service-driven knowledge e-infrastructure responding to general and specific requirements of researchers and research organisations for open access to research digital objects, their registration and preservation. This e-infrastructure will further develop the research capacity through a coordinated and participatory architecture linking institutional and thematic repositories across Europe. It will support publishing platforms by providing essential services for scientific information that can be used by humans and machines. Such target platforms can be generic, specific for a research field or specialised on quality assurance, discoverability, archiving etc. Essential functions of this service-driven approach will include helpdesks, training and guidance to support producers and users of scientific information, community building to support research data sharing and management, as well as implementation of Open Access policies in Europe. Relevant indicators on the take-up of open access in Europe including publications and data should be elaborated and reported regularly. The project will promote a limited set of biblio- and webometrics that reflect open access policies. It will collect bibliometric data on publications, citations, data citations, etc. on all Horizon 2020 scientific output (including on the Open Research Data Pilot) and produce both standard and on-demand statistics.
(2) Supporting global interoperability of open access data e-infrastructures and linking with similar initiatives across the globe to complement the physical access to research facilities with data access and to ensure that Europe plays a leading role in international collaborations.
The Commission considers that proposals requesting a contribution from the EU of between EUR 8 and 10 million would allow this specific challenge to be addressed appropriately. Nonetheless, this does not preclude submission and selection of proposals requesting other amounts. It is expected that one proposal will be selected.Expected Impact:
(a) the operation of a federated European data and distributed computing infrastructure for research and education communities will optimise the access to IT equipment and services and will put all European researchers and educators in equal footing to access essential resources to express their talent and creativity. Establishing partnerships with industrial and private partners the e-infrastructure will train people in research and academic organisations preventing lack of skilled and specialised infrastructure operators. It will avoid the locking-in to particular hardware or software platforms that would jeopardise the long-term planning for capacity upgrades. With such an operational infrastructure more scientific communities will use storage and computing infrastructures with state-of-the-art services for their research and education activities. The open nature of the infrastructure will allow scientists, educators and students to improve the service quality by interacting with data, software and computing resources. It will increase the incentives for scientific discovery and collaboration across disciplinary and geographical boundaries, putting Europe in the driving seat at global level. It will further develop the European economic innovation capacity and provide stability to the e-infrastructure.
(b) a reliable operation of e-infrastructure services for access and preservation of scientific information will make the intellectual capital of Europe available to researchers, business and citizens at large. It will generate economic and scientific advances now and in the future as that capital is safely preserved for further exploitation by future generations. Open Access publications resulting from Horizon 2020 funded research are available and easily findable online. Data needed to validate published results is linked to the publications and publicly shared whenever possible. Accurate science metrics for Horizon 2020 can be produced with almost no effort. Most of the European institutional repositories (at least 95%) as well as the principal thematic repositories are part of the same interoperable repository network.