Skip to main content

Open Science Knowledge Graphs: Transforming the Way we Manage, Explore, and Analyze Scientific Knowledge

In this workshop, SciLake representatives will elaborate on the mission of building a comprehensive scholarly communication graph and the technical solutions under development. Representatives from key ESFRIs will present their ongoing work on the creation and maintenance of domain-specific SKGs, their current needs and identified challenges. By exploring the initiatives of these projects, we aim to provide insights into the usage of SKGs in Open Science activities and the impact they have on research output and collaborations.


In this workshop, SciLake representatives will elaborate on the mission of building a comprehensive scholarly communication graph and the technical solutions under development. Representatives from key ESFRIs will present their ongoing work on the creation and maintenance of domain-specific SKGs, their current needs and identified challenges. By exploring the initiatives of these projects, we aim to provide insights into the usage of SKGs in Open Science activities and the impact they have on research output and collaborations.

In recent years, a variety of both domain-specific and cross-domain Science Knowledge Graphs (SKGs) have been created in the context of academic or industrial activities. These resources contain valuable information and are catalyzing the creation and provision of advanced knowledge extraction and exploration services, which are intended to increase research productivity, shorten the time between hypotheses and results, and provide the basis for a powerful toolbox that supports important workflows for researchers and other stakeholders by allowing informed and fact-based decisions. As a result, SKGs are of great value to the research community at large and various research communities have started building their own domain-specific SKGs and linking them with cross-domain knowledge graphs, like the OpenAIRE Graph (https://graph.openaire.eu/).

In this context, researchers participating in the five thematic clusters of the European Strategy Forum on Research Infrastructures (ESFRI) have given significant effort to develop their own SKGs and useful added-value services on top of them to facilitate knowledge management and exploration in the respective disciplines. At the same time, building SKGs and making them interoperable to unlock their full potential is a challenging task. There are various challenges that should be addressed, most of them being related to the complexity of the domains to be represented, the multilinguality of scientific texts, the interconnectivity of the involved entities, the lack of structure in organizing the respective knowledge (e.g., in text), and the heterogeneity of the available metadata formats. SciLake (https://scilake.eu/) is a Horizon Europe project that aims to alleviate the previous issues. More specifically, it aims to extend the technical work in the field of SKGs leveraging them as the foundation to establish the concept of the scientific lake: a research ecosystem to facilitate creating, combining, and querying cross-domain and domain-specific SKGs. This ecosystem, among others, will comprise tools that are capable to extract knowledge from unstructured (e.g., textual) information, facilitate the interoperability among SKGs, support various types of knowledge transformation, unify and simplify the way SKGs can be queried, and accelerate graph processing and analysis for SKGs. Finally, SciLake will build a prototype of this concept also delivering, on top of it, a series of EOSC-onboarded services to assist researchers in discovering scientific knowledge and improving research reproducibility.

In this workshop, representatives from key ESFRIs will present their ongoing work on the creation and maintenance of domain-specific SKGs, their current needs and identified challenges. SciLake representatives will elaborate on the mission of building a comprehensive scholarly communication graph and the technical solutions which are under development. By exploring the initiatives of these projects we aim to provide insights into the usage of the SKGs in Open Science activities and the impact they have on research output and collaborations.

In conclusion, the session will investigate the potential cooperation and common goals of these initiatives in the creation of multidisciplinary interlinked SKGs. In addition, the importance of the scientific lake concept will be discussed, as well as its impact on different research communities. The session will provide participants with an opportunity to engage with the speakers and provide feedback on their work, as well as to examine how SKGs will impact their research outputs and assist them in responding to the Open Science requirements.

Key learning outcomes:

  • Learn about the creation of Scientific Knowledge Graphs, specifically how to convert data into knowledge and how to use them to promote Open Science to improve navigation and discovery.
  • Discover the challenges associated with delivering high-quality SKGs in terms of coverage, timeliness, accuracy, and consistency, where and how good curation practices and AI come into play.
  • Learn about the various needs and efforts coming from different disciplines: Research Infrastructures (neuroscience, biodiversity, photon/neutron), national infrastructures (cancer), and emerging research communities (transport, energy)
  • .Explore how diverse SKGs and projects can work together to create interconnected multidisciplinary SKGs.
  • Learn about the concept of a scientific lake, the services it can provide to the SKG community, and the impact it has on various research communities.

Presentations are available here

Details

  • DATE:
    26 September 2023
  • ROOM:
    Main Auditorium

Organisers


Speakers

Stefania Amodeo

OpenAIRE

Leily Rabbani

Karolinska Institutet

Thanasis Vergoulis

OpenAIRE - Athena RC

Ingrid Reiten

University of Oslo

Max Novelli

European Spallation Source

Joaquín López Lérida

LifeWatch ERIC

Short Bios

Stefania Amodeo

Stefania Amodeo is a researcher with academic experience. She has a Ph.D. in Astronomy and Astrophysics from the Paris Observatory. She has worked as a postdoctoral researcher at Cornell University and the Observatory of Strasbourg, specializing in astronomical data science and open science. Her role in OpenAIRE is to implement engagement and training activities for the research community. Her work involves identifying communities' needs and developing cohesive activities that offer solutions through OpenAIRE services, while also improving OpenAIRE services through the addition of new features, data, and methodologies.

Leily Rabbani

Leily Rabbani is a Bioinformatician at the Department of Molecular Medicine and Surgery in Karolinska Institute. She completed a Ph.D. in Bioinformatics at Max Planck Institute for Biology, Tübingen. Afterwards, she worked several years at the Max Planck Institute of Immunobiology and Epigenetics in Freiburg im Breisgau as a core facility Bioinformatician where she was involved in developing tools and next-generation sequencing (NGS) data analyses. In her current job, she analyzes large-scale short/long-read NGS data and she implements and tests new technologies with the purpose of studying chronic lymphocytic leukemia (CLL).

Thanasis Vergoulis

Thanasis Vergoulis is the Development and Operation Director of OpenAIRE AMKE and a scientific associate at IMSI, Athena RC in Greece. He received his Ph.D. in Computer Science from NTU of Athens, under the supervision of Prof. Timos Sellis in 2014. Since then, he has been involved in several EU and national ICT projects involving big data management, scholarly knowledge representation and management, scientometrics, research analytics, and bioinformatics. He has served as a member of the program or organising committee for several CS conferences and workshops, and as a guest editor in special issues of well-established journals, while he has co-edited a book titled Predicting the Dynamics of Research Impact'' (Springer, 2021). He has been teaching courses in undergraduate and postgraduate levels in academic institutions in Greece and Cyprus.

Ingrid Reiten

Ingrid Reiten is a curation scientist at the Institute of Basic Medical Sciences at the University of Oslo (UiO). She completed a Msc in Neuroscience at NTNU, Trondheim, and is currently doing her PhD at Neural Systems Laboratory, UiO. In her current position, she develops the workflows and related procedures for FAIR data sharing using the EBRAINS Research Infrastructure, contributing with her expertise in neuroanatomy and neuroinformatics in the cross-cultural and multidisciplinary team behind the EBRAINS Data & Knowledge Services. With EBRAINS she is working towards a more collaborative and transparent neuroscientific research culture.

Max Novelli

Max Novelli is a software engineer, data architect, and data manager. He currently works as Data Curation Scientist at the Copenhagen office of the European Spallation Source, where is responsible for preparing the scientific data catalog for production, setting the foundation for data FAIRness and define the set of useful metadata. He is interested in everything related to data: Big Data, data management, data structure, data visualization, and data curation. When not lost in "computer land", he enjoy spending time with his family and friends, exercising and teaching yoga.

Joaquín López Lérida

Joaquín López Lérida has a PhD in Computer Science and Telecommunications Engineering. He holds the position of Data e-Science Management Plans and Blockchain Officer - DLBO - at LifeWatch ERIC. He holds a Master in Executive Coaching from AECOP/EOI and worked as a researcher in the PhD program at Stanford University, California, Computer Science Department. Since then he has been involved in the development and implementation of projects related to data management and telecommunications infrastructures at Spanish and European level, particularly with topics related to big data management, data management and curation and blockchain.

    Agenda

    Open Science Knowledge Graphs: Transforming the Way we Manage, Explore, and Analyze Scientific Knowledge

    • Welcome & introduction [5 min.] - Stefania Amodeo 
    • Presentations: 
    • OpenAIRE Graph: status and enrichments via the SciLake project - Thanasis Vergoulis [15 min] 
    • Domain-specific SKGs: requirements, models, interactions: 
    • EBRAINS Knowledge Graph [10min] Ingrid Reiten 
    • LifeWatch Knowledge Graph [10 min] Joaquín López Lérida
    • PaNOSC Data Catalogue [10 min] Max Novelli 
    • Roadmap for a Cancer Knowledge Graph [10 min] Leily Rabbani 
    • Round table :  Challenges for interoperability, exchange and way forward [30 min] 

    Moderator:  Stefania Amodeo 

    Panelists:  All the speakers, with interaction from the audience