TDM: unlocking a goldmine of information

Organisers: OpenMinTeD: Natalia Manola and Stelios Piperidis /Athena Reseach & Innovation Centre, Thomas Margoni /Univ. of Glasgow, Martine Oudenhoven/LIBER

Duration: 3 hours

Text and Data Mining (TDM) is a natural ‘next step’ in open science. It can lead to new and unexpected discoveries and increase the impact of publications and repositories. This workshop showcases examples of successful TDM and infrastructural solutions for researchers. We will also discuss what is needed to make most of infrastructures and how publishers and repositories can open up their content.

Workshop abstract

In open science, the first step is to make publications and data openly available. Once it is truly openly available, it opens the way for text and data mining to make new discoveries. In relevance to scholarly content, there is a huge amount of text, containing lots of patterns and unexpected connections that are still hidden. If these connections and patterns were uncovered, this could lead to new discoveries, for instance in medicine, agriculture, social sciences or humanities. Text and data mining can reveal these treasures.

The workshop is divided into two parts that can be attended both, or you can pick one of them. The first part showcases successful TDM initiatives and infrastructures and a demo of a TDM platform. The second part consists of campfire sessions and focuses on how to open up content for TDM. Settle down in front of the fire and talk about overcoming differences between publishers and repositories, the steps of connecting content to a TDM platform and legal issues and assistance.

Agenda

Part 1 (14:00-15:30): Applications of TDM

The first half of the workshop will address text and data mining (TDM) platforms, both public and commercial, for research related results. They will present architectures, solutions, barriers and lessons learnt, both on the technical and legal levels.

Part 2 (16:00-17:30): Content and legal TDM issues

The second part will initiate the conversation about incentives for content providers to hook up to emerging TDM platforms/infrastructures, on the concrete steps they need to take and on the legal questions related to opening up content.

To raise awareness on the potential of TDM and of particular efforts in the research domain, to provide knowledge on how to overcome/bypass legal barriers, and to draft the first steps in a roadmap towards widely adopted implementations.

Workshop overview

Workshop report

SPEAKERS

Prof. Sophia Ananiadou - NacTEM, University of Manchester, United Kingdom
Robert Bossy - French National Institute for Agricultural Research (INRA)
Haris Papageorgiou - Athena Research and Innovation Centre, Greece
Manuel Noya - Linknovate, Spain
Stelios Piperidis - Athena Research and Innovation Centre, Greece
Natalia Manola - Athena Research and Innovation Centre, Greece
Petr Knoth - Open University, United Kingdom
Thomas Margoni - University of Glasgow, United Kingdom

TARGET AUDIENCE

Content providers (publishers, libraries/repositories), TDM service providers, Researchers, Industry, Policy makers.

WHEN

DAY 2 - 14:00 PARALLEL SESSION 4 (14:00) & 5 (16:00)

See full programme here.

TDM, Text and Data Mining, open content, Open Data Services, Platform, EC copyright reform, innovation