Data2Semantics

Data2Semantics


Project homepage: http://data2semantics.github.io


From Data to Semantics for Scientific Data Publishers

Data2Semantics aims to provide essential semantic infrastructure for bringing e-Science to the next level. A core task for scientific publishers is to speed up scientific progress by improving the availability of scientific knowledge. This holds both for dissemination of results through traditional publications, as well as through the publication of scientific data. The Data2Semantics project focuses on a key problem for data management in e-Science: How to share, publish, access, analyse, interpret and reuse data?


Data2Semantics is a collaboration between the VU University Amsterdam, the University of Amsterdam, Data Archiving and Networked Services (DANS) of the KNAW, Elsevier Publishing and Synerscope, and is funded under the COMMIT programme of the NL Agency of the Dutch Ministry of Economic Affairs, Agriculture and Innovation.

The discovery of new knowledge is the heart of scientific progress; the generation, support and maintenance of knowledge form the foundation of the scientific endeavour. e-Science is ultimately about discovering and sharing knowledge in the form of experimental data, theory-rich vocabularies, publications and re-usable services that are meaningful to the working scientist.

The complexity and abundance of data resources in an e-Science environment requires support for knowledge and metadata management: data is notoriously hard to share, find, access, interpret and reuse. This project targets scientific data publishers as primary facilitators of the e-Science process.

Scientists need tools to better understand the complexity characteristics of their data and its ability to answer scientific questions. They must be able to equip data with meaning and to generate a surrounding semantic context in which data can be meaningfully interpreted. Scientists must be given the means to make their data speak for itself, to move from data to semantics.

The targets of this project are to

  • increase the ease with which scientists can share their datasets with others,
  • increase the ease with which scientists can access, analyse and interpret datasets, and thereby
  • increase the reuse of such datasets.