What data is exposed
The CORE project exposes data about similarities between papers in the Open Access domain. The similarities are calculated using Natural Language Processing techniques based on the full-text. This distinguishes CORE from other systems, such as Mendeley or MarcXimiL. The similarities are provided only for research articles with an accessible and machine readable full-text. At the moment we expose more than 3 million RDF triples describing similarities calculated on a set of more than 50,000 full-text articles harvested from British Open Access repositories.
The data about the similarities are represented using the MuSIM ontology (http://kakapo.dcs.qmul.ac.uk/ontology/musim/0.2/musim.html) and BIBO ontologies (http://bibliontology.com/) with links to the OAI (RKBExplorer) repository available in the Linked Data cloud. Have a look on example queries to see what is available (http://core.kmi.open.ac.uk:8081/COREWeb/example-queries).
Data Schema
Core exposes RDF data according to the schema demonstrated below:

Data License
All data from CORE (unless otherwise specified) are available under the a Creative Commons Attribution 3.0 Unported License.
Contact us
Petr Knoth
Knowledge Media institute, The Open University
Milton Keynes
United Kingdom

