ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge Implementation of CERIF-based infrastructure...

23
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge Implementation of CERIF-based infrastructure for enabling CRIS adoption in the Greek academia Vasilis Bonis Nikos Pougounias, Dimitris Karaiskos National Documentation Centre / National Hellenic Research Foundation, Greece 1

Transcript of ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge Implementation of CERIF-based infrastructure...

ΕΚΤ Access to KnowledgeΕΚΤ Access to Knowledge

Implementation of CERIF-based infrastructure for enabling CRIS adoption in the Greek academia

Vasilis Bonis

Nikos Pougounias, Dimitris Karaiskos

National Documentation Centre / National Hellenic Research

Foundation, Greece

1

ΕΚΤ Access to Knowledge

Outline• Current Research Infrastructure System in Greece• Native CERIF and CERIF related systems• CERIF Tools implemented by EKT• Lessons learned• Future Steps

2

ΕΚΤ Access to Knowledge

Greece - Research Community Demographics• 22 Universities, 14 Technological Educational Institutes, 13 Research

Centres in total

• 52.358 researchers, 29.054 Full-Time Equivalents (2013)

• Source: «Research and Development Expenditure and Personnel in Greece in 2013 - Main indicators, National Documentation Centre, 2015» (http://metrics.ekt.gr/sites/metrics/files/RDstatistics_2013final_Greece_en.pdf)

3

ΕΚΤ Access to Knowledge

Greek CRIS infrastructure• Each organisation operates its own institutional CRIS system

o Policies, procedures and workflows at the institutional level

• All institutional CRIS systems are based on the same CRIS software platform

• CRIS software developed by EKT (Native CERIF database)• A central aggregator will provide a unified point of access to the entire

information• CERIF XML/REST is used for interoperability

4

ΕΚΤ Access to Knowledge

Support to institutions• CRIS software – including documentation for installation, configuration,

usage• Online helpdesk for support

o http://helpdesk.cris.ekt.gr

• Software distribution systemo Including certification procedure, notifications for new releases, versioning

• Semantics – vocabularies• Tools for data import to the CERIF database

o CERIF XMLo Excel / CSV and Dublin Core through an open source library (Biblio-transformation-

engine, BTE)

5

ΕΚΤ Access to Knowledge

The CRIS software platform – overview (1)• Native CERIF database - support all CERIF entities• Java, Spring, JPA, Elastic Search, Javascript/jQuery• Based on the open source CERIF-JPA library – used for the CERIF

reference implementation (http://github.com/EKT/CERIF-Tools)

• Search, faceted browsing• User interface (web forms) for editing all entities and semantic layer• Advanced support for data management / approval workflows

6

ΕΚΤ Access to Knowledge

The CRIS software platform – overview (2)• Sophisticated user roles / authorization

o Role-based access control (RBAC) in CERIF tableso Roles aligned with institutions’ main needs

• Users stored in DB or logging in through LDAP. Single Sign-On

• Reporting• Review institution’s hierarchical structure• Highly configurable (e.g. fields order, facets,

permissions, etc.)

7

ResearcherResearcher

Project ManagerProject Manager

IndexerIndexer

Content ManagerContent Manager

Research Support StaffResearch Support Staff

Research Administration Staff

Research Administration Staff

Authorized SystemAuthorized System

AdministratorAdministrator

ΕΚΤ Access to Knowledge

Aggregator - Architecture

8

AggregatorAggregator

HarvesterHarvester

OAI ServerOAI Server

CRIS RESTCRIS REST

CRIS DB

•Augmented CRIS•CERIF DB + aggregation related tables

•Generic functionality•Relays messages•System agnostic•Accepts OAI-PMH data

•CERIF XML payload•Per entity retrieval

•/{entity}?identifiersOnly=false&limit=X&pageSize=Y

• Direct access

Institution

EKT

ΕΚΤ Access to Knowledge

Aggregator/Harvester

9

ΕΚΤ Access to Knowledge

Roadmap• 1st Phase – Early Adoption

o Currently running until end of 2015o To participate, 12 Universities, 3 TEIs and 1 Research Centreo Installations in 7 Universities, 1 TEIs and 1 Research Centreo Aggregator deployment and operationo CRIS current version 0.8.2 – next version to be released at the end of May 2015

• 2nd Phase – Complete Adoptiono All institutions will install and operate CRISo Aggregator fully operational (aggregation and reporting)

10

ΕΚΤ Access to Knowledge

EKT’s Native CERIF Systems (1)• R&D Projects Registry (in progress)

o Catalogs R&D related national and European projects and their associated organisations

o Sources of data: E-Corda (FP7 and Horizon2020) Greek Monitoring Information System for National Strategic Reference Framework

(NSRF 2007-2013) RDI national organisations registry (supported by EKT) Eurostat/UNESCO

o More than 30.000 organisationso More than 10.000 NSRF projects and 20.000 FP7 projects

11

ΕΚΤ Access to Knowledge

EKT’s Native CERIF Systems (2)• R&D Data Management System (DMS)

o Stores, interconnects and manages Greek R&D Expenditure and Personnel data as collected via online data collection system

(Eurostat - Commission Regulation 995/2012) Organisations’ profiles stored in the RDI national registry Indicators as produced from primitive data External R&D data sources as managed by EKT (e.g. FP7 projects etc.)

o Goals Single management point Preservation and archiving (time series) Data validation & estimation workflow Benchmarking using external sources Real time automated generation of indicators Data exports (SDMX, CSV, Excel, JSON, CERIF-XML) Reporting

12

ΕΚΤ Access to Knowledge

EKT’s CERIF related systems (1)• Greek Researcher Directory – researchers.gr

o Database of Greek Researchers - Researchers in Greek organisationso Access only to researchers – registration only by invitationo Pre-loaded information for permanent personnel of Greek public research

performing organisationso Automatic loading of information from sources:

CrossRef, PubMed, Microsoft Academic Search, DBLP, Greek Books in Print, RepEc (economics), arXiv (Cornel University Library), Google Scholar (bibTex, ris, refman)

Link publications to researchers – automatic mechanismso Automatic recognition of references from researcher CVs – Text analysiso Export profile in CERIF XML

13

ΕΚΤ Access to Knowledge

EKT’s CERIF related systems (2)• Greek Researcher Directory – researchers.gr

o Researcher Profile (exported in CERIF XML) Affiliations Education Research expertise – disciplines (Frascati) and keywords Identifiers (ORCID, MAS ID, Scopus Author ID, ResearcherID, …) Awards / Distinctions (Best paper awards, Journal Editorships, Scientific

Society Fellowships, …) Publications Journals, Conference Proceedings including impact figures, Books and

Monographs Projects (in development)

14

ΕΚΤ Access to Knowledge

The CRIS ecosystem

15

ΕΚΤ Access to Knowledge

CERIF Tools Implemented (1)• CERIF Model

o All CERIF entities as POJOso 100% compliant to the latest CERIF version

• CERIF Serviceso JPA data repositories for all entitieso Common persistence and selection services

• CERIF XMLo Full support of CERIF 1.6 XML Schema

• CERIF APIo RESTful services based on current EuroCRIS draft recommendationo OAI-PMH – per entity retrieval

16

ΕΚΤ Access to Knowledge

CERIF Tools Implemented (2)

17

• Open sourceo http://github.com/EKT/CERIF-Tools

• Matureo 4 years of continuous developmento Deployed on a plethora of production

systems

• Ongoing Maintenanceo Tools aligned to CERIF versionso New features and bug fixing

ΕΚΤ Access to Knowledge

Production Systems Using EKT’s CERIF Libraries• DMS Metrics - Native CERIF DB + Custom tables• EuroRIs-Net+ Observatory - Native CERIF DB, CERIF XML• R&D Business Registry - Native CERIF DB, CERIF XML• Scopus DB - Native CERIF DB• PAERIP Inventory of Research Infrastructures - Native CERIF DB (older

version)• CRIS - Native CERIF DB, CERIF XML, CERIF API• RICH - Native CERIF DB, CERIF XML• OpenAire - CERIF XML, CERIF API (Custom OAI-PMH implementation)• Greek Researcher Directory - CERIF XML

18

ΕΚΤ Access to Knowledge

Data Migration Tool• Biblio-Transformation-Engine

o Provided as Open Source https://github.com/EKT/Biblio-Transformation-Engine o Included in the core distribution of DSpaceo JAVA

19

ΕΚΤ Access to Knowledge

Using BTE for Data Migration1. Identify CRIS related data2. Identify per migration datasets3. Map data to CERIF

1. Identify CERIF entities2. Identify persistent identifiers3. Identify relations4. Identify semantics – Custom semantics may be necessary

4. Process, Clean, Merge source data5. Implement migration (commonly OutputGenerator classes)6. Run7. Test

20

ΕΚΤ Access to Knowledge

Lessons Learned• Central vs. decentralized CRIS – Keep data fresh• Minimizing the need for data migration and manual data entry – Tackle

cold start problem• Horizontal need for persistent identifiers• CERIF semantic entities:

o Difficulties in understanding and usingo Need for Greek translations

• User management – Integration of already existing user directories• Security issues – Data filtering • Policies and workflows• Communicate the value of CRIS

21

ΕΚΤ Access to Knowledge

Future Steps (2015 - 2016)• CRIS Installations in more institutions• Collect more data in Aggregator and produce statistics/metrics• Implement visualizations for Aggregator statistics/metrics• Ongoing Maintenance of CRIS software and all CERIF tools

o Fully aligned to CERIF versionso New features and bug fixing

• Organize workshops and tutorials for institutions’ personnel• Completion of CERIF XML export for Terms Thesaurus• Completion of Projects Registry

22

ΕΚΤ Access to Knowledge

23

Thank you for your attentionQ&A

• More info:

o bonisv AT ekt.gro http://metrics.ekt.gr/cris o http://helpdesk.cris.ekt.gr o http://github.com/EKT/CERIF-Tools o http://github.com/EKT/Biblio-Transformation-Engine