Michali s Vafopoulos NTUA &

45
Michalis Vafopoulos NTUA & www.publicspending.net www.vafopoulos.org Linked Data in a nutshell summer school NCSR, IRSS-2013

description

Linked Data in a nutshell . Michali s Vafopoulos NTUA & www.publicspending.net www.vafopoulos.org. summer school NCSR, IRSS -2013 . Welcome to the data era. Data: Open, big, linked. Open: access …everyone to use and republish as she wishes Big: scale - PowerPoint PPT Presentation

Transcript of Michali s Vafopoulos NTUA &

Page 1: Michali s Vafopoulos NTUA &

Michalis Vafopoulos NTUA & www.publicspending.net

www.vafopoulos.org

Linked Data in a nutshell

summer school NCSR, IRSS-2013

Page 2: Michali s Vafopoulos NTUA &

Welcome to the data era

Page 3: Michali s Vafopoulos NTUA &

Data: Open, big, linked

Open: access…everyone to use and republish as she wishes

Big: scalehigh volume, velocity and variety

Linked: usePublish once, use as many times

Page 5: Michali s Vafopoulos NTUA &

How is it working?

Linked data in a nutshell

Sources: T. Heath, J. Sequeda, the Web

Page 6: Michali s Vafopoulos NTUA &

The Web of Documents• Analogy: a global file system• Designed for: human consumption• Primary objects: documents• Links between: documents (or sub-parts

of)• Degree of structure in objects: fairly low• Semantics of content and links: implicit-

humans(Tom Heath)

The web = the internet + links + documents

Page 7: Michali s Vafopoulos NTUA &

The Web of Documents• Simple, big and unstructured• Organized in Silos

But humans are interested in:• Things, no documents and• these Things might be in documents or elsewhere• Humans: Limited capacity to extract meaning...

Page 8: Michali s Vafopoulos NTUA &

Limited SEARCH capacitySearch for: Football Players who went to the University of Texas at Austin, played for the Dallas Cowboys as Cornerback

(Juan F. Sequeda)

8

Page 9: Michali s Vafopoulos NTUA &

Google, Bing, yahoo! irrelevant

9

Page 10: Michali s Vafopoulos NTUA &

Wikipedia through LD: relevant

10

Page 11: Michali s Vafopoulos NTUA &

The Web of Data• Analogy: a global filesystem ----> global

database• Designed for:human consumption ->machines first-

humans later• Primary objects: documents --> things (or

descriptions of things)

• Links between: documents --> things • Degree of structure in objects: fairly low --->

high• Semantics of content and links: implicit -->

explicit(Tom Heath)11

Page 12: Michali s Vafopoulos NTUA &

The Modigliani Test• Show me all the locations of all

the original paintings of Modigliani

• Daniel Koller (@dakoller) showed that you can find this with a SPARQL query on DBpedia

Thanks Richard MacManus - ReadWriteWeb

Page 13: Michali s Vafopoulos NTUA &
Page 14: Michali s Vafopoulos NTUA &

Results of the Modigliani Test

• Atanas Kiryakov from Ontotext• Used LDSR – Linked Data Semantic

Repository– Dbpedia– Freebase– Geonames– UMBEL– Wordnet

Published April 26, 2010: http://www.readwriteweb.com/archives/the_modigliani_test_for_linked_data.php

Page 15: Michali s Vafopoulos NTUA &
Page 16: Michali s Vafopoulos NTUA &

The Web of Data: why?

16

– encourages reuse– reduces redundancy– maximises its (real and potential) inter-connectedness– enables network effects to add value to data

Page 17: Michali s Vafopoulos NTUA &

The Web of Data: how?

17

– current state on the Web• Relational Databases• APIs• XML• CSV• XLS

Computers can’t consume data because:• Different formats & models• Not inter-connected

Page 18: Michali s Vafopoulos NTUA &

The Web of Data: how?

18

– we need to create a standard way of publishing Data on the Web (like HTML for docs)

This is the Resource Description Framework

(RDF)

Page 19: Michali s Vafopoulos NTUA &

Resource Description Framework (RDF)

• A data model – A way to model data– Inspired form Relational databases and

Logic• RDF is a triple data model• Labeled Graph (semantic networks)• Subject, Predicate, Object<Chios> <is part of> <Greece>

Page 20: Michali s Vafopoulos NTUA &

Example: Document on the Web

Page 21: Michali s Vafopoulos NTUA &

Databases back up documents

Isbn Title Author PublisherID

ReleasedData

978-0-596-15381-6

Programming the Semantic Web

Toby Segaran

1 July 2009

… … … … …PublisherID PublisherNa

me1 O’Reilly

Media… …

This is a THING:A book title “Programming the Semantic Web” by Toby Segaran, …

THINGS have PROPERTIES:A Book as a Title, an author, …

Page 22: Michali s Vafopoulos NTUA &

Data representation in RDF

book

Programming the Semantic

Web

978-0-596-15381-6

Toby Segaran

Publisher O’Reilly

title

name

author

publisher

isbn

Isbn Title Author PublisherID

ReleasedData

978-0-596-15381-6

Programming the Semantic Web

Toby Segaran

1 July 2009

PublisherID

PublisherName

1 O’Reilly Media

Page 23: Michali s Vafopoulos NTUA &

Everything on the web is identified by a URI!

Page 24: Michali s Vafopoulos NTUA &

link the data to other data

http://…/

isbn978

Programming the Semantic

Web

978-0-596-15381-6

Toby Segaran

http://…/

publisher1

O’Reilly

title

name

author

publisher

isbn

Page 25: Michali s Vafopoulos NTUA &

consider the data from Revyu.comhttp://

…/isbn978

http://…/

review1

Awesome Book

http://…/

reviewerJuan

Sequeda

hasReview

reviewerdescription

name

Page 26: Michali s Vafopoulos NTUA &

start to link data

http://isbn978

Programming the Semantic

Web

978-0-596-15381-6

Toby Segaran

http://publisher

1O’Reilly

title

name

author

publisher

isbn

http://isbn978

sameAs

http://review1

Awesome Book

http://reviewe

rJuan

Sequeda

hasReview

hasReviewerdescription

name

Page 27: Michali s Vafopoulos NTUA &

Juan Sequeda publishes data too

http://juansequeda.com/id

livesInJuan Sequedaname

http://dbpedia.org/Austin

Page 28: Michali s Vafopoulos NTUA &

Let’s link more datahttp://

…/isbn978

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequed

ahttp://

juansequeda.com/id

hasReview

hasReviewerdescription

name

sameAs

livesInJuan Sequedaname

http://dbpedia.org/Austin

Page 29: Michali s Vafopoulos NTUA &

And more

http://…/

isbn978

Programming the Semantic Web

978-0-596-15381-6

Toby Segaran

http://…/publisher

1 O’Reilly

title

name

author

publisher

isbn

http://…/

isbn978

sameAs

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequeda

http://juansequeda

.com/id

hasReview

hasReviewerdescription

name

sameAs

livesIn

Juan Sequedanamehttp://dbpedia.org/Austin

Page 30: Michali s Vafopoulos NTUA &

Linked data = internet + http +

RDF

Page 31: Michali s Vafopoulos NTUA &

Linked Data Principles1. Use URIs as names for things2. Use URIs so that people can

look up (dereference) those names.

3. When someone looks up a URI, provide useful information.

4. Include links to other URIs so that they can discover more things.

Page 32: Michali s Vafopoulos NTUA &

Web as a database• Linked Data makes the web

exploitable as ONE GIANT HUGE GLOBAL DATABASE!

• Is there any query language like sql?SPARQL…

Page 33: Michali s Vafopoulos NTUA &

The LOD cloud: May 2007

Page 34: Michali s Vafopoulos NTUA &

Mar 2008

Page 35: Michali s Vafopoulos NTUA &

Sept 2008

Page 36: Michali s Vafopoulos NTUA &

Mar 2009

Page 37: Michali s Vafopoulos NTUA &
Page 38: Michali s Vafopoulos NTUA &

Fujitsu and DERI Revolutionize Access to Open Data by Jointly Developing Technology for Linked Open Data

Page 39: Michali s Vafopoulos NTUA &

What is a Linked Data application/service?

Software system that makes use of data on the Web from multiple datasets and that

benefits from links between the datasets

Page 40: Michali s Vafopoulos NTUA &

Characteristics of Linked Data Applications

• Consume data that is published on the web following the Linked Data principles: an application should be able to request, retrieve and process the accessed data

• Discover further information by following the links between different data sources

• Combine the consumed linked data with data from sources (not necessarily Linked Data)

• Expose the combined data back to the web following the Linked Data principles

• Offer value to end-users

Page 41: Michali s Vafopoulos NTUA &

the 5 stars of open linked data

★make your stuff available on the Web (whatever format)★★make it available as structured data (e.g. excel instead of image scan of a table)★★★non-proprietary format (e.g. csv instead of excel)★★★★use URLs to identify things, so that people can point at your stuff★★★★★link your data to other people’s data to provide context

http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/

Page 42: Michali s Vafopoulos NTUA &

Ideas for projects1. Think of interesting questions 2. Search for related datasets

And start “playing” with:• Interconnections – links to other

datasets • Statistical analysis• Economic/business analysis• Public policy analysis

Page 43: Michali s Vafopoulos NTUA &

43

• Where public money goes in a specific sector?

• Environment, education?• To which companies?

Interesting questions

Page 44: Michali s Vafopoulos NTUA &

Questions??

Page 45: Michali s Vafopoulos NTUA &

More info• Twitter: @vafopoulos• [email protected]• www.Vafopoulos.org • www.publicspending.net • www.Youtube.com/websciencegr