Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

Linked Data Flows: A new picture to illustrate the “openness” we mean

October 28, 2009 By: Tassilo Pellegrini Category: Corporate Semantic Web, Linked Data & Open Data 1 Comment →

(Original post taken from “About the Social Semantic Web“)

A lot of activities around Linking Open Data (“LOD”) and the associated data sets which are nicely visualised as a “cloud” are going on for quite a while now. It is exciting to see how the rather academic “Semantic Web” and all the work which is associated with this disruptive technology can be transformed now into real business use cases.

What I have observed in the last few months, especially in business communities, is the following:

  • “Linked Data” sounds interesting for the business people because the phrase creates a lot of associations in a second or two; also the database crowd seems to be attracted by this web-based approach of data integration
  • “Web of Data” is somehow misleading because many people think that this will be a new web which replaces something else. Same story with the “Semantic Web”
  • “Linking Open Data” sounds dangerous and not trustworthy to many companies

For insiders it is clear, that the “openness” of data, especially in commercial settings, can be controlled and has to be controlled in many cases i.e. by defining the right licensing models. But here we are still at the beginning as a workshop at ISWC 2009 has illustrated.

Anyway, looking at the characteristics of Linked Data Flows, they can be one-way or mutual. In some cases data from companies will be put into the cloud, and can be opened up for many purposes, in other use cases it will stay inside the boundaries. In other scenarios only (open) data from the web will be consumed and linked with corporate data, but no data will be exposed to the world (except the fact, that data was consumed by an entity).

And of course: On many other occasions datasets and repositories will be opened up partly depending on the CCs (or similar, not yet defined attributes) and the underlying privacy regulations one wants to use.

This makes clear that LOD / Linking Open Data is just one detail of a bigger picture. Since companies (and governments) play a crucial role to develop the whole infrastructure, we need to draw a new picture that illustrates the various Linked Data Flows in a better way:

linkeddataworld

Concluding from this the best thing would be to talk about Linked Data in general and just refer to Linking Open Data in the right context. Despite better knowledge for business people the term  “open” is still associated with “free” and “dubious provenance”. And given the fact that hardly anybody has given hard evidence on the ROI of open business models the “open argument” does count little in a time of decreasing economic prosperity.

So what would be critical to get the Linked Data thing running is to provide the corresponding business and licensing models for your Linked Data strategy. But this includes having a good understanding of the assets you want to capitalize. Given the fact that metada assets are still a novel and vastly unexplored business field which so far lack a regulated supply and demand structure there are still lots of structural obstacles that hinder the uptake of Linked Data. Providing more of the same in a laissez faire mode – like TimBL critisized at this year’s Web 2.0 Summit – might be inspiring for the in-crowd, but it might not be sufficient to build a linked data business.

Sphere: Related Content

55 people enjoyed the first semantic web meetup in vienna

July 17, 2009 By: Thomas Thurner Category: Conferences & Events No Comments →

dsc_0494Yesterdays first “semantic web meetup” attracted 55 attendees to join in for presenting, talking and socialising. Approximately one year after the series of semantic web meetups started in NYC, there is now also a vital community gathering in vienna. Beside an inside view on brandnew ideas and developments of austrias semweb-labs in presenations and lightning talks, Steve Sandhouse of New York Times joined in via webmeeing to give an insight on NY-Times’s Semantic Web – efforts, which have a back-history of about 100 years now – as he explained.

In conclusion: A good start for the First Vienna Semantic Web Meetup, which may paved the way for a next meeting in the very next future. In the meanwhile some pictures of the venue to amuse those which were there and to inspire new people to join: www.meetup.com

Reblog this post [with Zemanta]
Sphere: Related Content

Session 4: Using the Web of Data [WOD-PD]

October 23, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data 2 Comments →

This morning’s first session was dedicated to Using the Web of Data, or, as Alan Dix put it: “In the end, it’s not about data – it’s about use!” Alan and Richard Cyganiak were the keynoters for this session.

Alan Dix is a Professor at the Computing Department of Lancaster University, and author (with Janet Finlay, Gregory Abowd, and Russel Beale) of Human-Computer Interaction.

To start with, Alan pointed to the two sides of achieving the web of data: Firstly generating the web of data (a billion triples, as mighty as this may sound, is actually tiny, says Alan) and then, secondly, accessing the web of data.

Alan Dix giving a talk

With regard to generating the Web of Data, Alan distinguished between top down and bottom up approaches, counting to the former the creation of the web of data from legacy sources (i.e. where you take existing data and semantically lift them, e.g. from structured data) or web scraping such as DBpedia’s extraction of data from Wikipedia.

N.B.: This notion of ‘top-down’ does not imply a hierarchical relationship, but rather means that there is already a plan for what is going to be put on the web of data (e.g. ‘all semi-structured information on Wikipedia’ or ‘dataset XY from project Z’). The bottom-up idea here implies that data is added as the result of an action, or interaction, as the user/s go, e.g. relationships are created as the user expands his or her social network. For instance on Amazon, user interaction is used to generate semantics: People do not tell Amazon what they like, they simply buy it.

Having relationships of course does not imply yet that these relationships are part of the Semantic Web. Or, as Alan put it, “why should I be RDFizing my online presence if none of my friends are?”

Please take a look at the PDF of the Alan’s slides (2,4 MB) – what I cannot reproduce here is a chart he developed, which was very useful for describing current scenarios on the web and which posed a twofold question:

Does a website/platform have the web of data implemented? YES/NO
Is the web of data on ta website/platform apparent to the user? YES/NO

The possible combinations (YES/YES, YES/NO, NO/YES, NO/NO) provide a good heuristic tool for describing what is currently available, with and without the Semantic Web. Take, for instance, the shiny interface of Talis’ Project Cenote: Cenote’s vision is to “make library data visible in many contexts, inside and outside of the library, making the data much more accessible and visible to a wider audience – benefiting current and potential users of library services wherever they are.” On Cenote, the user doesn’t see that it’s got the Web of Dat in it – it is actually implemented, but not in a way that is apparent to the user.

On the other end of the spectrum, you have a platform like Facebook: Alan referred to Facebook as “the user’s own web of data”, i.e. web of relationships: The user is aware of these relationships (they actually shape his interaction and communication with the site), and the (numerous!) apps on Facebook continually add relationships, but, regrettably, insulated from one another and not using RDF (and don’t you try to take data out of Facebook!).

Two examples of public data that Alan cited and that grow as people/institutions add data do them are Freebase (the “open database of the world’s information” – see previous posts on this blog about Freebase) and Swivel. Swivel allows people, institutions, anyone to upload and explore data, also featuring official data sources such as (links go to their Swivel pages): New York Federal Reserve Bank, UNESCO Institute for Statistics, DukeResearch or EUROSTAT. According to Alan, there is already more data on Swivel now than in the whole Linked Data cloud.

Alan also mentioned the Social Graph API – o yesterday evening Luca Hammer (one of the web 2.0 people who had joined the Open Hacking Session) introduced me to the Wordpress Plugin “Meet your commenters” – Meet you commenters uses Social Graph to find social relations on the web, and adds these data to the commenter profiles it creates in Wordpress.

Two Christmas crackersImage via WikipediaOn a different note: I took sometime today to explore Alan’s homepage and found the cute Christmas Cracker’s application which was first developed in 1999 and which is now also available on Facebook. As trivial as it may sound at first – sending virtual Christmas Crackers (with more than 5000 possible combinations!) is a good showcase for developing Human Interaction Scenarios, and a number of papers have been written about the application. Here is the casestudy which Alan recommends to begin with: Designing experience – virtual Christmas Crackers.

The abstract and a list of links to all websites and demos Alan discussed can be found here. Full reference: A. Dix and R. Cyganiak (2008). Using the Web of Data. Keynote at WOD-PD 2008 | Web of Data Practitioners Days, Vienna, Austria – Oct 22-23, 2008. http://www.hcibook.com/alan/papers/WOD-PD-2008/

Even if you have not met Richard Cyganiak in person, you have certainly come across one of his creations: The Linked Data Cloud. Richard is a research assistant at DERI Galway. In his demo, he gave us the opportunity to gain hands on experience, introducing a tool he dubbed Snorql, which is basically an easier to use version of a SPARQL-endpoint, as it already has the required prefixes ‘pre-installed’:

Using the Snorql interface, we could explore the dataset we had created collaboratively during Keith Alexander and Yves Raimond’s session. Writing SPARQL queries manually can be a challenge, but is next to impossible if you (like me) don’t know the syntax. But today we could just copy and paste all the queries from a website Richard had put up prior to his session – thanks a lot for the excellent preparation and demonstration!

Richard also showed a couple of RDF browsers in action, e.g. the Tabulator Plugin (“a Firefox extension which allows Firefox to handle data as well as documents”), or the Marbles Linked Data browser which is running right on beckr.org/marbles; enter, for instance http://api.talis.com/stores/wod-pd-sandbox/items/People/JanaHerwig (learn more about Marbles here).

Thank you, Alan and Richard – the combination of talk and demo was indeed a perfect intro towards using the Web of Data.

Reblog this post [with Zemanta]
Sphere: Related Content

Web of Data Practitioners Days, 1st Session: Tweaking Turtles [WOD-PD]

October 22, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data 7 Comments →

Good morning from Vienna:) The Web of Data Practitioners Days really kicked off with a bang today – with Michael Hausenblas doing a strip! Only to expose the Semantic Web t-shirt he wore underneath his smart suit and tie, of course, but he really got the attention of attendees at 9:15 in the morning:)

First session – Web of Data 101 by Yves Raimond and Keith Alexander – explained the implications of the move from a Web of Documents to a Web of Data: With the Semantic Web architecture, data can be made explicit on the web. Data here means not only data contained in documents, but data describing persons, cities, bands, events, finally arriving at the “Web of Things” (see also this presentation by Dave Raggett, W3C, – PDF 2,7 MB). The Web of Data wouldn’t be a Web if the data weren’t interlinked – here is an overview of the principles of Linked Data:

  • always use URIs as names for things
  • more specifically, use HTTP URIs so that people can look up those names on the web
  • when someone looks up an URI, provide useful RDF information (RDF is the data model used for data on the web of data)
  • include RDF statements that link to other URI (otherwise it wouldn’t be a web).

Please also watch out for what is already happening and is going to happen in the future on www.bbc.co.uk/music/beta. This beta site is powered by MusicBrainz, the open content music database that is also part of the Linked Data cloud. Yves is collaborating with the BBC in the Programmes ontology project, the aim of which is to provide a simple vocabulary for describing programmes.

Yves’ intro was followed by a Turtle hacking session led by Keith Alexander. Turtle is a serialisation format for RDF, i.e. a format in which you can write RDF statements. The Turtle session is documented here on Keith’s Talis website. Even though I copied and pasted most of the code, I didn’t manage to produce a piece of valid code in N3 right away (i.e. not valid according to this validator). It only worked after I had removed the statements about who I know or what I am interested in – without these connections, what remains is a bit boring, I guess. But this looks like I managed to post at least something to the test store!

EDIT: Problem was that I had terminated the statements to soon, with a dot where a semicolon should have been; the demo didn’t allow me to overwrite the first post to the store, but here is my FOAF self-description in Turtle:

@prefix foaf:<http://xmlns.com/foaf/0.1/> .
@prefix owl:<http://www.w3.org/2002/07/owl#> .
@prefix people:<http://api.talis.com/stores/wod-pd-sandbox/items/People/> .

people:JanaHerwig a foaf:Person ;
foaf:name “Jana Herwig” ;
foaf:nick “digiom” ;
foaf:homepage <http://digiom.wordpress.com> ;
owl:sameAs <http://dbtune.org/last-fm/jezobeljones> ;
foaf:knows people:MichaelHausenblas, people:YvesRaimond, people:WolfgangHalb ;
foaf:topic_interest <http://dbpedia.org/resource/Semantic_Web>, <http://dbpedia.org/resource/Web>, <http://dbpedia.org/resource/Popular_Culture>, <http://dbpedia.org/resource/Lolcat>.

Achieved with zero Semantic coding skills – the Web of Data cannot be so hard to achieve:)

EDIT: Did do the update, too – just posted my first SPARQL query to this endpoint. Are the results going to be preserved in this link? Here is the query “by foot”:

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX people: <http://api.talis.com/stores/wod-pd-sandbox/items/People/>
DESCRIBE people:JanaHerwig

Sphere: Related Content

Read this: Building Linked Data For Both Humans and Machines

October 06, 2008 By: Jana Herwig Category: Linked Data & Open Data, Literature & Publications No Comments →

Publication recommendation: W. Halb, Y. Raimond, M. Hausenblas: Building Linked Data For Both Humans and Machines. Linked Data on the Web Workshop at the 17th International World Wide Web Conference 2008 (WWW2008), Beijing, China, 2008. 8 pages, download from this page.

Sphere: Related Content

Danny Ayers: “The Semantic Web is the path of least resistance”

October 02, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data 2 Comments →

Danny AyersThe Web of Data Practitioners Days are approaching – giving me the opportunity to do an advance interview with Danny Ayers, Semantic Web evangelist, Community Platform manager at Talis, Web of Things everything (I think). I’d just like to extract two or three points here – you can read the whole interview on our website. First something that’s noteworthy to me as it says something about the patterns of technological evolution in general:

Looking back a few years, I don’t think many people working on the Web could have predicted the remarkable rise of blogging, the revival of DHTML and ancient Internet Explorer tricks such as Ajax, online social networks, Wikis, the whole Web 2.0 thing. It’s worth noting that these developments have been consistent with Tim Berners-Lee’s vision of the Web as a system in which people are the key component.

Shifting to the Semantic Web perspective, for a long time I have believed this approach is on track simply because it offers improvements to the Web for which there are no obvious alternative techniques. Personally, I was relatively late to realise what those improvements really were – moving from a Web of Documents to a more general Web of Data. Expressed like that, and looking at existing Web architecture, the Semantic Web is the path of least resistance.

Remember? AJAX, when it cropped up and caused a big buzz in 2005, was nothing new, it was just a new term for an old thing, i.e. the Internet Explorer tricks Danny mentions (see also A Brief History of AJAX: “Browser asynchronous hacks have been possible since 1996, when Internet Explorer introduced the IFRAME tag, passing through a number of techniques such as pixel gifs, Netscape layers, Microsoft Remote Scripting, Java/JavaScript gateways, stylesheet hacks, image/cookies, and most recently the XMLHttpRequest.”)

Sometimes it takes a while until someone (society, industry, what have you) starts to notice that this or that, something, could actually be useful. Sometimes technologies that everybody thinks are silly become a huge sucess – think text messages!

And sometimes you have a great (piece of) technology and it just never really catches on, and if that is the case, then mostly because some forces in the market (trusts, monopolies, corporations who force you to use their software/technology and at ridiculous price, people who would do anyhing they can to undo the natural laws of the digital world) won’t let it happen. What happend to Video 2000 and Betamax? Nixed by JVC’s licensing strategies for VHS. Just wanted to make this point before moving on to the next quote. Danny:

Regarding possible obstacles, there are many ways the Web could suffer, probably most dangerous being interventions from national governments or commercial interests, tilting the table on which we build these systems – such as software patents and threats to net neutrality. The Web works because it’s more or less the same to everyone, everywhere.

So if you think that the Web should continue to be the same to everyone, everywhere, if you would like to liaise with other people interested in the SemWeb and the Web of Data, but most importantly, if you do not know a whole lot about the SemWeb yet but would like to learn more, then please come and do attend the Web of Data Practitioners Days in Vienna, Oct 22-23.

It is going to start with a “Web of Data 101″, i.e. a low-threshold introduction given by Keith Alexander (Talis, UK) and Yves Raimond (Queen Mary University of London, UK) to Semantic Technology in the context of the Web. Here is the full program – please mind that there is a deadline for the registration also (6 Oct 2008!).

Reblog this post [with Zemanta]
Sphere: Related Content

♪♫♪No Milk Today♫♪♪ – New Ways of Finding Music for Vegans

September 11, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data, Mashups & Web services 1 Comment →

Shortly before Yves Raimond, a researcher at Queen Mary University of London with a focus on metadata for musical resources, won the 2nd prize in the Triplification Challenge, he talked to us about new ways of finding music using the infrastructure of the web of data. If you ever catch anyone again complaining about the lack of persuasive showcases of the Semantic Web, please direct them to this interview with Yves! Quote:

I think there is something quite frustrating about music recommender systems at the moment though. First, they do not explain how a particular recommendation was derived. I would really like them to tell me “I recommended this track because the harmonies are similar to other tracks you liked according to such and such criteria”. I think I would place more trust in a recommender system that actually explains recommendations, like a friend would do.

Another frustration is that we now have a really huge music-related web of data, created within the scope of the Linking Open Data project, which is not used at all by current recommender systems.

We started some work with Alexandre Passant, driven by these two frustrations. Using all these interlinked data for recommendation purposes allows us to break free from the traditional ‘information barriers’, and use all sorts of data as a basis for a musical recommendation.

For example, using the datasets currently available and interlinked on the web, you can already provide recommendations such as “You’re interested in intentional living and the Beastie Boys? Did you know that B.B. King is a vegetarian, as is Adam Yauch, who is a member of the Beastie Boys?”

Last.fm, are you listening? The full interview can be found here.

Yves is also going to be a keynote speaker at the Web of Data Practitioners Days, Oct 22-23, here in Vienna, where you’ll have the chance to discuss the issue of LOD-based music recommendation with him in greater detail.

Other highlights of the program: Web of Data 101 (interested SemWeb beginners: please attend!), an Open Hacking Session, and keynotes from Danny Ayers and Keith Alexander, Richard Cyganiak, Ansgar Scherp, Alan Dix, Leo Sauerman, Sören Auer and Tassilo Pellegrini. URL of the website is webofdata.info

Other news of the day: Physicists can’t dance, but hasthelargehadroncolliderdestroyedtheworldyet.com?

Reblog this post [with Zemanta]
Sphere: Related Content