Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

Interview with Juan Sequeda: “I believe Linked Data will enable new killer apps that are only possible thanks to Linked Data.”

April 14, 2010 By: Tassilo Pellegrini Category: Calls & Competitions, Linked Data & Open Data, Semantic Web Applications 1 Comment →

Juan Sequeda, co-chair of the Triplification Challenge 2010 and one of the core figures in the Linked Data movement, gives us his view how the Semantic Web might evolve. His central message: “Once there is an incentive to create quality links, these links will start to show up. And then users will start linking to the data hubs of their interest.”

Linked Data itself has grabbed a lot of attention inside the Semantic Web community recently. But what about the outside perspective? Could linked data be called the killer app for the Semantic Web?

I foresee two things happening with Linked Data. One is from the web development perspective (the so-called Web 2.0 developers) and the other is from the enterprise perspective. The web development community will sooner than later realize that Linked Data will enable easy integration of data and therefore will ease the pain of consuming data from different data sources. Thanks to big organizations such as BBC, New York Times, Reuters, Best Buy, etc. web developers will start paying attention to this “new thing” called Linked Data.

What we need is that the inside Semantic Web community starts to create applications on top of current Linked Data so when the outside web development community starts to pay attention, they have something to chew on. We (the semantic web community) needs to start speaking the web development language. There is still a big gap. I have had personal experiences with people in the web development community who think that RDF is XML and because they hate XML, they will never consider it. This is false and this is something that we need to change.

From the enterprise perspective, Linked Data is another data integration solution. Data integration has been a problem since day one of relational databases. I believe enterprises will be open to consider new solutions with new technologies. I’m hoping to see new startups tackling the enterprise domain. Imagine being able to query “get all my clients from cities whose population is greater than 1 million” even though I don’t have the data about population of cities in my database.

Is Linked Data the killer app for the Semantic Web? Before I answer that, I would like to ask, what was the killer app of the Web? Was it the browser? Was it e-commerce? Was it search? Was it Amazon or Ebay or Google? I believe Linked Data will enable new killer apps, apps that are only possible thanks to Linked Data. The browser was only possible because of HTML. So let’s ask ourselves what is possible because of Linked Data, and there we will find our killer app.

One of the core deficiencies of the young open data cloud is the little amount of interlinks between datasets. Is it just a matter of time to overcome this or are there other measures needed to turn the existing datasets into a true giant global graph?

I like to remind myself that this new wave of semantic web technologies is an extension of the current web. Therefore we should analyze how the web evolved in the beginning. Initially, everything were a bunch of documents on the web in which people manually created links to other documents. When Google started, it created an incentive to offer quality links between documents. This also created data hubs. If you write a blog post about a book, most probably you will link to the web document of that book either on Amazon and/or Wikipedia. I believe that this will happen with Linked Data. Once there is an incentive to create quality links, these links will start to show up. And then users will start linking to the data hubs of their interest.

Open Governmental Data is a big issue at the moment. The US and UK government have started to apply Linked Data principles to turn this vision into reality. Lots of other countries are following. What do you expect from this trend?

I believe that Linked Data will take off thanks to the initiative of governments. We always talk about the chicken and egg problem of the semantic web. Once we have organizations that don’t even think about it and are just interested in putting their data on the web, the semantic web will start to grow. If Bookstore ABC puts their data on the web, it may not be so meaningful. But if the US and UK government puts their data on the web, following the Linked Data principles, then people can wake up and say “ok, so this is for real. Let me start paying attention to this”.

You are one of the chairs of the Triplification Challenge 2010. Can you give us a brief insight what to expect from this year’s challenge? What are the conditions to participate?

The Triplification Challenge this year has grown and is very exciting. For the first time, it is offering two different tracks.

The first track, the Open Track will accept submissions on three areas 1) new datasets that are published following the Linked Data principles and that show potential benefit, 2) generic methods, mechanisms and approaches of creating Linked Data from legacy datasets and 3) applications that make use of Linked Data.

The second track is the New York Times track which will accept submissions of applications that make use of the New York Times Linked Data and one or more government dataset. The objective is to create an application powered by Linked Data that would be of interest to any constituent of that government.

I personally believe that the year 2010 is the year of creating Linked Data applications and the Triplification Challenge is the way to be part of it.

Sphere: Related Content

Interview with Georgi Kobilarov: “I believe that data publishing must happen in a distributed style.”

March 26, 2010 By: Tassilo Pellegrini Category: Linked Data & Open Data, Mashups & Web services, Semantic Web Applications, Tools & Software 1 Comment →

Uberblic.org connects structured data from the web. The Berlin-based inventor Georgi Kobilarov gives a brief insight into the mashup service and talks about the challenges when it comes to build applications upon linked data.

You have recently published the service uberblic.org, a Linked Data mashup editor. What was your motivation to develop this tool?

Uberblic.org provides an integrated view of web data. Our goal is to integrate all the structured data on the web, and give web-developers a single point to access to that reconciled data. More than that, we will open up the tools we use to manage the data sources to the community, so that the people can help us curating that repository of free data. We re-publish all the data we import as Linked Data, under the licenses of the original data publishers.

Some of the data sources we import are available in the Linked Open Data cloud as well, but many are not. Linked Data is an elegant way to publish data in a distributed way on the web, but consuming it from that distributed cloud is – at least – impractical. In every real-world application using linked data from the web I’ve seen, organizations built up internal copies of the cloud, and often even reconcile linked data sources. They build their own Linked Data proxies. Uberblic.org helps those users by providing one public proxy for data from the web. Many of our sources get monitored for data changes, and the according data in uberblic is updated in real-time.

uberblic

Can you give us a brief insight how the tool works? What technology is is built on?

My company, Uberblic Labs, has developed a data integration platform that we use to power uberblic.org. We call it the Uberblic Platform (the name uberblic is derived from the German “Überblick” – English “overview”). This platform enables us to do the full process of “data fusion”: Importing and converting external data sources, mapping the data schemas to a central ontology, filtering out data errors, automatically suggesting duplicates to the user, and merging data from different sources into a single, reconciled representation.

Structured and semi-structured data from the web is an excellent use case for our software platform, since there we come across all the interesting cases of real-world data heterogeneity. But what I think is especially powerful and yet missing in other Linked Data projects I know, is the ability to subscribe to update-feeds. We do that extensively, fetching updates in real-time from Wikipedia and the like.

Our platform is built in Scala and runs a on cluster of machines, with workers communicating through a messaging system. We developed an RDF storage layer on top of a distributed key-values store for storing all provenance information used in the extraction process, currently around 100 million named graphs for uberblic.org. That storage layer does not directly provide SPARQL access, so we push all the output data into a SPARQL endpoint hosted by Talis as well.

What have been the biggest challenges in tackling the integration issues of dispersed data?

It was quite a steep learning curve to do Linked Data not only in an academic environment, but in a reliable, industry-strength set-up. In academia, there was always the excuse that things are just research prototypes. Now that excuse is gone. That’s also where it becomes necessary to manually clean up data. And there are two ways to do that: Either you enable the users to change facts directly in your repository after you have imported the external data (that is what Freebase does), or you facilitate clean-up cycles in the original data source and fetch these updates in real-time. That is what we do.

I believe that data publishing must happen in a distributed style, because then each data source gets taken care of by a specialized group of people using specialized tools. And it’s what you see not only on the web, but also inside organizations and enterprises. But consuming data trough centralized APIs is more than just convenient. We all use Google
or another search engine as a central access point to web pages which are published in a distributed way all over the web, don’t we? Can you imagine today researching a topic on the web without the centralization power of search engines, just by following links across web sites, like in the old days?

When we built the Uberblic Platform, some of the things I imagined to be large headaches, like schema mapping, turned out to work really well. Those pathologic cases you often see in academic “challenges” are – well – pathologic. It’s not necessary to solve them fully automatically through super-intelligent algorithms. Much more important than the sophistication of your algorithms are well designed workflows so that the user becomes a part of the solution. And that’s not about crowd-sourcing or swarm intelligence, the editorial curating of schema mappings and object reconciliation can be done just by a small team of people. If they have the right set of tools.

What are the next plans with uberblic.org? Where will the journey go?

Uberblic.org will continue to integrate more interesting and useful data sources from the web, and we will start making more APIs available to web developers to build their applications on top. We are also looking for partners who are interested in developing applications and have been struggling in the past to get the cross-source data from the web they need.

The work on improving uberblic.org will also benefit our Uberblic Platform, and hence our clients who use that same software for integrating organizational data sources with each other and with the web of data.

About Georgi Kobilarov

Georgi is founder and managing director of Uberblic Labs, a company based in Berlin specialized in Linked Data integration. He worked as a research associate in the Web-based Systems Group at Freie Universität Berlin and as a visiting researcher at Hewlett Packard Labs Bristol. As co-founder and lead developer of DBpedia, he was also a day-one contributor to the Linking Open Data project. Georgi is consulting with the BBC on several Linked Data related projects. He organizes the Web of Data Meetup London, a bi-yearly gathering of the UK Linked Data community. Georgi graduated with a Diplom in business administration from Freie Universität Berlin and has many years of work experience as a software developer. Visit his blog: http://blog.georgikobilarov.com

Sphere: Related Content

Linked Data Flows: A new picture to illustrate the “openness” we mean

October 28, 2009 By: Tassilo Pellegrini Category: Corporate Semantic Web, Linked Data & Open Data 1 Comment →

(Original post taken from “About the Social Semantic Web“)

A lot of activities around Linking Open Data (“LOD”) and the associated data sets which are nicely visualised as a “cloud” are going on for quite a while now. It is exciting to see how the rather academic “Semantic Web” and all the work which is associated with this disruptive technology can be transformed now into real business use cases.

What I have observed in the last few months, especially in business communities, is the following:

  • “Linked Data” sounds interesting for the business people because the phrase creates a lot of associations in a second or two; also the database crowd seems to be attracted by this web-based approach of data integration
  • “Web of Data” is somehow misleading because many people think that this will be a new web which replaces something else. Same story with the “Semantic Web”
  • “Linking Open Data” sounds dangerous and not trustworthy to many companies

For insiders it is clear, that the “openness” of data, especially in commercial settings, can be controlled and has to be controlled in many cases i.e. by defining the right licensing models. But here we are still at the beginning as a workshop at ISWC 2009 has illustrated.

Anyway, looking at the characteristics of Linked Data Flows, they can be one-way or mutual. In some cases data from companies will be put into the cloud, and can be opened up for many purposes, in other use cases it will stay inside the boundaries. In other scenarios only (open) data from the web will be consumed and linked with corporate data, but no data will be exposed to the world (except the fact, that data was consumed by an entity).

And of course: On many other occasions datasets and repositories will be opened up partly depending on the CCs (or similar, not yet defined attributes) and the underlying privacy regulations one wants to use.

This makes clear that LOD / Linking Open Data is just one detail of a bigger picture. Since companies (and governments) play a crucial role to develop the whole infrastructure, we need to draw a new picture that illustrates the various Linked Data Flows in a better way:

linkeddataworld

Concluding from this the best thing would be to talk about Linked Data in general and just refer to Linking Open Data in the right context. Despite better knowledge for business people the term  “open” is still associated with “free” and “dubious provenance”. And given the fact that hardly anybody has given hard evidence on the ROI of open business models the “open argument” does count little in a time of decreasing economic prosperity.

So what would be critical to get the Linked Data thing running is to provide the corresponding business and licensing models for your Linked Data strategy. But this includes having a good understanding of the assets you want to capitalize. Given the fact that metada assets are still a novel and vastly unexplored business field which so far lack a regulated supply and demand structure there are still lots of structural obstacles that hinder the uptake of Linked Data. Providing more of the same in a laissez faire mode – like TimBL critisized at this year’s Web 2.0 Summit – might be inspiring for the in-crowd, but it might not be sufficient to build a linked data business.

Sphere: Related Content

55 people enjoyed the first semantic web meetup in vienna

July 17, 2009 By: Thomas Thurner Category: Conferences & Events No Comments →

dsc_0494Yesterdays first “semantic web meetup” attracted 55 attendees to join in for presenting, talking and socialising. Approximately one year after the series of semantic web meetups started in NYC, there is now also a vital community gathering in vienna. Beside an inside view on brandnew ideas and developments of austrias semweb-labs in presenations and lightning talks, Steve Sandhouse of New York Times joined in via webmeeing to give an insight on NY-Times’s Semantic Web – efforts, which have a back-history of about 100 years now – as he explained.

In conclusion: A good start for the First Vienna Semantic Web Meetup, which may paved the way for a next meeting in the very next future. In the meanwhile some pictures of the venue to amuse those which were there and to inspire new people to join: www.meetup.com

Reblog this post [with Zemanta]
Sphere: Related Content

Session 4: Using the Web of Data [WOD-PD]

October 23, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data 2 Comments →

This morning’s first session was dedicated to Using the Web of Data, or, as Alan Dix put it: “In the end, it’s not about data – it’s about use!” Alan and Richard Cyganiak were the keynoters for this session.

Alan Dix is a Professor at the Computing Department of Lancaster University, and author (with Janet Finlay, Gregory Abowd, and Russel Beale) of Human-Computer Interaction.

To start with, Alan pointed to the two sides of achieving the web of data: Firstly generating the web of data (a billion triples, as mighty as this may sound, is actually tiny, says Alan) and then, secondly, accessing the web of data.

Alan Dix giving a talk

With regard to generating the Web of Data, Alan distinguished between top down and bottom up approaches, counting to the former the creation of the web of data from legacy sources (i.e. where you take existing data and semantically lift them, e.g. from structured data) or web scraping such as DBpedia’s extraction of data from Wikipedia.

N.B.: This notion of ‘top-down’ does not imply a hierarchical relationship, but rather means that there is already a plan for what is going to be put on the web of data (e.g. ‘all semi-structured information on Wikipedia’ or ‘dataset XY from project Z’). The bottom-up idea here implies that data is added as the result of an action, or interaction, as the user/s go, e.g. relationships are created as the user expands his or her social network. For instance on Amazon, user interaction is used to generate semantics: People do not tell Amazon what they like, they simply buy it.

Having relationships of course does not imply yet that these relationships are part of the Semantic Web. Or, as Alan put it, “why should I be RDFizing my online presence if none of my friends are?”

Please take a look at the PDF of the Alan’s slides (2,4 MB) – what I cannot reproduce here is a chart he developed, which was very useful for describing current scenarios on the web and which posed a twofold question:

Does a website/platform have the web of data implemented? YES/NO
Is the web of data on ta website/platform apparent to the user? YES/NO

The possible combinations (YES/YES, YES/NO, NO/YES, NO/NO) provide a good heuristic tool for describing what is currently available, with and without the Semantic Web. Take, for instance, the shiny interface of Talis’ Project Cenote: Cenote’s vision is to “make library data visible in many contexts, inside and outside of the library, making the data much more accessible and visible to a wider audience – benefiting current and potential users of library services wherever they are.” On Cenote, the user doesn’t see that it’s got the Web of Dat in it – it is actually implemented, but not in a way that is apparent to the user.

On the other end of the spectrum, you have a platform like Facebook: Alan referred to Facebook as “the user’s own web of data”, i.e. web of relationships: The user is aware of these relationships (they actually shape his interaction and communication with the site), and the (numerous!) apps on Facebook continually add relationships, but, regrettably, insulated from one another and not using RDF (and don’t you try to take data out of Facebook!).

Two examples of public data that Alan cited and that grow as people/institutions add data do them are Freebase (the “open database of the world’s information” – see previous posts on this blog about Freebase) and Swivel. Swivel allows people, institutions, anyone to upload and explore data, also featuring official data sources such as (links go to their Swivel pages): New York Federal Reserve Bank, UNESCO Institute for Statistics, DukeResearch or EUROSTAT. According to Alan, there is already more data on Swivel now than in the whole Linked Data cloud.

Alan also mentioned the Social Graph API – o yesterday evening Luca Hammer (one of the web 2.0 people who had joined the Open Hacking Session) introduced me to the Wordpress Plugin “Meet your commenters” – Meet you commenters uses Social Graph to find social relations on the web, and adds these data to the commenter profiles it creates in Wordpress.

Two Christmas crackersImage via WikipediaOn a different note: I took sometime today to explore Alan’s homepage and found the cute Christmas Cracker’s application which was first developed in 1999 and which is now also available on Facebook. As trivial as it may sound at first – sending virtual Christmas Crackers (with more than 5000 possible combinations!) is a good showcase for developing Human Interaction Scenarios, and a number of papers have been written about the application. Here is the casestudy which Alan recommends to begin with: Designing experience – virtual Christmas Crackers.

The abstract and a list of links to all websites and demos Alan discussed can be found here. Full reference: A. Dix and R. Cyganiak (2008). Using the Web of Data. Keynote at WOD-PD 2008 | Web of Data Practitioners Days, Vienna, Austria – Oct 22-23, 2008. http://www.hcibook.com/alan/papers/WOD-PD-2008/

Even if you have not met Richard Cyganiak in person, you have certainly come across one of his creations: The Linked Data Cloud. Richard is a research assistant at DERI Galway. In his demo, he gave us the opportunity to gain hands on experience, introducing a tool he dubbed Snorql, which is basically an easier to use version of a SPARQL-endpoint, as it already has the required prefixes ‘pre-installed’:

Using the Snorql interface, we could explore the dataset we had created collaboratively during Keith Alexander and Yves Raimond’s session. Writing SPARQL queries manually can be a challenge, but is next to impossible if you (like me) don’t know the syntax. But today we could just copy and paste all the queries from a website Richard had put up prior to his session – thanks a lot for the excellent preparation and demonstration!

Richard also showed a couple of RDF browsers in action, e.g. the Tabulator Plugin (“a Firefox extension which allows Firefox to handle data as well as documents”), or the Marbles Linked Data browser which is running right on beckr.org/marbles; enter, for instance http://api.talis.com/stores/wod-pd-sandbox/items/People/JanaHerwig (learn more about Marbles here).

Thank you, Alan and Richard – the combination of talk and demo was indeed a perfect intro towards using the Web of Data.

Reblog this post [with Zemanta]
Sphere: Related Content

Web of Data Practitioners Days, 1st Session: Tweaking Turtles [WOD-PD]

October 22, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data 7 Comments →

Good morning from Vienna:) The Web of Data Practitioners Days really kicked off with a bang today – with Michael Hausenblas doing a strip! Only to expose the Semantic Web t-shirt he wore underneath his smart suit and tie, of course, but he really got the attention of attendees at 9:15 in the morning:)

First session – Web of Data 101 by Yves Raimond and Keith Alexander – explained the implications of the move from a Web of Documents to a Web of Data: With the Semantic Web architecture, data can be made explicit on the web. Data here means not only data contained in documents, but data describing persons, cities, bands, events, finally arriving at the “Web of Things” (see also this presentation by Dave Raggett, W3C, – PDF 2,7 MB). The Web of Data wouldn’t be a Web if the data weren’t interlinked – here is an overview of the principles of Linked Data:

  • always use URIs as names for things
  • more specifically, use HTTP URIs so that people can look up those names on the web
  • when someone looks up an URI, provide useful RDF information (RDF is the data model used for data on the web of data)
  • include RDF statements that link to other URI (otherwise it wouldn’t be a web).

Please also watch out for what is already happening and is going to happen in the future on www.bbc.co.uk/music/beta. This beta site is powered by MusicBrainz, the open content music database that is also part of the Linked Data cloud. Yves is collaborating with the BBC in the Programmes ontology project, the aim of which is to provide a simple vocabulary for describing programmes.

Yves’ intro was followed by a Turtle hacking session led by Keith Alexander. Turtle is a serialisation format for RDF, i.e. a format in which you can write RDF statements. The Turtle session is documented here on Keith’s Talis website. Even though I copied and pasted most of the code, I didn’t manage to produce a piece of valid code in N3 right away (i.e. not valid according to this validator). It only worked after I had removed the statements about who I know or what I am interested in – without these connections, what remains is a bit boring, I guess. But this looks like I managed to post at least something to the test store!

EDIT: Problem was that I had terminated the statements to soon, with a dot where a semicolon should have been; the demo didn’t allow me to overwrite the first post to the store, but here is my FOAF self-description in Turtle:

@prefix foaf:<http://xmlns.com/foaf/0.1/> .
@prefix owl:<http://www.w3.org/2002/07/owl#> .
@prefix people:<http://api.talis.com/stores/wod-pd-sandbox/items/People/> .

people:JanaHerwig a foaf:Person ;
foaf:name “Jana Herwig” ;
foaf:nick “digiom” ;
foaf:homepage <http://digiom.wordpress.com> ;
owl:sameAs <http://dbtune.org/last-fm/jezobeljones> ;
foaf:knows people:MichaelHausenblas, people:YvesRaimond, people:WolfgangHalb ;
foaf:topic_interest <http://dbpedia.org/resource/Semantic_Web>, <http://dbpedia.org/resource/Web>, <http://dbpedia.org/resource/Popular_Culture>, <http://dbpedia.org/resource/Lolcat>.

Achieved with zero Semantic coding skills – the Web of Data cannot be so hard to achieve:)

EDIT: Did do the update, too – just posted my first SPARQL query to this endpoint. Are the results going to be preserved in this link? Here is the query “by foot”:

PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX people: <http://api.talis.com/stores/wod-pd-sandbox/items/People/>
DESCRIBE people:JanaHerwig

Sphere: Related Content

Read this: Building Linked Data For Both Humans and Machines

October 06, 2008 By: Jana Herwig Category: Linked Data & Open Data, Literature & Publications No Comments →

Publication recommendation: W. Halb, Y. Raimond, M. Hausenblas: Building Linked Data For Both Humans and Machines. Linked Data on the Web Workshop at the 17th International World Wide Web Conference 2008 (WWW2008), Beijing, China, 2008. 8 pages, download from this page.

Sphere: Related Content

Danny Ayers: “The Semantic Web is the path of least resistance”

October 02, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data 2 Comments →

Danny AyersThe Web of Data Practitioners Days are approaching – giving me the opportunity to do an advance interview with Danny Ayers, Semantic Web evangelist, Community Platform manager at Talis, Web of Things everything (I think). I’d just like to extract two or three points here – you can read the whole interview on our website. First something that’s noteworthy to me as it says something about the patterns of technological evolution in general:

Looking back a few years, I don’t think many people working on the Web could have predicted the remarkable rise of blogging, the revival of DHTML and ancient Internet Explorer tricks such as Ajax, online social networks, Wikis, the whole Web 2.0 thing. It’s worth noting that these developments have been consistent with Tim Berners-Lee’s vision of the Web as a system in which people are the key component.

Shifting to the Semantic Web perspective, for a long time I have believed this approach is on track simply because it offers improvements to the Web for which there are no obvious alternative techniques. Personally, I was relatively late to realise what those improvements really were – moving from a Web of Documents to a more general Web of Data. Expressed like that, and looking at existing Web architecture, the Semantic Web is the path of least resistance.

Remember? AJAX, when it cropped up and caused a big buzz in 2005, was nothing new, it was just a new term for an old thing, i.e. the Internet Explorer tricks Danny mentions (see also A Brief History of AJAX: “Browser asynchronous hacks have been possible since 1996, when Internet Explorer introduced the IFRAME tag, passing through a number of techniques such as pixel gifs, Netscape layers, Microsoft Remote Scripting, Java/JavaScript gateways, stylesheet hacks, image/cookies, and most recently the XMLHttpRequest.”)

Sometimes it takes a while until someone (society, industry, what have you) starts to notice that this or that, something, could actually be useful. Sometimes technologies that everybody thinks are silly become a huge sucess – think text messages!

And sometimes you have a great (piece of) technology and it just never really catches on, and if that is the case, then mostly because some forces in the market (trusts, monopolies, corporations who force you to use their software/technology and at ridiculous price, people who would do anyhing they can to undo the natural laws of the digital world) won’t let it happen. What happend to Video 2000 and Betamax? Nixed by JVC’s licensing strategies for VHS. Just wanted to make this point before moving on to the next quote. Danny:

Regarding possible obstacles, there are many ways the Web could suffer, probably most dangerous being interventions from national governments or commercial interests, tilting the table on which we build these systems – such as software patents and threats to net neutrality. The Web works because it’s more or less the same to everyone, everywhere.

So if you think that the Web should continue to be the same to everyone, everywhere, if you would like to liaise with other people interested in the SemWeb and the Web of Data, but most importantly, if you do not know a whole lot about the SemWeb yet but would like to learn more, then please come and do attend the Web of Data Practitioners Days in Vienna, Oct 22-23.

It is going to start with a “Web of Data 101″, i.e. a low-threshold introduction given by Keith Alexander (Talis, UK) and Yves Raimond (Queen Mary University of London, UK) to Semantic Technology in the context of the Web. Here is the full program – please mind that there is a deadline for the registration also (6 Oct 2008!).

Reblog this post [with Zemanta]
Sphere: Related Content

♪♫♪No Milk Today♫♪♪ – New Ways of Finding Music for Vegans

September 11, 2008 By: Jana Herwig Category: Conferences & Events, Linked Data & Open Data, Mashups & Web services 1 Comment →

Shortly before Yves Raimond, a researcher at Queen Mary University of London with a focus on metadata for musical resources, won the 2nd prize in the Triplification Challenge, he talked to us about new ways of finding music using the infrastructure of the web of data. If you ever catch anyone again complaining about the lack of persuasive showcases of the Semantic Web, please direct them to this interview with Yves! Quote:

I think there is something quite frustrating about music recommender systems at the moment though. First, they do not explain how a particular recommendation was derived. I would really like them to tell me “I recommended this track because the harmonies are similar to other tracks you liked according to such and such criteria”. I think I would place more trust in a recommender system that actually explains recommendations, like a friend would do.

Another frustration is that we now have a really huge music-related web of data, created within the scope of the Linking Open Data project, which is not used at all by current recommender systems.

We started some work with Alexandre Passant, driven by these two frustrations. Using all these interlinked data for recommendation purposes allows us to break free from the traditional ‘information barriers’, and use all sorts of data as a basis for a musical recommendation.

For example, using the datasets currently available and interlinked on the web, you can already provide recommendations such as “You’re interested in intentional living and the Beastie Boys? Did you know that B.B. King is a vegetarian, as is Adam Yauch, who is a member of the Beastie Boys?”

Last.fm, are you listening? The full interview can be found here.

Yves is also going to be a keynote speaker at the Web of Data Practitioners Days, Oct 22-23, here in Vienna, where you’ll have the chance to discuss the issue of LOD-based music recommendation with him in greater detail.

Other highlights of the program: Web of Data 101 (interested SemWeb beginners: please attend!), an Open Hacking Session, and keynotes from Danny Ayers and Keith Alexander, Richard Cyganiak, Ansgar Scherp, Alan Dix, Leo Sauerman, Sören Auer and Tassilo Pellegrini. URL of the website is webofdata.info

Other news of the day: Physicists can’t dance, but hasthelargehadroncolliderdestroyedtheworldyet.com?

Reblog this post [with Zemanta]
Sphere: Related Content