Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

Interview with David Huynh: “The user interface design must inform the back-end design”

May 14, 2009 By: Andreas Blumauer Category: Linked Data & Open Data, Semantic Web Applications No Comments →

Linked Data is evolving fast. A huge amount of RDF data is available and ready for exciting new applications. Unfortunately, the bottleneck is still the availability of Semantic Web user front-ends which demonstrate the power of linked data. To a certain degree BBC Music beta is the first commercial platform which makes heavy use of linked data. With Parallax David Huynh has shown that one of the most interesting semantic web applications can be built around browse and search applications which offer tools for doing complex search queries.

Andreas Blumauer from Semantic Web Company (SWC) talked with David Huynh, “Interaction Scientist” at Metaweb, the company which developed Freebase, an “open, shared database of the world’s knowledge”.

SWC: David, you have been working for MIT´s Simile Project and now for Metaweb Technologies – two “building blocks” of the Semantic Web. Could you tell us a bit about your ongoing work at Metaweb?

David: My official title at Metaweb is “Interaction Scientist,” and so my main focus is coming up with novel interaction designs for Metaweb’s platform and products, and prototyping them to some extent to evaluate their effectiveness. Parallax was one such prototype that has gathered much excitement within Metaweb and the Semantic Web community at large. And the Freebase query editor 2.0 shows my interaction designs at the other end of the spectrum – targeting developers rather than just end-users.
I’ve also learned that data-centric user interfaces and interaction designs can only be as good as the data allows them to. So I am also dedicating some of my time toward analyzing the data we have and improving its quality so that I can design even better interactions.

Freebase Query Editor 2.0 from David Huynh on Vimeo.

SWC: With Parallax you have introduced a new way to search and explore data: Could you explain the “set-based browsing paradigm”?

David: In the browsing paradigm of the original Web, while looking at a web page, you can only click on one hyperlink to get to one other web page. But in a lot of cases, the hyperlinks on that web page can be grouped into different groups based on what they mean to the human reader: these are the links that lead to reviews, these are the links that lead to authors, these are the links that lead to vendors, etc.
Now if the computer actually knows what these links mean, then you can tell it to follow several of those links that mean the same thing: follow all the links that lead to authors. Think of it as powered browsing: the computer does the work of following several similar browsing paths at the same time – going from a set of things (web pages or data entries) to a similarly related set of things – and making all of that information available for your perusal in one shot. It is a paradigm shift compared to how we browse the Web today. And it’s only possible when the computer is capable of telling which link is similar to which other link. And that capability, in turn, will be made possible by the Data Web.
(See this unpublished paper which goes into depth about this concept)

SWC: Linked Data is evolving fast. A huge amount of RDF data is available and ready for exciting new applications. Unfortunately, one bottleneck is still the availability of Semantic Web user front-ends which demonstrate the power of linked data. Do you think, that the Semantic Web is rather a server-technology than an end-user experience?

David: I have never thought of the Semantic Web as either a server technology or an end-user experience. I only care about usefulness, and then a matching amount of usability to make that usefulness accessible to people, especially those without Computer Science expertise.
I find that it’s so much easier to explain to people and get them excited about “immediate, personal, local benefits” of a particular technology than about “long-term, communal, global benefits” of a vision. For most people, the former must be experienced and felt often before the latter can appear vaguely appealing enough to call for actions. I’m lazy – I don’t like to spend efforts convincing people of visions; I only want entice people into using the tools that I have created.
So if Parallax is considered a success, it is so not just because of its technologies and research contributions, but also because the accompanying screencast explained it in a way that people who cared nothing about the Semantic Web could understand why Parallax would be useful to them. This was achieved by pointing out limitations of existing web technologies as already experienced and understood by a lot of web users, and then illustrating concretely a possible solution enabled by data web technologies.
Perhaps I could venture further and say that the dichotomy of server technologies and end-user experience is what’s holding back Semantic Web user interface efforts. For those who don’t have expertise in design, it is a comfort to think that once the back-end technologies are solid, then it’s just a matter of putting on some polishes, a.k.a. user interfaces from their point of view, to make the whole package appealing. This approach is wrong. The user interface design must inform the back-end design. Otherwise, the user interface will almost always reflect the internal system model, and that’s usually very dissonant with how users think and behave. Recall all the Semantic Web interfaces you have seen that force users to think in terms of triples or of raw URIs. Those were made by starting from the data model, not from user needs.

SWC: Quite often I hear people saying: Where is the Semantic Web? – I still can´t “see” it! How could the linking open data community make use of such user interfaces like Exhibit, Piggy Bank or Parallax? Is the set-based browsing paradigm a universal way to browse linked data or just one possible way?

David: My research prototypes embody a number of UI ideas that are quite transferable to other platforms. Most of my code is open source, too. This, by the way, is rarer than it should be: research prototypes often fall apart as soon as, or even sooner than, the relevant research papers get presented at conferences, and research code rots rather than gets offered free for reuse. This is sad, because reusable data needs reusable code to proliferate even more widely, but there is no reward system for making research code reusable, or for keeping research prototypes running. So perhaps people can’t “see” the Semantic Web because research prototypes are not presented in appealing and comprehensible ways, and they break down and disappear too quickly.
Regarding the set-based browsing paradigm, it is most certainly not the only way to browse linked data. It is just the first good one that came to my mind, around 2005. But it’s not until 2008 that I actually got around to implement it for real. One of the factors so important in its feasibility is the quality of data in Freebase, compared to other data sources that I had access to. Even the simple fact that a lot of Freebase topics have images makes Parallax look a lot more interesting and useful. People like to see pictures rather than raw URIs. And the diversity of types of data helps illustrate the browsing paradigm of Parallax – that ability to shift focus from one set of things to another set of things, even across very seemingly unrelated domains of information, such as from politicians to their celebrity friends in the movie industry.
So, perhaps one of the main challenges in adopting Parallax ideas on any arbitrary RDF data set is curating the data sufficiently for the purpose of presenting it. In fact, if you don’t know how some data is to be presented and used, there’s no way for you to determine if that data is of sufficient quality. User needs and interface designs drive back-end implementation and data curation, not the other way around. It’s a simple idea, really, but it can be hard to adopt if one is fixated on data alone.

SWC: Do you plan new versions of Parallax? When will it become part of Freebase or of even more Linked Data Sources?

David: I’ve done a few further experiments with the ideas in Parallax, but they are not ready for public use, yet. Freebase data makes my job much easier by allowing me to focus mostly on interaction designs rather than mostly on data quality, or rather, fighting the lack of data quality, for the purpose of presenting it. So I’ll start with Freebase data and we’ll see where it takes me.

SWC: What else are you working on at the moment?

David: As mentioned briefly earlier, reusable data needs reusable code to proliferate widely. That gives you a hint at an effort that I’m involved with.

SWC: Many thanks, David!

About David François Huynh

Reblog this post [with Zemanta]
Sphere: Related Content

A good data browser allows you to navigate the knowledge space by car

August 20, 2008 By: Jana Herwig Category: Search Engines, Tools & Software 5 Comments →

Or so I would like to paraphrase David Huynh’s words that I read today on the W3C’s Semantic Web mailing list, where he wrote in response to Michiel Hildebrand:

lange carIt’s very perceptive of you to ask about the tasks that Parallax is presumed to address, and who the users are. I don’t have a specific answer beside “browsing graph of data more efficiently”.

I tend to think that contemporary graph-based data browsers either fly the user at 50,000 feet and show her the whole world in one window below (render a huge data graph as a huge visual graph), or leave her at the street level to wander around on foot (single resource view). I’m just wishing to provide her a car. Perhaps the good thing is that the car doesn’t come with a destination built in. (It’d be quite bad in real life if you need different cars to go grocery shopping and to go to work, for example.)

I quite like this metaphor he uses to describe the motivation behind Parallax, the UI prototype David designed as a novel way to browse Freebase data. It also ties in nicely with a wish made by Richard Cyganiak in an interview with him we published yesterday:

On the top of my wish list would be a really good data browser. The current crop of data browsers for RDF, such as Tabulator, Disco and the OpenLink browser, are still very basic and geeky. I hope for some sort of “Excel for Web data”, an application that allows me to browse through different datasets, find the bits that are relevant to my problem, and lets me slice and dice and correlate the data in different ways. I think such an app would be key to the kind of serendipitous reuse I mentioned earlier.

In the mailing list post cited above, David pointed to the Spellbound blog where Jeanne Kramer-Smyth published a showcase of faceted browsing across Olympics games facts using Freebase Parallax and suggested that Parallax would be particularly useful for exploring connected information:

Now take this idea to the world of archives and libraries, OPACs and finding aids and imagine the sorts of questions you can start asking. Yes – it does depend on the data being connected, but that is happening more and more all the time. The promise of the semantic web is structured data everywhere we turn.

Image bei Wiki Commons

Reblog this post [with Zemanta]
Sphere: Related Content

Freebase Parallax: Browsing ad infinitum

August 18, 2008 By: Andreas Blumauer Category: Linked Data & Open Data, Search Engines, Tools & Software 2 Comments →

With David Huynh’s Freebase Parallax, an inspiring new user interface has come out, which conveys a sense of a future where googling isn’t the ultimate way to find information on the web. David was also strongly involved in some projects of CSAIL at MIT which also dealt with the “simple” question of making the (web of) data more accessible for users who aren’t aware of SPARQL, SQL or OLAP cubes. For instance, Exhibit has become a widely adopted environment for faceted search within a given dataset. A bit more sophisticated is the nested faceted browser – and now there is Parallax, too.

In his screencast about this novel browsing interface, David stresses the advantages over Google or Wikipedia. Is this a fair competition? What is it about Parallax that makes it a choice interface and when?

  1. If you want to learn a little more about Abraham Lincoln – go to Wikipedia!
  2. If you want to know where you can find even more information about Abraham Lincoln – go to Google!
  3. If you know a bit about Abraham Lincoln already, and you want to aggregate or compare some facts of his life to that of other presidents or if you want to visualise data on a time-line or on a map – learn how to use a tool like Parallax (and don’t complain if it isn’t as simple as Google anymore)!

In some respect, Parallax is the building block that has been missing in the web universe: (Professional) fact finding on the web could work like this in the close future (although Parallax needs some more servers and – indeed – more data in its database).

But what if Parallax became the graph-based UI on top of Freebase + LinkingOpenData?

Results for Shaka Zulu on Parallax
Refined search results for “Shaka” on Parallax

Reblog this post [with Zemanta]
Sphere: Related Content