Semantic Web Company

The Semantic Puzzle

Open World Assumptions

subscribe RSS

I-Semantics 2010: Relevance of semantic technologies for industry increases fast

July 01, 2010 By: Andreas Blumauer Category: Calls & Competitions, Conferences & Events, Corporate Semantic Web, Linked Data & Open Data 1 Comment →

I-Semantics 2010

I-Semantics will take place for the 6th time this year in September and it will be co-located again with I-Know in Graz/Austria. This year´s programme shows that Semantic Web and semantic technologies in general are increasingly relevant for all kind of industries:

  • Biomedicine
  • Public administration & Public transport
  • Information technology
  • Libraries
  • Media & Content Industry
  • E-commerce
  • Education etc.

450 people in 2009

I-Semantics “Industry Track” with its 3-days programme full of demos is one of the highlights of the congress. With 28 submissions this year´s Triplification Challenge tells a lot about the significance of Linked Data in areas like librarianship, public administration or GIS & environmental planning. Take a look at the 15 nominees – and if you consider to come to I-Semantics 2010 follow the link for registration.

Sphere: Related Content

Adrian Pohl: “We believe the Semantic Web plays an important role for the future of libraries.”

May 20, 2010 By: Tassilo Pellegrini Category: Companies & Institutions, Linked Data & Open Data No Comments →

A group of Cologne-based libraries has taken a big step towards open data. In an concerted action they have relased their catalogue data for reuse on the web. Project manager Adrian Pohl comments on the initiative and what role the Semantic Web will play for libraries in the future.

In March 2010 several Cologne-based libraries have opened their catalogue data under a CC0 license following Tim Berners-Lee’s call for “Raw Data Now!”. What has been the motivation behind this step?

The hbz (“Hochschulbibliothekzentrum des Landes Nordrhein-Westfalen”, english: “North Rhine-Westphalian Library Service Centre”) has come to the conclusion that libraries need to participate in the development of the Semantic Web. The opening of catalog data followed as a necessary first step. Our intention is to show with this first legal-political step how important the legal/licensing dimension is when you publish data on the web, be it Linked Data or not. So for us at the hbz the Open Data initiative primarily is seen as the first step in eventually publishing Linked Open Data just as Tim Berners-Lee had called for.

Other participants in the Cologne Open Data initiative like the Cologne University and City Library focus more on the direct advantages the releasing of raw bibliographic data bings: With other libraries and consortia following this example it will be easy to enrich existing catalog or other bibliographic services with subject headings, classification numbers, tags etc. Also, published raw data is integrated into other web services like Wikipedia which point back to libraries’ services. Indeed, Open Data is an end in itself which should be pursued by more organizations in the library world and beyond it.

The provided data is currently availble in a proprietary but open format. Can you give us some technical description of the published data? Do you have plans in providing more structured datasets in the future?

“Opaque but open” would be the better description of the underlying format because it isn’t proprietary at all. Actually, alongside the data from the hbz union catalog there is data stemming from libraries’ local databases (see http://opendata.ub.uni-koeln.de/ and http://opendata.zbsport.de/). We are using different internal formats. Generally, all the formats are based on the MAB format (an acronym for “Maschinelles Austauschformat für Bibliotheken” which means “Automatic Interchange Format for Libraries”) that is only used in the German and Austrian library world for the data interchange between libraries similar to the better known MARC format (Machine-Readable Cataloging) of the Library of Congress. It was developed in the 1970s for storing data on magnetic tape. The format documentation can be viewed on the German National Library’s webpages.   As the format is nearly 40 years old, the processing of MAB data is very cumbersome on modern computers. Therefore, the hbz provides an encapsulation method called “generic format”, where the historic data records of the library catalogs are unwrapped into a more common, user-friendly scheme. Each record is placed into a Unicode UTF-8 encoded file, containing all the MAB fields, each of them separated by line feeds, and the whole record set of a library is forming a “tar” archive, which is compressed afterwards to save space.   It is possible to dump those archives by a usual unpack tool. This software is available on all known Windows/Linux/Unix platforms. Or you can use a simple Perl helper script provided by hbz. More tools and scripts, even in other programming languages, are in preparation for publication.   The opaqueness and the age of the standards used in the library world (the english standard MARC which is used worldwide doesn’t differ in these respects from MAB) make it necessary to change to a more open and widely adopted standard. That’s where Linked Data comes into play which is based on the accepted and widespread standards HTTP and URIs. The construction of RDF out of the library catalog raw data is a very sophisticated design task. Our plans are to convert the existing data to RDF using proper vocabularies which enable us to lose as little information as possible and giving access to the data by providing a SPARQL endpoint.

Currently the data you provide is open but not yet linked. What are your plans when it comes to contribute to the Linked Data Cloud?

I have to go into greater detail to answer this question properly. Viewed simply, the data of library institutions can be divided into two broad types: authority data and bibliographic data. Authority data splits up in data about people, about corporate entities and about subject headings. In Germany, authority data is maintained centrally by the German National Library in cooperation with the six German library consortia. Bibliographic databases consist of records about books or rather editions of books. Authority data and bibliographic data are already heavily linked, for instance a bibliographic record contains the author’s or editor’s authority number which links to the corresponding authority record.   The German National Library is also working on migrating library data, especially authority data, into the Semantic Web. They recently made their Linked Data prototype for authority data publicly available. We have already taken first steps to cooperate and coordinate our efforts. The colleagues at the German National Library have recently developed a Linked Data prototype for their authority data. As they take care of authority data we focus ourselves on bibliographic data. At the moment we are exploring the technology and vocabularies for publishing bibliographic data as Linked Data. That’s a demanding task because besides the known vocabularies like Dublin Core or the Bibliographic Ontology (Bibo) which don’t fully map to the density and structure of the information in the catalogs, there has been several years’ work on the new comprehensive cataloging standard RDA (Resource Description and Access) for which a RDF representation has been developed. However, RDA in RDF needs to be modified a lot so that it can be applied to our bibliographic data. We are currently working on a vocabulary for the union catalog’s data based on existing vocabularies like Bibo and RDA.   Of course, as soon as we will have published bibliographic data as linked data we will start linking to hubs in the Linked Data Cloud like DBpedia or GeoNames.

Publishing data to the LOD Cloud is one thing. Consuming data is another. Have you plans to integrate data from the LOD Cloud into your systems? Do you have policies for quality assurance?

Of course the possibility to incorporate data from other sources easily is one major reason for us to publish Linked Data besides the goal of making libraries’ data an integral part of the web. Enriching our data with other data and providing new services through and with mashups would be a main reason to link to other data. We are, however, not working on such projects yet, because we first need to convert our legacy data to RDF.

What role will the Semantic Web play for libraries in the future?

We believe the Semantic Web plays an important role for the future of libraries. Discussions about “Next Generation Catalogs” are a recurring theme in the library world since the 1990s. It is time to finally act and move our data enprisoned in opaque formats to a new level by improving its structure and underlying technology and by migrating to formats that can be easily consumed by others who are not part of the library world. Joining the Linked Open Data community seems to us the best way to go.   Also, the production, publication and dissemination of academic literature is subject to ongoing and fundamental changes which have far-reaching implications for the work of academic libraries and their role in research and education. We believe that semantic markup and interlinking will play an important role in the development of knowledge production and thus indirectly will have great impact on libraries. Clearly, the Semantic Web can’t be cancelled out of the future of libraries.

Moreover, turning your question around, libraries could play an important role for the future of the Semantic Web. Libraries are trusted institutions and deeply grounded in our culture. As indicated above libraries have produced linked data (again: lower case) since the time of card catalogs. We undoubtly have some practice in producing and curating linked data which should be worth a lot to the Semantic Web community. We thus think libraries are predestinated for helping to coninuously order the messy place the Semantic Web always will be and ensuring its trustworthiness and stability.

About Adrian Pohl

Adrian Pohl is working at the Cologne-based North Rhine-Westphalian Library Service Center on Open Data, Linked Data and its conceptual, theoretical and legal implications. He regularly writes at Übertext: Blog about the internet, libraries and metadata, Linked Open Data, communication, epistemology and the like. He has studied communication science and philosophy in Aachen and is currently studying Library and Information Science at the Cologne University of Applied Science. You can follow him on Twitter: http://twitter.com/acka47.

Sphere: Related Content

Eric A. Franzon: “Semantic Technologies are becoming mainstream.”

May 19, 2010 By: Tassilo Pellegrini Category: Conferences & Events No Comments →

Started in 2005 the Semantic Technology Conference has become one of the international community hot spots for the commercial application of and trend scouting in semantic technologies. Tassilo Pellegrini talked to the organizer Eric A. Franzon, VP of Wilshire Conferences and Semantic Universe, about what to expect from the upcoming event and how semantic technologies are becoming mainstream.

From June 21 – 25, 2010 the annual Semantic Technology conference will take place for the 6th time. Looking back: what has changed over time? What are the hot topics at this year’s conference?

We launched SemTech in 2005 in San Francisco.  It was a good turnout for a new event, with around 300 attendees.  By 2009, that number had grown to 1100, so audience size has been a significant change, certainly.  However, our interest all along was to grow an industry as well as an event, and I have absolutely seen that growth and maturation.  Ours was the first conference devoted to the commercialization of Semantic Technologies, and at that first conference, there was a predominant academic presence.  That’s not a bad thing – this, like so many technical industries, came out of academia.  Nonetheless, it’s nice to see that by 2010, there is significant adoption by businesses and organizations. I actually feel comfortable saying that Semantic Technologies are becoming mainstream; certainly not ubiquitous, but widely adopted.

The hot topics at the 2010 conference include exciting news in areas we have covered extensively before such as Linked Data, Semantic Search, Healthcare, and Publishing.  But we also are delving much more deeply into new domains that have received a lot of attention recently such as Open Government, Marketing & Advertising, and Social Networks.  There are new standards benchmarks to discuss such as SPARQL 1.1 and the business rules work that is being done with RIF.  Additionally, we are seeing a lot of traction in Semantics in the Enterprise, so SemTech will have quite a bit to offer in that area as well.

While semantic technologies have been around for quite some years now the advent of the Semantic Web added a new spin to the community. What do you expect for the future when it comes to the convergence of semantic technologies and the Semantic Web?

I see Semantic Technologies as a superset of the space that is the Semantic Web.  The Semantic Web is public; the area I call Semantic Technologies includes non-public, closed systems – behind firewalls.  We’ve actually seen this before.  At the same time that the World Wide Web really hit its stride in the mid-1990’s, we saw widespread adoption of portals and corporate intranets.  Even though they did not sit on the public Web, these systems used the technologies of the Web to link documents, enabling organizations to share those documents globally, quickly, and inexpensively.

As the tools become better and we see more use cases in the Semantic Web, I see parallel development of semantically enabled enterprise systems.  In the same way enterprises were using early Web technologies to share documents behind firewalls, they are now using semantic systems to share data globally, quickly, and inexpensively.  At first – and we are seeing this already – in-house systems will consume data from the public Web, essentially mixing public and private data.  This is relatively easy to do when both systems are built on a similar set of technologies, and there are an increasing number of rich data sets for companies to use.  Think of a corporate system that consumes real-time stock data, for example.  The system is not generating that information itself, but it might be using it in a corporate application.

One of the prominent topics at the moment is Linked Data which in connection with Semantic Web might evoke a paradigm shift in data integration issues. How do you experience this trend? How should companies react?

If you think about the ‘traditional’ challenges that enterprises have faced in managing data and meta data — issues like integration, disparate data, unstructured data, governance, legacy systems, and data quality (to name a few) — Semantic Technologies offer solutions.  They’re not always the best solution for every problem, and I don’t expect that RDBMS systems will go away, but there are companies using Semantic Technology today to make money and save money.

From your perspective: what are the most exciting things to look out for in the near future?

There is a great opportunity for tool developers to enter the marketplace. The community is hungry for new tools and for semantic development to be integrated into the tools and development environments they are already using.  Another area that I believe the industry is hungry for is good UI development.  Data is powerful, but its usefulness is often only seen in solid visualizations and reporting.  I expect that more of these tools will emerge in the very near future.

Tools for publishers like OpenCalais, Zemanta, and the rich semantics available in Drupal 7 are making it possible for less-technical people to include semantics in their web pages.

Another area to watch is consumer applications. Tripit, Siri, and Adaptive Blue’s Glue have shown that there is a market for data-driven applications for consumers.

About Eric A. Franzon

Over the last decade, Eric Franzon has served as VP of Wilshire Conferences, where he has been exploring the world of enterprise data. As VP of Semantic Universe, he has worked to raise awareness and explain the usage of Semantic Technologies and Web 3.0 in business and consumer settings.  A lifelong learner and teacher, Eric is frequently called on as a consultant, coach, and trainer around complex technical topics. He is an advisory committee representative with the World Wide Web Consortium and an Affiliate Analyst with Guidewire Group.  Eric has also taught improvisational comedy, early childhood education, blues harmonica, and gender studies.  A Chicago native, he now lives in Los Angeles.

Sphere: Related Content

Sören Auer: “Establishing a network effect around linked data is the most important R&D goal for the near future.”

April 15, 2010 By: Tassilo Pellegrini Category: Conferences & Events, Linked Data & Open Data, Politics, Privacy & Information Ethics No Comments →

Leipzig is one of Germany’s Semantic Web hotspots. From May 5-6, 2010 the annual Semantic Web Day provides the opportunity to catch up with latest developments especially in the domain of Linked Data and the foundation of the German chapter of the Open Knowledge Foundation. Organizer Sören Auer gave us some background information.

From May 5 – 6, 2010 the 3rd Semantic Web Day in Leipzig will take place. What will be this year’s topics? Who should attend?

The Semantic Web Day is targeting IT people, software developers, decision makers and users interested in learning about the potential of semantic technologies. The language during the event is German, so primarily Austrians, Swiss and Germans will attend. Beside semantic technologies a particular focus of this years event is open data in governments, public administrations and science. Although the programme is not yet finalized we already compiled an interesting number of talks and presentations including talks about the open biodiversity database Fishbase, the European Digital Library Europeana, a Linked Data project of the German Umweltbundesamt, use case presentations in the pharma, publishing and telecommunication industries and many more (cf. http://aksw.org/LSWT). Also, in addition to AKSW the Topic Maps Lab and the Web Data Integration Labs from Universität Leipzig be present at LSWT.

One of the highlights of this year`s Semantic Web Day is the official institutionalization of the German Chapter of the Open Knowledge Foundation. How did this come around? What does this mean for the OKF as a whole?

OKFN started to work in 2006 and since then managed to sucessfully complete a number of projects facilitating open knowledge. In particular, the Comprehensive Knowledge Archive Network (CKAN), the OKCon conference series, the open knowledge definition and recently OKFN’s involvement in the launch of data.gov.uk are prominent examples of OKFN’s successful work. However, many of the OKFN activities were primarily driven by an active group of volunteers in the UK. With the official launch of the German OKFN branch we will strengthen the international dimension of OKFN’s work. Especially for Germany, where data privacy and security are perceived to be most important, raising awareness for enabling open, standards compliant access to public information will be an important target of OKFN’s activities.

The InFAI has become one of the hotspots in Semantic Web development in Germany over the past few years. What are you working on at the moment? What are the most interesting research and development aspects for the near future?

From our point of view establishing a network effect around the publishing and use of linked data is the most important research and development goal for the near future. We just completed a first draft and implementations of a semantic enabled pingback method (http://aksw.org/Projects/SemanticPingBack), which applies a similar peer notification mechanism to linked data endpoints as it is widely deployed on the blogosphere. Other important research issues we are tackling with our partners are closing the performance gap between RDF and relational data management, increasing the coherence and quality of linked data and the provisioning of adaptive user interfaces for authoring and maintaining information on the data web.

About Sören Auer

Dr. Sören Auer leads the research group Agile Knowledge Engineering and Semantic Web (AKSW) at University of Leipzig. His research interests include Semantic Web technologies, knowledge representation, engineering and management, agile methodologies as well as databases and information systems. Sören is founder (respectively co-founder) of several high-impact research and community projects such as the Wikipedia semantification project DBpedia, the open-source innovation platform Cofundos.org or the social Semantic Web toolkit OntoWiki. Sören is author of over 50 peer-reviewed scientific publications, co-organiser of several workshops, chair of the Social Semantic Web conference 2007 and I-Semantics 2008, serves as an expert for industry, the European Commission, the W3C and is member of the advisory board of the Open Knowledge Foundation.

Sphere: Related Content

Interview with Juan Sequeda: “I believe Linked Data will enable new killer apps that are only possible thanks to Linked Data.”

April 14, 2010 By: Tassilo Pellegrini Category: Calls & Competitions, Linked Data & Open Data, Semantic Web Applications 1 Comment →

Juan Sequeda, co-chair of the Triplification Challenge 2010 and one of the core figures in the Linked Data movement, gives us his view how the Semantic Web might evolve. His central message: “Once there is an incentive to create quality links, these links will start to show up. And then users will start linking to the data hubs of their interest.”

Linked Data itself has grabbed a lot of attention inside the Semantic Web community recently. But what about the outside perspective? Could linked data be called the killer app for the Semantic Web?

I foresee two things happening with Linked Data. One is from the web development perspective (the so-called Web 2.0 developers) and the other is from the enterprise perspective. The web development community will sooner than later realize that Linked Data will enable easy integration of data and therefore will ease the pain of consuming data from different data sources. Thanks to big organizations such as BBC, New York Times, Reuters, Best Buy, etc. web developers will start paying attention to this “new thing” called Linked Data.

What we need is that the inside Semantic Web community starts to create applications on top of current Linked Data so when the outside web development community starts to pay attention, they have something to chew on. We (the semantic web community) needs to start speaking the web development language. There is still a big gap. I have had personal experiences with people in the web development community who think that RDF is XML and because they hate XML, they will never consider it. This is false and this is something that we need to change.

From the enterprise perspective, Linked Data is another data integration solution. Data integration has been a problem since day one of relational databases. I believe enterprises will be open to consider new solutions with new technologies. I’m hoping to see new startups tackling the enterprise domain. Imagine being able to query “get all my clients from cities whose population is greater than 1 million” even though I don’t have the data about population of cities in my database.

Is Linked Data the killer app for the Semantic Web? Before I answer that, I would like to ask, what was the killer app of the Web? Was it the browser? Was it e-commerce? Was it search? Was it Amazon or Ebay or Google? I believe Linked Data will enable new killer apps, apps that are only possible thanks to Linked Data. The browser was only possible because of HTML. So let’s ask ourselves what is possible because of Linked Data, and there we will find our killer app.

One of the core deficiencies of the young open data cloud is the little amount of interlinks between datasets. Is it just a matter of time to overcome this or are there other measures needed to turn the existing datasets into a true giant global graph?

I like to remind myself that this new wave of semantic web technologies is an extension of the current web. Therefore we should analyze how the web evolved in the beginning. Initially, everything were a bunch of documents on the web in which people manually created links to other documents. When Google started, it created an incentive to offer quality links between documents. This also created data hubs. If you write a blog post about a book, most probably you will link to the web document of that book either on Amazon and/or Wikipedia. I believe that this will happen with Linked Data. Once there is an incentive to create quality links, these links will start to show up. And then users will start linking to the data hubs of their interest.

Open Governmental Data is a big issue at the moment. The US and UK government have started to apply Linked Data principles to turn this vision into reality. Lots of other countries are following. What do you expect from this trend?

I believe that Linked Data will take off thanks to the initiative of governments. We always talk about the chicken and egg problem of the semantic web. Once we have organizations that don’t even think about it and are just interested in putting their data on the web, the semantic web will start to grow. If Bookstore ABC puts their data on the web, it may not be so meaningful. But if the US and UK government puts their data on the web, following the Linked Data principles, then people can wake up and say “ok, so this is for real. Let me start paying attention to this”.

You are one of the chairs of the Triplification Challenge 2010. Can you give us a brief insight what to expect from this year’s challenge? What are the conditions to participate?

The Triplification Challenge this year has grown and is very exciting. For the first time, it is offering two different tracks.

The first track, the Open Track will accept submissions on three areas 1) new datasets that are published following the Linked Data principles and that show potential benefit, 2) generic methods, mechanisms and approaches of creating Linked Data from legacy datasets and 3) applications that make use of Linked Data.

The second track is the New York Times track which will accept submissions of applications that make use of the New York Times Linked Data and one or more government dataset. The objective is to create an application powered by Linked Data that would be of interest to any constituent of that government.

I personally believe that the year 2010 is the year of creating Linked Data applications and the Triplification Challenge is the way to be part of it.

Sphere: Related Content

The Open Government Data Meetup in Vienna

April 10, 2010 By: Thomas Thurner Category: Open Government Data, Politics No Comments →

Show what is possible! As Martin Kaltenböck – one of the organizers oft the recently held Semantic Web Meetup on an Austrian Open Government Data Initiative – said, there is a lot of enthusiasm and energy to inform the public and engage politics about the impact a initative similar to those in US and UK may have for Austria. And the KickOff was promissing. Inspiring talks by Rufus Pollock (UK) and Stefano Bertolo (EU) where giving an insight whats possible in the specific field of Open Government Data, as well as how a start of an initiative can look like.

As ePSI-Platform wrote in their blog
The Austrian Open Data initiative is online and at work.

The event was very well attended, and brought together stakeholders from science, industry, government and citizen activists, A promising melange of people which may carry the project forward to very concrete UseCases and Trials in the very near future. As the initiative is ment to be carried by a broad group of proponents, the follow-up of the meeting will be a round table talk, of those who are willing to contribute in upcoming light-tower projects and opening concrete sets of government data for that.

The next meeting of the Austrian Open Data Initiative
takes place on the 12th May at 9.30 a.m. in
Room D, quartier 21 of the Vienna Museum Quarter.

Find Documentation of the Meetup on Zukunftsweb, browse the Picture’s Album or read the conclusions at ePSI-Platform.

More resources

Sphere: Related Content

55 people enjoyed the first semantic web meetup in vienna

July 17, 2009 By: Thomas Thurner Category: Conferences & Events No Comments →

dsc_0494Yesterdays first “semantic web meetup” attracted 55 attendees to join in for presenting, talking and socialising. Approximately one year after the series of semantic web meetups started in NYC, there is now also a vital community gathering in vienna. Beside an inside view on brandnew ideas and developments of austrias semweb-labs in presenations and lightning talks, Steve Sandhouse of New York Times joined in via webmeeing to give an insight on NY-Times’s Semantic Web – efforts, which have a back-history of about 100 years now – as he explained.

In conclusion: A good start for the First Vienna Semantic Web Meetup, which may paved the way for a next meeting in the very next future. In the meanwhile some pictures of the venue to amuse those which were there and to inspire new people to join: www.meetup.com

Reblog this post [with Zemanta]
Sphere: Related Content

Keep the Semantic Web trusty

March 13, 2009 By: Thomas Thurner Category: Corporate Semantic Web, Mashups & Web services, Politics, Privacy & Information Ethics, Text Mining 1 Comment →

Tim Berners-Lee at a Podcast Interview
Image via Wikipedia

In recent days – here at Semantic Web Company – we have had a lot of discussions on how the future of the Semantic Web (name it Web3.0 if you like) will develop. Several stakeholders on the future of the Semantic Web see already, that also a potential danger will come along with the technical realisation of the web3.0: This is the present possibility to create applications and mashups with semantic technologies that are a real drain on privacy and information ethics. Without an underpinning discussion about the ethical framework within technolgies like linked data, text-mining, biometric-systems and geo-systems in combination with the web of data, the whole domain is in danger to be doomed like genetic engineering some years ago.

It’s crucial for the public opinion on the Semantic Web, to adress the immanent risks regarding privacy and ethics. In this context I’ll see also Tim Berners-Lee’s statement yesterday: “W3C wants to help make sure data use is appropriate,” he said. Berners-Lee, who is director of W3C, said in an interview on Wednesday that the teams working on the Semantic Web project are making sure that privacy principles are included in its architecture: “The Semantic Web project is developing systems which will answer where data came from and where it’s going to — the system will be architectured for a set of appropriate uses.”

Maybe it’s an important step in keeping the further development of Semantic Web trusty in the eyes of public opinion, that the W3C has privacy and information ethics on their agenda and persons like Berners-Lee stand with their reputation for it. But it is also crucial to build this awareness on the corporate side. Only if everyone within the domain follows a common ethic understanding we have a public opinion, which is on the future potential of the Semantic Web, and not in fear of the same.

Reblog this post [with Zemanta]
Sphere: Related Content

Just released: UMBEL – A New Vocabulary for the Semantic Web

July 17, 2008 By: Jana Herwig Category: Ontology Engineering, Vocabularies & Languages No Comments →

UMBELNews has reached me this morning that UMBEL has now been publicly released! UMBEL is a new vocabulary for the Semantic Web – I first learned about it when Andreas Blumauer returned from LinkedData Planet where he had met up with Mike Bergman from Zitgist LLC who are working on UMBEL.

Here is the release announcement Mike communicated via email yesterday:

UMBEL (Upper Mapping and Binding Exchange Layer) [1] is a lightweight ontology for relating Web content and data to a standard set of 20,000 subject concepts. Based on OpenCyc [2], these subject concepts have defined relationships between them, and can act as semantic binding nodes for any data or Web content. A further 1.5 million named entities have been extracted from Wikipedia and mapped to the UMBEL reference structure with cross-links to YAGO [3] and DBpedia [4]. The system can easily be extended with additional dictionaries of named entities, including ones specific to enterprises or domains.

UMBEL is provided as open source under the Creative Commons 3.0 Attribution-Share Alike license. The complete ontology with all subject concepts, definitions, terms and relationships can be freely downloaded [see 5]. All subject concepts and named entities are available as Linked Data [see 5]. Five volumes of documentation [5] are also available.

The release is accompanied by about a dozen Web services [6] for using or manipulating UMBEL, along with a new introductory slide show [7]. Additional release information may be found on Fred’s [8] or my [9] separate blog postings. We welcome those with interest or suggestions for improvements to do so through the UMBEL discussion forum [10]. We will shortly be putting easier services online for such input.

So, enjoy! We look forward to your commentary, suggestions and putting UMBEL under production-grade stress. We know will be doing the same!

Regards, Mike

Great release! They have also given us access to a media-oriented article which you can read on our portal.

Sphere: Related Content

LinkedData Planet in New York: A great community event for all things semantic

June 18, 2008 By: Andreas Blumauer Category: Conferences & Events 1 Comment →

Roosevelt HotelFirst of all: LinkedData Planet is a big success in terms of visitor numbers to begin with. The “Grand Ballroom” at Hotel Roosevelt, a lovely old hotel in Manhattan, was packed not only when Tim Berners-Lee gave his keynote this afternoon.

But not only the quantity of attendees, also the high quality of talks and discussions which are going on at this first conference on the “commercialization of the web of Linked Data” show that we are facing a fast growing phenomenon with a “great momentum” as Berners-Lee stated.

Kingsley Idehen from OpenLink Software started the first day of the conference with his keynote in which he tried to “demystify” the term Linked Data. He said that “Linked Data is the foundation of the semantic web, its connectivity is growing and the line between enterprise and individual level is blurring”. He also stressed the similarities between ODBC (Open DataBase Connectivity) and Linked Data – which might be interesting for my next talk with an “old-fashioned” CTO.

Uche Ogbuji from Zepheira referred to DBpedia as “the star” of the Linked Data Cloud and gave an interesting talk about the possibilities of Linking Enterprise Data.

Tim Berners-Lee listed in his keynote the areas in which the LOD-community is now facing the biggest challenges:

  • standards, for instance levels of inference, link following on linked data clients and servers
  • federated query; query service descriptions
  • ultimate human interface to all the data there are
  • balancing diversity & harmony in ontology development
  • and, of course, continuing the great momentum

Tim Berners-Lee also emphasised that hiding information in some cases like product information is “just crazy”. One way to expand the LOD cloud could be “lobbying” for data sources (with governments, providers of commercial information, etc).

Between the talks people spent their time in the exhibition area. Dean Allemang from TopQuadrant gave a demo of TopBraid Composer. I talked to Tom Tague from OpenCalais about the new features the next releases will have and how to use their service behind the firewall and Mike Bergman from Zitgist showed me the power of UMBEL web services.

All in all – LinkedData Planet is a great community event, well organised and well populated by people who want to use the semantic web in different commercial settings.

And to those who weren’t able to attend: I recommend to take a look at the Linked Data Shopping List, a page within the Linked Data Initiative’s wiki where you can add the data that you want to see published as Linked Data.

Read also pt. 2 of our conference report: The social hub @ LinkedData Planet 2008

[Image: official-ly cool]

Zemanta Pixie
Sphere: Related Content