Andreas Blumauer

It’s All about Finding the Needle in the Big Data Haystack

Wolters Kluwer Deutschland GmbH and Viennese Semantic Web Company agree on cooperating on the development of innovative and highly efficient products for data, information and metadata management.

Cologne/Vienna (February 05, 2013) – Wolters Kluwer Deutschland (WKD), knowledge and information service provider located in Cologne and the Austrian Semantic Web Company (SWC) act in collusion with each other. The aim of this cooperation is to offer the sustainable creation and targeted usage of domain specific thesauri and enterprise taxonomies based on Linked Data technologies as a market-ready and ready to use product.

Whereas WKD with its core competences in law, business, tax, finance and health is covering the domain and methodological dimension of the cooperation, SWC is contributing the technological know-how. Main target is to fulfill concrete needs and requirements of customers in a highly efficient and practical way.
“Our offering is addressing large national institutions like ministries, federal agencies and social insurances as well as banks. This includes also larger NGOs and administrations that have an international focus”, explains WKD content architect Christian Dirschl the direction of the cooperation. As with searching and finding the ‘proverbial needle in the data haystack’, we also address specialists like large law firms or smaller units in large enterprises, “who are specifically working on legal matters, making with their work an important contribution to the success of the company as a whole.” Especially knowledge domains like law, industry, tax and finance are getting more and more intransparent on global scale, Dirschl explains, “so that semantic technologies and Linked Data methods gain importance”.
Amongst others, the following services and products are offered, based on this cooperation:
  • Metadata management and enterprise thesaurus management
  • Semantic search and data integration
  • Text mining and knowledge extraction
  • Creation of knowledge networks and knowledge management systems
  • Supporting the creation of Linked Data and Open Data infrastructures
We observed in the last 10 years “how search and linking of information have gained importance in certain domains and what competitive advantages can evolve from that,” stresses Andreas Blumauer, managing director of SWC. “Our customer base profits from this cooperation. We immediately guarantee state-of-the-art technologies paired with professional domain assistance, e.g. with the creation of domain taxonomies and thesauri, so that information resources can be used more efficiently”, Blumauer says.

About Wolters Kluwer Germany
Wolters Kluwer Germany is an information services company specializing in the legal, business and tax sectors. Wolters Kluwer provides pertinent information to professionals in the form of literature, software and services. Headquartered in Cologne, it has over 1,200 employees located at over 20 offices throughout Germany and has been conducting business on the German market for over 25 years.
Wolters Kluwer Germany is part of the leading international information services company, Wolters Kluwer n.v., located in Alphen aan den Rijn (The Netherlands). The core market segments, targeting an audience of professional users, are legal, business, tax, accounting, corporate and finance services, and healthcare.  Its shares are quoted on the Euronext Amsterdam (WKL), and are included in the AEX and the Euronext 100 indices. Wolters Kluwer has annual sales of 3.4 billion Euros (2011), employs approx.19,000 people worldwide and has over 40 offices located throughout Europe, North America and Asia Pacific and in Latin America.
Thomas Thurner

Wolters Kluwer Deutschland is publishing 2 legal thesauri as Linked Open Data

Wolters Kluwer Deutschland GmbH (WKD) publishes two legal thesauri as Linked Open Data for free re-use by public administrations, industry and the Open Data community

(Munich, 12.07.2012, WKD) From today on, two thesauri (controlled vocabularies) covering juridical/legal topics are provided for free re-use as Linked Open Data: One thesaurus is covering topics around labor law in German language, while the other one describes German and European courts. Both vocabularies can be accessed at: vocabulary.wolterskluwer.de/.

Labor law thesaurus covers all main areas of labor law, like the roles of employee and employer; legal aspects around labor contracts and dismissal; also co-determination and industrial action. Therefore, this thesaurus is interesting and relevant for all parties, who are dealing with labor law – professionals like specialized lawyers as well as for employees looking for definitions of legal terms. Linking to thematically similar thesauri (Linked Open Data paradigm) has already taken place and is therefore available as well.

Courts thesaurus is structuring German and European courts in a hierarchical fashion and includes e.g. address information. This thesaurus is not only dedicated to parties interested in legal matters, but also to developers developing geo data applications. Information concerning courts and their roles and responsibilities can become an interesting aspect of many applications in the future.

Publication of these data sets as Open Data is motivated by many reasons. In particular two major directions should be mentioned here: first is to help our customers with their information overload and the other one is to support activities in the OGD (Open Government Data) community.

The creation of legal vocabularies is far from being a trivial thing and there are hardly any resources available in German language. By making these thesauri publicly available, we want to support especially administrations to classify and structure their internal data, in order to easily connect this data to relevant WKD legal resources afterwards (Interoperability of data). The Community on the other hand is very active in some domains, but unfortunately very reluctant when it comes to legal topics. Our aim here is to give initial support in order to create awareness, that also with this data it is possible to create highly interesting and relevant applications. In the end, all interested parties have to work together in a collaborative fashion, in order to bring transparency to the diversity and sheer amount of legal information – this is not possible within insular silos of applications and isolated approaches.

With this effort, Wolters Kluwer Deutschland GmbH is becoming part of the global Open Data movement, which is also heavily promoted by the European Commission, in order to strengthen Europe as an industrial location.

License models used here (like Creative Commons, CC-BY 3.0 for the contents) are as open as possible, in order to have available a real basis for further development in a collaborative fashion.

This commitment also implies next steps: both thesauri will be communicated to different target groups and the resulting discussions will hopefully generate many new requirements and concrete models for collaboration.

Facts and Figures

Licenses of WKD thesauri

  • Data is licensed using ‘Creative Commons Namensnennung 3.0 Deutschland (CC BY 3.0)’ License.
  • Data model is licensed using ‘ODBL’ License.
  • Links to external sources are licensed using a ‘CC0 1.0 Universal (CC0 1.0) Public Domain Dedication’ License.

Published as Linked Open Data (LOD)

WKD Thesauri are linked with

Programming interfaces as API / SPAQRL endpoints available at:

Used software tool

PoolParty Thesaurus Management Suite (www.poolparty.biz)

Both thesauri are described in ADMS format

coming from the European Commission, in order to be easily re-used in e-government services: http://joinup.ec.europa.eu/asset/adms/description

This project was implemented in a partnership between

Wolters Kluwer Deutschland GmbH (http://www.wolterskluwer.de), Semantic Web Company Wien (http://www.semantic-web.at) and the FP7 Project LOD2 (http://lod2.eu)

for more information you may contact

Christian Dirschl
Wolters Kluwer Deutschland GmbH (WKD)
Freisinger Strasse 3
D-85716 Unterschleißheim
cdirschl@wolterskluwer.de

 

Martin Kaltenböck

ADMS implemented in PoolParty Thesaurus Manager (PPT) 3.1.0 Release

ADMS – the Asset Description Metadata Schema of the European Commission Joinup Initiative, is a metadata vocabulary to describe semantic interoperability assets.

The Asset Description Metadata Schema (ADMS) is a common way to describe semantic interoperability assets making it possible for everyone to search and discover them once shared through the forthcoming federation of asset repositories.

 

One of the main objectives of ADMS is e.g. to foster cross-boarder services in Europe by the efficient re-use of semantic assets in e-Government.

 

Description of ADMS
ADMS will allow public administrations, businesses, standardisation bodies and academia to:
  • keep their own system for documenting and storing semantic assets;
  • improve indexing and visibility of their own assets;
  • describe semantic assets in a common way so that they can be seamlessly cross-queried and discovered by the community through a single access point (Joinup);
  • retrieve, compare and potentially link semantic assets to one another in cross-border and cross-sector settings;
  • identify assets to be reused avoiding duplication and expensive design work.

Outreach of ADMS
ADMS is the first step towards a federation of european assets repositories. From mid 2012, Joinup will make available a large number of semantic interoperability assets, described using ADMS, through a federation of asset repositories of Member States, standardisation bodies and other relevant stakeholders. Through this federation, semantic interoperability assets will become retrievable and available via a single point of access.

Please consult the ADMS brochure for further information.

 

ADMS implemented in PoolParty Thesaurus Manager (PPT) 3.1.0 Release

 

The current release 3.1.0 of the PoolParty Thesaurus Manager (PPT) of the Semantic Web Company provides now full ADMS capability!

Figure: PoolParty GUI for metadata management of controlled vocabularies – ADMS tab.

 

Therefore PoolParty Thesaurus Manager (PPT) now allows the content architect to fill in the full description of a controlled vocabulary (a SKOS Thesaurus) – means the meta data of a controlled vocabulary – following now also ADMS standards – these ADMS relevant meta data is automatically published with a controlled vocabulary using the ADMS RDF schema and thereby can be used to publish a vocabulary in the repository of Joinup or another relevant repository of semantic assets to ensure re-use of the controlled vocabulary and thereby interoperability for services et al.

 

Example

The SWC Social Semantic Web Thesaurus Linked Data Frontend: http://vocabulary.semantic-web.at/semweb.html

And: the corresponding ADMS description in RDF:
http://vocabulary.semantic-web.at/semweb/adms/0.1.rdf

More information also available in PoolParty Thesaurus Manager (PPT) 3.1.0 Release Notes.

If you are interested in this topic around Joinup and ADMS as well as the respective PoolParty implementation then participate in the SEMIC2012 conference on 18 June 2012 in Brussels, Belgium.

Helmut Nagy

LOD2 Plenary Vienna (March 2012) – 3rd day – afternoon session

Promising title. After two and a half day (well for almost all of us) we entered the final phase of the plenary. So two and a half days of intense and interesting discussions catching up with all that has been done so far and planning what should happen the next half year. But still two session in front of us.

The afternoon started with the discussion of WP9 the “Open Government Data” use case. First Uroš Milošević from Institut Mihajlo Pupin (IMP) reported about the Serbian CKAN project already holding some data from the Statistical Office of Serbia. Also tools from the LOD2 stack have been and will be used for this project. Sounds great!

Then Irina Bolychevsky of OKFN continued the session announcing that a better integration between CKAN and LOD2Stack should be made  to get more RDF in publicdata.eu. Good idea! We were collecting ideas for integration and talked about e.g. a wizard for generating RDF from .csv files (ULEI is working on something like that). Also a integration of google refine has been discussed. The consortium decided to make an extraction sprint transforming a (to be defined) number of  interesting data sets from CKAN to RDF.

Finally we had a discussion if linked data is a (the) solution for CKAN to find data and find related data etc. Well i think the people in the consortium are pretty sure it is (not so sure if people from OKFN are). Irina and Mark from OKFN invited everyone to provide input to the Use Case.

This session ended with a presentation about WP9a from Jindřich Mynarz from UEP and Martin Nečaský from CU. They are developing a distributed market place for public contracts. A ontology for public contracts has been developed and is open for review on google code. Next step here will be a web application for filing/creating public contracts in RDF as linked data using tools from the stack. So all in all pretty good progress in WP9.

The third day and the plenary ended with Martin Kaltenböck from SWC and Sören Auer our project lead from ULEI presenting WP10-11-12 Dissemination, Exploitation and Project Management. First we voted for our next plenary to be in Cambridge (hosted by OKFN). Past dissemination activities have already been presented on day one, so Martin reminded us all to write blog posts about all the great things we are doing in LOD2. Next big dissemination activity and also a good opportunity to meet people from the consortium will be the European Data Forum from June 6-7 in Copenhagen.

And that was pretty much it. I as i hope all the others enjoyed three days with a bunch of great people from all over Europe working on a great project. As always it was intense but it was also fun. Hope everyone had a save trip home!

Enhanced by Zemanta