OWL – an Ontology Language Standard with Sound Logical Underpinning
Submitting Institution
University of ManchesterUnit of Assessment
Computer Science and InformaticsSummary Impact Type
TechnologicalResearch Subject Area(s)
Mathematical Sciences: Applied Mathematics
Information and Computing Sciences: Computation Theory and Mathematics, Information Systems
Summary of the impact
Ontologies are used to describe the meaning of terms in a domain.
Manchester has had a leading role in the design of ontology languages,
algorithms and tools. Through standardization, algorithm development and
tool creation, we have significantly influenced the uptake of the Ontology
Web Language (OWL) and Semantic Web Technologies by public service
providers and industry. For example, the NCI thesaurus and SNOMED CT are
medical terminologies in OWL; specialised semantic web companies such as
Clark & Parsia, Racer Systems and TopQuadrant provide semantic
technologies and services that build on OWL; and companies such as Oracle
and B2i Healthcare include tool support for OWL.
Underpinning research
By the end of the 1990s, Description Logics (DLs) were well-understood
logical formalisms, e.g., with respect to their computational complexity,
model theory and reasoning services. They were used in niche applications,
but of limited expressive power and with limited tool support — the latter
due to the lack of a standardised syntax. Research carried out in
Manchester dramatically improved the expressive power of and tool support
for DLs, which led to the adoption of DLs as the logical underpinning of
ontology languages, to their standardisation and integration with Semantic
Web infrastructure in OWL and OWL 2, and to their usage in biohealth and
eScience applications.
Key researchers from Manchester are:
- Sean Bechhofer (1993 — present: RA, Lecturer 2004, Senior Lecturer
2013)
- Ian Horrocks (1996 - 2007: PhD student, RA, Lecturer 1999, Senior
Lecturer 2002, Professor 2003)
- Bijan Parsia (2006 — present: Lecturer, Senior Lecturer 2012)
- Uli Sattler (2003 — present: Senior Lecturer, Professor 2007)
Key research results that underpinned the impact were:
i. Motivated by application examples, the design and investigation of a
range of extensions to existing DLs, establishing various (un)decidability
and computational complexity results [2,3]. In particular, the design of
the SHIQ family of DLs, which show good computational properties
while being highly expressive: they support general concept inclusion,
transitive roles and role hierarchies, inverse roles, cardinality
restrictions and nominals. Each of these features was thought to be
problematic yet desirable, and the research proved that their combination
is of high computational complexity yet decidable.
ii. The design and analysis of reasoning algorithms for SHIQ DLs
[2,3]. These mainly tableau-based algorithms provide sound and complete
decision procedures for the basic DL reasoning problems, and depend on
sophisticated blocking techniques for their termination. The
implementation and optimisation of these algorithms in the FaCT
reasoner provided evidence of their applicability in practice, overcoming
the commonly held belief that such algorithms are impractical [1].
iii. Investigation of ontology engineering problems, in particular for
modularity, entailment explanation and query answering, and development of
logically sound yet practical solutions for these [4].
iv. Design and implementation of tools such as ontology-based editors,
reasoners, APIs, and web applications that both showcase the benefits of
OWL for various applications and demonstrate the practicality of the
developed algorithms [5]. The resulting reasoners, editors, and APIs were
highly influential: further research into optimisation in FaCT++
showed that reasoners for complex DLs can indeed cope with large-scale
ontologies; the ontology editor OiLEd was the first one that
exposed ontology designers to a reasoner, and it heavily influenced
Protégé and other editors; and the OWL API is the main API used to
interact programmatically with an ontology and reasoners [6].
Our research involved a unique combination of investigations of the
computational complexity of logics with informed language, algorithm
design and tool development that changed the general understanding of what
it means for an ontology language to be practical and expressive.
References to the research
Papers that describe this research have been published in international
leading journals.
Key references:
[1] I. Horrocks. Using an Expressive Description Logic: FaCT or Fiction?
In Proc. of the 6th Int. Conf. on Principles of Knowledge Representation
and Reasoning (KR'98), pp. 636-647, 1998. [Google Scholar: 575 citations]
KR Outstanding Paper Award.
[2] I. Horrocks, and U. Sattler. Decidability of SHIQ with complex role
inclusion axioms. Artificial Intelligence, 160 (1-2), pp. 79-104.
2004. DOI: 10.1016/j.artint.2004.06.002 [Google Scholar: 97 citations].
[3] I. Horrocks and U. Sattler. A Tableaux Decision Procedure for SHOIQ.
Journal of Automated Reasoning, Springer Verlag, 39(3), 245-429,
2007. DOI:
10.1007/s10817-007-9079-9 [Google Scholar: 419 citations].
Other references:
[4] B. Cuenca Grau, I. Horrocks, Y. Kazakov, and U. Sattler. Modular
Reuse of Ontologies: Theory and Practice. Journal of Artificial
Intelligence Research (JAIR), Vol. 31, pp. 273-318, 2008. DOI:/10.1613/jair.2375.
[Google Scholar: 215 citations].
[5] BC Grau, I Horrocks, B Motik, B Parsia, P Patel-Schneider, U Sattler,
OWL 2: The next step for OWL, J. Web Semantics 6 (4), 309-322,
2008. DOI:
10.1016/j.websem.2008.05.001 [Google Scholar: 351 citations].
[6] M. Horridge, S. Bechhofer. The OWL API: A Java API for OWL
ontologies. Semantic Web 2(1): 11-21, 2011. DOI:10.3233/SW-2011-0025.
[Google Scholar: 70 citations].
Details of the impact
Context
There were promising, prototypical implementations of DL reasoners that
were respected in the knowledge representation community, but found little
adoption: the logics were inexpressive, there was limited tool support, no
methodologies, no editors and no standardized syntax.
Pathways to Impact
The research that underpins the impact was reported in high profile
publications that were widely read and cited. Associated with these
publications, several software systems were released that enabled
experimentation with and application of the research results. For example,
the DL reasoners FaCT and FACT++ each received tens of
thousands of downloads, as did OiLEd, the first ontology editor that was
tightly integrated with a DL reasoner, thereby providing a tool to
showcase the feasibility and potential benefits of supporting ontology
engineers through DL reasoning. These tools, together with the OWL API,
led to a wide user base and support by other tool developers, which helped
to establish the utility and maturity of DLs for a range of applications,
including in the web. This, in turn, enabled Horrocks and others to
convince the Semantic Web community to adopt DLs as the logical
foundations of Semantic Web Ontology languages.
Reach and Significance of the Impact
This section describes impacts that have resulted from the research.
Although some of these are economic, there have also been important
impacts on practitioners and service providers through the introduction of
influential de jure standards and changes to best practice, in
particular for the design and development of widely deployed biomedical
terminologies.
Development of Standards in the W3C. The widespread adoption of
the research results has been made possible by the development of World
Wide Web Consortium (W3C) standards for ontology languages and associated
interfaces. These standards are designed by working groups that
have members from outside academia and require serious implementation and
usage efforts to be established. Standards that build directly on
Manchester research, and in which Manchester authors collaborated with
other academics and industrial partners include:
- OWL (2004): the first W3C ontology language standard, which builds on
Manchester research as described in [5]; contributors: Horrocks and
Bechhofer.
- OWL 2 (2009): a revision of OWL, heavily driven by the OWL
Experiences and Directions group founded by Horrocks, Parsia, et al.
and by research on qualified number restrictions, key constraints, and
rich property axioms by Horrocks, Parsia, and Sattler; contributors:
Horrocks, Parsia and Sattler.
- SKOS (2009): a standard language for knowledge organisation systems
based on OWL; contributors: Bechhofer.
- SPARQL (2008) and SPARQL 1.1 (2010-12): a query language for RDF and
OWL, influenced by work on OWL query answering by Horrocks and Sattler;
contributors: Parsia.
These de jure standards have been widely deployed throughout the
REF period, as described below, thus impacting on practitioners who both
develop and apply semantic web technologies.
Uptake of Reasoning Algorithms and Tools. The standards have
enabled the development of products and tools that in turn have
facilitated the widespread application of semantic web techniques. In this
section we focus principally on two examples:
- Pellet: Pellet, a commercial reasoning system that supports
OWL 2, is marketed by Clark & Parsia, and includes algorithms
developed at Manchester [A]. Pellet has been used by organisations
including NASA, US Army, US Banking Institutions, NATO, NCI, Ordnance
Survey and iPlant Collaborative [A]. In addition, Pellet has been
integrated with and is used in Oracle 11g [A].
- Protégé: Protégé is the most widely used computer system for
engineering ontologies, with 225,000 registered users, and more than
17,000 members of email discussion groups [B]. Protégé supports OWL 2,
the user interface for the current version of Protégé was designed and
implemented at Manchester, and Protégé implements the Manchester OWL API
[B].
Manchester techniques and tools, in conjunction with the W3C Standards,
inform other commercial platforms. For example: the commercial reasoner
RacerPro incorporates algorithms designed in Manchester and pioneered in
FaCT, and implements the Manchester OWL API [C]; and the KnowledgeServer
semantic infrastructure of derivo GmbH implements OWL and the OWL API [D].
OWL Tools and Ontologies. Today, thousands of OWL ontologies are
available on the Web. The National Centre for Biomedical Ontologies
BioPortal ontology repository [E] contains 365 OWL ontologies, which are
used in all areas of biomedical activity. The following three ontologies
are amongst the most well established and widely used ones and are built
using OWL and OWL 2:
- SNOMED CT [F], from the International Health Terminology Standards
Development Organisation, is the prime medical thesaurus and is used
worldwide in a variety of healthcare applications, e.g., in NHS
Connecting for Health. The company B2i Healthcare provides specialist
support around OWL and SNOMED CT, in particular Snow OWL [G].
- National Cancer Institute (NCI) Thesaurus [H] is a key biomedical
research vocabulary used in OWL that uses Pellet for classification [A].
- The 11th version of the International Statistical Classification of
Diseases and Related Health Problems (ICD-11) is currently being
developed in OWL under WHO leadership [I]. ICD is used to classify
diseases and other health problems, e.g., in death certificates and
health records. These records also provide the basis for the compilation
of national mortality and morbidity statistics by WHO Member States, and
are used for reimbursement and resource allocation decision-making by
governments.
The W3C maintains a list of use-cases of semantic web standards [J] that
detail the users that have built on the standards to which Manchester
contributed. These include: the Norwegian National Broadcaster, Cleveland
Clinic, Ordnance Survey, IBM, National Archives of Korea, Food and
Agriculture Organization of the United Nations, BBC, Chevron, Renault,
Agfa Healthcare and Vodaphone.
Sources to corroborate the impact
Supporting material is available from the university for the
corroborating sources below.
[A] Letter from CEO of Clark & Parsia (http://clarkparsia.com/).
Confirms use of Manchester research in Pellet, and provides details of its
use.
[B] Letter from Professor of Medicine (Biomedical Informatics), Stanford
Centre for Biomedical Informatics Research. Confirms the role of
Manchester research in Protégé, and the scale/nature of the user
community.
[C] Racer Systems GmbH & Co: RacerPro User's Guide, Version 2.0,
October 2012 (http://www.racer-systems.com),
also (http://www.racer-systems.com/technology/references.phtml),
on
29th August 2013. Confirms the influence of Manchester research
on RacerPro.
[D] derivo GMBH (http://www.derivo.de/),
also (http://www.derivo.de/ressourcen/owllink.html),
on 29th August 2013. Confirms the use of the Manchester OWL
API.
[E] BioPortal: (http://bioportal.bioontology.org/),
on 29th August 2013. Provides information on the number and
scale of biological ontologies.
[F] SNOMED CT Technical Implementation Guide, International Health
Terminology Standards Organisation (www.snomed.org/tig.pdf).
Confirms role of OWL in SNOMED CT.
[G] B2i Healthcare Ltd (http://www.b2international.com/),
on 29th August 2013. Demonstrating a commercial use of OWL.
[H] NCI Thesaurus (http://ncicb.nci.nih.gov/download/evsportal.jsp),
on 29th August 2013. Confirms role of OWL in NCI Thesaurus.
[I] Tudorache T, Falconer S, Nyulas C, Storey MA, Ustün TB, Musen MA.
Supporting the Collaborative Authoring of ICD-11 with WebProtégé. AMIA
Annu Symp Proc. 2010 Nov 13; 2010:802-6, http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041458/.
Confirms that ICD-11 uses OWL.
[J] W3C list of use cases: http://www.w3.org/2001/sw/sweo/public/UseCases/,
on 29th August 2013.