Belief Contractions on Large Ontologies with Minimal Knowledge Loss
The following was work I did during my NSERC Undergraduate Research Assistant position at Simon Fraser University during Summer 2018. The report assumes the reader has prior knowledge about knowledge representation particularly descriptive logics.
Code can be found here.
The field of knowledge representation deals with finding efficient methods to represent, store and perform inference on large collections of data. When dealing with large knowledge bases, a user may require to remove a fact that was previously believed to be true however has been rendered false after the addition of new information. What makes this operation difficult is that simply removing the belief (represented as a formal logic axiom) is often not enough since the combinations of many other beliefs in the knowledge base can also infer the same false fact resulting in no knowledge actually being removed. Various contraction methods working on different formal logics have been proposed which ensures that a belief is completely forgotten by removing multiple axioms from a knowledge base. In [Dawood17], a kernel contraction algorithm was constructed for TBox. The contraction is performed by removing a minimum set of axioms which infer the belief and using a heuristic to select the a prefered set when multiple minimum sets exist. One of these heuristics is Specificity which weighs axioms by their generality within the domain. We will be expanding upon the Specificity heuristic to create a total preorder relation that orders axioms based on the amount of epistemic loss that they cause when removed from a TBox. The Hierarchical Total Preorder, will work on TBoxes and will be shown how it can be implemented into the kernel contraction algorithm.
Description Logic
Description Logics (DL) [Baader07] are a family of logics used to model relationships between entities in a domain. DLs consist of three types of entities, concepts which represent sets of individuals, roles which describe relationships between individuals and singleton individuals from a domain. A DL knowledge base is composed of two parts, the ABox containing extensional knowledge and the TBox containing intensional knowledge. The ABox states assertions about individuals using concepts and roles such as and . The TBox contains subsumption axioms that describe relationships between concepts and roles such and . Many DLs exist with varying expressibility and reasoning complexity. The language that we will be using is .
[Baader05], an extension of , is a lightweight DL that has limited expressibility but boasts polynomial time reasoning and is used on large ontologies like SNOMED CT. The table below outlines the syntax of the language.
Name | Syntax | Semantics |
---|---|---|
top | ||
bottom | ||
nominal | {} | {} |
conjunction | ||
existential restriction | { : } | |
concrete domain | for | { : for } |
GCI | ||
RI |
An TBox is a finite and consistent set of GCIs and RIs. We refer to the left hand side expression as the sub-concept or sub-role and the right hand side expression as the super-concept or super-role for GCIs and RIs respectively.
Belief Change
The most prominently used construction of belief change is the AGM framework [Alchourron85]. The framework models an agent’s state of knowledge with a belief set which is a closed under logical implication set of sentences. Belief sets state exactly what the agent currently perceives as true. There are three belief change operations for modifying these sets:
- Expansion: Adding a new belief to a belief set.
- Contraction: Removing a belief from a belief set.
- Revision: Adding a new belief which may create an inconsistent belief set that requires other beliefs to be removed.
The operation we will be focusing on is contraction.
While AGM describes contractions using belief sets, [Hansson93] describes a different approach using belief bases. Belief bases are sets of beliefs not closed under logical implication which better models what an agent with finite memory would store and are equivalent to DL TBoxes.
Two methods of belief base contractions are regularly used, partial meet contractions and kernel contractions. Partial meet contractions [Alchourron85] are done by using remainder sets, maximal subsets of a belief base that do not entail the axiom we wish to contract, . The contracted belief base is the intersection of a select set of remainder sets. Kernels [Hansson94] are minimal subsets of that entail . To perform a kernel contraction of we select an axiom from each kernel and remove them from the belief base. Kernel contraction is the method that we will be considering from now on and is denoted by .
The following five postulates [Hansson93] are used to capture the definition of a belief base kernel contraction:
- Success: If then
- Inclusion:
- Core retainment: If and then there is a set such that but
- Uniformity: If for every we have iff then
- Relative closure:
If a function satisfies the first four postulates then it is a kernel contraction and if all five hold then it is a smooth kernel contraction.
The kernel contraction algorithm that we will be working was introduced in [Dawood17]. The algorithm calculates all -kernels, kernels that entail , in using the axiom pinpointing algorithm [Baader08]. We denote the set of -kernels in belief base with . An incision function then selects axioms from each kernel to remove from the belief base. The set of axioms chosen by the incision function is called the drop set, . Since we prefer to remove as few axioms as possible, the incision function selects a minimum drop set. The calculation for minimum drop sets is equivalent to the minimum hitting set problem [Garey79, Dawood17] therefore a hitting set algorithm is used to find drop sets. Once a minimum hitting set is selected for the drop set, the axioms are removed to form the contracted belief base, .
Hierarchical Total Preorder
Since an TBox is equivalent to a belief base we can use the kernel contraction on some TBox and axiom . Once the kernels are calculated and the minimum hitting sets are found we typically have multiple equal sized sets to choose as the drop set. Aside from simply removing as few axioms as possible, we ideally want to achieve the contraction with as minimal knowledge loss to the TBox as possible. Expanding on the specificity heuristic from [Dawood17] and exploiting the axiom hierarchy found in , we can define a total preorder binary relation which can order axioms by their importance within the TBox to help make our decision.
The hierarchical preorder relation, , is based off the concept of an epistemic entrenchment [Gardenfors88], . An epistemic entrenchment is a total preorder over the axioms of a belief set that represents the relative epistemic loss caused by removing each axiom. The relation states that is equally or more entrenched in the knowledge base as and therefore during contractions we would prefer to remove over . An epistemic entrenchment is defined by five postulates, transitivity, dominance, conjunctiveness, minimality and maximality. These postulates capture the definition of epistemic loss in standard logics however not all postulates can be applied to description logics.
For the hierarchical preorder, we will form a new set of postulates to formulate a preorder that still uses the metric of epistemic loss to order the axioms but can applied on TBoxes. In we can measure epistemic loss as the number of entailments related to the most general expression that are lost. For example, in the TBox {}, removing results in losing the entailments and , however removing only loses the entailment . Since removing causes less epistemic loss we have .
Before we formulate the postulates we need to first define some terminology that will help us describe the subsumption hierarchy of axioms.
Def 1 Connected Axioms:
For some TBox and axioms {} , where are either all concepts, existential restrictions and conjunctions (GCI) or all roles (RI). If either:
- {}
- {}, where are support axioms, and no subset of {} entails
then is connected with , . We refer to as a of and as a of .
Def 2 Subsumption Path:
For some TBox , axioms and are on the same if there exists a sequence of axioms {} for , where both:
- for all .
- and .
{} is a of and .
We now define the four postulates that we will be following while constructing the hierarchical preorder:
- Transitivity: If and , then .
- Totality: For all , or .
- Minimality: If belief base is consistent, then iff for all .
- Hierarchical: If and {} for some belief base , then .
Transitivity and totality are the two properties a total preorder must follow. The hierarchical postulate is a new postulate that captures the connection between subsumption hierarchies and epistemic loss in description logics. For example, if we have then is higher in the subsumption hierarchy than which means removing it will cause more entailments to be lost from the TBox, therefore causes more epistemic loss than .
Hierarchical Weighting Function
When given a hierarchical preorder relation like , we must determine which of the axioms causes the greater amount of epistemic loss when removed. To measure this, we will weights axioms based on their position within the TBox using the hierarchical weighting function.
The function takes an axiom and first confirms if (if not we assign ) and then calculates by going through the following 4 phases.
Subsumption Hierarchy Phase
The initial phase weighs based off its placement within the TBox’s subsumption hierarchy. This is calculated by using the set of LHS connecting axioms of , . To find the LHS connecting axioms that appear from using support axioms we first calculate the indirect children of the existential restrictions in the TBox.
Def 3 Indirect Child:
Given is a concept, existential restriction or a conjunction of concepts and existential restrictions, and are concepts and and are roles:
- If {} , then {}
- If {} , then {}
- If {} , then {}
We can use these indirect children to find all LHS connecting axioms for each axiom by using the rule:
- if or
The subsumption hierarchy weighting procedure can now be executed as follows:
Subsumption Hierarchy Weighting Procedure:
- If then .
- If then where is the maximum subsumption hierarchy weight among all axioms in .
Since cycles are allowed to occur in the TBox we have an anti-cycling check for the recursive step in the above procedure. While keeping a list of all previously visited axioms in the current recursive stack, if we try to get and is already in the visited axiom list we do not consider its weight at the current recursion level.
An example of the subsumption hierarchy weightings procedure is shown in the following:
Example 1:
TBox:
LHS Connecting Axioms:
Weights:
Support Axiom Phase
An issue with subsumption hierarchy weights is that support axioms are under-weighted. If we consider Example 1, we have because of the support axioms {}. With the axioms of S we have however if we remove any of these axioms this would not hold. When we perform a contraction of on TBox T we get 2 kernels:
- {}
- {}
Removing the lowest weighted axiom in we simply choose , however has 3 axioms with a weight of 0 to choose from, , and . The issue is removing preserves , while removing either or does not because its causes . Since removing these two axioms causes a greater amount epistemic loss to the TBox we need to adjust their weights to reflect this.
The support axiom weighting phase makes adjustments by matching the support axioms’ sub-concept/role with axioms that have existential restrictions with the same concept/role in their super-concept.
Support Axiom Weighting Procedure:
Given are concepts and are roles:
For axioms in the form with :
- If with and , then .
For axioms in the form with :
- If with and , then .
Continuing with Example 1, applying the support axiom weighting procedure results in the following:
Example 1 cont.
Support Axiom Adjustments:
- Since {} , and , set .
- Since {} , and , set .
Weights:
Cycle Adjustment Phase:
Next is an optional phase to deal with cyclic TBoxes. Say we have the TBox:
The weights assigned in the subsumption hierarchy phase varies depending on the order the axioms are processed. For example, if we start with we would gets the weights:
The anti-cycling check prevents the procedure from entering an infinite loop however the hierarchical postulate is broken because . Another problem is that the current weights state that removing causes the least amount of epistemic loss however all of the axioms in the loop cause the same amount of loss. To fix both these issues, the cycle adjustment procedure identifies loops and increases all of the loop’s axioms to the maximum weight among these axioms.
Cycle Adjustment Procedure:
- If is in a cycle comprised of the set of axioms , then where for all .
Applying this procedure on the above TBox gives us the weights:
Offset Adjustment Phase
The support axiom phase adjusts support axioms’ weights to better reflect their potential epistemic loss within the TBox, however these adjustments can break the hierarchical postulate. In Example 1’s TBox, we currently have and however since we require . The offset adjustment procedure fixes this by checking that all of the axioms have a larger weight than their LHS connecting axioms and increases the axiom’s weight when this does not hold.
Offset Adjustment Procedure:
- If :
- Else, For all :
- If , then
- Else,
Like the subsumption hierarchy weighting procedure, the offset adjustment procedure also contains the same anti-cycling check during the recursive step of calculating to prevent infinite loops.
Finishing Example 1, we go through the offset adjustment phase and get the following:
Example 1 cont.
Offset Adjustments:
- Since :
- .
Weights:
Using Hierarchical Weighting Function in
With the hierarchical weighting function we can now implement to solve hierarchical preorder relations using the following rule:
- iff
Algorithm 1 outlines the relation validity checking and weight calculation processes. Here we are assuming that the TBox T stores pairs of axioms and their weights.

Postulate Proofs
We will now prove that the hierarchical preorder relation follows the postulates previously introduced.
Theorem: The hierarchical preorder binary relation satisfies the transitivity, totality, minimality and hierarchical postulates when applied to an EL++ TBox.
Proof: Given TBox and axioms .
Transitivity: Assume and . This means that and which implies therefore giving .
Totality: For all we have their weights, and . If then and if then . Therefore for all , we have or
Minimality: () Assume T is consistent and . In the hierarchical weighting function we initially check if . Since this is false, we assign . The minimum weight any axiom can have is -1, therefore and for all .
() Assume T is consistent and for all . We then get for all . Since is consistent, we know that there exists some axiom that would make inconsistent, therefore . Since for all we must have which can only occur when .
Hierarchical: Assume and . To check if we begin by getting and . At the start of offset adjustment phase we have and where . During the phase, offsets for and , and , are calculated and added to each of the axioms’ weights. Depending on the values of and , has the value:
- If then
- If i j then
Applying these offsets, we get:
- If :
- If :
Therefore the hierarchical weighting function always terminates with and thus always returns whenever and .
This proves that the hierarchical preorder binary relation satisfies all four postulates.
Kernel Contraction with Hierarchical Preorder
Returning to the problem of selecting which hitting set to remove in the kernel contraction algorithm, we can extend the definition of to create a new total preorder relation that works with sets of axioms, . The relation where states removing causes an equal or greater amount of epistemic lost as removing . For set weights can say for all axioms . To validate relations we use the rule:
- if where
In order for the kernel contraction algorithm to be smooth, we must ensure that the same drop set is chosen every time we repeat a contraction. Therefore we assume that a canonical ordering of the axioms exists which can be used in tiebreakers where multiple axioms have the same minimum weight.
When choosing which hitting set to remove, all of the hitting sets are ordered with and then the set that is at the bottom of the preorder is selected as the drop set. Algorithm 2 outlines the kernel contraction algorithm using .

Smooth Kernel Contraction Proof
We will now show that is a smooth kernel contraction by proving that all five postulates hold.
Theorem 2: The kernel contraction described in Algorithm 2 satisfies all five smooth kernel contraction postulates when applied an TBox.
Proof: Given a TBox and axioms
Success: Assume . When the pinpointing algorithm is applied, a finite number of -kernels are calculated that each entail . is a minimum hitting set of the kernels so the set will include axioms from every kernel. Therefore the contracted TBox will have no -kernels and .
Inclusion: The algorithm never adds new axioms to therefore .
Core Retainment: Assume and . That means . Set which means and . If we add to then at least one -kernel exists in since is in a minimal hitting set of the kernels. Therefore .
Uniformity: Assume for every , we have iff . In [Hansson94], it is explained that our assumption is equivalent to . When the incision function is run on and , the same set of hitting sets will be calculated and since the preorder using is unique (assuming exists), and therefore .
Relative Closure: For some axiom , if and , then trivially . Also trivially, if then .
For the case where and . Let us assume that . We know that therefore is found in at least one -kernel. More specifically, since all hitting sets calculated are minimum, we know that for all -kernels that contain , there must be some -kernel where no other is found. If this is not true then the hitting set would not be minimal since would be unnecessary. After removing the axioms of , we should have contracted and have however since , there exists a subset of axioms where and therefore . This contradicts the success postulate for kernel contractions. Therefore thus . Since all axioms fall into one of these cases, we have always holds.
This proves that Algorithm 2 is a smooth kernel contraction.
Future Work
The hierarchical total preorder relation is a versatile method for ordering axioms and sets of axioms in TBoxes that do not need to be normalized and can be cyclic. When used in kernel contractions, the preorder limits the amount of epistemic loss caused to the TBox and maintains the smooth kernel contraction property.
This hierarchical approach is simply one way of ordering axioms. Future work can be done into developing new preorders or epistemic entrenchments that use different approaches to weighting axioms that work better under certain contexts. An issue with the hierarchical weighting function currently is that after every contraction, the entire TBox needs to be re-weighted in order to update the preorder. Developing a way to adjust weights after a contraction is an avenue to explore further. Finally, though the preorder was construction for TBoxes, it would be interesting to see if the method can be expanded to work with more expressive descriptions logics like the family.
References
[Alchourron85] Alchourrón, C., Gärdenfors, P., and Makinson, D. (1985). On the logic of theory change: Partial meet contraction and revi- sion functions. 50(2):510–530.
[Baader05] Baader, F., Brandt, S., and Lutz, C. (2005). Pushing the EL envelope. In Proceedings of the Nineteenth International Joint Confer- ence on Artificial Intelligence IJCAI-05, Edinburgh, UK. Morgan-Kaufmann Publishers.
[Baader07] Baader, F., Calvanese, D., McGuiness, D., Nardi, D., and Patel-Schneider, P., editors (2007). The Description Logic Handbook. Cam- bridge University Press, Cambridge, second edition.
[Baader08] Baader, F. and Suntisrivaraporn, B. (2008). Debugging SNOMED CT using axiom pinpointing in the description logic EL+ . In Proceedings of the 3rd Knowledge Representation in Medicine (KR-MED’08): Representing and Sharing Knowledge Using SNOMED, vol- ume 410 of CEUR-WS.
[Dawood17] Dawood, A., Delgrande, J., and Liao, Z. (2017). A study of kernel contraction in EL. In Gordon, A., Miller, R., and Turan, G., editors, Thirteenth International Symposium on Logical Formalizations of Common- sense Reasoning, London, UK. (7 double-column AAAI-style pages).
[Gardenfors88] Gärdenfors, P. and Makinson, D. (1988). Re- visions of knowledge systems using epistemic entrenchment. In Proceedings of the 2Nd Conference on Theoretical Aspects of Reasoning About Knowl- edge, TARK ’88, pages 83–95, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
[Garey79] Garey, M. and Johnson, D. (1979). Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman and Co., New York.
[Hansson93] Hansson, S. O. (1993). Reversing the levi identity. 22(6):637– 669.
[Hansson94] Hansson, S. O. (1994). Kernel contraction. 59(3):845–859.
Next
- An Empirical Analysis Packing Discriminators in Generative Adversarial Networks