A Model Theory for a
Quantified Generalized Logic of Contexts

[ - ]

Selene Makarios

Abstract:

We investigate the formal semantics of a quantified version of a generalization of the logic of contexts that uses the operator $ist(c,\phi)$ introduced by Guha 3 (3) and consonant to the notion of context logic as proposed by McCarthy 8 (8). We propose a model theory for this logic, one in which various properties on contexts, deemed as being intuitively desirable, hold in all models. We believe this to be the first offering of a full, rigorous, model-theoretic treatment of the quantified context-logic formalism, since context logic was first described in 1987.

Introduction

The notion of formalized contexts was proposed 8 (8) as a way of addressing the problem of generality in AI. The original idea was to create AI systems that never get ``stuck'' with the set of concepts they are using at any given moment, always being able to transcend any given context to a more general one.

Another perspective is that, for the sake of tractability, contexts allow humans to temporarily fix a domain of discourse and a set of interpretations, for purposes of performing some cognitive or communicative task. Without this ability, that is, if humans were required to always account for all entities, relationships, and the interpretations thereof, while performing a cognitive or communicative task, all such tasks would likely become unmanageable.

McCarthy's and Guha's approach to formalizing context 3 (3); 4 (4); 2 (2) was to introduce an operator `' , with the intended intuitive meaning that if is a context, and $\phi$ is a proposition, then the expression

$\begin{displaymath} ist(c,\phi) \end{displaymath}$

(1)

is considered true exactly in case the proposition $\phi$ , interpreted in the context

, is true. The name `

' is derived from `is true' (according to the account of a given context).

The notion of ``context'' itself is taken as a mathematical primitive in the formalism discussed here. Regardless of what motivating conception of context one employs, what matters is that propositions can be asserted in a context, and then other propositions can be derived in it, or possibly in other contexts that are related to it.

The following are some examples of motivating conceptions of context: a constraint upon time, such as in the Middle Ages, or last Tuesday afternoon; a constraint upon place, such as in the Gobi desert; the context of a given conversation; the context of a particular news report; the context of a fictitious universe, such as the world of Sherlock Holmes; the context of a set of beliefs; a linguistic context, such as a particular tongue, or a collection of application-specific verbal efficiencies¹.

As originally described 8 (8); 4 (4), context logic was a relatively informal idea, presented as a syntax with a framework of examples and intuitions, and suggested approaches to applications. Since that time, the goal of providing a complete semantic account of what the language of context logic really says, and what its operations really do - especially for the quantificational case - has proved difficult to achieve.

We believe the present work to be the first offering of a full, rigorous, model-theoretic treatment of quantificational context logic, during the time since context logic was proposed as a technique in Artificial Intelligence almost twenty years ago. In addition to providing semantics, the formalism described here further generalizes the original concept of , allowing an elaboration where the truth-conditions that determine a system's modalities can themselves be vary with context - in other words, the meaning of itself can be context-dependent.

The Context Logic Language $\cL$

Abstract Syntax.

The syntax of context logic is not, as is customarily the case with formal languages, defined over the collection of sequences of some set of symbols, though it is generally convenient to project context logic expressions onto paper as arrangements of typographic symbols.

The reason for this decision is that there may exist in one's mind a qualitative distinction between ``terms'' and ``sentences''. First-order logic makes this distinction quite formal. But in context logic, we will need to move smoothly from the idea of propositions whose arguments are merely (what might be called) ``terms'', to propositions having arguments that are (what might be called) ``sentences''.

Thus we avoid the expressions ``symbol'', ``term'', and ``sentence'' altogether. Following the approach proposed by McCarthy 5 (5), define the syntax of context logic in terms of ``abstract syntax-objects'' and ``constructs''. It helps to imagine constructs as data-structures, built up from syntax-objects and other constructs via constructors; the entities that do such building will in fact be called ``constructors''. So, the current sentence is the last place the reader will find the words ``symbol'', ``term'', or ``sentence'' used in this work.

The language $\cL$ of context logic is defined over a signature

$\begin{displaymath} \Tuple{\cF,\cP,\cC,\cA} \comma \end{displaymath}$

(2)

where $\cF$ contains function-objects, $\cP$ contains predicate-objects, and $\cC$ and $\cA$ each contain constant-objects. $\cC$ and $\cA$ are disjoint, and intuitively, $\cC$ represents the collection of context objects, while $\cA$ represents the collection of ``ordinary'' or ``domain'' objects. Note that we are still in the realm of syntax here - these are not intended to be semantic objects.

Constructors.

The syntax of context logic employs constructors corresponding to the standard logical connectives $\land$ , $\lor$ , and $\lnot$ (we will liberally use constructors for implication as well, assuming their analogous definitions in terms of the other constructors); the context-logic specific quantifier-constructors ${\forall_{\mbox{{\scriptsize\texttt{C}}}}{}{}}$ , $\forall_{\mbox{{\scriptsize\texttt{A}}}}{}{}$ , $\forall_{\mbox{{\scriptsize\texttt{L}}}}{}{}$ , and $\forall_\circ$ , for quantifying respectively over contexts, over domain objects, over proposition-objects in $\cL$ itself, and over the objects associated with various contexts (in a way to be made precise); the existential counterparts $\exists_{\mbox{{\scriptsize\texttt{C}}}}{}{}$ , $\exists_{\mbox{{\scriptsize\texttt{A}}}}{}{}$ , $\exists_{\mbox{{\scriptsize\texttt{L}}}}{}{}$ , and $\exists_\circ$ of each universal quantifier-construct; and finally, the constructor $\Istsymb{}$ .

The constructor $\Istsymb{}$ is the `' constructor, which is a two-place constructor over $\Times{\cC }{\cL }$ . Intuitively, the construct

$\begin{displaymath} \Ist{}{c}{\phi} \comma \end{displaymath}$

(3)

to be formally defined later, produces a proposition-object concerning the status of $\phi$ according to the account of the context

The set of predicate-objects $\cP$ optionally includes $\Istsymb{}$ . This inclusion corresponds to the case where is allowed to nest, that is, where the truth of some proposition asserted relative to a context is itself asserted relative to some context. The function-objects in $\cF$ may only apply to term-objects, which are either domain-objects in $\cA$ , or are the result of applying function-objects in $\cF$ to term-objects. The predicate-objects in $\cP$ , except for $\Istsymb{}$ if it is included, may likewise only apply to term-objects.

For brevity, we may resort to using the expressions ``term'', ``function'', ``predicate'', and ``object'' as abbreviations for ``term-object'', ``function-object'', ``predicate-object'', and ``domain-object'', respectively, with the understanding the use of these expressions is simply a convenience with no ontological significance.

Proposition-Objects

The language $\cL$ has two kinds of ``atomic'' constructs,

$\begin{displaymath} \FuncApp{P}{\Dots{x_1}{x_k}} \end{displaymath}$

(4)

and

$\begin{displaymath} \Ist{}{c}{\phi} \comma \end{displaymath}$

(5)

where $\phi$ is itself a construct in $\cL$ , in which $\Istsymb{}$ is not allowed to appear, unless $\Istsymb{}$ is included in $\cP$ . If constructs $\phi$ and $\psi$ are in $\cL$ , then so are constructs $\Land{\phi}{\psi}$ , $\Lor{\phi}{\psi}$ , $\Lnot{\phi}$ , and $\Limplies{\phi}{\psi}$ . If $\phi$ is in $\cL$ , then so are

$\begin{displaymath} % {\forall_{\mbox{{\scriptsize\texttt{C}}}}{(c)}{(\phi)}} \comma \end{displaymath}$

(6)

where

is in $\cC$ and occurs free in $\phi$ , and

$\begin{displaymath} % \forall_{\mbox{{\scriptsize\texttt{A}}}}{(a)}{(\phi)} \comma \end{displaymath}$

(7)

where

is in $\cA$ and occurs free in $\phi$ , and

$\begin{displaymath} % \forall_{\mbox{{\scriptsize\texttt{L}}}}{(\psi)}{(\phi)} \comma \end{displaymath}$

(8)

where $\psi$ is in $\cL$ and occurs free in $\phi$ . If $\phi$ is in $\cL$ ,

is in $\cC$ , and

is in $\cA$ , then

$\begin{displaymath} % \forall_\circ {(c)}{(a)}{(\phi)} \comma \end{displaymath}$

(9)

where

occurs free in $\phi$ , is in $\cL$ . There are analogous constructors $\exists_{\mbox{{\scriptsize\texttt{C}}}}{}{}$ , $\exists_{\mbox{{\scriptsize\texttt{A}}}}{}{}$ , $\exists_{\mbox{{\scriptsize\texttt{L}}}}{}{}$ , and $\exists_\circ {}{}$ . The construct $\forall_\circ {(c)}{(a)}{(\phi)}$ from (9) will henceforth be abbreviated as

$\begin{displaymath} \forall_{{c}}{(a)}{(\phi)} \comma \end{displaymath}$

(10)

with a similar abbreviation for $\exists_\circ$ . In all quantificational forms, parentheses are optional and will be used as needed for clarity.

Semantics of $\cL$

Extending Classical Structures

Our notion of a semantic structure is an extension of the classical notion of a semantic structure for first-order logic. For each in $\cC$ , our context logic structure $\cS$ assigns a context object ${{c}^{\cS }_{}}$ , a function ${{f}^{{\cS }{c}}_{}}$ to each in $\cF$ , a predicate ${{P}^{{\cS }{c}}_{}}$ to each in $\cP$ , and a domain object ${{a}^{{\cS }{c}}_{}}$ to each in $\cA$ . The structure $\cS$ will also assign a semantic proposition-object ${{\phi}^{{\cS }{c}}_{}}$ to each $\phi$ in $\cL$ , as will be discussed in more detail later.

Thus all in $\cC$ are rigid designators with respect to context-relative interpretation, but this turns out not to matter. The reason is that there is more to the semantics of contexts than the particular objects they designate, which turn out to be irrelevant (and could just as well been made non-rigid, writing ${{c}^{{\cS }{c}}_{1}}$ , with notational cost and no semantic gain). The only characteristic of ${{c}^{\cS }_{}}$ that has semantic significance, is in fact context-relative, see definition (20). Meanwhile, no syntax objects in $\cA$ , $\cF$ , or $\cP$ need be rigid, nor, as it turns out, do the syntax objects in $\cL$ .

The expression ${\cC}^{\cS }$ stands for the collection of all ${{c}^{\cS }_{}}$ , ${\cF}^{{\cS }{c}}$ stands for all ${{f}^{{\cS }{c}}_{}}$ , ${\cP}^{{\cS }{c}}$ stands for all ${{P}^{{\cS }{c}}_{}}$ , and ${\cA}^{{\cS }{c}}$ stands for all ${{a}^{{\cS }{c}}_{}}$ . Furthermore, ${\cF}^{{\cS }{}}$ is the union of all ${\cF}^{{\cS }{c}}$ , ${\cP}^{{\cS }{}}$ is the union of all ${\cP}^{{\cS }{c}}$ , and ${\cA}^{{\cS }{}}$ is the union of all ${\cA}^{{\cS }{c}}$ .

The semantics of the logical connectives based on a classical first-order structure are in effect for each . If is in $\cP$ , we write

$\begin{displaymath} \Miff {\Ent{{\cS }{c}}{}{P(a_1,\cdots,a_n)} } {\In {\Tup... ...dots,{{a}^{{\cS }{c}}_{n}}}} {{{P}^{{\cS }{c}}_{}}} } \comma \end{displaymath}$

(11)

and for constructs $\phi$ and $\psi$ ,

$\begin{displaymath} \Miff {\Ent{{\cS }{c}}{}{\Land{\phi}{\psi}}} {\Mand {\Ent{{\cS }{c}}{}{\phi}} {\Ent{{\cS }{c}}{}{\psi}} } \comma \end{displaymath}$

(12)

$\begin{displaymath} \Miff {\Ent{{\cS }{c}}{}{\Lor{\phi}{\psi}}} {\Mor {\Ent{{\cS }{c}}{}{\phi}} {\Ent{{\cS }{c}}{}{\psi}} } \comma \end{displaymath}$

(13)

and,

$\begin{displaymath} \Miff {\Ent{{\cS }{c}}{}{\Lnot{\phi}}} {{} \nentsymb_{{\cS }{c}} {\phi}} \period \end{displaymath}$

(14)

If $\Ent{{\cS }{c}}{}{\phi}$ for every structure $\cS$ , then

$\begin{displaymath} {\Ent{c}{}{\phi}} \period \end{displaymath}$

(15)

If $\Ent{c}{}{\phi}$ for every context

, then

$\begin{displaymath} {\Ent{}{}{\phi}} \period \end{displaymath}$

(16)

Quantification

With $\phi[a]$ denoting occurring free in $\phi$ , and $\phi[a_1/a]$ denoting $\phi$ with replacing , we define

$\begin{displaymath} \begin{array}{c} \Miff {\Ent{{\cS }{c}}{}{\forall_{\mbox{{\... ...} {\Ent{{\cS }{c}}{}{\phi[{b}_{}/a]} } \comma } \end{array}\end{displaymath}$

(17)

and similarly for ${\forall_{\mbox{{\scriptsize\texttt{C}}}}{}{}}$ , $\exists_{\mbox{{\scriptsize\texttt{C}}}}{}{}$ , $\forall_{\mbox{{\scriptsize\texttt{L}}}}{}{}$ , and $\exists_{\mbox{{\scriptsize\texttt{L}}}}{}{}$ . At ${{\cS }{c}}$ , to each

in $\cC$ , the structure assigns an object ${\mbox{\texttt{U}}_{c_1}^{{{\cS }{c}}}}$ , which is a set

$\begin{displaymath} \Subset {{\mbox{\texttt{U}}_{c_1}^{{{\cS }{c}}}}} {{{\cA }^{{\cS }{c}}_{}}} \period \end{displaymath}$

(18)

Intuitively, ${\mbox{\texttt{U}}_{c_1}^{{{\cS }{c}}}}$ is the collection of items associated with the context

, as determined by the structure at ${{\cS }{c}}$ . Note that a given item can be associated with multiple contexts. We define

$\begin{displaymath} \begin{array}{c} \Miff {\Ent{{\cS }{c}}{}{\forall_{{c_1}}{a... ...} {\Ent{{\cS }{c}}{}{\phi[{b}_{}/a]} } } \end{array}\period \end{displaymath}$

(19)

Context-Relative Contexts

Possible Worlds

The structure $\cS$ includes a set of possible worlds $\cW$ (intended as the same abstract notion of possible worlds as used in modal logic semantics) and a ``consistency relation'' $\kappa_{}$ , where

$\begin{displaymath} % \Subset {\kappa_{}} {(\TimesThree{{\cC}^{\cS }}{\cW }{{\cC}^{\cS }})} \period \end{displaymath}$

(20)

Intuitively, we think of a context as a condition on possible worlds. Together, the contexts subsume, but do not necessarily partition, the collection of all possible worlds. Intuitively, the consistency relation $\kappa_{}$ specifies what possible worlds are associated with what contexts. However, each context is allowed to have its own specification, under the structure $\cS$ , of which possible worlds are consistent with each context. Whatever these various relative constitutions of the contexts are, however, the set of syntactic context-objects for them is not context-variant, being given as part of the definition of $\cL$ .

Intuitively, the expression

$\begin{displaymath} \kappa_{}({{c}^{\cS }_{1}},{w},{{c}^{\cS }_{2}}) \end{displaymath}$

(21)

indicates that according to the account of

, the possible world

is consistent with context

. This level of generality is employed because we see no reason a priori to rule out the idea of context-dependent contexts, and this subsumes the case of context-independent contexts.

Examples

As an example of a context logic model, let contexts correspond to closed intervals of real time, and then the possible worlds consistent with a given context could be given as those worlds wherein the time falls within . As another example, if we take contexts to be the logical closures $[\alpha]$ of assertions $\alpha$ about an arrangement of objects on a table, then the possible worlds consistent with $[\alpha]$ could be given as those worlds wherein $\alpha$ is satisfied, but where other aspects of the arrangement might vary.

Views and Truths

To say that truth is context-relative is meaningless without some way of comparing truths. One could devise an absolute framework within which relative truth is to be assessed; this is the case with the modal valuation function $v(\phi,w)$ , which is a universal entity governing all world-relative truth.

But we shall not do this, and instead we leave it to each context to define its own framework within which to assess relative truth. For, ``who'' else is there to assess it? We are declining to require an absolute arbiter of either truth or relative truth, and yet there is no a priori reason to distinguish any one among the contexts themselves for this purpose. The result, of course, is that the assessment of relative truth is itself relative, with each context not only having its own account of what is true, but also a view on what truth means for each among the collection of contexts. Thus, every context can be an arbiter of relative truth, each in its own relative way.

So, for each , the structure $\cS$ assigns an object ${\mbox{\texttt{T}}_{}^{{{\cS }{c}}}}$ , which is a set

$\begin{displaymath} \Subset {{\mbox{\texttt{T}}_{}^{{{\cS }{c}}}}} {(\Times {\cW } {{{\cL }^{{\cS }{c}}_{}}} )} \period \end{displaymath}$

(22)

Intuitively, ${\mbox{\texttt{T}}_{}^{{{\cS }{c}}}}$ can be thought of as giving what ``true'' means according to the account of

For each , the structure $\cS$ assigns an object ${\texttt{V}_{}^{{{\cS }{c}}}}$ , which is a set

$\begin{displaymath} \Subset {{\texttt{V}_{}^{{{\cS }{c}}}}} {(\TimesThree {\c... ...{{\cC }^{{\cS }{}}_{}}} {{{\cL }^{{\cS }{c}}_{}}} )} \period \end{displaymath}$

(23)

Intuitively, ${\texttt{V}_{}^{{{\cS }{c}}}}$ can be thought of as giving context 's view of what ``true in a context'' means, for each among the collection of contexts.

Self-Interpreting Structures

The semantics of context logic employ self-interpreting structures, that is, ones where the semantic objects in the range of the structure $\cS$ will themselves have interpretations assigned to them by $\cS$ . This will prove useful in handling arbitrarily nested s (among other things), by offering a kind of internal self-reference at the semantic level, rather than a language that is expressive enough to represent its own syntax and allow the construction of self-referential sentences.

Moreover, we are not aware of any a priori reason to rule out the idea of self-interpreting semantic structures. Our model-theoretic structures have both domains and ranges consisting of abstract objects, so it's not difficult to imagine the domain subsuming the range, resulting in self-interpretation. If interpretation under $\cS$ is idempotent, the resulting structure will likely be simpler, but this is not a requirement of the model theory.

Constructs

For each context , the structure $\cS$ assigns to each atomic construct

$\begin{displaymath} % P(\Dots{a_1}{a_k}) \end{displaymath}$

(24)

an object which is a tuple from the cartesian product

$\begin{displaymath} {\cA}^{{\cS }{c}} \times \cdots \times {\cA}^{{\cS }{c}} \times \{{{P}^{{\cS }{c}}_{}}\} \comma \end{displaymath}$

(25)

where ${{\cA}^{{\cS }{c}}_{}}{}$ occurs

times, and $\{{{P}^{{\cS }{c}}_{}}\}$ is a singleton set containing ${{P}^{{\cS }{c}}_{}}$ , and so

$\begin{displaymath} \DefineEq {{{[P(\Dots{a_1}{a_k})]}^{{\cS }{c}}_{}}} {\Tupl... ... {{{a}^{{\cS }{c}}_{k}}}, {{P}^{{\cS }{c}}_{}} } } \period \end{displaymath}$

(26)

Note that this should not be confused with the definition of $\Ent{{\cS }{c}}{}{P(\Dots{a_1}{a_k})}$ , which is given as a condition on objects in the range of the structure $\cS$ , but rather is a mapping of first-order atomic constructs to objects in the range of $\cS$ . By definition,

$\begin{displaymath} \Mifflnln {\left[\Mforallln {\cW } {w} {\Mimplies {\kap... ...c}}_{}}}} } {{{({{P}^{{\cS }{c_1}}_{}})}^{{\cS }{c}}_{}}} } \end{displaymath}$

(27)

and

$\begin{displaymath} \Mifflnln {\left[\Mforallln {\cW } {w} {\Mimplies {\kap... ... } {{{({{P}^{{\cS }{c_1}}_{}})}^{{\cS }{c_1}}_{}}} } \period \end{displaymath}$

(28)

If $\phi$ is a first-order atomic construct like (24), then we define, for each in $\cC$ ,

$\begin{displaymath} % \Mifflnln {\Ent{{\cS }{c}}{}{\Ist{}{c_1}{\phi}} } { {\... ...}_{}^{{\cS }{{c}_{1}}}}} } } \right)} \right]} } } \period \end{displaymath}$

(29)

Moreover, this same definition holds for non-atomic $\phi$ , via the following compositional rules. For every

in $\cC$ and

in $\cW$ , we have,
for $\land$ ,

$\begin{displaymath} % \begin{array}{c} \Miffln { {\In {\Tuple{w, {{c}^{\cS }_{... ...x{\texttt{T}}_{}^{{\cS }{{c}_{1}}}}} } } } \comma \end{array}\end{displaymath}$

(30)

for $\lor$ ,

$\begin{displaymath} % \begin{array}{c} \Miffln { {\In {\Tuple{w, {{c}^{\cS }_{... ...x{\texttt{T}}_{}^{{\cS }{{c}_{1}}}}} } } } \comma \end{array}\end{displaymath}$

(31)

for $\lnot$ ,

$\begin{displaymath} % \begin{array}{c} \Miff { {\In {\Tuple{w, {{c}^{\cS }_{1}... ...mbox{\texttt{T}}_{}^{{\cS }{{c}_{1}}}}} } } \comma \end{array}\end{displaymath}$