Statistiques
| Révision :

root / CSL17 / arithmetic.tex @ 221

Historique | Voir | Annoter | Télécharger (18,55 ko)

1
\section{An arithmetic for the polynomial hierarchy}\label{sect:arithmetic}
2
%Our base language is $\{ 0, \succ{} , + , \times, \smsh , |\cdot| , \leq \}$. 
3
Our base language consists of constant and function symbols $\{ 0, \succ{} , + , \times, \smsh , |\cdot|, \hlf{}.\}$ and predicate symbols 
4
 $\{\leq, \safe, \normal \}$. The function symbols are interpreted in the intuitive way, with $|x|$ denoting the length of $x$ seen as a binary string, and $\smash(x,y)$ denoting $2^{|x||y|}$, so a string of length $|x||y|+1$.
5
 We may write $\succ{0}(x)$ for $2\cdot x$, $\succ{1}(x)$ for $\succ{}(2\cdot x)$, and $\pred (x)$ for $\hlf{x}$, to better relate to the $\bc$ setting.
6
 
7
We consider formulas of classical first-order logic, over $\neg$, $\cand$, $\cor$, $\forall$, $\exists$, along with usual shorthands and abbreviations. 
8
%The formula $A \cimp B$ will be a notation for $\neg A \cor B$.
9
%We will also use as shorthand notations:
10
%$$ (s=t) = (s\leq t) \cand (t\leq s), \quad (s\neq t) = \neg(s=t).$$ 
11
\textit{Atomic formulas} formulas are of the form $(s\leq t)$, for terms $s,t$.
12
 We will assume, without loss of generality, that formulas are in \textit{De Morgan normal form}, that is to say that in formulas negation can only occur on atomic formulas, and that there is not any occurrence of a subformula of the form $\neg \neg A$.
13
 We write $\exists x^{N_i} . A$ or $\forall x^{N_i} . A$ for $\exists x . (N_i (x) \cand A)$ and $\forall x . (N_i (x) \cimp A)$ respectively.
14

    
15

    
16
The theories we introduce are directly inspired from bounded arithmetic, namely the theories $S^i_2$.
17
We include a similar set of axioms for direct comparison, although in our setting these are not minimal.
18
Indeed, $\#$ can be defined using induction in our setting.
19

    
20
The $\basic$ axioms of bounded arithmetic give the inductive definitions and interrelationships of the various function symbols.
21
It also states the fundamental algebraic properties, i.e.\ $(0,\succ{ } )$ is a free algebra, and, for us, it will also give us certain `typing' information for our function symbols based on their $\bc$ specification, with safe inputs ranging over $\safe$ and normal ones over $\normal$.
22
\begin{definition}
23
	[Basic theory]
24
	The theory $\basic$ consists of the axioms from Appendix \ref{appendix:arithmetic}.
25
	\end{definition}
26

    
27

    
28
Notation: if $\vec t=t_0,\dots, t_k$, we will denote as $\safe(\vec t)$ the sequence of formulas $\safe(t_0),\dots, \safe(t_k)$. Similarly for $\normal(\vec t)$.
29
  
30
\begin{definition}
31
[Derived functions and notations]
32
We write $1,2,3,\dots$ for the terms $\succ{} 0, \succ{} \succ{} 0, \succ{} \succ{} \succ{} 0 \dots$, and frequently omit the $\times$ symbol.
33
We define the functions $\succ 0 x , \succ 1 x$ as $2 x$ and $2x +1$ respectively.
34

    
35
Need $bit$, $\beta$ , $\pair{}{}{}$.
36
\end{definition}
37

    
38
(Here use a variation of S12 with sharply bounded quantifiers and safe quantifiers)
39

    
40
Use base theory + sharply bounded quantifiers.
41

    
42

    
43
\begin{definition}
44
[Quantifier hierarchy]
45
$\Sigma^\safe_0 = \Pi^\safe_0 $ is the set of formulae whose only quantifiers are sharply bounded.
46
We define $\Sigma^\safe_{i+1}$ as the closure of $\Pi^\safe_i $ under $\cor, \cand $, safe existentials and sharply bounded quantifiers.
47
We define $\Pi^\safe_{i+1}$ as the closure of $\Sigma^\safe_i $ under $\cor, \cand $, safe universals and sharply bounded quantifiers.
48
\end{definition}
49

    
50

    
51
\anupam{Collection principles for prenexing? Otherwise need to add closure under sharply bounded quantifiers.}
52
\begin{definition}\label{def:polynomialinduction}
53
[Polynomial induction]
54
The \emph{polynomial induction} axiom schema, $\pind$, consists of the following axioms,
55
\[
56
A(0) 
57
\cimp (\forall x^{\normal} . ( A(x) \cimp A(\succ{0} x) ) )
58
\cimp  (\forall x^{\normal} . ( A(x) \cimp A(\succ{1} x) ) ) 
59
\cimp  \forall x^{\normal} . A(x)
60
\]
61
for each formula $A(x)$.
62

    
63
For a class $\Xi$ of formulae, $\cax{\Xi}{\pind}$ denotes the set of induction axioms when $A(x) \in \Xi$. 
64

    
65
%We write $I\Xi$ to denote the theory consisting of $\basic$ and $\cax{\Xi}{\ind}$.
66
\end{definition}
67

    
68

    
69
\begin{definition}\label{def:ariththeory}
70
Define the theory $\arith^i$ consisting of the following axioms:
71
\begin{itemize}
72
	\item $\basic$;
73
	\item $\cpind{\Sigma^\safe_i } $:
74
\end{itemize}
75
and an inference rule, called $\rais$, for closed formulas $\exists y^\normal . A$:
76
\[
77
 \dfrac{\forall \vec x^\normal . \exists  y^\safe .  A }{ \forall \vec x^\normal .\exists y^\normal . A}
78
\]
79
\end{definition}
80
\patrick{I think in the definition of  $\arith^i$ we should impose that the formulas considered are \textit{integer positive}, that is to say that the only negative occurrences of atoms $\safe(t)$, $\normal(t)$ are those occurring in $\forall^{\safe}$ and $\forall^{\normal}$.  Indeed I don't think this can be just proved to be a consequence of a kind of 'normal form' of proofs, as we had discussed (see sect 4.4)}
81

    
82
\anupam{In induction,for inductive cases, need $u\neq 0$ for $\succ 0$ case.}
83

    
84
It is often useful for us to work with \emph{length-induction}, which is equivalent to polynomial induction and well known from bounded arithmetic:
85
\begin{proposition}
86
	[Length induction]
87
	The axiom schema of formulae,
88
\begin{equation}
89
\label{eqn:lind}
90
	( A(0) \cand \forall x^\normal . (A(x) \cimp A(\succ{} x)) ) \cimp \forall x^\safe. A(|x|)
91
\end{equation}
92
	for formulae $A \in \Sigma^\safe_i$
93
	is equivalent to $\cpind{\Sigma^\safe_i}$.
94
\end{proposition}
95
\begin{proof}
96
	Suppose we have $A(0)$ and $A(a) \cimp A(\succ{} a)$ for each $a \in \normal$.
97
	Then, by $\basic$, we have that $A(|a|) \cimp A(|2a|)$ and $A(|a|) \cimp A(|2a+1|)$ for each $a \in \normal$, whence we may conclude $\forall x. A(|x|)$ by polynomial induction on $A(|x|)$.
98
\end{proof}
99

    
100
Let us refer to the axiom schema in \eqref{eqn:lind} as $\clind{\mathcal C}$, when $A \in \mathcal C$.
101
We will freely use this in place of polynomial induction whenever it is convenient.
102

    
103

    
104

    
105

    
106
\subsection{Graphs of some basic functions}\label{sect:graphsbasicfunctions}
107
%Todo: $+1$,  
108

    
109
We say that a function $f$ is represented by a formula $A_f$ if the arithmetic can prove (in the forthcoming proof system) a formula of the form $\forall ^{\normal} \vec u, \forall ^{\safe} x, \exists^{\safe}! y. A_f$. The variables $\vec u$ and $\vec x$ can respectively be thought of as normal and safe arguments of $f$, and $y$ is the result of $f(\vec u; \vec x)$.
110
 
111
Let us give a few examples of formulas representing basic functions.
112
\begin{itemize}
113
\item Projection $\pi_k^{m,n}$: $\forall^{\normal} u_1, \dots, u_m,  \forall^{\safe} x_{m+1}, \dots, x_{m+n}, \exists^{\safe} y. y=x_k$.
114
\item Successor $\succ{}$: $\forall^{\safe} x, \exists^{\safe} y. y=x+1.$. The formulas for the binary successors $\succ{0}$, $\succ{1}$ and the constant functions $\epsilon^k$ are defined in a similar way.
115
\item Predecessor $p$:   $\forall^{\safe} x, \exists^{\safe} y. (x=\succ{0} y \cor x=\succ{1} y \cor (x=\epsilon \cand y= \epsilon)) .$
116
\item Conditional $C$: 
117
$$\begin{array}{ll}
118
\forall^{\safe} x, y_{\epsilon}, y_0, y_1, \exists^{\safe} y. & ((x=\epsilon)\cand (y=y_{\epsilon})\\
119
                                                                                                   & \cor( \exists^{\safe}z.(x=\succ{0}z) \cand (y=y_0))\\
120
                                                                                                   & \cor( \exists^{\safe}z.(x=\succ{1}z) \cand (y=y_1)))\
121
\end{array}
122
$$
123
\item Addition:
124
$\forall^{\safe} x, y,  \exists^{\safe} z. z=x+y$. 
125
\item Prefix:
126

    
127
  This is a predicate that we will need for technical reasons, in the completeness proof. The predicate $\pref(k,x,y)$ holds if the prefix of string $x$
128
  of length $k$ is $y$. It is defined as:
129
  $$\begin{array}{ll}
130
\exists^{\safe} z, \exists^{\normal} l\leq |x|. & (k+l= |x|\\
131
                                                                                                   & \cand \; |z|=l\\
132
                                                                                                   & \cand \; x=y\smsh(2,l)+z)
133
                                                                                                   \end{array}
134
$$
135
\item The following predicates will also be needed in proofs:
136

    
137
$\zerobit(x,k)$ (resp. $\onebit(x,k)$) holds iff the $k$th bit of $x$ is 0 (resp. 1). The predicate $\zerobit(x,k)$  can be
138
defined by:
139
$$ \exists^{\safe} y.(\pref(k,x,y) \cand C(y,0,1,0)).$$
140
\end{itemize}
141

    
142

    
143
\subsection{Encoding sequences in the arithmetic}
144
\todo{}
145

    
146
\anupam{Assume we have a $\Sigma^\safe_1$ predicate $\beta(i,x,y)$, expressing that the $i$th element of the sequence $x$ is $y$, such that $\arith^1 \proves \forall i^\normal , x^\safe . \exists ! y^\safe . \beta (i,x,y)$.}
147

    
148

    
149
\subsection{A sequent calculus presentation}
150

    
151
 We denote sequence as $\Gamma \seqar \Delta$ where $\Gamma$, $\Delta$ are multi sets of formulas. The sequent calculus rules are displayed on Fig. \ref{fig:sequentcalculus} in Appendix~\ref{sect:app-sequent-calculus}.
152

    
153
%We consider \emph{systems} of `nonlogical' rules extending this sequent calculus, which we write as follows,
154
% \[
155
% \begin{array}{cc}
156
%    \vlinf{(R)}{}{ \Gamma , \Sigma' \seqar \Delta' , \Pi  }{ \{\Gamma , \Sigma_i \seqar \Delta_i , \Pi \}_{i \in I} }
157
%\end{array}
158
%\]
159
% where, in each rule $(R)$, $I$ is a finite possibly empty set (indicating the number of premises) and we assume the following conditions and terminology:
160
% \begin{enumerate}
161
% \item In $(R)$ the formulas of $\Sigma', \Delta'$  are called \textit{principal}, those of $\Sigma_i, \Delta_i$ are called \textit{active}, and those of   
162
%$ \Gamma,  \Pi$ are called \textit{context formulas}. 
163
%\item Each rule $(R)$ comes with a list $a_1$, \dots, $a_k$ of eigenvariables such that each $a_j$ appears in exactly one $\Sigma_i, \Delta_i$ (so in some active formulas of exactly one premise)  and does not appear in  $\Sigma', \Delta'$ or $ \Gamma,  \Pi$.
164
%    \item A system $\mathcal{S}$ of rules must be closed under substitutions of free variables by terms (where these substitutions do not contain the eigenvariables $a_j$ in their domain or codomain).  
165
%   \end{enumerate}
166
% 
167
%%The distinction between modal and nonmodal formulae in $(R)$ induces condition 1
168
% Conditions 2 and 3 are standard requirements for nonlogical rules, independently of the logical setting, cf.\ \cite{Beckmann11}. Condition 2 reflects the intuitive idea that, in our nonlogical rules, we often need a notion of \textit{bound} variables in the active formulas (typically for induction rules), for which we rely on eigenvariables. Condition 3 is needed for our proof system to admit elimination of cuts on quantified formulas.
169
%
170
%%\begin{definition}
171
%%[Polynomial induction]
172
%%The \emph{polynomial induction} axiom schema, $\pind$, consists of the following axioms,
173
%%\[
174
%%A(0) 
175
%%\cimp (\forall x^{\normal} . ( A(x) \cimp A(\succ{0} x) ) )
176
%%\cimp  (\forall x^{\normal} . ( A(x) \cimp A(\succ{1} x) ) ) 
177
%%\cimp  \forall x^{\normal} . A(x)
178
%%\]
179
%%for each formula $A(x)$.
180
%%
181
%%For a class $\Xi$ of formulae, $\cax{\Xi}{\pind}$ denotes the set of induction axioms when $A(x) \in \Xi$. 
182
%%
183
%%We write $I\Xi$ to denote the theory consisting of $\basic$ and $\cax{\Xi}{\ind}$.
184
%%\end{definition}
185
%
186
%As an example any axiom can be represented by such a nonlogical rule $(R)$, with no premise ($I=\emptyset$), $\Delta'$ equal to the axiom and $\Gamma=\Sigma'=\Pi$.
187

    
188
We extend the purely logical calculus with certain non-logical rules and initial sequents corresponding to our theories $\arith^i$.
189
 For instance the axiom $\pind$ of Def. \ref{def:polynomialinduction} is represented by the following rule:
190
\begin{equation}
191
\label{eqn:ind-rule}
192
\small
193
\vliinf{\pind}{}{ \normal(t) , \Gamma , A(0) \seqar A(t), \Delta }{ \normal(a) , \Gamma, A(a) \seqar A(\succ{0} a) , \Delta }{ \normal(a) , \Gamma, A(a) \seqar A(\succ{1} a) , \Delta  }
194
\end{equation}
195
where $I=2$ and  in all cases, $t$ varies over arbitrary terms and the eigenvariable $a$ does not occur in the lower sequent of the $\pind$ rule.
196

    
197
Similarly the $\rais$ inference rule of Def. \ref{def:ariththeory} is represented by the nonlogical rule:
198
 \[
199
 \begin{array}{cc}
200
    \vlinf{\rais}{}{  \normal(t_1), \dots, \normal(t_k) \seqar  \exists  y^\normal .  A }{  \normal(t_1), \dots, \normal(t_k) \seqar \exists  y^\safe .  A}
201
\end{array}
202
\]
203

    
204
%\patrick{In fact, I think we rather need the following nonlogical rule, which implies the previous one but is I guess more general:
205
%\[
206
% \begin{array}{cc}
207
%    \vlinf{\rais}{}{  \normal(t_1), \dots, \normal(t_k) \seqar  \normal(t) }{  \normal(t_1), \dots, \normal(t_k) \seqar \safe(t)}
208
%\end{array}
209
%\]
210
%}
211

    
212
The $\basic$ axioms are equivalent to the following nonlogical rules, that we will also designate by $\basic$:
213
\[
214
\small
215
\begin{array}{l}
216
\begin{array}{cccc}
217
\vlinf{}{}{\seqar \safe (0)}{}&
218
\vlinf{}{}{\safe(t) \seqar \safe(\succ{} t)}{}&
219
\vlinf{}{}{ \safe (t)   \seqar 0 \neq \succ{} t}{} &
220
\vlinf{}{}{\safe (s) , \safe (t)  , \succ{} s = \succ{} t\seqar s = t }{}\\
221
\end{array}
222
\\
223
\vlinf{}{}{\safe (t) \seqar t = 0 \cor \exists y^\safe . t = \succ{} y  }{}
224
\qquad
225
\vlinf{}{}{\safe(s), \safe(t) \seqar \safe(s+t) }{}\\
226
\vlinf{}{}{\normal (s), \safe(t) \seqar \safe(s \times t)  }{}
227
\qquad
228
\vlinf{}{}{\normal (s), \normal(t) \seqar \safe(s \smsh t)  }{}\\
229
\vlinf{}{}{\normal(t) \seqar \safe(t)  }{}
230
\end{array}
231
\]
232

    
233
 The sequent calculus for $\arith^i$ is that of Fig. \ref{fig:sequentcalculus} extended with the $\basic$,  $\cpind{\Sigma^\safe_i } $ and $\rais$ nonlogical rules.
234
 
235
 \begin{lemma}
236
 For any term $t$, its free variables can be split in two sets $\vec{x}$ and $\vec{y}$ such  that the sequent $\normal(\vec x), \safe(\vec y) \seqar \safe(t)$ is provable. 
237
 \end{lemma}
238
 
239
\subsection{Free-cut free normal form of proofs}
240
\todo{State theorem, with references (Takeuti, Cook-Nguyen) and present the important corollaries for this work.}
241

    
242
Since our nonlogical rules may have many principal formulae on which cuts may be anchored, we need a slightly more general notion of principality.
243
    \begin{definition}\label{def:anchoredcut}
244
  We define the notions of \textit{hereditarily principal formula} and \textit{anchored cut} in a $\system$-proof, for a system $\system$, by mutual induction as follows:
245
  \begin{itemize}
246
  \item A formula $A$ in a sequent $\Gamma \seqar \Delta$ is \textit{hereditarily principal} for a rule instance (S) if either (i) the sequent is in the conclusion of (S) and $A$ is principal in it, or 
247
(ii)  the sequent is in the conclusion of an anchored cut, the direct ancestor of $A$ in the corresponding premise is hereditarily principal for the rule instance (S), and the rule (S) is nonlogical.
248
  \item A cut-step is an \textit{anchored cut} if the two occurrences of its cut-formula $A$ in each premise are hereditarily principal for nonlogical steps, or one is hereditarily principal for a nonlogical step and the other one is principal for a logical step.
249
  \end{itemize}
250
     A cut which is not anchored will also be called a \textit{free-cut}.
251
  \end{definition}
252
  As a consequence of this definition, an anchored cut on a formula $A$ has the following properties:
253
  \begin{itemize}
254
  \item At least one of the two premises of the cut has above it a sub-branch of the proof which starts (top-down) with a nonlogical step (R) with $A$ as one of its principal formulas, and then a sequence of anchored cuts in which $A$ is part of the context.
255
  \item The other premise is either of the same form or is a logical step with principal formula $A$. 
256
  \end{itemize}
257
   
258
   Now we have (see \cite{Takeuti87}): 
259
   \begin{theorem}
260
   [Free-cut elimination]\label{thm:freecutelimination}
261
   \label{thm:free-cut-elim}
262
    Given a system  $\mathcal{S}$, any  $\mathcal{S}$-proof $\pi$ can be transformed into a $\system$-proof $\pi'$ with same end sequent and without any free-cut.
263
   \end{theorem}
264
   Now we want to deduce from that theorem a normal form property for proofs of certain formulas. But before that let us define some particular classes of sequents and proofs.
265
   
266
   Say that a sequent $\Gamma \seqar \Delta$ is \textit{well-typed} if for any free variable $x$ occurring in $\Gamma$ or $\Delta$, there exists a formula $\safe(x)$ or $\normal(x)$ in $\Gamma$. A proof is well-typed if its sequence are.
267
   
268
   \begin{lemma}\label{lem:welltyped}
269
   If a well-typed sequent $\Gamma \seqar \Delta$ is provable, then there exists $\vec u$  such that
270
 the sequent $\normal(\vec u), \Gamma \seqar \Delta$ admits a well-typed proof.
271
   \end{lemma}
272
   \patrick{It seems to me the statement had to be modified so as to prove the lemma. Maybe I misunderstand something.}
273
   \begin{proof}[Proof sketch]
274
   First by Thm \ref{thm:freecutelimination} we know that $\Gamma \seqar \Delta$ admits a proof $\pi$ without any free-cut. Let us then prove that $\pi$ can be transformed in a proof $\pi'$ of conclusion of the form  $\normal(\vec u), \Gamma \seqar \Delta$ and such that, for any sequent, if it is well-typed then its premises are well-typed.
275
  
276
   Observe first that by definition of $\arith^i$ and the absence of free cut, all quantifiers occurring in a formula of the proof are of one of the forms
277
   $\forall^{\safe}$,   $\exists^{\safe}$,  $\forall^{\normal}$,   $\exists^{\normal}$, and for the last two ones they are sharply bounded.
278
  
279
  Then, one can check that for all rules but the quantifier rules and the cut rule, if the conclusion is well-typed, then so are the two premises.  For the remaining rules, $\forall-r$ and $\exists-l$ are unproblematic, because of the observation above. Let us now examine the case of $\exists-r$, with a $\safe$ label, and the other rules can be treated in the same way. In the premise we get a formula $\safe(t) \cand A(t)$. Then what we do is that, if  $\vec u$ denote the free variables of $t$, we add to the context of all sequents of the proof $\normal(\vec u)$. We obtain in this way a valid proof new proof,  and the premises of the rule have become well-typed.
280
       \end{proof}
281
     
282
     \patrick{As mentioned after Def 14, I don't think that we can prove that the proofs we consider are equivalent to integer positive proofs, by arguing that negative occurrences $\neg \safe(t)$ could be replaced by 'false', by using the lemma above. Indeed even if for all its free variables we have $\safe(\vec x)$, $\normal(\vec u)$ on the l.h.s. of the sequent, it is not clear to me why that would prove $\safe(t)$. My proposition is thus to restrict 'by definition' of $\arith^i$ to integer positive formulas.}
283
     
284
 \begin{theorem}
285
   Assume the $\arith^i$ sequent calculus proves a closed formula $\forall \vec u^\normal . \forall \vec x^\safe . \exists y^\safe . A(\vec u ; \vec x , y)$. Then there exists a proof $\pi$ of the sequent 
286
   $\normal(\vec u), \safe(\vec x) \seqar \exists y^\safe . A(\vec u ; \vec x , y)$ satisfying:
287
   \begin{enumerate}
288
    \item $\pi$  only contains  $\Sigma^\safe_{i}$ formulas,
289
    \item $\pi$ is a well-typed and integer-positive proof. 
290
   \end{enumerate}
291
   \end{theorem}