Statistiques
| Révision :

root / CSL17 / arithmetic.tex @ 182

Historique | Voir | Annoter | Télécharger (17,83 ko)

1
\section{An arithmetic for the polynomial hierarchy}
2
%Our base language is $\{ 0, \succ{} , + , \times, \smsh , |\cdot| , \leq \}$. 
3
Our base language is defined by the set of functions (and constants) symbols $\{ 0, \succ{} , + , \times, \smsh , |\cdot|\}$ and the set of predicate symbols 
4
 $\{\leq, \safe, \normal \}$.
5
We use classical logic connectives $\neg$, $\cand$, $\cor$, $\forall$, $\exists$. The formula $A \cimp B$ will be a notation for $\neg A \cor B$.
6
We will also use as shorthand notations:
7
$$ (s=t) = (s\leq t) \cand (t\leq s), \quad (s\neq t) = \neg(s=t).$$ 
8
We call \textit{atomic formulas} formulas of the form $(s\leq t)$ or $(s=t)$.
9
 As we are in classical logic, we will assume, without loss of generality, that formulas are in \textit{De Morgan normal form}, that is to say that in formulas negation can only occur on atomic formulas, and that there is not any occurrence of subformula of the form $\neg \neg A$.
10

    
11
Now, let us describe the axioms we are considering.The $\basic$ axioms are as follows:
12
\[
13
\begin{array}{l}
14
\safe (0) \\
15
\forall x^\safe . \safe (\succ{} x) \\
16
\forall x^\safe . 0 \neq \succ{} (x) \\
17
\forall x^\safe , y^\safe . (\succ{} x = \succ{} y \cimp x = y) \\
18
\forall x^\safe . (x = 0 \cor \exists y^\safe.\  x = \succ{} y   )\\
19
\forall x^\safe, y^\safe . \safe(x+y)\\
20
\forall u^\normal, x^\safe . \safe(u\times x) \\
21
\forall u^\normal , v^\normal . \safe (u \smsh v)\\
22
\forall u^\normal \safe(u) 
23
\end{array}
24
\]
25
\anupam{in fact, we use essentially the same language, so just take Buss' Basic axioms after proper typing. Should also add the symbol $\hlf{\cdot}$ for binary predecessor then we have the full language of bounded arithmetic.}
26

    
27

    
28
Notation: if $\vec t=t_0,\dots, t_k$, we will denote as $\safe(\vec t)$ the sequence of formulas $\safe(t_0),\dots, \safe(t_k)$. Similarly for $\normal(\vec t)$.
29
  
30
\begin{definition}
31
[Derived functions and notations]
32
We write $1,2,3,\dots$ for the terms $\succ{} 0, \succ{} \succ{} 0, \succ{} \succ{} \succ{} 0 \dots$, and frequently omit the $\times$ symbol.
33
We define the functions $\succ 0 x , \succ 1 x$ as $2 x$ and $2x +1$ respectively.
34

    
35
Need $bit$, $\beta$ , $\pair{}{}{}$.
36
\end{definition}
37

    
38
(Here use a variation of S12 with sharply bounded quantifiers and safe quantifiers)
39

    
40
Use base theory + sharply bounded quantifiers.
41

    
42

    
43
\begin{definition}
44
[Quantifier hierarchy]
45
$\Sigma^\safe_0 = \Pi^\safe_0 $ is the set of formulae whose only quantifiers are sharply bounded.
46
We define $\Sigma^\safe_{i+1}$ as the closure of $\Pi^\safe_i $ under $\cor, \cand $, safe existentials and sharply bounded quantifiers.
47
We define $\Pi^\safe_{i+1}$ as the closure of $\Sigma^\safe_i $ under $\cor, \cand $, safe universals and sharply bounded quantifiers.
48
\end{definition}
49

    
50

    
51
\anupam{Collection principles for prenexing? Otherwise need to add closure under sharply bounded quantifiers.}
52
\begin{definition}\label{def:polynomialinduction}
53
[Polynomial induction]
54
The \emph{polynomial induction} axiom schema, $\pind$, consists of the following axioms,
55
\[
56
A(0) 
57
\cimp (\forall x^{\normal} . ( A(x) \cimp A(\succ{0} x) ) )
58
\cimp  (\forall x^{\normal} . ( A(x) \cimp A(\succ{1} x) ) ) 
59
\cimp  \forall x^{\normal} . A(x)
60
\]
61
for each formula $A(x)$.
62

    
63
For a class $\Xi$ of formulae, $\cax{\Xi}{\pind}$ denotes the set of induction axioms when $A(x) \in \Xi$. 
64

    
65
%We write $I\Xi$ to denote the theory consisting of $\basic$ and $\cax{\Xi}{\ind}$.
66
\end{definition}
67

    
68

    
69
\begin{definition}\label{def:ariththeory}
70
Define the theory $\arith^i$ consisting of the following axioms:
71
\begin{itemize}
72
	\item $\basic$;
73
	\item $\cpind{\Sigma^\safe_i } $:
74
\end{itemize}
75
and an inference rule, called $\rais$, for closed formulas $\exists y^\normal . A$:
76
\[
77
 \dfrac{\forall \vec x^\normal . \exists  y^\safe .  A }{ \forall \vec x^\normal .\exists y^\normal . A}
78
\]
79
\end{definition}
80
\patrick{I think in the definition of  $\arith^i$ we should impose that the formulas considered are \textit{integer positive}, that is to say that the only negative occurrences of atoms $\safe(t)$, $\normal(t)$ are those occurring in $\forall^{\safe}$ and $\forall^{\normal}$.  Indeed I don't think this can be just proved to be a consequence of a kind of 'normal form' of proofs, as we had discussed (see sect 4.4)}
81

    
82
\anupam{In induction,for inductive cases, need $u\neq 0$ for $\succ 0$ case.}
83

    
84
\begin{lemma}
85
[Sharply bounded lemma]
86
Let $f_A$ be the characteristic function of a predicate $A(u , \vec u ; \vec x)$.
87
Then the characteristic functions of $\forall u \prefix v . A(u,\vec u ; \vec x)$ and $\exists u \prefix v . A(u , \vec u ; \vec x)$ are in $\bc(f_A)$.
88
\end{lemma}
89
\begin{proof}
90
	We give the $\forall$ case, the $\exists$ case being dual.
91
	The characteristic function $f(v , \vec u ; \vec x)$ is defined by predicative recursion on $v$ as:
92
	\[
93
	\begin{array}{rcl}
94
	f(0, \vec u ; \vec x) & \dfn & f_A (0 , \vec u ; \vec x) \\
95
	f(\succ i v , \vec u ; \vec x) & \dfn & \cond ( ; f_A (\succ i v, \vec u ; \vec x) , 0 , f(v , \vec u ; \vec x) )
96
	\end{array}
97
	\]
98
\end{proof}
99

    
100
Notice that $\prefix$ suffices to encode usual sharply bounded inequalities,
101
since $\forall u \leq |t| . A(u , \vec u ; \vec x) \ciff \forall u \prefix t . A(|u|, \vec u ; \vec x)$.
102

    
103

    
104
\subsection{Graphs of some basic functions}
105
Todo: $+1$,  
106

    
107
\subsection{Encoding sequences in the arithmetic}
108
\todo{}
109

    
110
\anupam{Assume we have a $\Sigma^\safe_1$ predicate $\beta(i,x,y)$, expressing that the $i$th element of the sequence $x$ is $y$, such that $\arith^1 \proves \forall i^\normal , x^\safe . \exists ! y^\safe . \beta (i,x,y)$.}
111

    
112

    
113
\subsection{A sequent calculus presentation}
114

    
115
\begin{figure}
116
\[
117
\small
118
\begin{array}{l}
119
\begin{array}{cccc}
120
%\vlinf{\lefrul{\bot}}{}{p, \lnot{p} \seqar }{}
121
%& \vlinf{\id}{}{p \seqar p}{}
122
%& \vlinf{\rigrul{\bot}}{}{\seqar p, \lnot{p}}{}
123
%& \vliinf{\cut}{}{\Gamma, \Sigma \seqar \Delta , \Pi}{ \Gamma \seqar \Delta, A }{\Sigma, A \seqar \Pi}
124
 \vlinf{id}{}{\Gamma, p \seqar p, \Delta }{}
125
& \vliinf{cut}{}{\Gamma \seqar \Delta }{ \Gamma \seqar \Delta, A }{\Gamma, A \seqar \Delta}
126
&&
127
\\
128
\noalign{\bigskip}
129
%\noalign{\bigskip}
130
\vliinf{\lefrul{\cor}}{}{\Gamma, A \cor B \seqar \Delta}{\Gamma , A \seqar \Delta}{\Gamma, B \seqar \Delta}
131
&
132
\vlinf{\lefrul{\cand}}{}{\Gamma, A\cand B \seqar \Delta}{\Gamma, A , B \seqar \Delta}
133
&
134
%\vlinf{\lefrul{\laand}}{}{\Gamma, A\laand B \seqar \Delta}{\Gamma, B \seqar \Delta}
135
%\quad
136
\vlinf{\rigrul{\cor}}{}{\Gamma \seqar \Delta, A \cor B}{\Gamma \seqar \Delta, A, B}
137
&
138
%\vlinf{\rigrul{\laor}}{}{\Gamma \seqar \Delta, A\laor B}{\Gamma \seqar \Delta, B}
139
%\quad
140
\vliinf{\rigrul{\cand}}{}{\Gamma \seqar \Delta, A \cand B }{\Gamma \seqar \Delta, A}{\Gamma \seqar \Delta, B}
141
\\
142
\noalign{\bigskip}
143

    
144
\vlinf{\lefrul{\neg}}{}{\Gamma, \neg A \seqar \Delta}{\Gamma \seqar A, \Delta}
145
&
146

    
147
\vlinf{\lefrul{\neg}}{}{\Gamma, \seqar \neg A, \Delta}{\Gamma, A \seqar  \Delta}
148
&
149
&
150
%\vliinf{\lefrul{\cimp}}{}{\Gamma, A \cimp B \seqar \Delta}{\Gamma \seqar A, \Delta}{\Gamma, B \seqar \Delta}
151
%&
152
%
153
%\vlinf{\rigrul{\cimp}}{}{\Gamma \seqar \Delta, A \cimp B}{\Gamma, A \seqar \Delta,  B}
154

    
155

    
156
\\
157

    
158
\noalign{\bigskip}
159
%\text{Structural:} & & & \\
160
%\noalign{\bigskip}
161

    
162
%\vlinf{\lefrul{\wk}}{}{\Gamma, A \seqar \Delta}{\Gamma \seqar \Delta}
163
%&
164
\vlinf{\lefrul{\cntr}}{}{\Gamma, A \seqar \Delta}{\Gamma, A, A \seqar \Delta}
165
%&
166
%\vlinf{\rigrul{\wk}}{}{\Gamma \seqar \Delta, A }{\Gamma \seqar \Delta}
167
&
168
\vlinf{\rigrul{\cntr}}{}{\Gamma \seqar \Delta, A}{\Gamma \seqar \Delta, A, A}
169
&
170
&
171
\\
172
\noalign{\bigskip}
173
\vlinf{\lefrul{\exists}}{}{\Gamma, \exists x . A(x) \seqar \Delta}{\Gamma, A(a) \seqar \Delta}
174
&
175
\vlinf{\lefrul{\forall}}{}{\Gamma, \forall x. A(x) \seqar \Delta}{\Gamma, A(t) \seqar \Delta}
176
&
177
\vlinf{\rigrul{\exists}}{}{\Gamma \seqar \Delta, \exists x . A(x)}{ \Gamma \seqar \Delta, A(t)}
178
&
179
\vlinf{\rigrul{\forall}}{}{\Gamma \seqar \Delta, \forall x . A(x)}{ \Gamma \seqar \Delta, A(a) } \\
180
%\noalign{\bigskip}
181
% \vliinf{mix}{}{\Gamma, \Sigma \seqar \Delta , \Pi}{ \Gamma \seqar \Delta}{\Sigma \seqar \Pi} &&&
182
\end{array}
183
\end{array}
184
\]
185
\caption{Sequent calculus rules}\label{fig:sequentcalculus}
186
\end{figure}
187
 We denote sequence as $\Gamma \seqar \Delta$ where $\Gamma$, $\Delta$ are multi sets of formulas. The sequent calculus rules are displayed on Fig. \ref{fig:sequentcalculus},  where $p$ is atomic, $i \in \{ 1,2 \}$, $t$ is a term and the eigenvariable $a$ does not occur free in $\Gamma$ or $\Delta$.
188

    
189
We consider \emph{systems} of `nonlogical' rules extending this sequent calculus, which we write as follows,
190
 \[
191
 \begin{array}{cc}
192
    \vlinf{(R)}{}{ \Gamma , \Sigma' \seqar \Delta' , \Pi  }{ \{\Gamma , \Sigma_i \seqar \Delta_i , \Pi \}_{i \in I} }
193
\end{array}
194
\]
195
 where, in each rule $(R)$, $I$ is a finite possibly empty set (indicating the number of premises) and we assume the following conditions and terminology:
196
 \begin{enumerate}
197
 \item In $(R)$ the formulas of $\Sigma', \Delta'$  are called \textit{principal}, those of $\Sigma_i, \Delta_i$ are called \textit{active}, and those of   
198
$ \Gamma,  \Pi$ are called \textit{context formulas}. 
199
\item Each rule $(R)$ comes with a list $a_1$, \dots, $a_k$ of eigenvariables such that each $a_j$ appears in exactly one $\Sigma_i, \Delta_i$ (so in some active formulas of exactly one premise)  and does not appear in  $\Sigma', \Delta'$ or $ \Gamma,  \Pi$.
200
    \item A system $\mathcal{S}$ of rules must be closed under substitutions of free variables by terms (where these substitutions do not contain the eigenvariables $a_j$ in their domain or codomain).  
201
   \end{enumerate}
202
 
203
%The distinction between modal and nonmodal formulae in $(R)$ induces condition 1
204
 Conditions 2 and 3 are standard requirements for nonlogical rules, independently of the logical setting, cf.\ \cite{Beckmann11}. Condition 2 reflects the intuitive idea that, in our nonlogical rules, we often need a notion of \textit{bound} variables in the active formulas (typically for induction rules), for which we rely on eigenvariables. Condition 3 is needed for our proof system to admit elimination of cuts on quantified formulas.
205

    
206
%\begin{definition}
207
%[Polynomial induction]
208
%The \emph{polynomial induction} axiom schema, $\pind$, consists of the following axioms,
209
%\[
210
%A(0) 
211
%\cimp (\forall x^{\normal} . ( A(x) \cimp A(\succ{0} x) ) )
212
%\cimp  (\forall x^{\normal} . ( A(x) \cimp A(\succ{1} x) ) ) 
213
%\cimp  \forall x^{\normal} . A(x)
214
%\]
215
%for each formula $A(x)$.
216
%
217
%For a class $\Xi$ of formulae, $\cax{\Xi}{\pind}$ denotes the set of induction axioms when $A(x) \in \Xi$. 
218
%
219
%We write $I\Xi$ to denote the theory consisting of $\basic$ and $\cax{\Xi}{\ind}$.
220
%\end{definition}
221

    
222
As an example any axiom can be represented by such a nonlogical rule $(R)$, with no premise ($I=\emptyset$), $\Delta'$ equal to the axiom and $\Gamma=\Sigma'=\Pi$. For instance the axiom $\pind$ of Def. \ref{def:polynomialinduction}.
223

    
224
Actually  $\pind$ is equivalent to the following rule:
225
\begin{equation}
226
\label{eqn:ind-rule}
227
\small
228
\vliinf{\pind}{}{ \normal(t) , \Gamma , A(0) \seqar A(t), \Delta }{ \normal(a) , \Gamma, A(a) \seqar A(\succ{0} a) , \Delta }{ \normal(a) , \Gamma, A(a) \seqar A(\succ{1} a) , \Delta  }
229
\end{equation}
230
where $I=2$ and  in all cases, $t$ varies over arbitrary terms and the eigenvariable $a$ does not occur in the lower sequent of the $\pind$ rule.
231

    
232
Similarly the $\rais$ inference rule of Def. \ref{def:ariththeory} is represented by the nonlogical rule:
233
 \[
234
 \begin{array}{cc}
235
    \vlinf{\rais}{}{  \normal(t_1), \dots, \normal(t_k) \seqar  \exists  y^\normal .  A }{  \normal(t_1), \dots, \normal(t_k) \seqar \exists  y^\safe .  A}
236
\end{array}
237
\]
238

    
239
%\patrick{In fact, I think we rather need the following nonlogical rule, which implies the previous one but is I guess more general:
240
%\[
241
% \begin{array}{cc}
242
%    \vlinf{\rais}{}{  \normal(t_1), \dots, \normal(t_k) \seqar  \normal(t) }{  \normal(t_1), \dots, \normal(t_k) \seqar \safe(t)}
243
%\end{array}
244
%\]
245
%}
246

    
247
The $\basic$ axioms are equivalent to the following nonlogical rules, that we will also designate by $\basic$:
248
\[
249
\small
250
\begin{array}{l}
251
\begin{array}{cccc}
252
\vlinf{}{}{\seqar \safe (0)}{}&
253
\vlinf{}{}{\safe(t) \seqar \safe(\succ{} t)}{}&
254
\vlinf{}{}{ \safe (t)   \seqar 0 \neq \succ{} t}{} &
255
\vlinf{}{}{\safe (s) , \safe (t)  , \succ{} s = \succ{} t\seqar s = t }{}\\
256
\end{array}
257
\\
258
\vlinf{}{}{\safe (t) \seqar t = 0 \cor \exists y^\safe . t = \succ{} y  }{}
259
\qquad
260
\vlinf{}{}{\safe(s), \safe(t) \seqar \safe(s+t) }{}\\
261
\vlinf{}{}{\normal (s), \safe(t) \seqar \safe(s \times t)  }{}
262
\qquad
263
\vlinf{}{}{\normal (s), \normal(t) \seqar \safe(s \smsh t)  }{}\\
264
\vlinf{}{}{\normal(t) \seqar \safe(t)  }{}
265
\end{array}
266
\]
267

    
268
 The sequent calculus for $\arith^i$ is that of Fig. \ref{fig:sequentcalculus} extended with the $\basic$,  $\cpind{\Sigma^\safe_i } $ and $\rais$ nonlogical rules.
269
 
270
 \begin{lemma}
271
 For any term $t$, its free variables can be split in two sets $\vec{x}$ and $\vec{y}$ such  that the sequent $\normal(\vec x), \safe(\vec y) \seqar \safe(t)$ is provable. 
272
 \end{lemma}
273
 
274
\subsection{Free-cut free normal form of proofs}
275
\todo{State theorem, with references (Takeuti, Cook-Nguyen) and present the important corollaries for this work.}
276

    
277
Since our nonlogical rules may have many principal formulae on which cuts may be anchored, we need a slightly more general notion of principality.
278
    \begin{definition}\label{def:anchoredcut}
279
  We define the notions of \textit{hereditarily principal formula} and \textit{anchored cut} in a $\system$-proof, for a system $\system$, by mutual induction as follows:
280
  \begin{itemize}
281
  \item A formula $A$ in a sequent $\Gamma \seqar \Delta$ is \textit{hereditarily principal} for a rule instance (S) if either (i) the sequent is in the conclusion of (S) and $A$ is principal in it, or 
282
(ii)  the sequent is in the conclusion of an anchored cut, the direct ancestor of $A$ in the corresponding premise is hereditarily principal for the rule instance (S), and the rule (S) is nonlogical.
283
  \item A cut-step is an \textit{anchored cut} if the two occurrences of its cut-formula $A$ in each premise are hereditarily principal for nonlogical steps, or one is hereditarily principal for a nonlogical step and the other one is principal for a logical step.
284
  \end{itemize}
285
     A cut which is not anchored will also be called a \textit{free-cut}.
286
  \end{definition}
287
  As a consequence of this definition, an anchored cut on a formula $A$ has the following properties:
288
  \begin{itemize}
289
  \item At least one of the two premises of the cut has above it a sub-branch of the proof which starts (top-down) with a nonlogical step (R) with $A$ as one of its principal formulas, and then a sequence of anchored cuts in which $A$ is part of the context.
290
  \item The other premise is either of the same form or is a logical step with principal formula $A$. 
291
  \end{itemize}
292
   
293
   Now we have (see \cite{Takeuti87}): 
294
   \begin{theorem}
295
   [Free-cut elimination]\label{thm:freecutelimination}
296
   \label{thm:free-cut-elim}
297
    Given a system  $\mathcal{S}$, any  $\mathcal{S}$-proof $\pi$ can be transformed into a $\system$-proof $\pi'$ with same end sequent and without any free-cut.
298
   \end{theorem}
299
   Now we want to deduce from that theorem a normal form property for proofs of certain formulas. But before that let us define some particular classes of sequents and proofs.
300
   
301
   Say that a sequent $\Gamma \seqar \Delta$ is \textit{well-typed} if for any free variable $x$ occurring in $\Gamma$ or $\Delta$, there exists a formula $\safe(x)$ or $\normal(x)$ in $\Gamma$. A proof is well-typed if its sequence are.
302
   
303
   \begin{lemma}\label{lem:welltyped}
304
   If a well-typed sequent $\Gamma \seqar \Delta$ is provable, then there exists $\vec u$  such that
305
 the sequent $\normal(\vec u), \Gamma \seqar \Delta$ admits a well-typed proof.
306
   \end{lemma}
307
   \patrick{It seems to me the statement had to be modified so as to prove the lemma. Maybe I misunderstand something.}
308
   \begin{proof}[Proof sketch]
309
   First by Thm \ref{thm:freecutelimination} we know that $\Gamma \seqar \Delta$ admits a proof $\pi$ without any free-cut. Let us then prove that $\pi$ can be transformed in a proof $\pi'$ of conclusion of the form  $\normal(\vec u), \Gamma \seqar \Delta$ and such that, for any sequent, if it is well-typed then its premises are well-typed.
310
  
311
   Observe first that by definition of $\arith^i$ and the absence of free cut, all quantifiers occurring in a formula of the proof are of one of the forms
312
   $\forall^{\safe}$,   $\exists^{\safe}$,  $\forall^{\normal}$,   $\exists^{\normal}$, and for the last two ones they are sharply bounded.
313
  
314
  Then, one can check that for all rules but the quantifier rules and the cut rule, if the conclusion is well-typed, then so are the two premises.  For the remaining rules, $\forall-r$ and $\exists-l$ are unproblematic, because of the observation above. Let us now examine the case of $\exists-r$, with a $\safe$ label, and the other rules can be treated in the same way. In the premise we get a formula $\safe(t) \cand A(t)$. Then what we do is that, if  $\vec u$ denote the free variables of $t$, we add to the context of all sequents of the proof $\normal(\vec u)$. We obtain in this way a valid proof new proof,  and the premises of the rule have become well-typed.
315
       \end{proof}
316
     
317
     \patrick{As mentioned after Def 14, I don't think that we can prove that the proofs we consider are equivalent to integer positive proofs, by arguing that negative occurrences $\neg \safe(t)$ could be replaced by 'false', by using the lemma above. Indeed even if for all its free variables we have $\safe(\vec x)$, $\normal(\vec u)$ on the l.h.s. of the sequent, it is not clear to me why that would prove $\safe(t)$. My proposition is thus to restrict 'by definition' of $\arith^i$ to integer positive formulas.}
318
     
319
 \begin{theorem}
320
   Assume the $\arith^i$ sequent calculus proves a closed formula $\forall \vec u^\normal . \forall \vec x^\safe . \exists y^\safe . A(\vec u ; \vec x , y)$. Then there exists a proof $\pi$ of the sequent 
321
   $\normal(\vec u), \safe(\vec x) \seqar \exists y^\safe . A(\vec u ; \vec x , y)$ satisfying:
322
   \begin{enumerate}
323
    \item $\pi$  only contains  $\Sigma^\safe_{i}$ formulas,
324
    \item $\pi$ is a well-typed and integer-positive proof. 
325
   \end{enumerate}
326
   \end{theorem}