-
Notifications
You must be signed in to change notification settings - Fork 21
/
Copy pathhints.txt
270 lines (216 loc) · 10.2 KB
/
hints.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
Hints on effectively using TLAPS
The TLA+ proof system is designed to check the validity of claims as
independently as possible of specific proof back-ends. We believe
that users should concentrate on writing proofs in terms of their
particular applications, not in terms of the capabilities of a
particular proof system. In particular, TLAPS invokes its back-ends
with some default setup for automatic proof, and we try to make it
hard for users to change this default setup. Expert users of back-end
provers may be frustrated because they may have to develop proofs
somewhat further than what would be necessary with a fine-tuned tactic
script. The main payoff of limited access to the nitty-gritty details
of provers is greater clarity of the resulting proofs. They are also
easier to maintain across minor changes of the specification or new
releases of the TLA prover.
On some occasions, users will encounter situations where the prover
cannot prove an "obvious" proof obligation. Here are a few hints on
what to try to make the proof go through. Your additions to this
lists are welcome.
HINTS FOR SOLVING PARTICULAR PROBLEMS
1. Proving Simple Facts About Large Formulas
-----------------------------------------
Our provers are currently not good at making abstractions that humans
understand immediately. They are easily confused by moderately big
proof obligations and are just as likely to work on a top-level
conjunction as on a set construction buried deeply inside the formula.
This can cause back-ends to become very slow or even unable to
complete seemingly trivial steps.
If you are trouble proving a fact that seems to follow by simple
reasoning about large formulas, try making the formulas small by using
hidden definitions. (Keep in mind that definitions introduced in a
proof are usable by default and must be hidden explicitly, unlike
definitions in specifications, which must be explicitly USEd.)
Here is a contrived example:
LEMMA /\ x \in SomeVeryBigExpression
/\ y \in AnotherBigExpression
<=>
/\ y \in AnotherBigExpression
/\ x \in SomeVeryBigExpression
<1> DEFINE S == SomeVeryBigExpression
\** here and in the following, you may use positional subexpression
\** names to avoid repeating the big expressions
<1> DEFINE T == AnotherBigExpression
<1>1. x \in S <=> x \in SomeVeryBigExpression
OBVIOUS
\** The provers should not have any trouble proving that
\** a very large expression is equivalent to itself. If it
\** can't prove this, you've almost certainly made a mistake
\** in the definition of S.
<1>2. y \in T <=> y \in AnotherBigExpression
OBVIOUS
<1> HIDE DEF S, T
<1>3. /\ x \in S
/\ y \in T
<=>
/\ y \in T
/\ x \in S
OBVIOUS
<1>4. QED
BY <1>1, <1>2, <1>3
This kind of problem typically arises when reasoning about LET
expressions, which are silently expanded by the proof manager. In a
proof, introduce local definitions corresponding to the LET (using
copy and paste from the specification or subexpression names), show
that the overall expression equals the body of the LET (trivial by
reflexivity), establish the necessary facts about these locally
defined operators, and HIDE the definitions afterwards.
Introducing definitions to hide irrelevant expressions is a useful
thing to do whenever you're having trouble proving something. It
makes the obligations whose proofs fail easier to read, making
mistakes easier to find.
2. Avoiding "Circular" (Sets of) Equations
----------------------------------------
Rewriting is one effective way to reason about equations, and it underlies the
automatic proof methods used by the Isabelle back-end. The basic idea is to
orient equalities such that the expressions on the left-hand side are
systematically replaced by the right-hand sides. However, if the set of
equations contains cycles as in
s = f(t)
t = g(s)
then rewriting may never terminate. Isabelle employs some (incomplete)
heuristics to detect such cycles and will refuse to rewrite equations that it
determines to be circular. This usually leads to its inability to infer anything
about these equations. If circularity is not detected, it may cause Isabelle to
enter an infinite loop. The suggested remedy is again to introduce local
definitions that are hidden to break the loops.
As a concrete example consider the following proof snippet:
<4>17. foo.name = "xyz"
<5>1. foo = [name |-> "xyz", value = foo.value]
BY <2>2
<5>2. QED
BY <5>1 \** may not work because <5>1 is a circular equation
One possible workaround is as follows:
<4>17. foo.name = "xyz"
<5> DEFINE fooval == foo.value
<5>1. foo = [name |-> "xyz", value = fooval]
BY <2>2
<5> HIDE DEF fooval
<5>2. QED
BY <5>1
3. Reasoning About CHOOSE
----------------------
Consider a definition such as
foo == CHOOSE x \in S : P(x)
In order to prove a property Q(foo), you will typically prove the two following
assertions:
(a) \E x \in S : P(x)
(b) \A x \in S : P(x) => Q(x)
In some cases, assertion (b) can be trivial and need not be shown explicitly.
Reasoning about an unbounded CHOOSE expression is analogous.
Remember that CHOOSE always denotes some value, even if P(x) holds for no
x \in S (in particular, if S = {}), in which case the CHOOSE expression is
fixed, but arbitrary. In practice, CHOOSE expressions usually arise when
condition (a) is satisfied. Should you have designed your property to work even
if the domain of the CHOOSE is empty, property Q must be trivial in that case,
and you can structure your proof as follows:
<3>5. Q(foo)
<4>1. CASE \E x \in S : P(x)
<5>1. \A x \in S : P(x) => Q(x)
<5>2. QED
BY <4>1, <5>1
<4>2. CASE ~ \E x \in S : P(x)
<5>1. \A x : Q(x)
<5>2. QED
BY <5>1
<4>3. QED
BY <4>1, <4>2
A frequent TLA+ idiom is to define a "null" value by writing
NoValue == CHOOSE x : x \notin Value
The laws of set theory ensure that no set is universal, hence there exists an x
that is not an element of set Value, ensuring condition (a) above. The theorem
NoSetContainsEverything in the standard module TLAPS can be used to prove this
condition.
4. Reasoning About Records
-----------------------
4a. Using the Values of Record Fields
---------------------------------
In one proof, we had
mb == [type |-> "1b", bal |-> b, acc |-> self,
mCBal |-> maxCBal[self], mCVal |-> maxCVal[self]]
and were trying to prove
m1 # mb /\ m2 # mb
from facts that included
m1.type = "2av" /\ m2.type = "2av"
Zenon failed on the proof and Isabelle proved it only after a long
time. (In fact, we originally stopped the proof because it was taking
so long.) However, Zenon proved it instantly when we added mb.type =
"1b" to the BY statement's list of facts. The provers are reluctant
to try finding relations of the form record.field = value; they often
need help.
4b. Deducing the Value of a Record
------------------------------
You may run into trouble deducing a fact of the form
r = [a1 |-> v1, ... , aN |-> vN]
from
r \in [a1 : S1, ... , aN : SN]
and facts about the values of r's fields. An archtypal example is
proving the following trivial fact:
THEOREM ASSUME NEW A, NEW B, NEW C,
NEW r \in [a : A, b : B, c : C]
PROVE r = [a |-> r.a, b |-> r.b, c |-> r.c]
The proof of this is simple, but not easy to discover. The trick in
the following proof should enable you to solve this class of problem.
<1> DEFINE rx == [a |-> r.a, b |-> r.b, c |-> r.c]
<1>1. /\ rx = [i \in {"a", "b", "c"} |-> rx[i]]
/\ r = [i \in {"a", "b", "c"} |-> r[i]]
OBVIOUS
<1>2. QED
BY <1>1
GENERAL HINTS FOR WRITING TLAPS PROOFS
5. Divide and Conquer
------------------
When the provers can't prove something that you think is obvious, it's
usually because it isn't true. You can easily spend hours looking at
a proof obligation without noticing a tiny mistake. The best way to
find a mistake is by breaking the proof into simpler steps.
Continuing to do this on the step or steps whose proof fails will
eventually lead you to discover the problem--usually a missing
hypothesis or a mistake in a formula. When you correct the mistake in
the original proof step, the prover will usually be able to prove it.
6. Don't Reinvent Mathematics
--------------------------
We expect that most people who use TLAPS will do so because they want
to verify properties of an algorithm or system. We have therefore not
devoted our limited resources to building libraries of mathematical
results. If you want to create such libraries, we would welcome your
help. However, if you are concerned with an algorithm or system, you
should not be spending your time proving basic mathematical facts.
Instead, you should assert the mathematical theorems you need as
assumptions or theorems.
Asserting facts is dangerous, because it's easy to make a mistake and
assert something false, making your entire proof unsound.
Fortunately, you can use the TLC model checker to avoid such mistakes.
For example, our example correctness proof of Euclid's algorithm uses
this assumption
ASSUME GCDProperty3 ==
\A p, q \in Nat \ {0}: (p < q) => GCD(p, q) = GCD(p, q-p)
TLC cannot check this assumption because it can't evaluate a
quantification over an infinite set. However, you can tell TLC to
replace the definition of Nat with
Nat == 0..50
(In the Toolbox, use the Definition Override section of the model's
Advanced Options page.) TLC quickly verifies this assumption. (TLC
checks each ASSUME; to add an assumption that you don't want TLC to check,
make it an AXIOM.)
This kind of checking is almost certain to catch an error in
expressing a fundamentally correct mathematical result--except when
the only counterexamples are infinite. Fortunately, this is rarely
the case when the result is needed for reasoning about an algorithm or
system.
7. It's Easier to Prove Something if it's True
-------------------------------------------
Before trying to prove a property of an algorithm or system, try to
check it with TLC. Even if TLC cannot check a large enough model to
catch all errors, running it on a small model can still catch many
simple errors. You will save a lot of time if you let TLC find these
errors instead of discovering them while writing the proof.