smt-symex/README.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116

# SMT + SymEx Lab

The goal of this lab is to build a working symbolic execution engine off of our
SAT solver.

### STOP!!!
Before you proceed, make sure you do one of:
- If you finished the "fast" SAT solver last week, copy the `fast` binary into
  `code/` as `code/fast`.
- If you only finished the "basic" SAT solver last week, copy the `basic`
  binary into `code/` as `code/fast` (yes, pretend it's fast!).
- If you finished neither, but downloaded minisat and it's on your path, pass
  `-DUSE_MINISAT` to all compilation steps (see the Makefile).

### SMT Solver
The core of our symex engine will be an SMT solver. An SMT solver is a nice
frontend to a SAT solver, allowing you to construct constraints involving
arrays and bitvectors (finite bitwidth integers, like in C).

For simplicity, we'll be building an eager SMT solver, also known as bit
blasting. This is the same fundamental approach used by the STP solver:
https://github.com/stp/stp

The idea is to "compile" the higher-level constraints into a big SAT problem,
then use our SAT solver to solve this problem.  This approach is nice because
it makes a clean separation between the SMT solver and SAT solver; you can test
against your SMT solver against any SAT solver to better localize bugs, and we
don't have to change our SAT solver at all.

When you create a bitvector of size N, it actually creates N new bits in the
SAT problem representing the N bits of the bitvector. The method `bv_eq(b1,b2)`
creates a new SAT variable `v_eq` and spits out clauses ensuring that `v_eq` is
true if and only if all of the bits in b1 and b2 are identical. Similar for
`bv_add(b1,b2)` --- we essentially spit out a ripple-carry adder in the
underlying SAT constraints.

Arrays are a bit more complicated. When the user requests `arr[x]`, we give
back a fresh bitvector and record separately that it's supposed to be `arr[x]`.
Then, before spitting out the final DIMACS file, we look at every earlier call
to `arr[y]` and add assertions that `arr[x] = arr[y]` if `x = y` (being careful
to handle cases where we overwrite `arr[x]`!).

Then, we print the constraints to a DIMACS file, run our SAT solver, and read
back in the results.

Some tips/reminders for the SAT encoding:
- `x <=> y` is the same as `x => y` and `y => x`
- `x => z` is equivalent to the clause `{-x, z}`
- `x and y => z` is `{-x, -y, z}`
- `x or y => z` is `x => z` and `y => z`
- `x => y and z` is `x => y` and `x => z`
- `x => y or z` is `{-x, y, z}`

I have provided some "unit tests" for the SMT solver which you can run with
`make do_tests`. I suggest the following order of implementation:
1. Implement `solve` and `get_solution`
2. Implement `new_bv`, `const_bv`, `bv_eq`
3. Run `make do_tests` --- the first test (`test_basic_bv`) should pass
4. Implement `bv_add`
5. Run `make do_tests` --- the first two tests (`test_basic_bv`, `test_bv_add`)
   should pass
6. Implement `new_array`, `array_store`, `array_get`, `array_axioms`
7. Run `make do_tests` --- all tests should pass

Then you should be in a good position to move on to the SymEx engine!

### Symbolic Interpreter
We'll work on a simple little assembly-like IR. This IR has:
- Arbitrarily many registers
- A word size determined by the `WORD_SIZE` global in `main.c`
- A heap memory indexed by registers
- A branching instruction that jumps to a relative offset if two registers have
  the same value
- A "failure" instruction that indicates any path reaching this is a bug

The idea is to basically implement a little interpreter for this language
except:
1. Instead of using concrete integers for register/memory values, use
   bitvectors and array operations as exposed by the SMT solver library.
2. When a branch statement is reached, simply fork the interpreter. In one
   process, assert the branch is taken and proceed. In the other, assert it's
   not and proceed.
3. When a failure statement is reached, try to solve the current path
   constraints. If a solution is found, that represents an input that can reach
   this failure location, i.e., a bug. If we visit all possible paths and none
   can reach fail, we've proven the absence of such bugs.

The trickiest part is actually printing out the solution at the end. We need to
track the first time we use a register, as well as the first time we read a
certain value in memory (since these represent possible inputs to our program).

All you need to do is implement the instructions symbolically by calling out to
the methods in `smt.h`. Use `get_register` and `set_register` to get/set
bitvectors representing register values; these methods will automatically
update IN_VARS for register operations, but you'll need to do it yourself for
memory operations.

### Symbolic Interpreter Without Arrays
If you want to try doing the symex engine without finishing the arrays portion
of `smt.c`, you can do that too. You should be able to run
`test_programs/test_prog`.

### Extensions
- Extend the symex IR to support more operations
- Write a compiler (or interpreter!) from a higher-level language (some useful
  subset of C?) to the symex IR
- Modern SMT solvers often use a *lazy* encoding instead of eager/bitblasting.
  The idea is to only give the SAT solver a subset of the clauses; if it says
  "unsat" with that subset, you're already done. Otherwise, check if its
  solution also satisfies the remaining clauses. If not, give it a few more
  clauses until those clauses rule out the current attempted solution. Repeat
  this until you either get unsat or find a solution that works for all the
  clauses. This is often faster in part because it lets you check the remaining
  clauses however you want, without having to somehow encode them into CNF.
  Also, you may not need all the clauses to prove unsat (or even SAT), so just
  adding them as-needed is a good idea.