DRMacIver's Notebook

Published: 2018-09-09

I previously wrote a post about NP-hardness in decision theory. On rereading it, I think its tone really doesn’t help its point at all, so I thought I’d quickly write up a more formal version.

The basic point is this: If you don’t assume computation is free, NP-hard problems prove an interesting barrier to decision making that satisfies the classical “rationality” axioms.

Suppose you have an NP-hard problem (say a SAT instance), \(S\), and are offered a choice between the following three options:

A certain reward \(R_1\).
A reward \(R_2 > R_1\) if a particular solution \(x\) witnesses that \(S\) is satisfiable.
A reward \(R_3 > R_2\) if \(S\) is satisfiable.

Arrange the values such that \(\alpha_2 \ll R_2 - R_1 < R_3 - R_1 < \alpha_3\), where \(\alpha_i\) is the cost of solving the computational problem that would allow you to determine the pay off of these choices. i.e. the difference in reward is big enough that it is worth evaluating one solution, but small enough that it’s not worth solving the problem.

You should always strictly prefer \(3\) to \(2\), because under every circumstance where \(2\) pays anything, \(3\) pays a larger amount.

Additionally, you should prefer \(2\) to \(1\) if and only if \(x\) is a witness - because it’s cheap enough to check, you just acquire the information and be done with it (if you want to quibble about expected payoffs, assume that \(\alpha_2\) is really really small).

However, you should also prefer \(1\) to \(3\), because the possible reward you can gain by solving the problem is not worth the cost, so you should take the certain reward instead.

This means that if \(x\) is chosen so that it is a witness, you have an intransitive preference \(2 > 3 > 2 > 1\).

Another way of thinking about this is that the choices over this problem are not subset consistent. The choice you make from \(\{1, 2, 3\}\) in the case that \(x\) is a solution is either \(3\) - evaluating \(2\) tells you that it’s worth choosing \(3\) over \(1\), so you can skip paying the cost and just choose the good option. In contrast, your choice when picking from \(\{1, 3\}\) would be \(2\) - removing a value that was not the chosen answer has caused your opinion to flip.