Sumcheck

(Most of the content of this section is from Section 4.1 in Proofs, Args, and ZK by Justin Thaler, which has more detailed explanations of the below.)

This section will first cover the background behind the sumcheck protocol, and then provide an introduction as to why this may be useful in verifying the computation of layerwise arithmetic circuits.

The sumcheck protocol is an interactive protocol which verifies claims of the form: $H = ? x_{i} \in {0, 1} \sum f (x_{1}, \dots, x_{n}) .$ In this statement, $f$ is not necessarily multilinear, and $H \in F$ .

In other words, the prover, $P$ , claims that the sum of the evaluations of a function $f : F^{n} \to F$ over the boolean hypercube of dimension $n$ is $H .$ Naively, the verifier $V$ can verify this statement by evaluating this sum themselves in $O (2^{n})$ time assuming oracle access to $f$ (being able to query evaluations of $f$ in $O (1)$ time), with perfect completeness (true claims are always identified by the verifier) and perfect soundness (false claims are always identified by the verifier).

Sumcheck relaxes the perfect soundness to provide a probabilistic protocol which verifies the claim in $O (n \cdot d)$ time with a soundness error of $\leq \frac{n \cdot d}{∣ F ∣}$ where $d$ is the maximum degree of any variable $x_{1}, \dots, x_{n}$ .

The Interactive Protocol

We start with a straw-man interactive protocol which still achieves perfect completeness and soundness in verifier time $O (2^{n})$ and prover time $O (2^{n}) .$ Then, we build on this version of the protocol and introduce randomness to achieve $O (n \cdot d)$ verifier time, with a soundness error of $\leq \frac{n \cdot d}{∣ F ∣} .$

A non-probabilistic protocol

Note that the sum we are trying to verify can be rewritten as such: $H = ? x_{1} \in {0, 1} \sum x_{2} \in {0, 1} \sum x_{3} \in {0, 1} \sum \dots x_{n} \in {0, 1} \sum f (x_{1}, \dots, x_{n}) .$ Let's say $P$ sends $V$ the following univariate: $f_{1} (X) = ? x_{2} \in {0, 1} \sum x_{3} \in {0, 1} \sum \dots x_{n} \in {0, 1} \sum f (X, x_{2}, \dots, x_{n}) .$ One way for $P$ to communicate $g (X)$ to $V$ the univariate $g (X)$ is to send $d + 1 = degree (g) + 1$ evaluations of $g (X)$ . While $P$ can alternatively send coefficients, we focus on this method of defining a univariate and assume $P$ sends the evaluations $g (0), g (1), \dots, g (d)$ to $V$ .

$V$ can verify whether $H$ is correct in relation to $f_{1} (X)$ by checking whether $H = f_{1} (0) + f_{1} (1) .$ In other words, we have reduced the validity of claim that $H$ is the sum of the evaluations of $f$ over the $n$ -dimensional boolean hypercube to the claim that $f_{1} (X)$ is the univariate polynomial over a smaller sum.

Now the verifier has evaluations $f_{1} (0), f_{1} (1)$ to verify. We can similarly reduce this to claims over even smaller summations. Namely, now the prover sends over the following univariates: $f_{2, j} (X) = ? x_{3} \in {0, 1} \sum x_{4} \in {0, 1} \sum \dots x_{n} \in {0, 1} \sum f (j, X, x_{2}, \dots, x_{n}), \forall j \in [0, 1] .$

$P$ and $V$ keep engaging in such reductions until $V$ is left to verify $2^{n}$ evaluations of $f$ : this is exactly the evaluations of $f$ over the boolean hypercube, assuming that $V$ has oracle access to $f$ (it can query evaluations of $f$ in $O (1)$ time).

We have transformed the naive solution, where $V$ just evaluates the summation on their own, into an interactive protocol. In the next section we will go over how to slightly modify this by adding randomness to significantly reduce the costs incurred by $P$ and $V$ .

Schwartz-Zippel Lemma

As a brief interlude, let us go over the Schwartz-Zippel Lemma, which we can use to modify the straw-man protocol. It states that if $f (x)$ is a nonzero polynomial with degree $d$ , then the probability that $f (r) = 0$ for some random value $r$ sampled from a set $S$ is upper-bounded by $\frac{d}{∣ S ∣}$ .

This is can be seen because by the Fundamental Theorem of Alegbra, $f (x)$ has at most $d$ roots. We take the probability that we randomly sampled one of those roots out of a set of size $∣ S ∣$ .

In the case of sumcheck, we consider the polynomial to be over a field $F$ , and our randomly sampled element to be uniformly sampled from $F$ .

Introducing randomness

Our main blow-up with the straw-man interactive protocol came from the exponentially growing number of claims $V$ had to verify, ending up with $2^{n}$ evaluations of $f$ at the end. Instead, if we found a way for the reduction from $f_{i} (X)$ to claims on $f_{i + 1} (X)$ (or the reduction from the claim of $H$ to claims on $f_{1} (X)$ ) to be a one-to-one reduction in terms of number of claims, rather than one claim reduced to two claims, $V$ would only have to verify $n$ claims, and $P$ would only have to send over $n$ univariate polynomials.

Let us keep the first step the same, where $P$ first sends the following univariate polynomial: $f_{1} (X) = ? x_{2} \in {0, 1} \sum x_{3} \in {0, 1} \sum \dots x_{n} \in {0, 1} \sum f (X, x_{2}, \dots, x_{n}) .$

Now, $V$ checks whether $H = g (0) + g (1) .$ Instead of $P$ sending both $f_{2, 0} (X)$ and $f_{2, 1} (X)$ , $V$ uniformly samples a random challenge $r_{1}$ from $F$ and sends this to $P .$ $P$ sends a single univariate: $f_{2} (X) = ? x_{3} \in {0, 1} \sum x_{4} \in {0, 1} \sum \dots x_{n} \in {0, 1} \sum f (r_{1}, X, x_{3}, \dots, x_{n}) .$ $V$ checks whether $f_{1} (r_{1}) = f_{2} (0) + f_{2} (1) .$ This process is repeated iteratively, until finally in the last round, where $P$ sends $V$ the following:

$f_{n} (X) = ? f (r_{1}, \dots, r_{n - 1}, X) .$

Assuming the verifier has oracle access to $f$ , the verifier can check whether $f_{n} (r_{n}) = f (r_{1}, \dots, r_{n})$ . The difference between this protocol and the naive protocol above is that at each step, instead of individually verifying $f_{i} (0)$ and $f_{i} (1)$ , $V$ sends $P$ a "challenge" $r_{i}$ in which $P$ responds to by sending over the appropriate univariate polynomial. Therefore we have achieved a one-to-one claim reduction, and the verifier having to only verify one equation per round.

Soundness Intuition

We provide brief intuition for the soundness bound from the above protocol. At any step $i$ , the prover can cheat by sending a different univariate polynomial $h_{i} (X)$ instead of the expected $f_{i} (X)$ such that $h_{i} (r_{i}) = f_{i} (r_{i})$ , but $h_{i} (X) \neq = f_{i} (X)$ . This allows them to, ultimately, prove a different original statement: that the sum $\sum_{x \in {0, 1}^{n}} f (x) = H^{'} \neq = H$ . Because $V$ sends $r_{i}$ to $P$ , we can be confident that $P$ does not adversarially choose $r_{i}$ to be one of the roots of $h_{i} (X) - f_{i} (X)$ . Then, by the Schwartz-Zippel lemma, the probability that $r_{i}$ happened to be one of the "zeros" of $h_{i} (X) - f_{i} (X) = \frac{d _{i}}{∣ F ∣}$ where $d_{i}$ is the degree of $h_{i} (X) - f_{i} (X) .$

Example

We do a short example of the sumcheck protocol in the integers. Let $f (x) = 3 x_{1} x_{2}^{2} + 4 x_{3} x_{2} + 5 x_{1}^{3} x_{3} + 2.$ $P$ rightfully claims that $\sum_{x_{1} \in {0, 1}} \sum_{x_{2} \in {0, 1}} \sum_{x_{1} \in {0, 1}} f (x_{1}, x_{2}, x_{3}) = H = 40.$ In order to verify this claim, $P$ and $V$ engage in a sumcheck protocol.

$P$ sends $V$ the univariate: $f_{1} (X) = x_{2} \in {0, 1} \sum x_{3} \in {0, 1} \sum (3 X x_{2}^{2} + 4 x_{3} x_{2} + 5 X^{3} x_{3} + 2) = 10 X^{3} + 6 X + 12.$ $V$ verifies that $(f_{1} (0) = 12) + (f_{1} (1) = 28) = 40 = H .$ Then, $V$ samples the challenge $r_{1} = 5$ and now $P$ computes: $f_{2} (X) = x_{3} \in {0, 1} \sum (3 (5) X^{2} + 4 X x_{3} + 5 (5)^{3} x_{3} + 2) = 30 X^{2} + 4 X + 629.$

$V$ checks that $(f_{2} (0) = 629) + (f_{2} (1) = 663) = 1292 = f_{1} (5) .$ Next, $V$ samples another challenge $r_{2} = 7$ and sends it to $P$ who then computes and sends $f_{3} (X) = 653 X + 737.$ Finally, $V$ samples another random challenge $r_{3} = 3$ and checks whether $f_{3} (3) = f (5, 7, 3) .$ Indeed, $f_{3} = 2696 = f (5, 7, 3) .$

Why Sumcheck?

In the previous section, we introduced the notion of a multilinear extension of a polynomial $f$ , which is defined as $f (x_{1}, \dots, x_{n}) = z_{i} \in {0, 1} \sum eq (x; z) \cdot f (z_{1}, \dots, z_{n}) .$ Notice that naturally, a multilinear extension is defined by taking the sum over a boolean hypercube, which is what sumcheck proves claims over.

In the next section, we will go over how we can encode layers of circuits as multilinear extensions, and prove statements about the output of these layers using sumcheck.

The Remainder Book