GKR Claims

Claim definition

"Claims" in GKR are statements which the prover has yet to show correctness for. As described earlier, the first step in proving the correctness of a GKR circuit (after sending over all circuit inputs, both public and committed) is to take the circuit's (public) output layer $V_{0}$ and send over all of its evaluations to the verifier.

For example, let's say that we have a circuit whose output layer contains 4 elements, i.e. whose representative MLE can be described by $V_{0} (x_{1}, x_{2}) : F^{2} \mapsto F$ . Additionally, let's say that these evaluations are $[a_{1}, a_{2}, a_{3}, a_{4}]$ , such that

$V_{0} (0, 0) = a_{1} V_{0} (0, 1) = a_{2} V_{0} (1, 0) = a_{3} V_{0} (1, 1) = a_{4}$

These four equalities above are actually the first claims whose validity the prover wishes to demonstrate to the verifier. The verifier doesn't know what the true values of $V_{0}$ are, of course, but would be able to check each of these relationships with the prover's help via sumcheck. This would be rather expensive, however, as the number of claims is exactly equal to the number of circuit outputs/evaluations within the circuit's output layer. Instead, the verifier can sample some randomness and have the prover prove the following:

$Sample r_{1}, r_{2} \leftarrow $ F a^{⋆} = assign (1 - r_{1}) (1 - r_{2}) (a_{1}) + (1 - r_{1}) (r_{2}) (a_{2}) + (r_{1}) (1 - r_{2}) (a_{3}) + (r_{1}) (r_{2}) (a_{4}) Prove V_{0} (r_{1}, r_{2}) = ? a^{⋆}$

Note that the above follows precisely from the definition of a multilinear extension (MLE), and it can indeed be viewed exactly as the evaluation of $V_{0}$ at the random points $r_{1}, r_{2}$ . The protocol takes a slight soundness hit here, as a cheating prover might get away with an incorrect circuit output (say, $V_{0}^{*} \neq = V_{0}$ , but $V_{0}^{*} (r_{1}, r_{2}) = V_{0} (r_{1}, r_{2})$ ), but the probability of such an occurrence is $\frac{1}{∣ F ∣}$ , as non-identical MLEs only intersect at exactly one point via the Schwartz-Zippel lemma.

In general, claims take the following form:

$V_{i} (g_{1}, ..., g_{s_{i}}) = c_{i}$

In other words, the prover wishes to convince the verifier that the evaluation of the MLE representing the $i$ 'th layer at the challenge $g_{1}, ..., g_{s_{i}} \in F^{s_{i}}$ is $c_{i} \in F$ .

Claim Propagation

For another example of claim propagation/reduction, see this section. Note that the below example uses a structured GKR relationship while the other example uses a canonic GKR relationship.

Recall the general sumcheck relationship for a function $f : F^{n} \mapsto F$ ; the prover claims that the following relationship is true for $H \in F$ :

$H = b_{1}, ..., b_{n} \in {0, 1}^{n} \sum f (b_{0}, ..., b_{n})$

Assuming that $b_{1}, ..., b_{n}$ are bound to $r_{1}, ..., r_{n} \in F$ during the sumcheck process, the final verifier check within sumcheck is the following, where the RHS must be an "oracle query", i.e. the verifier must know that the evaluation of $f$ on $r_{1}, ..., r_{n}$ is correct:

$f_{n} (r_{n}) = ? f (r_{1}, ..., r_{n})$

How does this oracle query actually get evaluated in GKR? The answer is claims and sumcheck over claims for a previous layer. Specifically, let's consider the following relationship (see structured GKR section for more information about the $eq$ polynomial and this kind of layerwise relationship):

$V_{i + 1} (X_{1}, ..., X_{n}) = b_{1}, ..., b_{n} \sum eq (X_{1}, ..., X_{n}; b_{1}, ..., b_{n}) \cdot V_{i} (b_{1}, ..., b_{n})^{2}$

This is the polynomial relationship between layer $i$ and layer $i + 1$ of a circuit where the $i + 1$ 'th layer's values are exactly those of the $i$ 'th layer's values squared. For example, if the evaluations of $V_{i}$ are $[a_{1}, a_{2}, a_{3}, a_{4}]$ then we expect the evaluations of $V_{i + 1}$ to be $[a_{1}^{2}, a_{2}^{2}, a_{3}^{2}, a_{4}^{2}]$ .

The prover starts with a claim

$V_{i} (g_{1}, ..., g_{n}) = c_{i}$

for $g_{1}, ..., g_{n}, c_{i} \in F$ , and wishes to prove it to the verifier. It does so by running sumcheck on the RHS of the above equation, i.e.

$c_{i} = ? b_{1}, ..., b_{n} \sum eq (g_{1}, ..., g_{n}; b_{1}, ..., b_{n}) \cdot V_{i + 1} (b_{1}, ..., b_{n})^{2}$

Let $b_{1}, ..., b_{n}$ be bound to $r_{1}, ..., r_{n} \in F$ during the rounds of sumcheck. Additionally, let $f_{n} (X_{n})$ be the univariate polynomial the prover sends in the $n$ 'th round of sumcheck. The oracle query check is then

$f_{n} (r_{n}) = ? eq (g_{1}, ..., g_{n}; r_{1}, ..., r_{n}) \cdot V_{i + 1} (r_{1}, ..., r_{n})^{2}$

The verifier is able to compute $eq (g_{1}, ..., g_{n}; r_{1}, ..., r_{n})$ on its own in $O (n)$ time, but unless $V_{i + 1}$ is an MLE within an input layer of the GKR circuit, they will not be able to determine the value of $V_{i + 1} (r_{1}, ..., r_{n})$ . Instead, the prover sends over a new claimed value $c_{i + 1} = ? V_{i + 1} (r_{1}, ..., r_{n})$ , and the verifier checks that

$f_{n} (r_{n}) = ? eq (g_{1}, ..., g_{n}; r_{1}, ..., r_{n}) \cdot c_{i + 1}^{2}$

The only thing left to check is whether $V_{i + 1} (r_{1}, ..., r_{n}) = ? c_{i + 1}$ . Notice, however, that this now a new claim on an MLE residing in layer $i + 1$ , and that we started with a claim on layer $i$ . In other words, we've reduced the validity of a claim on layer $i + 1$ to that of a claim on layer $i$ , which is the core idea behind GKR: start with claims on circuit output layers, and reduce those using sumcheck to claims on earlier layers of the circuit. Eventually all remaining claims will be those on circuit input layers, which can be directly checked via either a direct verifier MLE evaluation for public input layers, or a PCS evaluation proof for committed input layers.

Claim Aggregation

In the above example, we reduced a single claim on layer $i$ to claim(s) on MLEs residing in previous layers. What happens when there are multiple claims on the same layer, e.g.

$V_{i} (g_{1}^{(1)}, ..., g_{n}^{(1)}) = ? c_{i}^{(1)} V_{i} (g_{1}^{(2)}, ..., g_{n}^{(2)}) = ? c_{i}^{(2)} ⋮ V_{i} (g_{1}^{(m)}, ..., g_{n}^{(m)}) = ? c_{i}^{(m)}$

One method would be to simply run sumcheck $m$ times, once for each of the above claims, and reduce to $\geq 2 m$ separate claims on MLEs residing in previous layers. This strategy, however, leads to an exponential number of claims in the depth of the circuit, which is undesirable.

Instead, Remainder implements two primary modes of claim aggregation, i.e. methods for using a single sumcheck to prove the validity of many claims on the same MLE.

RLC (Random Linear Combination) Claim Aggregation

Additional reading: See XZZ+19, page 10 ("Combining two claims: random linear combination").

The idea behind RLC claim aggregation is precisely what it sounds like: the prover shows that a random linear combination of the claimed values indeed equals the corresponding random linear combination of the summations on the RHS of e.g. the third equation in the above section. The implementation of RLC claim aggregation within Remainder works for structured layers and gate layers, but not for matrix multiplication layers or input layers (as explained below).

We defer to the corresponding pages for more detailed explanations of the layerwise relationships, but review their form factors here and show how RLC claim aggregation can be done for each here.

Structured Layers

We start with structured layers, and use the same example relationship from above:

$V_{i} (X_{1}, ..., X_{n}) = b_{1}, ..., b_{n} \sum eq (X_{1}, ..., X_{n}; b_{1}, ..., b_{n}) \cdot V_{i + 1} (b_{1}, ..., b_{n})^{2}$

For simplicity, we aggregate two claims rather than $m$ claims, but the methodology generalizes in a straightforward fashion. Our aggregated claim is constructed as follows:

$Sample α \leftarrow $ F Let c_{i}^{⋆} = c_{i}^{(1)} + α \cdot c_{i}^{(2)}$

Similarly, we take an RLC of the summations and create a new summation to sumcheck over (we let $b = b_{1}, ..., b_{n}$ and $g^{(j)} = g_{1}^{(j)}, ..., g_{n}^{(j)}$ for concision):

$c_{i}^{⋆} = ? b_{1}, ..., b_{n} \sum eq (g^{(1)}; b) \cdot V_{i} (b) + α \cdot b_{1}, ..., b_{n} \sum eq (g^{(2)}; b) \cdot V_{i + 1} (b)^{2} = b_{1}, ..., b_{n} \sum [eq (g^{(1)}; b) + α \cdot eq (g^{(2)}; b)] \cdot V_{i + 1} (b)^{2}$

For structured layers, in other words, the prover and verifier simply take a random linear combination of the claims and perform sumcheck over a polynomial which is identical to the original layerwise relationship polynomial but with the $eq$ term replaced with an RLC of $eq$ terms in the same manner as the RLC of the original claims.

Gate Layers

A similar idea applies to gate layers. We use mul gate as the example layerwise relationship here:

$V_{i} (Z_{1}, ..., Z_{s_{i}}) = x \in {0, 1}^{s_{j}}, y \in {0, 1}^{s_{k}} \sum mul (Z, x, y) \cdot [V_{j} (x) * V_{k} (y)]$

Again, we aggregate just two claims for simplicity, although the idea generalizes very naturally to $m$ claims:

$Sample α \leftarrow $ F Let c_{i}^{⋆} = c_{i}^{(1)} + α \cdot c_{i}^{(2)}$

The polynomial relationship to run sumcheck over is constructed using a similar idea as that of structured layers:

$c_{i}^{⋆} = ? x \in {0, 1}^{s_{j}}, y \in {0, 1}^{s_{k}} \sum mul_{i, j, k} (g^{(1)}, x, y) \cdot V_{j} (x) \cdot V_{k} (y) + α \cdot x \in {0, 1}^{s_{j}}, y \in {0, 1}^{s_{k}} \sum mul_{i, j, k} (g^{(2)}, x, y) \cdot V_{j} (x) \cdot V_{k} (y) = x \in {0, 1}^{s_{j}}, y \in {0, 1}^{s_{k}} \sum [mul_{i, j, k} (g^{(1)}, x, y) + α \cdot mul_{i, j, k} (g^{(2)}, x, y)] \cdot [V_{i} (x) \cdot V_{i} (y)]$

Rather than taking a linear combination of the $eq$ polynomials, we instead take a linear combination of the $mul_{i, j, k}$ polynomials.

Costs

The prover costs for RLC claim aggregation are as follows -- assume that we are working with a structured layer (the analysis is similar for gate layers) and that the degree of every sumcheck variable is $d$ (in the above example for a structured layer, $d = 3$ ). Additionally, assume that we have $m$ claims over a layer with $n$ variables.

As shown above, RLC claim aggregation for structured layers simply involves "factoring out" the $eq$ term between each of the $g^{(i)}$ 's and the $b$ 's. Rather than multiplying the structured polynomial relationship by a single $eq$ , we multiply by an RLC of $m$ $eq$ terms.
For each additional $eq$ term, the prover incurs an additional $d + 1$ evaluations worth of work (across a single sumcheck round). Evaluating $eq$ can be done in $O (2^{j})$ time by the prover for $j$ variables, and thus the total cost (for $m$ claims) is $m \cdot (d + 1) \cdot j = 1 \sum n 2^{j}$
across all rounds of sumcheck. The total prover runtime is thus $O (m \cdot d \cdot 2^{n + 1})$ .

The proof size is identical to that of the single-claim sumcheck case, since the degree of the sumcheck messages do not change.

Finally, the verifier cost is slightly increased. Specifically, during intermediate rounds of sumcheck the verifier does not do any additional work (compared to the single-claim sumcheck case), but during the oracle query the verifier must evaluate $m$ separate instances of $eq$ at fixed points. This takes the verifier $O (mn)$ additional time.

Matrix Multiplication Layers (counterexample)

Prerequisite: matrix multiplication layers page.

For matrix multiplication layers: consider $A B = ? C$ , and consider the sumcheck relationship $C (X, Z) = ? \sum_{y \in {0, 1}^{j}} A (X, y) \cdot B (y, Z)$ .

In matrix multiplication layers, the claim is always of the form $C (g_{X}, g_{Z}) = ? c$ , and the prover proceeds by first binding $A (g_{X}, Y)$ and $B (Y, g_{Z})$ before showing that $c = \sum_{y \in {0, 1}^{j}} A (g_{X}, y) \cdot B (y, g_{Z})$ . In the RLC claim aggregation case, we have claims $C (g_{X}, g_{Z}) = ? c C (g_{X}^{'}, g_{Z}^{'}) = ? c^{'}$ Where $g_{X} \neq = g_{X}^{'}$ and $g_{Z} \neq = g_{Z}^{'}$ (otherwise they would be claims from the same "source" layer and would therefore be identical). The verifier samples random challenge $α \leftarrow $ F$ . In this case, our sumcheck relationship is the following: $c + α c^{'} = ? y \in {0, 1}^{j} \sum A (g_{X}, y) \cdot B (y, g_{Z}) + α [A (g_{X}^{'}, y) \cdot B (y, g_{Z}^{'})]$ Because $g_{X} \neq = g_{X}^{'}$ and $g_{Z} \neq = g_{Z}^{'}$ , there is no way to factor the above expression's RHS to combine terms in any way, and thus RLC claim aggregation is equivalent to not aggregating claims at all and simply running two separate sumchecks on $c = ? y \in {0, 1}^{j} \sum A (g_{X}, y) \cdot B (y, g_{Z}) c^{'} = ? y \in {0, 1}^{j} \sum A (g_{X}^{'}, y) \cdot B (y, g_{Z}^{'})$

Input Layers (counterexample)

For input layers: RLC claim aggregation combines claims $V_{d} (g_{1}, ..., g_{n}) = ? c_{d} V_{d} (g_{1}^{'}, ..., g_{n}^{'}) = ? c_{d}^{'}$ into a single claimed statement $V_{d} (g_{1}, ..., g_{n}) + α V_{d} (g_{1}^{'}, ..., g_{n}^{'}) = ? c_{d} + α \cdot c_{d}^{'}$ For public inputs, the verifier must evaluate each of $V_{d} (g_{1}, ..., g_{n})$ and $V_{d} (g_{1}^{'}, ..., g_{n}^{'})$ on their own, and thus nothing is gained by the combination.

For committed inputs, a polynomial commitment scheme may allow for cheaper evaluation proofs in the above form (vs. two separate evaluation proofs; one for each claim), but this is generally not the case.

Interpolative Claim Aggregation

Additional reading: See Tha13, page 15 ("reducing to verification of a single point"), for another description of the protocol, and Mod24, page 15 (Section 3.4, "Claim aggregation"), for a thorough description + optimization.

Interpolative claim aggreation works by having the prover and verifier both compute an interpolating polynomial $ℓ : F \mapsto F^{n}$ , such that for the claims described earlier, i.e.

$V_{i} (g_{1}^{(1)}, ..., g_{n}^{(1)}) = ? c_{i}^{(1)} V_{i} (g_{1}^{(2)}, ..., g_{n}^{(2)}) = ? c_{i}^{(2)} ⋮ V_{i} (g_{1}^{(m)}, ..., g_{n}^{(m)}) = ? c_{i}^{(m)}$

we have that

$ℓ (1) = g_{1}^{(1)}, ..., g_{n}^{(1)} ℓ (2) = g_{1}^{(2)}, ..., g_{n}^{(2)} ⋮ ℓ (m) = g_{1}^{(m)}, ..., g_{n}^{(m)}$

Note that the degree of $ℓ$ is $m - 1$ , as there are $m$ points for each of the $n$ coordinates which must be interpolated.

The prover then sends over the polynomial $V_{i} \circ ℓ : F \mapsto F$ , i.e. the restriction of $V_{i}$ to points in $F^{n}$ generated by $ℓ$ . Note that the degree of $V_{i} \circ ℓ$ is $(m - 1) \cdot n$ , as $V_{i}$ is multilinear in each of its variables, and each of those variables is degree at most $m - 1$ in the input variable $X$ for $(V_{i} \circ ℓ) (X)$ .

The verifier samples $r^{⋆} \leftarrow $ F$ and sends it to the prover. The prover and verifier both compute $ℓ (r^{⋆}) = r_{1}^{⋆}, ..., r_{n}^{⋆}$ , and the prover proves the single claim

$V_{i} (r_{1}^{⋆}, ..., r_{n}^{⋆}) = ? (V_{i} \circ ℓ) (r^{⋆})$

where $V_{i} \circ ℓ$ was sent by the prover and the verifier evaluates it at $r^{⋆}$ on its own.

Costs

The prover cost for interpolative claim aggregation is as follows:

Given $m$ claims with $n$ variables each, $ℓ$ is a $m - 1$ -degree function in each of its components, and since $V_{i}$ is multilinear in each of its $n$ variables, $V_{i} \circ ℓ$ is a univariate polynomial with degree $n (m - 1)$ . The prover must send $n (m - 1) + 1$ evaluations to the verifier, although the first $m$ have already been sent implicitly in the form of the $m$ claims.
The prover thus must evaluate $V_{i} \circ ℓ$ at $n (m - 1) + 1 - m = (n - 1) (m - 1)$ points. Each evaluation requires the prover to evaluate $ℓ$ in $O (mn)$ time, and then $V_{i}$ in $O (2^{n})$ time. The prover's total runtime is thus $O ((n - 1) (m - 1) \cdot (mn + 2^{n})) = O (mn 2^{n})$ .

The proof size for interpolative claim aggregation is as follows:

As reasoned earlier in the prover cost section, the prover sends over $(n - 1) (m - 1)$ evaluations of $V_{i} \circ ℓ$ . The proof size is thus $O (mn)$ field elements.

The verifier runtime for interpolative claim aggregation is as follows:

The verifier receives evaluations of $V_{i} \circ ℓ$ from the prover and evaluates it at a random point $r^{⋆}$ . This takes $O (mn)$ time. Additionally, the verifier evaluates $ℓ (r^{⋆})$ , which takes $O (mn)$ time as well. The verifier's total runtime is thus $O (mn)$ .

Optimizations

Remainder has a few built-in optimizations for interpolative claim aggregation which substantially lower the prover costs for claims with "structure" within their evaluation points. For more details, see Mod24, page 15 (Claim Aggregation section).

The Remainder Book