Structured ("Sector") GKR

Source: Tha13, section 5 ("Time-Optimal Protocols for Circuit Evaluation").

Review: Equality MLE

We begin by briefly recalling the $eq$ MLE (see this section for more details). We first consider the binary string equality function $eq : {0, 1}^{2 n} \mapsto {0, 1}$ , where

$eq (X_{1}, ..., X_{n}; Y_{1}, ..., Y_{n}) = {10 if \forall i : X_{i} = Y_{i} otherwise$

This function is $1$ if and only if $X$ and $Y$ are equal as binary strings, and $0$ otherwise. We can extend this to a multilinear extension via the following -- consider $eq : F^{2 n} \mapsto F$ , where

$eq (X_{1}, ..., X_{n}; Y_{1}, ..., Y_{n}) = i = 1 \prod n (1 - X_{i}) (1 - Y_{i}) + X_{i} Y_{i}$

Structured Layerwise Relationship

See Tha13, page 25 ("Theorem 1") for a more rigorous treatment. Note that Remainder does not implement Theorem 1 in its entirety, and that many circuits which do fulfill the criteria of Theorem 1 are currently not expressible within Remainder's circuit frontend.

Structured layerwise relationships can loosely be thought of as data relationships where the bits of the index of the "destination" value in the $i$ 'th layer are a (optionally subset) permutation of the bits of the index of the "source" value in the $j$ 'th layer for $j > i$ . As a concrete example, we consider a layerwise relationship where the destination layer is half the size, and its values are the element-wise products of the two halves of the source layer's values: Let $V_{i} (Z_{1}, Z_{2})$ represent the MLE of the destination layer, and let $V_{j} (Z_{1}, Z_{2}, Z_{3})$ represent the MLE of the source layer.

Let the evaluations of $V_{j}$ over the hypercube be $[a, b, c, d, e, f, g, h]$ . Then we wish to create a layerwise relationship such that the evaluations of $V_{i}$ over the hypercube are $[a e, b f, c g, d h]$ . We can actually write this as a simple rule in terms of the (integer) indices of $V_{i}$ as follows:

$V_{i} (z) = V_{j} (z) \cdot V_{j} (4 + z)$

If we allow for our arguments to be the binary decomposition of $z$ rather than $z$ itself, we might have the following relationship:

$V_{i} (z_{1}, z_{2}) = V_{j} (0, z_{1}, z_{2}) \cdot V_{j} (1, z_{1}, z_{2})$

where $0 z_{1} z_{2}$ is the binary representation of $z$ and $1 z_{1} z_{2}$ is the binary representation of $4 + z$ . This is in fact very close to the exact form-factor of the polynomial layerwise relationship which we should create between the layers -- we now consider the somewhat un-intuitive relationship

$V_{i} (Z_{1}, Z_{2}) = b_{1}, b_{2} \in {0, 1}^{2} \sum eq (Z_{1}, Z_{2}; b_{1}, b_{2}) \cdot V_{j} (0, b_{1}, b_{2}) \cdot V_{j} (1, b_{1}, b_{2})$

One way to read the above relationship is the following: for any (binary decomposition) $Z_{1}, Z_{2}$ , the value of the $i$ 'th layer at the index represented by $Z_{1}, Z_{2}$ should be $V_{j} (0, Z_{1}, Z_{2}) \cdot V_{j} (1, Z_{1}, Z_{2})$ . We are summing over all possible values of the hypercube above, $b_{1}, b_{2} \in {0, 1}^{2}$ , and for each value we check whether the current iterated hypercube value $b_{1}, b_{2}$ "equals" the argument $Z_{1}, Z_{2}$ value. If so, we contribute $V_{j} (0, b_{1}, b_{2}) \cdot V_{j} (1, b_{1}, b_{2})$ to the sum and if not, we contribute zero to the sum.

In this way we see for $Z_{1}, Z_{2} \in {0, 1}$ that all of the summed values will be zero except for when $b_{1}, b_{2}$ are exactly identical to $Z_{1}, Z_{2}$ , and thus only the correct value $V_{j} (0, b_{1}, b_{2}) \cdot V_{j} (1, b_{1}, b_{2})$ will contribute to the sum (and thus the value of $V_{i} (Z_{1}, Z_{2})$ ).

As described, the above relationship looks extremely inefficient in some sense -- why bother summing over all the hypercube values when we already know that all of them will be zero because $eq$ will evaluate to zero at all values except one?

The answer is that it's not enough to only consider $V_{i} (Z_{1}, Z_{2})$ for binary $Z_{1}, Z_{2}$ , as our claims will be of the form $V_{i} (g_{1}, g_{2}) = c$ , where $g_{1}, g_{2}, c \in F$ , and $V_{i}$ is the multilinear extension of $V_{i}$ (see claims section for more information on prover claims). Another way to see this is that the above relationship is able to be shown for each $Z_{1}, Z_{2} \in {0, 1}$ , but we want to make sure that the relationship holds for all $Z_{1}, Z_{2} \in {0, 1}$ . Rather than checking each index individually, it's much more efficient to check a "random combination" of all values simultaneously by evaluating $V_{i}$ at a random point $g_{1}, g_{2}$ . We thus have, instead, that

$V_{i} (Z_{1}, Z_{2}) = b_{1}, b_{2} \in {0, 1}^{2} \sum eq (Z_{1}, Z_{2}; b_{1}, b_{2}) \cdot V_{j} (0, b_{1}, b_{2}) \cdot V_{j} (1, b_{1}, b_{2})$

Since $V_{i}$ is identical to $V_{i}$ (and similarly for $eq$ and $V_{j}$ ) everywhere on the hypercube, the above relationship should still hold for all binary $Z_{1}, Z_{2}$ . Moreover, the above relationship is now one which we can directly apply sumcheck to, since we have a summation over the hypercube!

But wait, you might say. This still seems wasteful -- why are we bothering with this summation and $eq$ polynomial? Why can't we just have something like

$V_{i} (Z_{1}, Z_{2}) = V_{j} (0, Z_{1}, Z_{2}) \cdot V_{j} (1, Z_{1}, Z_{2})$

Unfortunately the above relationship cannot work, as $Z_{1}, Z_{2}$ are linear on the LHS and quadratic on the RHS. The purpose of the summation and $eq$ polynomial is to "linearize" the RHS and quite literally turn any high degree polynomial (such as $V_{j} (0, Z_{1}, Z_{2}) \cdot V_{j} (1, Z_{1}, Z_{2})$ ) into its unique multilinear extension (recall the definition of multilinear extension).

We see that the general pattern of creating a "structured" layerwise relationship is as follows:

First, write the relationship in terms of the binary indices of values between each layer. In our case, $V_{i} (z_{1}, z_{2}) = V_{j} (0, z_{1}, z_{2}) \cdot V_{j} (1, z_{1}, z_{2})$ .
Next, replace $z_{i}$ on the LHS of the equation with formal variable $Z_{i} \in F$ , and allow the LHS to be a multilinear extension. We now have $V_{i} (Z_{1}, Z_{2})$ on the LHS.
Next, replace $z_{i}$ on the RHS of the equation with boolean $b_{i}$ values and add an $eq$ predicate between $Z_{i}, b_{i}$ , and add a summation over all $b_{i}$ values. Additionally, extend all $V_{j}$ to their multilinear extensions $V_{j}$ (this is importantly only for sumcheck):

$V_{i} (Z_{1}, Z_{2}) = b_{1}, b_{2} \in {0, 1}^{2} \sum eq (Z_{1}, Z_{2}; b_{1}, b_{2}) \cdot V_{j} (0, b_{1}, b_{2}) \cdot V_{j} (1, b_{1}, b_{2})$

Structured "Selector" Variables

Some relationships between layers are best expressed piece-wise. For example, let's say that we have a destination layer, $V_{i} (Z_{1}, Z_{2})$ , and a source layer of the same size, $V_{j} (Z_{1}, Z_{2})$ , where we'd like to square the first two evaluations but double the last two.

In other words, if $V_{j} (Z_{1}, Z_{2})$ has evaluations $[a, b, c, d]$ over the boolean hypercube, then $V_{i} (Z_{1}, Z_{2})$ should have evaluations $[a^{2}, b^{2}, 2 c, 2 d]$ . If we follow our usual protocol for writing the layerwise relationship here, we would have something like the following for the "integer index" version of the relationship:

$V_{i} (z) = {V_{j} (z)^{2} 2 \cdot V_{j} (z) if z < 2 otherwise$

We notice that in binary form, $z < 2$ whenever $z_{1} = 0$ (and $z \geq 2$ when $z_{1} = 1$ ). We can thus re-write the above as

$V_{i} (z_{1}, z_{2}) = (1 - z_{1}) \cdot V_{j} (0, z_{2})^{2} + z_{1} \cdot 2 \cdot V_{j} (1, z_{2})$

In other words, when $z_{1} = 0$ the second summand on the RHS is zero, and the first summand is just $V_{j} (0, z_{2}) = V_{j} (z_{1}, z_{2})$ since we already know that $z_{1} = 0$ , and vice versa for when $z_{1} = 1$ . At a first glance, this may look similar to the earlier example in which we took the products of adjacent pairs of layer values, but the two are not the same --

First, the current setup is semantically quite different; in the previous example we applied a binary "shrinking" transformation between the source and destination layers by multiplying pairs of values, while in the current example we are "splitting" the circuit into two semantic halves and applying a different unary operation element-wise to each half.
Second, the current setup actually has its $z_{1}$ variable outside of the argument to an MLE representing data in a previous layer. This is precisely what allows us to "select" between the two semantic halves of the circuit and compute an element-wise squaring in the first and a doubling in the second.

Applying the third transformation rule from above and extending everything into its multilinear form, we get

$V_{i} (Z_{1}, Z_{2}) = b_{1}, b_{2} \in {0, 1}^{2} \sum eq (Z_{1}, Z_{2}; b_{1}, b_{2}) \cdot [(1 - b_{1}) \cdot V_{j} (0, b_{2})^{2} + b_{1} \cdot 2 \cdot V_{j} (1, b_{2})]$

However, now the observation that $eq$ should only apply to variables which are nonlinear on the RHS is helpful here -- notice that although $b_{2}$ as a variable would be quadratic on the RHS, $b_{1}$ is linear and can thus be removed from the summation altogether and replaced directly with $Z_{1}$ :

$V_{i} (Z_{1}, Z_{2}) = b_{2} \in {0, 1} \sum eq (Z_{2}; b_{2}) \cdot [(1 - Z_{1}) \cdot V_{j} (0, b_{2})^{2} + Z_{1} \cdot 2 \cdot V_{j} (1, b_{2})]$

This layerwise relationship form-factor is called a "selector" in Remainder terminology and in general refers to an in-circuit version of an "if/else" statement where MLEs representing the values of layers can be broken into power-of-two-sized pieces.

Why Structured Circuits?

Compared to canonic GKR, structured circuits are good for two reasons -- verifier runtime and circuit description size. Note that every layerwise relationship which can be expressed as a structured layer (using $eq$ ) can be written as an equivalent series of $add$ and $mul$ gate layers.

To see why structured layers are good for circuit description size, we compare the following layer-wise relationships which describe the same circuit wiring pattern, but with different verifier cost and circuit description complexities (let $Z = Z_{1}, ..., Z_{n}$ and $b = b_{1}, ..., b_{n}$ for shorthand). Firstly, the structured version, which computes an element-wise product between each pair of evaluations:

$V_{i} (Z_{1}, ..., Z_{n}) = b_{1}, ..., b_{n} \in {0, 1}^{n} \sum eq (Z; b) \cdot V_{j} (b_{1}, ..., b_{n - 1}, 0) \cdot V_{j} (b_{1}, ..., b_{n - 1}, 1)$

And secondly, the multiplication gate version:

$V_{i} (Z_{1}, ..., Z_{n}) = x \in {0, 1}^{n}, y \in {0, 1}^{n} \sum mul_{i, j, j} (Z, x, y) \cdot [V_{j} (x) \cdot V_{j} (y)]$

We first consider the verifier runtime for both. Note that the sumcheck verifier performs $O (1)$ operations per round of sumcheck, plus the work necessary for the oracle query. In the structured relationship's case, there are $n$ rounds of sumcheck, and assuming that $b_{1}, ..., b_{n}$ get bound to $r_{1}, ..., r_{n} \in F$ , the oracle query which the verifier must evaluate is of the form

$f_{n} (r_{n}) = ? eq (Z_{1}, ..., Z_{n}; r_{1}, ..., r_{n}) \cdot V_{j} (r_{1}, ..., r_{n - 1}, 0) \cdot V_{j} (r_{1}, ..., r_{n - 1}, 1)$

where $f_{n}$ is the $n$ 'th univariate polynomial which the prover sends during sumcheck. The prover sends the claimed values for both $V_{j} (r_{1}, ..., r_{n - 1}, 0)$ and $V_{j} (r_{1}, ..., r_{n - 1}, 1)$ , and so the verifier doesn't do any work there. The verifier additionally evaluates $eq (Z_{1}, ..., Z_{n}; r_{1}, ..., r_{n})$ on its own, which it can do in $O (n)$ time.

Next, we consider the verifier runtime for the multiplication gate case: let $x_{1}, ..., x_{n}$ be bound to $u_{1}, ..., u_{n} \in F$ and let $y_{1}, ..., y_{n}$ be bound to $v_{1}, ..., v_{n} \in F$ during sumcheck. The oracle query is then

$f_{n} (r_{n}) = ? mul_{i, j, j} (Z_{1}, ..., Z_{n}, u_{1}, ..., u_{n}, v_{1}, ..., v_{n}) \cdot [V_{j} (u_{1}, ..., u_{n}) \cdot V_{j} (v_{1}, ..., v_{n})]$

Similarly to the structured case, the prover sends claimed values for $V_{j} (u_{1}, ..., u_{n})$ and $V_{j} (v_{1}, ..., v_{n})$ , and so the verifier doesn't have to do any work here. However, the verifier must also evaluate $mul_{i, j, j} (Z_{1}, ..., Z_{n}, u_{1}, ..., u_{n}, v_{1}, ..., v_{n})$ on its own. This requires time linear to the sparsity of the $mul_{i, j, j}$ polynomial, i.e. $O (2^{n - 1})$ in this example, since there are $2^{n - 1}$ nonzero multiplication gates (one for each pair of values in layer $i + 1$ ).

A circuit description size comparison between the two can be seen in a very similar light. In particular, the representation of $eq$ requires just $O (n)$ words to store (assuming each word can hold an $n$ -bit value), as we simply enumerate the indices between the $Z_{i}$ 's and the $b_{i}$ 's. On the other hand, storing the sparse representation of $mul$ required for linear-time proving requires storing all nonzero evaluations, i.e. $O (2^{n - 1})$ such indices in the above example (although one might argue that the representation is quite structured and can therefore be further compressed).

A note on claims

The above example (and other similar layerwise relationships) give us a further prover speedup through the structure of the claims which arise from the oracle query during sumcheck. In particular, the claims which the prover makes to the verifier in the structured case are as follows:

$V_{j} (r_{1}, ..., r_{n - 1}, 0) = ? c_{0} V_{j} (r_{1}, ..., r_{n - 1}, 1) = ? c_{1}$

These claims can be aggregated with almost no additional cost to the prover and verifier, as the first $n - 1$ challenges are identical between the two and the last challenges are precisely a $0$ and a $1$ . In particular, the verifier can simply sample $r^{⋆}$ and have the prover instead show that

$V_{j} (r_{1}, ..., r_{n - 1}, r^{⋆}) = (1 - r^{⋆}) \cdot c_{0} + r^{⋆} \cdot c_{1}$

On the other hand, the claims generated by the multiplication gate version of the layer above are in the form

$V_{j} (u_{1}, ..., u_{n}) = ? c_{u} V_{j} (v_{1}, ..., v_{n}) = ? c_{v}$

These claims have no challenges in common, and can only be aggregated through interpolative or RLC claim aggregation, both of which are significantly more expensive than the above method.

Costs

In general (note that this does not capture all possible structured layer cases, but should be enough to give some intuition on the prover/verifier/proof size costs for most structured layers), let us assume that our layerwise relationship can be expressed in the following manner -- $V_{i} (Z_{1}, ..., Z_{n}) = b_{1}, ..., b_{n} \in {0, 1}^{n} \sum eq (Z_{1}, ..., Z_{n}; b_{1}, ..., b_{n}) [k = 1 \sum s ℓ = 1 \prod d V_{j_{k, ℓ}} (b_{1}, ..., b_{n})]$ In other words, we have a layerwise relationship over $n$ variables, where the values in $V_{i}$ are a function of those in layers $V_{j_{k, ℓ}}$ , where $j_{k, ℓ} > i$ , such that we have $s$ total summand "groups" of MLEs over $n$ variables of size up to $d$ , i.e. the total polynomial degree in each of the $b_{i}$ is $d + 1$ .

The prover costs are as follows:

For simplicity here, we assume that the prover has $O (1)$ access to any value in $eq (Z_{1}, ..., Z_{n}; b_{1}, ..., b_{n})$ (note that these evaluations must be either precomputed in linear time, e.g. Tha13, or can be streamed in linear time in a clever way, e.g. Rot24).
Additionally, we assume that the prover has $O (1)$ access to any value in any $V_{i_{k, ℓ}} (b_{1}, ..., b_{n})$ , since the prover presumably knows all of the circuit values ahead of time.
Finally, we assume that the prover can use the "bookkeeping table folding" trick from Tha13 to compute both $eq (Z_{1}, ..., Z_{n}; r_{1}, b_{2}, ..., b_{n})$ from $eq (Z_{1}, ..., Z_{n}; b_{1}, b_{2}, ..., b_{n})$ and $V_{i_{k, ℓ}} (Z_{1}, ..., Z_{n}; r_{1}, b_{2}, ..., b_{n})$ from $V_{i_{k, ℓ}} (Z_{1}, ..., Z_{n}; b_{1}, b_{2}, ..., b_{n})$ in $O (2^{n})$ field operations.
We thus see that in the first round of sumcheck, the prover must do the following:
- Evaluate each of the terms in the summation for every $b_{1}, ..., b_{n} \in {0, 1}^{n}$ and sum them together. Each evaluation takes $O (s d)$ field operations, as there are $s$ summands and each requires $d$ multiplications.
- Since there are $2^{n}$ values of $b_{1}, ..., b_{n}$ , the above costs $O (s \cdot d \cdot 2^{n})$ field operations. This is the total cost for the prover to compute the claimed sum for the first round.
- Next, the prover must evaluate the RHS of the sumchecked equation at $X = 0, ..., d + 1$ in the place of $b_{1}$ to compute the univariate sumcheck message. Each evaluation of $X$ , similarly to the above, costs $O (s \cdot d \cdot 2^{n})$ field operations.
- The total cost for the prover to compute the claimed sum + univariate message in the first round is thus $O (s \cdot d \cdot 2^{n} + (d + 1) \cdot s \cdot d \cdot 2^{n}) = O (s \cdot d (d + 2) \cdot 2^{n})$ field operations.
We can generalize the above to the $t$ 'th round of sumcheck (let $t = 0$ for the first round) by noting that rather than $n$ variables, there are $n - t$ variables which are being summed over in the outer sum. Since the other values $s, d$ remain constant, the prover's total cost is simply $O (s \cdot d (d + 1) \cdot 2^{n - t})$ .
Finally, as mentioned earlier, the prover can generate the necessary precomputed values from the $t$ 'th round from those of the $t - 1$ 'th round in $O ((s \cdot d + 1) \cdot 2^{n - t})$ , as there are $s \cdot d + 1$ MLEs (including the $eq$ polynomial) which need their bookkeeping tables to be "folded" in $O (2^{n - t})$ .
Putting it altogether, the prover's total cost across all sumcheck rounds is thus $t = 0 \sum n - 1 O (s \cdot d (d + 2) \cdot 2^{n - t}) + O ((s \cdot d + 1) \cdot 2^{n - t})$
Since the above is a geometric series in $2^{n}$ , we have that the prover's total cost across all rounds is simply $O (s \cdot d (d + 2) \cdot 2^{n + 1})$

The proof size is as follows:

For each of the $n$ rounds of sumcheck, the prover must send over a degree $d + 1$ univariate polynomial to the verifier. Additionally, the prover must send the original sum (although this is actually free in GKR since the verifier already has the prover-claimed sum implicitly through the prover's claim from a previous sumcheck's oracle query).
Finally, the prover must send over each of its claimed values for the $V_{j_{k, ℓ}} (r_{1}, ..., r_{n})$ at the end of sumcheck. There are at most $s \cdot d$ claims.
The proof size is thus simply $O ((d + 2) \cdot n + s \cdot d)$ field elements.

The verifier runtime is as follows:

For each round of sumcheck, let the prover's univariate polynomial message be $f_{t} (X) \in F^{< d + 2} [X]$ . The verifier samples a random challenge $r_{t} \leftarrow $ F$ and checks whether $f_{t - 1} (r_{t}) = ? f_{t} (0) + f_{t} (1)$
Since the verifier can evaluate $f_{t}$ and $f_{t - 1}$ in $O (d + 2)$ and performs this check for each of the $n$ rounds of sumcheck, they can compute all the intermediate checks in $O (n \cdot (d + 2))$ .
During the final oracle query, the verifier must check whether $f_{n} (r_{1}, ..., r_{n}) = ? eq (Z_{1}, ..., Z_{n}; r_{1}, ..., r_{n}) [k = 1 \sum s ℓ = 1 \prod d V_{j_{k, ℓ}} (r_{1}, ..., r_{n})]$
The verifier can compute $eq (Z_{1}, ..., Z_{n}; r_{1}, ..., r_{n})$ on its own in $O (n)$ , and has access to each of the prover-claimed values for $V_{j_{k, ℓ}} (r_{1}, ..., r_{n})$ . It can thus compute the RHS of the above in $O (n + s \cdot d)$ .
The verifier's total runtime is thus $O (n \cdot d + n + s \cdot d) = O ((s + n) \cdot d)$

The Remainder Book

Structured ("Sector") GKR

Review: Equality MLE

Structured Layerwise Relationship

Structured "Selector" Variables

Why Structured Circuits?

A note on claims

Costs