
120 Chapter 4
The geometric priors we have described so far do not prescribe a specific
architecture for building such representation, but rather a series of necessary
conditions. However, they hint at an axiomatic construction that provably sat-
isfies these geometric priors, while ensuring a highly expressive representation
that can approximate any target function satisfying such priors.
4.5.1 Invariant and Equivariant Layers
A simple initial observation is that, in order to obtain a highly expressive repre-
sentation, we are required to introduce a non-linear element, since if f is linear
and G-invariant, then for all x ∈X(Ω),
12
f (x) =
1
µ(G)
Z
G
f (g.x)dµ(g) = f
1
µ(G)
Z
G
(g.x)dµ(g)
,
which indicates that F only depends on x through the G-average Ax =
1
µ(G)
R
G
(g.x)dµ(g). In the case of images and translation, this would entail using
only the average RGB color of the input!
While this reasoning shows that the family of linear invariants is not a very
rich object, the family of linear equivariants provides a much more powerful
tool, since it enables the construction of rich and stable features by com-
position with appropriate non-linear maps, as we will now explain. Indeed,
if B : X(Ω, C) →X(Ω, C
′
) is G-equivariant satisfying B(g.x) = g.B(x) for all
x ∈X and g ∈G, and σ : C
′
→C
′′
is an arbitrary (non-linear) map, then we
easily verify that the composition U := (σ ◦B) : X(Ω, C) →X(Ω, C
′′
) is also G-
equivariant, where σ : X(Ω, C
′
) →X(Ω, C
′′
) is the element-wise instantiation
of σ given as (σ(x))(u) := σ(x(u)).
This simple property allows us to define a very general family of G-
invariants, by composing U with the group averages A ◦U : X(Ω, C) →C
′′
.
A natural question is thus whether any G-invariant function can be approx-
imated at arbitrary precision by such a model, for appropriate choices of B
and σ. It is not hard to adapt the standard Universal Approximation Theorems
from unstructured vector inputs to show that shallow ‘geometric’ networks are
also universal approximators, by properly generalising the group average to
a general non-linear invariant
13
. However, as already described in the case of
Fourier versus Wavelet invariants, there is a fundamental tension between shal-
low global invariance and deformation stability. This motivates an alternative
representation, which considers instead localised equivariant maps
14
.
Assuming that Ω is further equipped with a distance metric d, we call an
equivariant map U localised if (Ux)(u) depends only on the values of x(v) for
N
u
= {v : d(u, v) ≤r}, for some small radius r; the latter set N
u
is called the
receptive field.