Kendall shape space: a working sketch

This note is meant as a complete sketch rather than a finished tutorial. The goal is to collect the main definitions, mental pictures, and technical facts I want to use later when writing a fuller note on Kendall shape space.

Why Kendall Shape Space Shows Up So Often

When shapes are represented by labeled landmarks, the raw coordinates still contain nuisance degrees of freedom:

translation
global scale
rotation

Kendall’s construction removes those transformations and asks us to study the geometry of what remains: shape itself.

That is the attractive part of the framework. It turns a messy collection of landmark configurations into a geometric object on which distances, means, geodesics, and statistics make sense.

What is removed

Absolute position, overall size, and orientation.

What remains

The relative arrangement of landmarks, which is what we usually want to call shape.

The Starting Point: Landmark Configurations

Take (k) ordered landmarks in (\mathbb{R}^m). A configuration can be written as a matrix

[ X \in \mathbb{R}^{k \times m}. ]

Each row corresponds to a landmark. At this stage we are not yet in shape space. We are still in the ambient Euclidean space of all configurations.

The first two operations are straightforward:

center the configuration to remove translation
normalize its size to remove global scale

If (H) is the centering operator, then the centered and normalized configuration is

[ Z = \frac{HX}{|HX|_F}. ]

This (Z) is called a pre-shape.

Configuration space

All landmark coordinates, including translation, scale, and rotation.

Pre-shape space

Centered, unit-norm configurations. This is already a curved object: a sphere.

Shape space

Pre-shapes modulo rotation. This is Kendall's shape space.

The Pre-shape Sphere

The normalization step places (Z) on a unit sphere in a linear subspace. For centered configurations, the dimension drops by (m), and the unit-norm constraint removes one more degree of freedom. So pre-shapes live on

[ S^{m(k-1)-1}. ]

This sphere is not the final answer, but it is the crucial intermediate object. Many formulas become cleaner once we realize that landmark shape analysis first passes through this spherical geometry.

Technical note

Calling pre-shape space a sphere is not just a visual analogy. The unit Frobenius norm constraint really defines a sphere inside the centered configuration subspace.

Quotienting Out Rotations

Two pre-shapes that differ only by a rotation should represent the same shape. So the final shape space is the quotient

[ \Sigma_m^k = S^{m(k-1)-1} / SO(m). ]

This is Kendall shape space for (k) landmarks in (m) dimensions.

That quotient notation is the compact way to say:

start from normalized landmark configurations
identify all rotated versions as equivalent
treat each equivalence class as one shape

This is where the geometry becomes richer. We are no longer on an ordinary Euclidean space, and many statistics need to be defined intrinsically or through tangent approximations.

The Famous Two-dimensional Case

For planar landmarks, the quotient has a particularly elegant form:

[ \Sigma_2^k \cong \mathbb{CP}^{k-2}. ]

So in 2D, Kendall shape space is complex projective space.

This is one reason the theory is so nice in the planar case: the quotient geometry can be written very compactly, and several quantities admit closed or well-studied expressions.

For 3D landmarks, the geometry is still meaningful and useful, but it is less algebraically neat than the planar setting.

Distances: Procrustes Enters the Picture

One of the central quantities is the full Procrustes distance. On pre-shapes (Z_1) and (Z_2), the corresponding shape distance can be written through the optimal alignment over rotations:

[ d([Z_1], [Z_2]) = \arccos \left( \max_{R \in SO(m)} \langle Z_1, Z_2 R \rangle \right). ]

This expression already reveals the structure of the problem:

align the two pre-shapes as well as possible
measure how close they are once only shape-relevant variation remains

The important point is that distances are not measured directly in raw coordinate space. They are measured after quotienting out nuisance transformations.

Tangent Spaces and Linearization

Even though the geometry is nonlinear, local computations often happen in tangent spaces. Around a reference shape, one can:

map nearby shapes to a tangent space
perform linear statistics there
map results back when needed

This is the basic logic behind many practical procedures:

tangent PCA or principal geodesic analysis
approximate means and covariances
local regression and visualization

This is also where a lot of misuse can happen: once the data are spread too widely, tangent linearization may stop being a good approximation.

Statistical Questions That Matter

Once shapes live on a manifold or quotient manifold, classical Euclidean statistics need reinterpretation.

The questions I want this note to keep in focus are:

what is the right notion of mean shape?
how do we describe principal directions of variation?
when is a tangent approximation good enough?
how much of the observed variability is geometric, and how much is due to landmarking noise?

The Fréchet mean becomes the natural replacement for the Euclidean average, and principal geodesic analysis becomes the curved analogue of PCA.

Why This Matters For Modern Geometry Processing

Kendall shape space is not only a classical statistics object. It is also a useful conceptual bridge for current geometric learning problems.

It reminds us that:

representation choices encode invariances
quotient geometry often appears before learning does
a good latent space should respect the symmetries of the data, not merely fit it

That is why Kendall’s construction still matters even in deep learning contexts: it gives a clean example of how to separate nuisance transformations from intrinsic variation.

A Minimal Computational Skeleton

def preshape(X):
    Xc = X - X.mean(axis=0, keepdims=True)
    return Xc / frobenius_norm(Xc)


def kendall_distance(Z1, Z2):
    R = optimal_rotation(Z1, Z2)
    return arccos(inner_product(Z1, Z2 @ R))

This pseudocode hides several details, but it captures the logic:

center
normalize
align over rotations
measure distance on the quotient

What I Still Want To Expand

This sketch is complete enough to orient the note, but the full version should still add:

a clean derivation of the tangent space constraint
the difference between full and partial Procrustes viewpoints
more intuition for geodesics on the pre-shape sphere versus the quotient
one concrete worked example with a tiny landmark dataset
a short discussion of how Kendall geometry compares with diffeomorphic or correspondence-based shape spaces

Keywords

Kendall shape space, statistical shape analysis, Procrustes analysis, landmark configurations, pre-shape sphere, quotient geometry, Fréchet mean, principal geodesic analysis.