Face Recognition

Definition

The task of verifying or identifying a person from their face image; framed as a metric learning problem where the model learns an embedding space where same-identity faces cluster together.

Intuition

Training a separate classifier per identity requires retraining when new identities are added. Metric learning instead trains an embedding where distance directly measures similarity — enabling one-shot recognition of unseen identities.

Formal Description

Verification vs identification:

  • Verification: given two images, are they the same person? (binary)
  • Identification: given a probe image, who is this person? (retrieval)

Embedding network , typically or ; trained so is small for same identity, large for different.

Triplet loss: see triplet_loss in 01_foundations/deep_learning_theory; anchor-positive-negative triplets with margin .

Siamese networks: two branches with shared weights processing two images; outputs compared directly.

Deployment: compare probe embedding against gallery embeddings; threshold on distance for accept/reject.

Applications

Phone unlock, building access control, photo management (tagging), law enforcement (surveillance).

Trade-offs

  • Requires large labeled datasets of identity pairs/triplets
  • Hard triplet mining is critical for convergence
  • Bias/fairness concerns (demographic performance disparities)
  • Privacy implications