Face Recognition

Definition

The task of verifying or identifying a person from their face image; framed as a metric learning problem where the model learns an embedding space where same-identity faces cluster together.

Intuition

Training a separate classifier per identity requires retraining when new identities are added. Metric learning instead trains an embedding where distance directly measures similarity — enabling one-shot recognition of unseen identities.

Formal Description

Verification vs identification:

  • Verification: given two images, are they the same person? (binary)
  • Identification: given a probe image, who is this person? (retrieval)

Embedding network , typically or ; trained so is small for same identity, large for different.

Triplet loss: anchor-positive-negative triplets with margin :

Hard triplet mining (selecting difficult negatives) is critical for convergence.

Siamese networks: two branches with shared weights processing two images; outputs compared directly.

Deployment: compare probe embedding against gallery embeddings; threshold on distance for accept/reject.

Applications

Phone unlock, building access control, photo management (tagging), law enforcement (surveillance).

Trade-offs

  • Requires large labeled datasets of identity pairs/triplets
  • Hard triplet mining is critical for convergence
  • Bias/fairness concerns (demographic performance disparities)
  • Privacy implications