Skip to content
Technical GlossarySpeech, Voice and Audio AI

Speaker Embeddings

Dense vector representations that capture speaker identity in a discriminative form.

Speaker embeddings form the basis of modern speaker recognition systems. The goal is to place samples from the same person close together and samples from different people far apart in vector space. This supports verification, clustering, and diarization workflows alike. It is one of the key representations that makes voice biometrics scalable and flexible.