Skip to content
Technical GlossaryComputer Vision

Spatio-Temporal Convolution

A convolutional approach that jointly models spatial patterns and temporal change within video.

Spatio-temporal convolution treats video not as a set of independent frames, but as a volumetric structure containing motion. This allows more effective learning of action patterns, motion flow, and event dynamics. It produces strong representations especially for action recognition and behavior analytics. It is one of the core building blocks of deep learning for video.