Skip to content
Technical GlossaryDeep Learning

Truncated BPTT

A method that makes training more tractable on long sequences by applying backpropagation over a limited window.

Truncated BPTT is used to reduce the cost of full backpropagation through time on very long sequences. The model computes gradients only across a fixed historical window. This improves computational efficiency, but it may weaken learning of very long dependencies. It therefore represents an engineering trade-off between practicality and contextual coverage.