JMLR

Test-Time Training on Video Streams

Authors
Renhao Wang Yu Sun Arnuv Tandon Yossi Gandelsman Xinlei Chen Alexei A. Efros Xiaolong Wang
Research Topics
Machine Learning
Paper Information
  • Journal:
    Journal of Machine Learning Research
  • Added to Tracker:
    Jul 15, 2025
Abstract

Prior work has established Test-Time Training (TTT) as a general framework to further improve a trained model at test time. Before making a prediction on each test instance, the model is first trained on the same instance using a self-supervised task such as reconstruction. We extend TTT to the streaming setting, where multiple test instances - video frames in our case - arrive in temporal order. Our extension is online TTT: The current model is initialized from the previous model, then trained on the current frame and a small window of frames immediately before. Online TTT significantly outperforms the fixed-model baseline for four tasks, on three real-world datasets. The improvements are more than 2.2x and 1.5x for instance and panoptic segmentation. Surprisingly, online TTT also outperforms its offline variant that accesses strictly more information, training on all frames from the entire test video regardless of temporal order. This finding challenges those in prior work using synthetic videos. We formalize a notion of locality as the advantage of online over offline TTT, and analyze its role with ablations and a theory based on bias-variance trade-off.

Author Details
Renhao Wang
Author
Yu Sun
Author
Arnuv Tandon
Author
Yossi Gandelsman
Author
Xinlei Chen
Author
Alexei A. Efros
Author
Xiaolong Wang
Author
Research Topics & Keywords
Machine Learning
Research Area
Citation Information
APA Format
Renhao Wang , Yu Sun , Arnuv Tandon , Yossi Gandelsman , Xinlei Chen , Alexei A. Efros & Xiaolong Wang . Test-Time Training on Video Streams. Journal of Machine Learning Research .
BibTeX Format
@article{JMLR:v26:24-0439,
  author  = {Renhao Wang and Yu Sun and Arnuv Tandon and Yossi Gandelsman and Xinlei Chen and Alexei A. Efros and Xiaolong Wang},
  title   = {Test-Time Training on Video Streams},
  journal = {Journal of Machine Learning Research},
  year    = {2025},
  volume  = {26},
  number  = {9},
  pages   = {1--29},
  url     = {http://jmlr.org/papers/v26/24-0439.html}
}
Related Papers