logo 4DSegStreamer

Streaming 4D Panoptic Segmentation
via Dual Threads Track

Ling Liu1*, Jun Tian2*, Li Yi2,3,4†
1Institut Polytechnique de Paris, 2IIIS, Tsinghua University, 3Shanghai Qi Zhi Institute, 4Shanghai AI Lab
ICCV 2025
*Equal Contribution Corresponding Author

Abstract

4D panoptic segmentation in a streaming setting is critical for highly dynamic environments, such as evacuating dense crowds and autonomous driving in complex scenarios, where real-time, fine-grained perception within a constrained time budget is essential. In this paper, we introduce 4DSegStreamer, a novel framework that employs a Dual-Thread System to efficiently process streaming frames. Our method consists of a predictive thread and an inference thread. The predictive thread leverages historical motion and geometric information to extract features and forecast future dynamics. The inference thread ensures timely prediction for incoming frames by aligning with the latest memory and compensating for ego-motion and dynamic object movements. We evaluate 4DSegStreamer on the indoor HOI4D dataset and the outdoor SemanticKITTI and nuScenes datasets. Comprehensive experiments demonstrate the effectiveness of our approach, particularly in accurately predicting dynamic objects in complex scenes.