Table 1. Quantitative comparison on Neural 3D dataset. In the Colmap column, SA denotes 'Sparse point cloud for All frames' and D0 denotes 'Dense point cloud for the 0th frame'. Following the original STG paper, which reports training six models for every 50 frames, we provide results for both the multi-model approach and a single-model approach trained on the full 300-frame sequence.

Method Colmap Preproc. Time ↓ PSNR ↑SSIM ↑LPIPS ↓ Train Time ↓FPS ↑Storage ↓Frames
4DGSD06 mins 28.720.93060.1528 33 mins9840.3300
STGSA25 mins 31.750.94730.1423 2h 43mins683127.550×6
STGSA25 mins 31.460.94320.1474 29 mins53254.0300
TaylorGSA25 mins 29.800.95580.1597 9 hours125205.7300
Swift4DD018 mins 29.930.93830.1370 19 mins273141.2300
DeGaussD06 mins 30.160.93570.1430 1h 27mins95117.5300
OURS-35K4 sec 32.350.94800.1295 10 mins76623.1300
OURS-45K4 sec 32.720.95020.1221 14 mins75523.7300