Unlocking Structural, Photometric, and Semantic Cues for Sparse-View 3D Gaussian Splatting

Overview

Sparse-view 3D Gaussian Splatting often suffers from structural degradation, floating primitives, fragmented geometry, and unreliable appearance estimation under limited supervision. SPS-GS addresses this problem by jointly modeling structural, photometric, and semantic cues within a unified optimization framework.

Specifically, SPS-GS is built on three complementary components:

Structural cue: a topology-aware graph regularizer that removes spurious Gaussians and improves connectivity, producing a more coherent structural scaffold.
Photometric cue: a GNN-based propagation module that transfers reliable appearance information to weakly observed regions, improving texture consistency under sparse views.
Semantic cue: a rarity- and boundary-aware modulation strategy that places more emphasis on informative boundaries and underrepresented structures, helping preserve thin structures and fine details.

By combining these three cues, SPS-GS improves both geometric stability and rendering quality for sparse-view novel view synthesis.

Related Work Comparison

Comparison of representative sparse-view reconstruction methods with respect to structural, photometric, and semantic cues.

Qualitative Results

LLFF (3 Views)

Qualitative comparison on LLFF with three input views.

mip-NeRF 360 (24 Views)

Qualitative comparison on mip-NeRF 360 with twenty-four input views.

Blender (8 Views) and Tanks and Temples (3 Views)

Qualitative results on Blender and Tanks and Temples

Qualitative comparison on Blender with eight input views and Tanks and Temples with three input views.

Environmental Setups

Tested on Ubuntu, CUDA 11.8, and Python 3.10.

conda create -n SPSgs python=3.10
conda activate SPSgs

Install PyTorch and the Python dependencies:

pip install -r requirements.txt --extra-index-url https://download.pytorch.org/whl/cu118

Fetch the third-party repositories on demand:

bash script/fetch_third_party.sh

Then install the CUDA extensions:

pip install submodules/diff-gaussian-rasterization
pip install submodules/simple-knn

Running

We directly provide the dense initialization point clouds used by the method. No additional initialization-point generation step is required before training.

Taking LLFF as an example, the expected dataset layout is:

├── data
    ├── nerf_llff_data_colmap
        ├── fern
            ├── sparse
            │   └── 0
            └── images
├── keypoints_to_3d
    ├── LLFF
        └── fern_keypoints_to_3d.ply

LLFF

Download the LLFF dataset and place each scene under:

./data/nerf_llff_data_colmap/

The provided dense initialization point clouds are already included under:

./keypoints_to_3d/LLFF/

Run using the following commands:

CUDA_VISIBLE_DEVICES=0 python train.py \
    -s ./data/nerf_llff_data_colmap/fern \
    -m ./output/LLFF/fern/3 \
    --n_views 3 \
    --dataset_name LLFF \
    --resolution 2 \
    --eval

CUDA_VISIBLE_DEVICES=0 python render.py \
    --model_path ./output/LLFF/fern/3 \
    --n_views 3 \
    --skip_train \
    --resolution 2 \
    --eval \
    --dataset_name LLFF

CUDA_VISIBLE_DEVICES=0 python metrics.py \
    -m ./output/LLFF/fern/3

Acknowledgements

Our code follows several awesome repositories. We appreciate them for making their codes available to public.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
arguments		arguments
assets		assets
gaussian_renderer		gaussian_renderer
keypoints_to_3d/LLFF		keypoints_to_3d/LLFF
lpipsPyTorch		lpipsPyTorch
scene		scene
script		script
utils		utils
LICENSE.md		LICENSE.md
README.md		README.md
metrics.py		metrics.py
render.py		render.py
requirements.txt		requirements.txt
spiral.py		spiral.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unlocking Structural, Photometric, and Semantic Cues for Sparse-View 3D Gaussian Splatting

Overview

Related Work Comparison

Qualitative Results

LLFF (3 Views)

mip-NeRF 360 (24 Views)

Blender (8 Views) and Tanks and Temples (3 Views)

Environmental Setups

Running

LLFF

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unlocking Structural, Photometric, and Semantic Cues for Sparse-View 3D Gaussian Splatting

Overview

Related Work Comparison

Qualitative Results

LLFF (3 Views)

mip-NeRF 360 (24 Views)

Blender (8 Views) and Tanks and Temples (3 Views)

Environmental Setups

Running

LLFF

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages