Splat Feature Solver

📄 Paper

This repository contains the official implementation of our paper:

Splat Feature Solver
arXiv:2508.12216

If you find this code useful, please consider citing our work.

Quickstart

Simple Tutorial can be found in the above Bilibili badge and YouTube badge, or directly via the following links:
- Bilibili Tutorial (Chinese)
- YouTube Tutorial (English)
User Simplified Version: August 22TH

This project is built on top of the gaussian-splatting, gsplat, beta-splatting, LAGAand Featup code bases. The authors are grateful to the original authors for their open-source codebase contributions.

Installation Steps

Clone the Repository:

git clone --single-branch --branch main https://github.com/saliteta/splat-distiller.git
cd splat-distiller

Set Up the Conda Environment: Notice that if you can do this, you usally on a linux machine, otherwise check

conda env create -f environment.yml
conda activate splat-distiller
pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0  --index-url https://download.pytorch.org/whl/cu128
conda install -c pytorch -c nvidia faiss-gpu -y
pip install .

Set Up the Compiler (GCC | MSVC) (Optional, if no error in 1, skip this)

# on linux make sure install compiler on the conda env, you can install compilers manually through conda-forge
# on windows it seems like the nvcc is based on micorsoft visual compiler, you need to set it to msvc 2022 or the one fit nvcc
# You might manually set the path like the following
set PATH=C:\Program Files (x86)\Microsoft Visual Studio\2022\BuildTools\VC\Tools\MSVC\${The version u use}\bin\Hostx64\x64\;%PATH%

Set Up SAM (Optional, if you want to use SAM OpenCLIP model)

  wget https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth

Processing your own Scenes

If just want to reproduce the result in paper, skip this part The project expect the following dataset structure in the source path location:

<location>
|---images
|   |---<image 0>
|   |---<image 1>
|   |---...
|---features
|   |---<feature 0>
|   |---<feature 1>
|   |---...
|---sparse
    |---0
        |---cameras.bin
        |---images.bin
        |---points3D.bin

To prepare the required dataset, please put the images you want to use in a directory <location>/input.

<location>
|---input
    |---<image 0>
    |---<image 1>
    |---...

Then

1. Calibrate images via COLMAP

python convert.py -s <location>

Command Line Arguments for convert.py

--no_gpu

Flag to avoid using GPU in COLMAP.

--skip_matching

Flag to indicate that COLMAP info is available for images.

--source_path / -s

Location of the inputs.

--camera

Which camera model to use for the early matching steps, OPENCV by default.

--resize

Flag for creating resized versions of input images.

--colmap_executable

Path to the COLMAP executable (.bat on Windows).

--magick_executable

Path to the ImageMagick executable.

2. Extract features via FeatUp or SAMOpenCLIP

python feature_extractor.py -s <location> --model <model> --sam_ckpt_path <if use SAMOpenCLIP>

Command Line Arguments for feature_extractor.py

--source_path / -s

Location of the inputs.

--model

Select the 2D foundation model from the list: dino16, dinov2, clip, maskclip, vit, resnet50, SAMOpenCLIP(paper metrics).

--sam_ckpt_path (optional, if you use sam open clip)

3. Feature Lifting

python distill.py --method {3DGS, 2DGS, or DBS} --ckpt {place hold the pretrained model, currently support .ply(inria/dbs) .ckpt(gsplat)} --dir {colmap folder} --feature_folder {sub folder e.g. SAMOpenCLIP_features} --quantize {if use post lifting aggregation} --tikhonove {lambda parameters, around 1 to 1.2, one means no tikhonov}

4. Visualization

python general_viewer.py --ckpt {place hold the pretrained model, currently support .ply(inria/dbs) .ckpt(gsplat)}  --splat_method {3DGS, 2DGS, or DBS} --feature_ckpt {feature location in absolute path} --text_encoder {SAMOpenCLIP, SAM2OpenCLIP, ...}

Expected Output

We support RGB, Feature PCA, and Attention Map Rendering for now. One can also do the segmentation by use the segmentation bottom. We also support 2DGS, 3DGS, and DBS

Evaluation Scripts

python eval.py --data-dir ${The colmap path} --ckpt ${Place you store the trained model} --label-dir ${stored json file}

All in One Scripts

To run feature extraction, model training, feature lifting, rendering, and evaluation metrics in one scripts for lerf_ovs, we prepare the following scripts:

Download Dataset

gdown 1QF1Po5p5DwTjFHu6tnTeYs_G0egMVmHt
unzip lerf_ovs
rm lerf_ovs.zip
##### Here You Need to modify the config.py file accordingly
python benchmark.py --config ${Your config .yaml} # default is for_metrics.yaml in congif/

The all the results should be in "results" folder The results seperated by the scene name, and it has:

ckpts: The trained geometry, and lifted features
Features: Rendered Features from evaluation camera pose
Feature_PCA: Rendered Features after PCA for visualization from evaluation camera pose
metrics_images: Segmented results compare to ground truth
renderes: More rendered RGB images from different angles, at different training iteration
RGB: final rendered RGB at evaluation camera pose
frame_metrics.csv: per_frame calcualted results and whole scene average
others for training logs

There are some examples:

left one are the feature PCA, and the right one is the metrics image

Visualization and Query in 3D (Older Viewer, only support 3DGS, no segmentation, but more visualization mode, such as depth...)

To visualize and query in 3D, one can do the following. If one is using 3DGS cd gaussian_spaltting python simple_viewer.py --ckpt {in the ckpt folder, there is a ckpt_29999} --feature_ckpt {optional, default is ckpt_29999_feature at the same folder} examples: left one is when one visualize in relevance mode, input eggs, and the right one is the rendered rgb

Adapting LAGA format Feature Extraction

LAGA and OpenGaussian and LangSplat all use about the same way of feature extraction, however, it usually takes around 2 hours to process one scene. By utilizing their preprocessed feature, we can further boost our mIoU at the cost of time. to use, simply do the following

pip install git+https://github.com/facebookresearch/segment-anything.git

Then we can run:

cd laga
pre_processing.py --help
feature_converter.py --help

One script to generate langsplat original data One script to convert feature to our format

LangSplat Feature and Positional Embeddings

Currently, we add all those information into the metrics.py One need to explicityly name the --text-encoder to SAMOpenCLIP to make it work

python eval.py --text-encoder {sam_clip} ...

We re-write the CLIP+SAM feature + Cluster for segmentation

It is still lifting, actually really fast
We adding the quick cluster, a little hard to install on 4090, changed to rapidsai 25.06
CUDA toolkit 12.6, pytorch 2.7.1
Should additionally install omega-conf and hydra

to run modify the config/for_metrics.yaml part and run:

python benchmark.py

Some examples

📚 Citation

If you use this code or paper in your research, please cite:

@misc{xiong2025splatfeaturesolver,
    title={Splat Feature Solver}, 
    author={Butian Xiong and Rong Liu and Kenneth Xu and Meida Chen and Andrew Feng},
    year={2025},
    eprint={2508.12216},
    archivePrefix={arXiv},
    primaryClass={cs.CV},
    url={https://arxiv.org/abs/2508.12216}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
application_beta/semantic_query		application_beta/semantic_query
assets		assets
beta_splatting		beta_splatting
config		config
gaussian_splatting		gaussian_splatting
submodules		submodules
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
argparser.py		argparser.py
benchmark.py		benchmark.py
cluster_utils.py		cluster_utils.py
convert.py		convert.py
distill.py		distill.py
environment.yml		environment.yml
eval.py		eval.py
evaluator_loader.py		evaluator_loader.py
feature_extractor.py		feature_extractor.py
general_viewer.py		general_viewer.py
metrics.py		metrics.py
pre_processing.py		pre_processing.py
setup.py		setup.py
video_generator.py		video_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Splat Feature Solver

📄 Paper

Quickstart

Installation Steps

Processing your own Scenes

1. Calibrate images via COLMAP

--no_gpu

--skip_matching

--source_path / -s

--camera

--resize

--colmap_executable

--magick_executable

2. Extract features via FeatUp or SAMOpenCLIP

--source_path / -s

--model

--sam_ckpt_path (optional, if you use sam open clip)

3. Feature Lifting

4. Visualization

Expected Output

Evaluation Scripts

All in One Scripts

Visualization and Query in 3D (Older Viewer, only support 3DGS, no segmentation, but more visualization mode, such as depth...)

Adapting LAGA format Feature Extraction

LangSplat Feature and Positional Embeddings

We re-write the CLIP+SAM feature + Cluster for segmentation

Some examples

📚 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

saliteta/splat-distiller

Folders and files

Latest commit

History

Repository files navigation

Splat Feature Solver

📄 Paper

Quickstart

Installation Steps

Processing your own Scenes

1. Calibrate images via COLMAP

--no_gpu

--skip_matching

--source_path / -s

--camera

--resize

--colmap_executable

--magick_executable

2. Extract features via FeatUp or SAMOpenCLIP

--source_path / -s

--model

--sam_ckpt_path (optional, if you use sam open clip)

3. Feature Lifting

4. Visualization

Expected Output

Evaluation Scripts

All in One Scripts

Visualization and Query in 3D (Older Viewer, only support 3DGS, no segmentation, but more visualization mode, such as depth...)

Adapting LAGA format Feature Extraction

LangSplat Feature and Positional Embeddings

We re-write the CLIP+SAM feature + Cluster for segmentation

Some examples

📚 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages