Skip to content

Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)

License

Notifications You must be signed in to change notification settings

YuLiu-LY/SlotLifter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SlotLifter

Paper arXiv Paper PDF Project Page

This repository contains the official implementation of the ECCV 2024 paper:

SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields

YuLiu*,Baoxiong Jia*,Yixin Chen, Siyuan Huang

Environment Setup

We provide all environment configurations in requirements.txt. To install all packages, you can create a conda environment and install the packages as follows:

conda create -n slotlifter python=3.8
conda activate slotlifter
conda install pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install -r requirements.txt

In our experiments, we used NVIDIA CUDA 12.1 on Ubuntu 22.04. Similar CUDA version should also be acceptable with corresponding version control for torch and torchvision.

Dataset

1. CLEVR567, Room-Chair, Room-Diverse

CLEVR567, Room-Chair, and Room-Diverse datasets are provided by uORF.

2. Room-Texture, Kitchen-Matte, Kitchen-Shiny

Room-Texture, Kitchen-Matte, and Kitchen-Shiny datasets are provided by uOCF.

3.ScanNet

Download the ScanNet dataset here and process it with the official codes to obtain images, poses, and intrinsics. We use 100 scenes (from scene0001_00 to scene0101_00 except scene0079_00 which is in the test split) for training. We sample about 400 views and resize each image to a resolution of 640 × 480. We organize the dataset as below:

├──scannet/
    ├──scene0001_00/
        ├──color_480640/
            ├──0.jpg
        ├──pose/
            ├──0.txt
        ├──intrinsic/
            ├──intrinsic_color.txt

Following previous works, we use the test data provided by NerfingMVS.

4.DTU

Download the DTU dataset provided by PixelNeRF here.

Training & Evaluation

We provide training and testing scripts under scripts/ for all datasets.

  • train_uorf_data.sh and eval_uorf_data.sh: CLEVR567, Room-Chair, Room-Diverse, Room-Texture, Kitchen-Matte, and Kitchen-Shiny dataset
  • train_scannet.sh and eval_scannet.sh: Scannet dataset
  • train_dtu.sh and eval_dtu.sh: DTU dataset

Download pre-trained checkpoints here.

Citation

If you find our paper and/or code helpful, please consider citing:

@inproceedings{Liu2024slotlifter,
  title={SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields},
  author={Liu, Yu and Jia, Baoxiong and Chen, Yixin and Huang, Siyuan},
  booktitle={European Conference on Computer Vision (ECCV)},
  year={2024}
}

Acknowledgement

This code heavily used resources from PanopticLifting, BO-QSA, SLATE, OSRT, IBRNet, and uORF. We thank the authors for open-sourcing their awesome projects.

About

Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published