Skip to content

Cross-modal Semantic Transfer for Point Cloud Semantic Segmentation

Notifications You must be signed in to change notification settings

a4152684/StreetSeg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StreetSeg

Please refer to https://github.com/WHU-USI3DV/StreetSeg

Cross-modal Semantic Transfer for Point Cloud Semantic Segmentation
Zhen Cao, Xiaoxin Mi, Bo Qiu, Zhipeng Cao, Chen Long, Xinrui Yan, Chao Zheng, Zhen Dong, Bisheng Yang

StreetSeg

Abstract: 3D street scene semantic segmentation is essential for urban understanding. However, supervised point cloud semantic segmentation networks heavily rely on expensive manual annotations and demonstrate limited generalization capabilities across datasets, which poses limitations in a range of downstream tasks. In contrast, image segmentation networks exhibit stronger generalization. Fortunately, mobile laser scanning systems can collect images and point clouds simultaneously, offering a potential solution for 2D-3D semantic transfer. In this paper, we introduce a cross-modal label transfer framework for point cloud semantic segmentation, without the supervision of 3D semantic annotation. Specifically, the proposed method takes point clouds and the associated posed images of a scene as inputs, and accomplishes the pointwise semantic segmentation for point clouds. We first get the image semantic pseudo-labels through a pre-trained image semantic segmentation model. Building on this, we construct implicit neural radiance fields (NeRF) to achieve multi-view consistent label mapping by jointly constructing color and semantic fields. Then, we design a superpoint semantic module to capture the local geometric features on point clouds, which contributes a lot to correcting semantic errors in the implicit field. Moreover, we introduce a dynamic object filter and a pose adjustment module to address the spatio-temporal misalignment between point clouds and images, further enhancing the consistency of the transferred semantic labels. The proposed approach has shown promising outcomes on two street scene datasets, namely KITTI-360 and WHU-Urban3D, highlighting the effectiveness and reliability of our method.

Dataset

Download the two datasets:

For pseudo-labels, refer to SegFormer, Mask2Former or other image semantic segmentation methods.

For superpoint generation, refer to Lin et al.

Run

To train/test the method on WHU-Urban3D dataset, you can edit the parameter in the file wuhan.sh, and run:

sh scripts/wuhan.sh

For KITTI-360 dataset, you can edit the parameter in the file kitti360.sh, and run:

sh scripts/kitti360.sh

About

Cross-modal Semantic Transfer for Point Cloud Semantic Segmentation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published