PSNet: Perspective-sensitive convolutional network for object detection

This is the official implementation of PSNet (NeuroComputing), an object detection method that is robust to perspective variances. For more details, please refer to:

PSNet: Perspective-sensitive convolutional network for object detection [Paper]
XinZhang, Yicheng Liu, Chunlei Huo, Nuo Xu, Lingfeng Wang, Chunhong Pan

Get Started

Preparation

Clone the code

git clone https://github.com/XinZhangNLPR/NC_PSNet.git

Download the model weight used in the paper to /models/res101/spacenet/:

ResNet101

Installation

Python 2.7 or 3.6
Pytorch 0.4.0 (now it does not support 0.4.1 or higher)
CUDA 8.0 or higher

Pretrained Model (If you want to train the model by yourself)

We used ResNet101 in our experiments. You can download the pretrain model from:

ResNet101: Dropbox, VT Server

Download them and put them into the data/pretrained_model/.

Compilation

As pointed out by ruotianluo/pytorch-faster-rcnn, choose the right -arch in make.sh file, to compile the cuda code:

GPU model	Architecture
TitanX (Maxwell/Pascal)	sm_52
GTX 960M	sm_50
GTX 1080 (Ti)	sm_61
Grid K520 (AWS g2.2xlarge)	sm_30
Tesla K80 (AWS p2.xlarge)	sm_37

More details about setting the architecture can be found here or here

Install all the python dependencies using pip:

pip install -r requirements.txt

Compile the cuda dependencies using following simple commands:

cd lib
sh make.sh

It will compile all the modules you need, including NMS, ROI_Pooing, ROI_Align and ROI_Crop. The default version is compiled with Python 2.7, please compile by yourself if you are using a different python version.

As pointed out in this issue, if you encounter some error during the compilation, you might miss to export the CUDA paths to your environment.

Train

Before training, set the right directory to save and load the trained models. Change the arguments "save_dir" and "load_dir" in trainval_net.py and test_net.py to adapt to your environment.

To train a faster R-CNN model with vgg16 on pascal_voc, simply run:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py \
                   --dataset pascal_voc --net vgg16 \
                   --bs $BATCH_SIZE --nw $WORKER_NUMBER \
                   --lr $LEARNING_RATE --lr_decay_step $DECAY_STEP \
                   --cuda

where 'bs' is the batch size with default 1. Alternatively, to train with resnet101 on pascal_voc, simple run:

 CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py \
                    --dataset pascal_voc --net res101 \
                    --bs $BATCH_SIZE --nw $WORKER_NUMBER \
                    --lr $LEARNING_RATE --lr_decay_step $DECAY_STEP \
                    --cuda

Above, BATCH_SIZE and WORKER_NUMBER can be set adaptively according to your GPU memory size. On Titan Xp with 12G memory, it can be up to 4.

If you have multiple (say 8) Titan Xp GPUs, then just use them all! Try:

python trainval_net.py --dataset pascal_voc --net vgg16 \
                       --bs 24 --nw 8 \
                       --lr $LEARNING_RATE --lr_decay_step $DECAY_STEP \
                       --cuda --mGPUs

Change dataset to "coco" or 'vg' if you want to train on COCO or Visual Genome.

Test

If you want to evlauate the detection performance of a pre-trained vgg16 model on pascal_voc test set, simply run

python test_net.py --dataset pascal_voc --net vgg16 \
                   --checksession $SESSION --checkepoch $EPOCH --checkpoint $CHECKPOINT \
                   --cuda

Specify the specific model session, chechepoch and checkpoint, e.g., SESSION=1, EPOCH=6, CHECKPOINT=416.

Demo

If you want to run detection on your own images with a pre-trained model, download the pretrained model listed in above tables or train your own models at first, then add images to folder $ROOT/images, and then run

python demo.py --net vgg16 \
               --checksession $SESSION --checkepoch $EPOCH --checkpoint $CHECKPOINT \
               --cuda --load_dir path/to/model/directoy

Then you will find the detection results in folder $ROOT/images.

Citation

@article{zhang2022psnet,
  title={PSNet: Perspective-sensitive convolutional network for object detection},
  author={Zhang, Xin and Liu, Yicheng and Huo, Chunlei and Xu, Nuo and Wang, Lingfeng and Pan, Chunhong},
  journal={Neurocomputing},
  volume={468},
  pages={384--395},
  year={2022},
  publisher={Elsevier}
}

Name		Name	Last commit message	Last commit date
Latest commit History 568 Commits
cfgs		cfgs
command		command
lib		lib
models/res101/spacenet		models/res101/spacenet
pic		pic
.gitignore		.gitignore
Network.jpg		Network.jpg
PCA_VIS.py		PCA_VIS.py
README.md		README.md
_init_paths.py		_init_paths.py
affine_ransac.py		affine_ransac.py
affine_transform.py		affine_transform.py
align_transform.py		align_transform.py
demo.py		demo.py
demo_from_list.py		demo_from_list.py
eval_from_pkl.py		eval_from_pkl.py
geo_distance.py		geo_distance.py
requirements.txt		requirements.txt
roi_pooling_test.py		roi_pooling_test.py
test_net.py		test_net.py
trainval_net.py		trainval_net.py
view_computer.py		view_computer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PSNet: Perspective-sensitive convolutional network for object detection

Get Started

Preparation

Installation

Pretrained Model (If you want to train the model by yourself)

Compilation

Train

Test

Demo

Citation

About

Releases

Packages

Contributors 18

Languages

XinZhangRadar/NC_PSNet

Folders and files

Latest commit

History

Repository files navigation

PSNet: Perspective-sensitive convolutional network for object detection

Get Started

Preparation

Installation

Pretrained Model (If you want to train the model by yourself)

Compilation

Train

Test

Demo

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 18

Languages

Packages