Skip to content

Commit 17736e1

Browse files
committed
released version 1.0
1 parent 90c0ae3 commit 17736e1

14 files changed

+1720
-1
lines changed

README.md

+81-1
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,84 @@
11
# VSE-HAL
22
Code release for "Improved Text-Image Matching by Mitigating Visual Semantic Hubs" [\[arxiv\]](https://arxiv.org/pdf/1911.10097v1.pdf) at AAAI 2020.
33

4-
[coming soon]
4+
5+
6+
## Train
7+
8+
Run `train.py`
9+
10+
#### COCO
11+
12+
##### w/o global weighting
13+
14+
```bash
15+
python3 train.py \
16+
--data_path "data/data/resnet_precomp" \
17+
--vocab_path "data/vocab/" \
18+
--data_name coco_precomp \
19+
--batch_size 512 \
20+
--learning_rate 0.001 \
21+
--lr_update 8 \
22+
--num_epochs 13 \
23+
--img_dim 2048 \
24+
--logger_name runs/COCO \
25+
--local_alpha 30.00 \
26+
--local_ep 0.3
27+
```
28+
29+
##### with global weighting
30+
31+
```bash
32+
python3 train.py \
33+
--data_path "data/data/resnet_precomp" \
34+
--vocab_path "data/vocab/" \
35+
--data_name coco_precomp \
36+
--batch_size 512 \
37+
--learning_rate 0.001 \
38+
--lr_update 8 \
39+
--num_epochs 13 \
40+
--img_dim 2048 \
41+
--logger_name runs/COCO_mb \
42+
--local_alpha 30.00 \
43+
--local_ep 0.3 \
44+
--memory_bank \
45+
--global_alpha 40.00 \
46+
--global_beta 40.00 \
47+
--global_ep_posi 0.20 \
48+
--global_ep_nega 0.10 \
49+
--mb_rate 0.05 \
50+
--mb_k 250
51+
```
52+
53+
#### Flickr30k
54+
55+
```bash
56+
python3 train.py \
57+
--data_path "data/data" \
58+
--vocab_path "data/vocab/" \
59+
--data_name f30k_precomp \
60+
--batch_size 128 \
61+
--learning_rate 0.001 \
62+
--lr_update 8 \
63+
--num_epochs 13 \
64+
--logger_name runs/f30k \
65+
--local_alpha 60.00 \
66+
--local_ep 0.7
67+
```
68+
69+
## Evaluate
70+
71+
run `compute_results.py`
72+
73+
#### COCO
74+
75+
```bash
76+
python3 compute_results.py --data_path data/data/resnet_precomp --fold5 --model_path runs/COCO/model_best.pth.tar
77+
```
78+
79+
#### Flickr30k
80+
81+
```bash
82+
python3 compute_results.py --data_path data/data --model_path runs/f30k/model_best.pth.tar
83+
```
84+

compute_results.py

+17
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
from vocab import Vocabulary
2+
import evaluation
3+
4+
import argparse
5+
6+
parser = argparse.ArgumentParser()
7+
parser.add_argument('--model_path', default='$RUN_PATH/coco_vse/model_best.pth.tar', help='path to model')
8+
parser.add_argument('--data_path', default='data/data', help='path to datasets')
9+
parser.add_argument('--fold5', action='store_true',
10+
help='Use fold5')
11+
parser.add_argument('--save_embeddings', action='store_true',
12+
help='save_embeddings')
13+
parser.add_argument('--save_csv', default='')
14+
15+
opt_eval = parser.parse_args()
16+
17+
evaluation.evalrank(opt_eval, split='test')

0 commit comments

Comments
 (0)