S³NAS: Fast NPU-aware Neural Architecture Search

We conduct NAS Following three steps : Supernet design, Single-Path NAS with modification, Scaling and post-processing.

Results

We share our weight and model files at google drive.

Requirements

Access to Cloud TPUs (Official Cloud TPU Tutorial)
Tensorflow 1.15.3 for TPU, 1.13 for GPU
Python 3.5+
python-box 3.4.6

Usage

Set up ImageNet dataset

To setup the ImageNet follow the instructions from here

Or you can just copy from other bucket using gsutil -m cp -r, or transfer from other bucket.
Set up the profiled latency files
```
latency_folder
|-- Conv2D
|-- Dense
|-- GlobalAvgPool
|-- MBConvBlock
|-- MixConvBlock
    |-- r1_k3,5_s22_e2,4_i32_o32_c100_noskip_relu_imgsize112
    |-- ...
```
each latency file contains a dictionary with latency value. For example, the content of r1_k3,5_s22_e2,4_i32_o32_c100_noskip_relu_imgsize112 may be {"latency": 364425}

For blocks, imgsize indicate width/height of input image of the block. Activation function is set to be relu as default. For more information, refer latency_estimator.py and blockargs.py.

For other components, the file name rule is similar, you can refer to get_str ftns of each BasicOps. refer block_ops.py

to use our profiled latency files for MIDAP, please type
```
git submodule update --init --recursive
```
Set up flags and run

Refer to the script files in base_experiment_scripts, or set up flags yourself. When you use scripts in base_experiment_scripts, please MODIFY
- Google Cloud Storage Bucket
- Model file name
- Google Cloud TPU name
- Target latency
- Latency folder name --constraint_lut_folder=XXX
We provide script templates for NAS / train / post_process
Run the script file.

Note

When running on Multi-GPU, set --moving_average_decay=0.0

Multi-GPU in TF 1.x does not support tf.train.ExponentialMovingAverage. refer
When running on GPU, set --use_tpu=False --transpose_input=False
Setting --use_cache=False can reduce memory usage.
We didn't check the validity on GPU environment.

Citation

If it helps your research, please cite

@misc{lee2020s3nas,
    title={S3NAS: Fast NPU-aware Neural Architecture Search Methodology},
    author={Jaeseong Lee and Duseok Kang and Soonhoi Ha},
    year={2020},
    eprint={2009.02009},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
base_experiment_scripts		base_experiment_scripts
etc_utils		etc_utils
figures		figures
graph		graph
latency @ 6d5010d		latency @ 6d5010d
models/supergraphs		models/supergraphs
run		run
util		util
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
tester.py		tester.py
tester_README.md		tester_README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S³NAS: Fast NPU-aware Neural Architecture Search

Results

Requirements

Usage

Note

Citation

About

Releases

Packages

Contributors 2

Languages

License

cap-lab/S3NAS

Folders and files

Latest commit

History

Repository files navigation

S3NAS: Fast NPU-aware Neural Architecture Search

Results

Requirements

Usage

Note

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

S³NAS: Fast NPU-aware Neural Architecture Search

Packages