Saturn: Optimized Training of Multiple Large Deep Learning Models

Saturn is a novel system for multi-model deep learning training that automatically optimizes jobs for highly efficient training. It automatically selects parallelization techniques, determines optimized resource allocations, and constructs execution schedules for submitted jobs. Applying Saturn for hyperparameter optimization or model selection requires only a few lines of code.

Saturn is designed to support extensibility, allowing users to specify new execution procedures that can be included in its optimization plan and search space. In this way, you can keep up with the latest advances in model execution optimizations without having to wait for library updates & changes.

Install Saturn

To install Saturn, please read the instructions. We're always excited to hear about new use cases and details of your experience with Saturn, so feel free to contact us at [email protected] if you want to share news.

Framework Support

We currently prioritize PyTorch support, but Saturn's general techniques are framework-independent. We would welcome contributions for TensorFlow & Jax.

Contributing

We welcome contributions to Saturn. Areas of particular interest are an alternative solver (e.g. using reinforcement learning), new interfaces, dashboards, and ways to support online job submissions. Please let us know if you encounter any bugs or have any suggestions by submitting an issue.

You can join the Slack here: https://join.slack.com/t/saturn-dl/shared_invite/zt-267mfi3s4-ifUYLiJUtaVeGFcYe9vbxA or by scanning this QR code:

Documentation

You can find the docs for Saturn here.

How to Cite this Work

If you use this system in an academic work, please cite our tech report as follows.

@article{nagrechasaturn,
  title={Saturn: An Optimized Data System for Multi-Large-Model Deep Learning Workloads (Information System Architectures)},
  author={Nagrecha, Kabir and Kumar, Arun}
}

The Team

Saturn is currently developed and maintained by Kabir Nagrecha at UCSD.

License

Saturn uses Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
docs		docs
examples		examples
saturn		saturn
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Saturn: Optimized Training of Multiple Large Deep Learning Models

Install Saturn

Framework Support

Contributing

Documentation

How to Cite this Work

The Team

License

About

Releases

Packages

Contributors 3

Languages

License

knagrecha/saturn

Folders and files

Latest commit

History

Repository files navigation

Saturn: Optimized Training of Multiple Large Deep Learning Models

Install Saturn

Framework Support

Contributing

Documentation

How to Cite this Work

The Team

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages