| Documentation | Users Forum | #sig-spyre |
IBM Spyre is the first production-grade Artificial Intelligence Unit (AIU) accelerator born out of the IBM Research AIU family, and is part of a long-term strategy of developing novel architectures and full-stack technology solutions for the emerging space of generative AI. Spyre builds on the foundation of IBM’s internal AIU research and delivers a scalable, efficient architecture for accelerating AI in enterprise environments.
The vLLM Spyre plugin (vllm-spyre
) is a dedicated backend extension that enables seamless integration of IBM Spyre Accelerator with vLLM. It follows the architecture described in vLLM's Plugin System, making it easy to integrate IBM's advanced AI acceleration into existing vLLM workflows.
For more information, check out the following:
- 📚 Meet the IBM Artificial Intelligence Unit
- 📽️ AI Accelerators: Transforming Scalability & Model Efficiency
- 🚀 Spyre Accelerator for IBM Z
Visit our documentation:
We welcome and value any contributions and collaborations. Please check out Contributing to vLLM Spyre for how to get involved.
You can reach out for discussion or support in the #sig-spyre
channel in the vLLM Slack workspace or by opening an issue.