You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project aims to address the resource management issues of AIGC instances by providing an AIGC serverless gateway based on the auto-scaling feature of cloud-native architecture.
The gateway has the following features:
User management. Each user has their own AIGC instance, and the gateway will maintain the mapping between the user and the instance.
User-level resource management. AIGC computing instances are created and destroyed based on the user's login/offline status, while preserving user data.
As shown in the figure below, AIGC-Gateway supports managing multiple AIGC model collections. When user A logs in and requests the gateway, the gateway returns the corresponding access endpoint of the instance for the user to connect and use.
As shown in the figure below, when a new user B logs in and selects an AIGC model, the gateway calls the instance collection interface to create a new instance for the user.
以下图为例,新用户B登录时选择AIGC模型,网关调用实例集合接口为用户创建对应的新实例。
Old User Offline(老用户下线)
As shown in the figure below, when an old user A logs out or the session expires, the gateway calls the instance collection interface to delete the instance accordingly, releasing computing resources while preserving storage resources.
As shown in the figure below, when an old user A logs in and selects an AIGC model, the gateway calls the instance collection interface to create a corresponding instance for user A. The instance name and access endpoint remain consistent with the previous settings, and the mounted persistent storage disk data will not be lost.
Introduction(介绍)
This project aims to address the resource management issues of AIGC instances by providing an AIGC serverless gateway based on the auto-scaling feature of cloud-native architecture.
The gateway has the following features:
本项目着眼于AIGC实例资源管理问题,基于云原生架构下自动伸缩特性,提供一个AIGC serverless网关。
该网关具备如下特点:
Design(设计)
User Usage Flow(用户使用链路)
As shown in the figure below, AIGC-Gateway supports managing multiple AIGC model collections. When user A logs in and requests the gateway, the gateway returns the corresponding access endpoint of the instance for the user to connect and use.
以下图为例,AIGC-Gateway支持管理多个AIGC模型集合,用户登录时请求网关并选择使用模型,网关将返回实例对应访问端点,供用户连接使用。
New User Online(新用户访问)
As shown in the figure below, when a new user B logs in and selects an AIGC model, the gateway calls the instance collection interface to create a new instance for the user.
以下图为例,新用户B登录时选择AIGC模型,网关调用实例集合接口为用户创建对应的新实例。
Old User Offline(老用户下线)
As shown in the figure below, when an old user A logs out or the session expires, the gateway calls the instance collection interface to delete the instance accordingly, releasing computing resources while preserving storage resources.
以下图为例,老用户A登出或session过期,网关调用实例集合接口,定向缩容实例使计算资源释放,同时保留存储资源。
Old User Back Online(老用户重新上线)
As shown in the figure below, when an old user A logs in and selects an AIGC model, the gateway calls the instance collection interface to create a corresponding instance for user A. The instance name and access endpoint remain consistent with the previous settings, and the mounted persistent storage disk data will not be lost.
以下图为例,老用户A登录选择AIGC模型,网关调用实例集合接口为用户A创建对应的实例,实例名称与访问端点与之前保持一致,挂载的持久化存储盘数据不会丢失。
The text was updated successfully, but these errors were encountered: