Skip to content

Commit 07abbd6

Browse files
committed
Revert "[Feature] Integrate new deepgemm (vllm-project#19820)"
This reverts commit c6e3bba. Signed-off-by: yewentao256 <[email protected]>
1 parent 0d06b53 commit 07abbd6

File tree

8 files changed

+268
-234
lines changed

8 files changed

+268
-234
lines changed

benchmarks/kernels/benchmark_moe.py

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -86,9 +86,6 @@ def benchmark_config(
8686
(num_experts, 2 * shard_intermediate_size), dtype=torch.float32
8787
)
8888
w2_scale = torch.randn((hidden_size, num_experts), dtype=torch.float32)
89-
if use_deep_gemm:
90-
# we use the default block shape for deepgemm
91-
block_quant_shape = [128, 128]
9289
if use_fp8_w8a8:
9390
if block_quant_shape:
9491
block_n, block_k = block_quant_shape[0], block_quant_shape[1]

0 commit comments

Comments
 (0)