vllm.model_executor.models.glm4_moe_lite ¶
Inference-only GLM-4.7-Flash model compatible with HuggingFace weights.
vllm.model_executor.models.glm4_moe_lite ¶Inference-only GLM-4.7-Flash model compatible with HuggingFace weights.