qwen3moe instead of qwen3_moe?

by bnjmnmarie - opened Aug 3

Aug 3

•

I found that you named the architecture qwen3moe while the original one is qwen3_moe.
It makes it incompatible with several frameworks, including Transformers.

Would it be possible to rename it with the standard name? (or maybe this is not possible and something hardcoded in llama.cpp?)

mnpnice123

Aug 5

I found that you named the architecture qwen3moe while the original one is qwen3_moe.
It makes it incompatible with several frameworks, including Transformers.

Would it be possible to rename it with the standard name? (or maybe this is not possible and something hardcoded in llama.cpp?)

same issue

Eryk-Chmielewski

Sep 9

Yes, it is not possible to load using vllm 10.0.1.1

Value error, The checkpoint you are trying to load has model type qwen3moe but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

It there a way to change gguf file to rename it?

Eryk-Chmielewski

Sep 9

Or maybe somehow change a vllm code to duplicate config for Qwen3MoeForCausalLM under qwen3moe ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment