Kimi-K2 (non-thinking) QAT

#35
by geboh67859 - opened

Any plans for a new non-thinking model release with QAT or a Kimi-K2-Instruct-0905 refresh with QAT? This would make inference and deployment a lot easier and faster!

Sign up or log in to comment