Kimi-K2 (non-thinking) QAT
#35
by
geboh67859
- opened
Any plans for a new non-thinking model release with QAT or a Kimi-K2-Instruct-0905 refresh with QAT? This would make inference and deployment a lot easier and faster!