inference.plus
Bringing State-of-the-Art AI Models to Intel® NPUs
Breaks down our joint work with Intel to land whisper-large-v3-turbo, qwen-3, and phi-4-mini on the NPU, including compiler patches, INT4 weight packing, and Windows packaging for AI PC developers.
Jul 29, 2025