Wow, AI models can run on your phone without losing smarts! Researchers from Nanjing University and Microsoft AI present EdgeRazor — a lightweight framework that uses mixed-precision quantization-aware distillation. It assigns different bit-widths to different parts of the
EdgeRazor: AI Models Run on Phones via Mixed-Precision Quantization
By
–
