Seminar by Giuseppe Pagano: A Practical SIMD Implementation and Analysis of Power of Two Quantization on Modern Hardware
Abstract: The rising use of AI in the last few years has greatly reshaped day-to-day productivity in many industries and caused a surge in the demand for AI-capable hardware. To reduce the imposing requirements that running a state of the art model entails many quantization techniques have been introduced.Among them there is Power of Two […]