11.4 IBM NorthPole: An Architecture for Neural Network Inference with a 12nm ChipAndrew S. CassidyJohn V. Arthuret al.2024ISSCC 2024
Neural inference at the frontier of energy, space, and timeDharmendra S. ModhaFilipp Akopyanet al.2023Science
Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient InferenceJeffrey L. McKinstrySteven K. Esseret al.2019EMC2-NIPS 2019