INVITED: Accelerator Design for Deep Learning Training: Extended Abstract: InvitedAnkur AgrawalChia-Yu Chenet al.2017DAC 2017
24 Feb 2025US12236338Single Function To Perform Combined Matrix Multiplication And Bias Add Operations
11 Nov 2024US12141513Method To Map Convolutional Layers Of Deep Neural Network On A Plurality Of Processing Elements With Simd Execution Units, Private Memories, And Connected As A 2d Systolic Processor Array
30 Apr 2024TWI840790Single Function To Perform Combined Matrix Multiplication And Bias Add Operations
21 Apr 2024JP7477249System-aware Selective Quantization For Performance Optimized Distributed Deep Learning