Swagath Venkataramani

Title

Principal Research Scientist, AIU Architecture and Compilers

Publications

Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
- - Rui Xie
  - Asad Ul Haq
  - et al.
- 2025
- IEEE Computer Architecture Letters
MixTrain: accelerating DNN training via input mixing
- - Sarada Krithivasan
  - Sanchari Sen
  - et al.
- 2024
- Frontiers in Artificial Intelligence
A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- ISSCC 2024
Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- IEEE Journal of Solid-State Circuits
DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU
- - Sanchari Sen
  - Shubham Jain
  - et al.
- 2024
- IEEE Micro
Deep Compression of Pre-trained Transformer Models
- - Naigang Wang
  - Chi-Chun Liu
  - et al.
- 2022
- NeurIPS 2022
Approximate computing and the efficient machine learning expedition
- - Jörg Henkel
  - Hai Li
  - et al.
- 2022
- ICCAD 2022
OnSRAM: Efficient Inter-Node On-Chip Scratchpad Management in Deep Learning Accelerators
- - Subhankar Pal
  - Swagath Venkataramani
  - et al.
- 2022
- Transactions on Embedded Computing Systems
Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
- - Andrea Fasoli
  - Chia-Yu Chen
  - et al.
- 2022
- INTERSPEECH 2022

Top collaborators

Alberto Mannari

Software Developer

Matthew Ziegler

Principal Research Scientist

Xiaodong Cui

Principal Research Scientist

Prasanth Chatarasi

Staff Research Scientist, AIU Accelerator Compilers and Architecture

Swagath Venkataramani

Title

Publications

Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators

Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure

MixTrain: accelerating DNN training via input mixing

A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC

Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC

DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU

Deep Compression of Pre-trained Transformer Models

Approximate computing and the efficient machine learning expedition

OnSRAM: Efficient Inter-Node On-Chip Scratchpad Management in Deep Learning Accelerators

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization

Patents

Programmable Multicast Protocol For Ring-topology Based Artificial Intelligence Systems

System And Method For Consensus-based Representation And Error Checking For Neural Networks

Single Function To Perform Combined Convolution And Select Operations

Reformatting Of Tensors To Provide Sub-tensors

Sparse Systolic Array Design

Exploiting Fine-grained Structured Weight Sparsity In Systolic Arrays

Programmable Data Delivery To A System Of Shared Processing Elements With Shared Memory

Reducing The Cost Of N Modular Redundancy For Neural Networks

Hybrid Data-model Parallelism For Efficient Deep Learning

System-aware Selective Quantization For Performance Optimized Distributed Deep Learning

Top collaborators

Alberto Mannari

Matthew Ziegler

Xiaodong Cui

Prasanth Chatarasi