Analyzing source code vulnerabilities in the D2A dataset with ML ensembles and C-BERT
- Saurabh Pujar
- Yunhui Zheng
- et al.
- 2024
- Empirical Software Engineering
Currently working in the Hybrid Cloud department of IBM Research, my team is involved in multiple aspects of future cloud platforms: using OpenShift/Kubernetes based computing for training and using large AI models; developing identity-based zero-trust networking architectures; contributing to adoption of system attestation in cloud environments; and development of multi-cluster control planes to extend the resources available to applications.
As part of the Watson/Deep QA project I was responsible for scaling out Watson's computation over thousands of compute cores in order to achieve the speed needed to be competitive in the 2011 IBM-Jeopardy! exhibition. For that effort our team heavily utilized the Apache UIMA framework that we had been developing since its inception.
As a Research Staff Member and manager within IBM's large vocabulary speech recognition research department, I contributed custom DSP-based computing hardware and software used for IBM's Tangora system prototype, and later VoiceType and ViaVoice speech products.
Before coming to IBM, I worked for Technicon Instruments on development of white blood cell differential and CBC systems, including the Hemalog D, H6000 and H-1 products.