A work-stealing scheduler for X10's task parallelism with suspensionOlivier TardieuHaichuan Wanget al.2012PPoPP 2012
Large-scale fast Fourier transform on a heterogeneous multi-core systemYan LiJeffrey R. Diamondet al.2012IJHPCA
Providing source code level portability between CPU and GPU with MapCGChun-Tao HongDe-Hao Chenet al.2012Journal of Computer Science and Technology