Generalizing run-time tiling with the loop chain abstractionMichelle Mills StroutFabio Luporiniet al.2014IPDPS 2014
Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressionsQingda LuXiaoyang Gaoet al.2012JPDC
Loop transformations: Convexity, pruning and optimizationLouis-Noël PouchetUday Bondhugulaet al.2011POPL 2011
Combined iterative and model-driven optimization in an automatic parallelization frameworkLouis-Noël PouchetUday Bondhugulaet al.2010SC 2010