Critical lock analysis: Diagnosing critical section bottlenecks in multithreaded applicationsGuancheng ChenPer Stenstrom2012SC 2012
Auto-tuning Spark Big Data Workloads on POWER8: Prediction-Based Dynamic SMT ThreadingZhen JiaChao Xueet al.2016PACT 2016
Breaking the boundary for whole-system performance optimization of big dataYan LiKun Wanget al.2013ISLPED 2013
Correlation-based performance analysis for full-system MapReduce optimizationQi GuoYan Liet al.2013Big Data 2013
PATer: A hardware prefetching automatic tuner on IBM POWER8 processorMinghua LiGuancheng Chenet al.2016IEEE Computer Architecture Letters