A high-performance SIMD floating point unit for BlueGene/L: Architecture, compilation, and algorithm designLeonardo BachegaSiddhartha Chatterjeeet al.2004PACT 2004