Double Product combine1: Maximum use of data abstraction: 22.16 cycles/element Double Product combine2: Take vec_length() out of loop: 15.78 cycles/element Double Product combine3: Array reference to vector data: 5.02 cycles/element Double Product combine4: Array reference, accumulate in temporary: 5.00 cycles/element Double Product combine4p: Pointer reference, accumulate in temporary: 5.00 cycles/element Double Product Array code, unrolled by 2: 5.00 cycles/element Double Product combine5p: Pointer code, unrolled by 3, for loop: 5.00 cycles/element Function Array code, unrolled by 3, while loop, Should be 1, Got -1 Double Product Array code, unrolled by 3, while loop: 5.00 cycles/element Double Product Array code, unrolled by 4: 5.00 cycles/element Double Product Array code, unrolled by 8: 5.01 cycles/element Double Product Array code, unrolled by 16: 5.00 cycles/element Double Product Pointer code, unrolled by 2: 5.00 cycles/element Double Product Pointer code, unrolled by 3: 5.00 cycles/element Double Product Pointer code, unrolled by 4: 5.00 cycles/element Double Product Pointer code, unrolled by 8: 5.02 cycles/element Double Product Pointer code, unrolled by 16: 5.00 cycles/element Double Product combine6: Array code, unrolled by 2, Superscalar x2: 2.50 cycles/element Double Product Array code, unrolled by 4, Superscalar x2: 2.50 cycles/element Double Product Array code, unrolled by 8, Superscalar x2: 2.52 cycles/element Double Product Array code, unrolled by 3, Superscalar x3: 2.00 cycles/element Double Product Array code, unrolled by 4, Superscalar x4: 2.00 cycles/element Double Product Array code, unrolled by 8, Superscalar x4: 2.16 cycles/element Double Product Array code, unrolled by 6, Superscalar x6: 2.13 cycles/element Double Product Array code, unrolled by 8, Superscalar x8: 2.10 cycles/element Double Product Array code, unrolled by 10, Superscalar x10: 2.09 cycles/element Double Product Array code, unrolled by 12, Superscalar x6: 2.06 cycles/element Double Product Array code, unrolled by 12, Superscalar x12: 2.10 cycles/element Double Product Pointer code, unrolled by 8, Superscalar x2: 2.52 cycles/element Double Product Pointer code, unrolled by 8, Superscalar x4: 2.10 cycles/element Double Product Pointer code, unrolled by 8, Superscalar x8: 2.05 cycles/element Double Product Pointer code, unrolled by 9, Superscalar x3: 2.03 cycles/element Double Product Array code, Unroll x2, Superscalar x2, noninterleaved: 2.50 cycles/element Double Product Array code, unrolled by 2, different associativity: 2.84 cycles/element Double Product Array code, unrolled by 3, Different Associativity: 2.11 cycles/element Double Product Array code, unrolled by 4, Different Associativity: 2.06 cycles/element Double Product Array code, unrolled by 6, Different Associativity: 2.10 cycles/element Double Product Array code, unrolled by 8, Different Associativity: 2.10 cycles/element Double Product SSE code, 1*VSIZE-way parallelism: 2.50 cycles/element Double Product SSE code, 2*VSIZE-way parallelism: 2.02 cycles/element Double Product SSE code, 4*VSIZE-way parallelism: 2.09 cycles/element Double Product SSE code, 8*VSIZE-way parallelism: 2.01 cycles/element Double Product SSE code, 12*VSIZE-way parallelism: 2.01 cycles/element Double Product SSE code, 2*VSIZE-way parallelism, reassociate: 2.02 cycles/element Double Product SSE code, 4*VSIZE-way parallelism, reassociate: 2.11 cycles/element Double Product SSE code, 8*VSIZE-way parallelism, reassociate: 2.04 cycles/element