MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint8_t>/127
|
-80.41% |
277.820 |
54.437 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC15
|
-73.56% |
54.434 |
14.391 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC15
|
-73.56% |
54.432 |
14.391 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC31
|
-70.31% |
80.091 |
23.776 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC31
|
-70.31% |
80.088 |
23.776 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC15
|
-69.06% |
165.802 |
51.305 |
0.001 |
-0.00% |
0.001 |
SingleSource/Benchmarks/Shootout/Shootout-matrix
Profile
|
-66.26% |
8.266 |
2.789 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC31
|
-62.69% |
209.601 |
78.211 |
0.090 |
0.00% |
0.090 |
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-matrix
Profile
|
-62.63% |
3.609 |
1.349 |
0.000 |
-0.06% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/127
|
-61.32% |
198.975 |
76.961 |
0.065 |
0.00% |
0.065 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC8
|
-58.93% |
35.037 |
14.390 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC8
|
-58.93% |
35.038 |
14.391 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC7
|
-54.91% |
31.913 |
14.391 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC16
|
-54.90% |
31.911 |
14.391 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC7
|
-54.90% |
31.910 |
14.390 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC16
|
-54.90% |
31.909 |
14.391 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC63
|
-54.67% |
99.392 |
45.050 |
0.002 |
0.00% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC63
|
-54.59% |
99.198 |
45.048 |
0.003 |
-0.00% |
0.003 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC31
|
-52.88% |
65.072 |
30.660 |
0.004 |
-0.00% |
0.004 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC63
|
-49.88% |
263.413 |
132.024 |
0.076 |
-0.02% |
0.076 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC15
|
-48.54% |
47.419 |
24.403 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC8
|
-46.41% |
95.729 |
51.305 |
0.004 |
-0.00% |
0.004 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC32
|
-42.43% |
41.296 |
23.776 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC32
|
-42.43% |
41.296 |
23.776 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC127
|
-40.40% |
142.732 |
85.073 |
0.011 |
-0.02% |
0.011 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC127
|
-40.36% |
142.679 |
85.092 |
0.002 |
-0.00% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC7
|
-40.15% |
85.718 |
51.305 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/ImageProcessing/Blur/blur.test:BENCHMARK_boxBlurKernel/256
|
-40.03% |
3129.777 |
1876.922 |
88.987 |
-6.16% |
88.987 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC4
|
-39.71% |
23.869 |
14.391 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC4
|
-39.47% |
23.776 |
14.391 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC63
|
-37.84% |
70.077 |
43.561 |
0.051 |
0.03% |
0.051 |
MicroBenchmarks/ImageProcessing/Blur/blur.test:BENCHMARK_boxBlurKernel/512
|
-37.69% |
11481.979 |
7154.850 |
285.468 |
-6.34% |
285.468 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC127
|
-36.19% |
376.043 |
239.938 |
0.097 |
-0.06% |
0.097 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC16
|
-36.06% |
38.167 |
24.402 |
0.000 |
0.00% |
0.000 |
MultiSource/Benchmarks/BitBench/uuencode/uuencode
Profile
|
-35.10% |
0.055 |
0.036 |
0.000 |
-0.06% |
0.000 |
MicroBenchmarks/ImageProcessing/Blur/blur.test:BENCHMARK_boxBlurKernel/128
|
-34.36% |
640.513 |
420.413 |
29.277 |
0.98% |
29.277 |
SingleSource/Benchmarks/Stanford/FloatMM
Profile
|
-33.06% |
0.609 |
0.407 |
0.000 |
-0.04% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint32_t>/127
|
-31.31% |
173.757 |
119.352 |
0.074 |
-0.13% |
0.074 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC3
|
-30.64% |
20.749 |
14.391 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC3
|
-30.30% |
20.647 |
14.390 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC8
|
-30.30% |
35.008 |
24.402 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC64
|
-29.97% |
64.329 |
45.049 |
0.002 |
0.00% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC64
|
-29.93% |
64.292 |
45.049 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, GreaterThanZero, Mid>
|
-28.19% |
24994.526 |
17947.576 |
0.539 |
-0.00% |
0.539 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, GreaterThanZero, Last>
|
-28.19% |
24993.402 |
17948.002 |
0.806 |
-0.00% |
0.806 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, GreaterThanZero, First>
|
-28.18% |
24993.194 |
17949.123 |
0.268 |
0.00% |
0.268 |
MultiSource/Benchmarks/DOE-ProxyApps-C/SimpleMOC/SimpleMOC
Profile
|
-27.01% |
14.399 |
10.510 |
0.012 |
-0.22% |
0.012 |
MicroBenchmarks/ImageProcessing/Blur/blur.test:BENCHMARK_boxBlurKernel/1024
|
-26.52% |
38003.366 |
27925.900 |
451.394 |
-4.82% |
451.394 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC32
|
-24.10% |
40.670 |
30.870 |
0.151 |
0.18% |
0.151 |
SingleSource/Benchmarks/Linpack/linpack-pc
Profile
|
-23.42% |
9.049 |
6.929 |
0.005 |
0.07% |
0.005 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint16_t_To_uint8_t_
|
-23.08% |
48.803 |
37.541 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC2
|
-20.69% |
18.145 |
14.391 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC2
|
-20.69% |
18.145 |
14.391 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC128
|
-19.83% |
105.923 |
84.916 |
0.085 |
0.05% |
0.085 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, EqZero, First>
|
-18.76% |
20514.365 |
16665.474 |
0.638 |
-0.00% |
0.638 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, EqZero, Last>
|
-18.74% |
20509.281 |
16665.278 |
0.549 |
-0.00% |
0.549 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<2, EqZero, Mid>
|
-18.74% |
20509.107 |
16665.585 |
1.043 |
-0.00% |
1.043 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC128
|
-18.62% |
104.558 |
85.092 |
0.006 |
0.00% |
0.006 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, First>
|
-18.60% |
27560.183 |
22435.259 |
0.737 |
-0.00% |
0.737 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, Last>
|
-18.59% |
27558.993 |
22434.891 |
0.462 |
-0.00% |
0.462 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<1, EqZero, Mid>
|
-18.59% |
27557.531 |
22435.302 |
0.964 |
0.00% |
0.964 |
MultiSource/Benchmarks/TSVC/Equivalencing-dbl/Equivalencing-dbl
Profile
|
-18.45% |
7.056 |
5.755 |
0.322 |
-5.86% |
0.322 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint8_t_
|
-15.82% |
31.962 |
26.905 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint32_t_To_uint8_t_
|
-15.00% |
100.111 |
85.092 |
0.004 |
-0.00% |
0.004 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC127
|
-14.06% |
80.087 |
68.829 |
0.005 |
0.00% |
0.005 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC16
|
-13.68% |
59.440 |
51.310 |
0.001 |
0.01% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC7
|
-13.33% |
28.157 |
24.402 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint32_t_To_uint16_t_
|
-13.11% |
76.335 |
66.324 |
0.002 |
-0.00% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint8_t>/65
|
-13.00% |
62.587 |
54.448 |
0.003 |
-0.00% |
0.003 |
MicroBenchmarks/SLPVectorization/SLPVectorizationBenchmarks.test:benchmark_xor_runtime_checks_pass<4, int>
|
-12.50% |
10.013 |
8.762 |
0.000 |
-0.01% |
0.000 |
SingleSource/Benchmarks/Shootout/Shootout-heapsort
Profile
|
-12.43% |
23.014 |
20.154 |
0.040 |
-10.01% |
0.040 |
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_ENERGY_CALC_LAMBDA/171
|
-12.07% |
16.830 |
14.799 |
0.012 |
-0.35% |
0.012 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, EqZero, First>
|
-11.99% |
16024.663 |
14102.644 |
0.315 |
-0.00% |
0.315 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, EqZero, Last>
|
-11.99% |
16024.269 |
14102.318 |
0.033 |
-0.01% |
0.033 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, EqZero, Mid>
|
-11.98% |
16022.872 |
14102.535 |
0.357 |
0.00% |
0.357 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint16_t>/65
|
-11.96% |
73.205 |
64.446 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint32_t_
|
-11.78% |
63.186 |
55.739 |
1.455 |
-5.44% |
1.455 |
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_ENERGY_CALC_RAW/171
|
-11.26% |
16.672 |
14.794 |
0.146 |
-0.08% |
0.146 |
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_MULADDSUB_RAW/171
|
-11.25% |
0.497 |
0.441 |
0.004 |
1.05% |
0.004 |
MicroBenchmarks/SLPVectorization/SLPVectorizationBenchmarks.test:benchmark_xor_runtime_checks_fail<4, int>
|
-11.22% |
11.262 |
9.998 |
0.078 |
-0.69% |
0.078 |
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_FIR_RAW/171
|
-11.04% |
1.312 |
1.167 |
0.001 |
0.06% |
0.001 |
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_FIR_LAMBDA/171
|
-11.04% |
1.312 |
1.167 |
0.000 |
-0.00% |
0.000 |
MultiSource/Benchmarks/Rodinia/srad/srad
Profile
|
-10.72% |
2.165 |
1.933 |
0.064 |
-6.64% |
0.064 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint8_t>/127
|
-10.72% |
157.678 |
140.782 |
0.003 |
0.00% |
0.003 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/127
|
-10.72% |
157.677 |
140.782 |
0.002 |
-0.00% |
0.002 |
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-heapsort
Profile
|
-10.60% |
22.543 |
20.154 |
0.070 |
-9.56% |
0.070 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchAutoVec<uint32_t>/65
|
-10.39% |
96.362 |
86.348 |
0.082 |
0.00% |
0.082 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint32_t_To_uint16_t_
|
-10.26% |
48.806 |
43.800 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint16_t_
|
-9.86% |
171.441 |
154.540 |
0.009 |
-0.00% |
0.009 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1LoopWithReductionTC3
|
-9.49% |
13.134 |
11.888 |
0.000 |
0.00% |
0.000 |
MultiSource/Benchmarks/TSVC/InductionVariable-flt/InductionVariable-flt
Profile
|
-9.08% |
11.386 |
10.353 |
0.050 |
-1.12% |
0.050 |
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_MULADDSUB_LAMBDA/171
|
-9.01% |
0.497 |
0.452 |
0.008 |
2.53% |
0.008 |
SingleSource/Benchmarks/BenchmarkGame/puzzle
Profile
|
-8.93% |
0.908 |
0.827 |
0.019 |
-4.92% |
0.019 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint8_t>/65
|
-8.82% |
106.371 |
96.985 |
0.002 |
-0.00% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopEpilogueVectorizationBenchmarks.test:benchReductionAutoVec<uint16_t>/65
|
-8.82% |
106.365 |
96.983 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC32
|
-8.76% |
85.719 |
78.214 |
0.079 |
-0.00% |
0.079 |
SingleSource/Benchmarks/Polybench/linear-algebra/blas/trmm/trmm
Profile
|
-8.67% |
61.313 |
55.999 |
1.913 |
-7.66% |
1.913 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC4
|
-7.87% |
55.685 |
51.305 |
0.002 |
0.00% |
0.002 |
MultiSource/Benchmarks/TSVC/GlobalDataFlow-dbl/GlobalDataFlow-dbl
Profile
|
-7.83% |
18.760 |
17.292 |
0.243 |
-6.03% |
0.243 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint32_t_To_uint8_t_
|
-7.77% |
64.446 |
59.441 |
0.003 |
0.00% |
0.003 |
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-hash
Profile
|
-7.66% |
3.709 |
3.425 |
0.010 |
-7.49% |
0.010 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint8_t_
|
-7.65% |
204.595 |
188.953 |
0.004 |
0.00% |
0.004 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint64_t_To_uint32_t_
|
-7.54% |
132.648 |
122.643 |
0.005 |
-0.00% |
0.005 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint16_t_To_uint64_t_
|
-7.40% |
99.328 |
91.978 |
0.258 |
0.00% |
0.258 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForBigLoopTC1
|
-7.27% |
15.518 |
14.391 |
0.000 |
0.00% |
0.000 |
SingleSource/Benchmarks/BenchmarkGame/spectral-norm
Profile
|
-6.94% |
1.696 |
1.578 |
0.000 |
-7.60% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint8_t_To_uint32_t_
|
-6.82% |
82.592 |
76.961 |
0.003 |
0.00% |
0.003 |
SingleSource/Benchmarks/Shootout/Shootout-hash
Profile
|
-6.79% |
39.164 |
36.505 |
0.032 |
-5.62% |
0.032 |
MultiSource/Benchmarks/Ptrdist/ks/ks
Profile
|
-6.73% |
3.202 |
2.987 |
0.001 |
-0.01% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC64
|
-6.64% |
141.402 |
132.018 |
0.011 |
0.00% |
0.011 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint32_t_To_uint64_t_
|
-6.58% |
106.209 |
99.223 |
1.353 |
0.38% |
1.353 |
SingleSource/Benchmarks/Polybench/medley/nussinov/nussinov
Profile
|
-6.48% |
292.271 |
273.344 |
0.945 |
-8.76% |
0.945 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4LoopWithReductionTC7
|
-6.26% |
28.781 |
26.980 |
0.258 |
-1.17% |
0.258 |
SingleSource/Benchmarks/CoyoteBench/fftbench
Profile
|
-6.25% |
10.351 |
9.705 |
0.159 |
-3.23% |
0.159 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForBigLoopWithReductionAutoVecTC128
|
-5.66% |
254.027 |
239.653 |
0.119 |
-0.02% |
0.119 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW1LoopWithReductionTC8
|
-5.60% |
23.490 |
22.175 |
0.045 |
-0.95% |
0.045 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC2
|
-5.57% |
14.595 |
13.783 |
0.009 |
0.03% |
0.009 |
MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4
Profile
|
-5.51% |
1.120 |
1.058 |
0.016 |
-2.15% |
0.016 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_EOS_LAMBDA/171
|
-5.33% |
0.931 |
0.881 |
0.001 |
-0.07% |
0.001 |
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_EOS_RAW/171
|
-5.27% |
0.931 |
0.882 |
0.001 |
0.03% |
0.001 |
MicroBenchmarks/LoopInterchange/LoopInterchange.test:BENCHMARK_LI1
|
-5.19% |
1550.412 |
1469.887 |
26.101 |
-1.27% |
26.101 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC1
|
-5.17% |
11.888 |
11.273 |
0.191 |
-0.77% |
0.191 |
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_FIR_RAW/5001
|
-5.12% |
44.564 |
42.283 |
0.542 |
-2.02% |
0.542 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint8_t_To_uint64_t_
|
-5.07% |
98.860 |
93.851 |
0.101 |
0.00% |
0.101 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint64_t_To_uint16_t_
|
-4.88% |
111.161 |
105.741 |
0.004 |
0.00% |
0.004 |
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_FIR_LAMBDA/5001
|
-4.87% |
44.428 |
42.264 |
0.325 |
-3.43% |
0.325 |
MicroBenchmarks/harris/harris.test:BENCHMARK_HARRIS/256/256
|
-4.86% |
10188.463 |
9693.308 |
164.151 |
-2.98% |
164.151 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC3
|
-4.77% |
17.740 |
16.894 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_INIT3_RAW/171
|
-4.74% |
0.469 |
0.446 |
0.007 |
0.09% |
0.007 |
SingleSource/Benchmarks/Shootout-C++/Shootout-C++-hash2
Profile
|
-4.73% |
9.042 |
8.615 |
0.010 |
-4.29% |
0.010 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForLoopWithReductionAutoVecTC64
|
-4.63% |
45.675 |
43.559 |
0.038 |
-0.19% |
0.038 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint16_t_To_uint64_t_
|
-4.55% |
143.897 |
137.344 |
0.005 |
0.00% |
0.005 |
SingleSource/Benchmarks/Polybench/medley/deriche/deriche
Profile
|
-4.52% |
16.064 |
15.338 |
0.194 |
-2.09% |
0.194 |
MultiSource/Applications/JM/lencod/lencod
Profile
|
-4.48% |
29.155 |
27.849 |
0.435 |
-3.22% |
0.435 |
SingleSource/Benchmarks/Polybench/stencils/adi/adi
Profile
|
-4.48% |
141.920 |
135.565 |
0.634 |
-2.71% |
0.634 |
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_INIT3_LAMBDA/171
|
-4.35% |
0.459 |
0.439 |
0.005 |
0.21% |
0.005 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIRST_DIFF_LAMBDA/5001
|
-4.29% |
21.222 |
20.312 |
0.143 |
-0.76% |
0.143 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC2
|
-4.22% |
15.017 |
14.384 |
0.003 |
0.00% |
0.003 |
MultiSource/Benchmarks/McCat/12-IOtest/iotest
Profile
|
-4.17% |
1.458 |
1.398 |
0.011 |
-1.44% |
0.011 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchAutoVecForLoopTC1
|
-4.17% |
15.020 |
14.394 |
0.001 |
0.00% |
0.001 |
MultiSource/Benchmarks/TSVC/NodeSplitting-dbl/NodeSplitting-dbl
Profile
|
-4.08% |
25.520 |
24.478 |
0.361 |
-13.02% |
0.361 |
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_PIC_1D_RAW/171
|
-3.86% |
7.935 |
7.629 |
0.002 |
0.03% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint8_t_To_uint16_t_
|
-3.85% |
48.807 |
46.927 |
0.000 |
-0.00% |
0.000 |
MultiSource/Benchmarks/MiBench/telecomm-gsm/telecomm-gsm
Profile
|
-3.81% |
0.684 |
0.658 |
0.002 |
-0.01% |
0.002 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_MAT_X_MAT_LAMBDA/5001
|
-3.80% |
1008152.663 |
969797.997 |
10554.537 |
-6.81% |
10554.537 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint8_t_To_uint16_t_
|
-3.64% |
34.412 |
33.160 |
0.002 |
0.00% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC4
|
-3.41% |
20.081 |
19.396 |
0.000 |
0.00% |
0.000 |
MultiSource/Applications/spiff/spiff
|
-3.41% |
9.995 |
9.654 |
0.098 |
-1.64% |
0.098 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC1
|
-3.33% |
18.770 |
18.145 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_1D_RAW/5001
|
-3.32% |
17.004 |
16.439 |
0.038 |
-2.99% |
0.038 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_EOS_LAMBDA/5001
|
-3.13% |
30.039 |
29.100 |
0.327 |
-1.37% |
0.327 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, LessThanZero, Mid>
|
-2.98% |
8048.964 |
7809.419 |
0.797 |
-0.01% |
0.797 |
MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2
Profile
|
-2.97% |
0.407 |
0.395 |
0.003 |
-1.09% |
0.003 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, EqZero, First>
|
-2.89% |
6646.153 |
6453.819 |
0.239 |
-0.01% |
0.239 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopWithReductionTC1
|
-2.86% |
21.899 |
21.273 |
0.027 |
-0.17% |
0.027 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_HYDRO_1D_LAMBDA/171
|
-2.77% |
0.455 |
0.442 |
0.000 |
-0.05% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopWithReductionTC1
|
-2.76% |
21.900 |
21.294 |
0.008 |
0.05% |
0.008 |
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_HYDRO_1D_RAW/171
|
-2.70% |
0.455 |
0.442 |
0.000 |
-0.10% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopWithVW8From_uint8_t_To_uint16_t_
|
-2.65% |
70.701 |
68.826 |
0.002 |
-0.01% |
0.002 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_PIC_1D_LAMBDA/171
|
-2.62% |
7.883 |
7.677 |
0.007 |
-0.18% |
0.007 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopWithReductionTC3
|
-2.57% |
16.689 |
16.261 |
0.003 |
0.01% |
0.003 |
MicroBenchmarks/LCALS/SubsetCRawLoops/lcalsCRaw.test:BM_FIRST_SUM_RAW/5001
|
-2.45% |
27.661 |
26.984 |
0.225 |
-3.31% |
0.225 |
MultiSource/Benchmarks/TSVC/Searching-flt/Searching-flt
Profile
|
-2.41% |
4.912 |
4.794 |
0.044 |
1.13% |
0.044 |
MultiSource/Benchmarks/SciMark2-C/scimark2
Profile
|
-2.34% |
113.775 |
111.109 |
0.029 |
-1.84% |
0.029 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, EqZero, Last>
|
-2.32% |
8305.106 |
8112.113 |
12.297 |
0.03% |
12.297 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecWithAddInLoopFrom_uint8_t_To_uint64_t_
|
-2.31% |
144.756 |
141.407 |
0.006 |
-0.00% |
0.006 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<63, EqZero, Mid>
|
-2.31% |
4094.577 |
4000.118 |
28.361 |
-0.70% |
28.361 |
MultiSource/Applications/hexxagon/hexxagon
Profile
|
-2.18% |
12.220 |
11.953 |
0.003 |
-1.82% |
0.003 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC2
|
-2.18% |
28.782 |
28.156 |
0.001 |
-0.00% |
0.001 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint64_t_To_uint8_t_
|
-2.17% |
143.907 |
140.777 |
0.006 |
-0.00% |
0.006 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC7
|
-2.17% |
28.781 |
28.156 |
0.003 |
0.00% |
0.003 |
MicroBenchmarks/LCALS/SubsetCLambdaLoops/lcalsCLambda.test:BM_FIRST_SUM_LAMBDA/5001
|
-2.17% |
27.703 |
27.103 |
0.052 |
0.18% |
0.052 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<31, GreaterThanZero, First>
|
-2.11% |
3354.569 |
3283.942 |
0.202 |
-0.00% |
0.202 |
MicroBenchmarks/LCALS/SubsetARawLoops/lcalsARaw.test:BM_FIR_RAW/44217
|
-2.01% |
337.409 |
330.626 |
0.039 |
-0.02% |
0.039 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC8
|
-1.99% |
31.284 |
30.660 |
0.000 |
0.00% |
0.000 |
MicroBenchmarks/LCALS/SubsetALambdaLoops/lcalsALambda.test:BM_FIR_LAMBDA/44217
|
-1.99% |
337.379 |
330.667 |
0.034 |
0.02% |
0.034 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Mid>
|
-1.97% |
32652.410 |
32007.532 |
9.589 |
-0.07% |
9.589 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopWithReductionTC2
|
-1.96% |
31.910 |
31.285 |
0.019 |
-0.10% |
0.019 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC16
|
-1.89% |
33.162 |
32.536 |
0.197 |
-0.05% |
0.197 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, LessThanZero, Last>
|
-1.85% |
43460.042 |
42657.526 |
18.213 |
-0.05% |
18.213 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, None>
|
-1.83% |
46071.787 |
45227.708 |
15.809 |
-0.03% |
15.809 |
MicroBenchmarks/LCALS/SubsetBRawLoops/lcalsBRaw.test:BM_TRAP_INT_RAW/171
|
-1.81% |
2.558 |
2.512 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LCALS/SubsetBLambdaLoops/lcalsBLambda.test:BM_TRAP_INT_LAMBDA/171
|
-1.79% |
2.558 |
2.512 |
0.000 |
0.00% |
0.000 |
MultiSource/Applications/sqlite3/sqlite3
Profile
|
-1.73% |
13.297 |
13.067 |
0.069 |
-1.89% |
0.069 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, GreaterThanZero, None>
|
-1.65% |
37145.521 |
36531.632 |
30.720 |
-0.00% |
30.720 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<4, LessThanZero, Last>
|
-1.61% |
35827.466 |
35249.037 |
1.396 |
-0.00% |
1.396 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC3
|
-1.61% |
38.791 |
38.166 |
0.004 |
-0.00% |
0.004 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopFrom_uint64_t_To_uint32_t_
|
-1.59% |
82.656 |
81.341 |
0.004 |
-0.00% |
0.004 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<64, EqZero, None>
|
-1.53% |
3027.672 |
2981.299 |
8.243 |
0.13% |
8.243 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, Mid>
|
-1.52% |
25636.774 |
25245.891 |
0.237 |
-0.00% |
0.237 |
SingleSource/Benchmarks/Polybench/linear-algebra/solvers/gramschmidt/gramschmidt
Profile
|
-1.49% |
146.613 |
144.429 |
0.161 |
-2.06% |
0.161 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopWithReductionTC3
|
-1.49% |
41.919 |
41.295 |
0.061 |
-0.07% |
0.061 |
MicroBenchmarks/LoopVectorization/LoopVectorizationBenchmarks.test:benchForTruncOrZextVecInLoopWithVW8From_uint8_t_To_uint32_t_
|
-1.43% |
82.410 |
81.232 |
0.365 |
-0.36% |
0.365 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, EqZero, Mid>
|
-1.41% |
25608.939 |
25247.137 |
0.676 |
0.00% |
0.676 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC2VW4LoopTC4
|
-1.40% |
20.941 |
20.648 |
0.001 |
0.00% |
0.001 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, GreaterThanZero, Last>
|
-1.39% |
42667.145 |
42073.434 |
14.941 |
-0.00% |
14.941 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC128
|
-1.38% |
639.504 |
630.682 |
0.059 |
-0.00% |
0.059 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<64, GreaterThanZero, None>
|
-1.33% |
2968.176 |
2928.727 |
0.214 |
0.00% |
0.214 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<16, GreaterThanZero, Last>
|
-1.32% |
13155.374 |
12981.182 |
0.695 |
-0.00% |
0.695 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopWithReductionTC3
|
-1.30% |
41.920 |
41.374 |
0.002 |
0.01% |
0.002 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW1BigLoopWithReductionTC4
|
-1.28% |
48.803 |
48.178 |
0.000 |
-0.00% |
0.000 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4LoopTC15
|
-1.22% |
51.307 |
50.683 |
0.112 |
0.00% |
0.112 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC1VW4BigLoopWithReductionTC4
|
-1.18% |
42.421 |
41.921 |
0.001 |
0.00% |
0.001 |
MultiSource/Benchmarks/Olden/tsp/tsp
Profile
|
-1.18% |
4.733 |
4.677 |
0.019 |
-0.98% |
0.019 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC64
|
-1.15% |
326.597 |
322.855 |
0.005 |
-0.00% |
0.005 |
SingleSource/Benchmarks/Misc/ReedSolomon
Profile
|
-1.12% |
21.473 |
21.233 |
0.002 |
-0.00% |
0.002 |
MicroBenchmarks/ImageProcessing/Dither/Dither.test:BENCHMARK_ORDERED_DITHER/128/3
|
-1.10% |
217.797 |
215.395 |
0.863 |
-0.85% |
0.863 |
MicroBenchmarks/SLPVectorization/SLPVectorizationBenchmarks.test:benchmark_add_xor_runtime_checks_fail<4, int>
|
-1.09% |
16.805 |
16.621 |
0.040 |
-0.41% |
0.040 |
MicroBenchmarks/ImageProcessing/Dither/Dither.test:BENCHMARK_ORDERED_DITHER/128/2
|
-1.05% |
250.050 |
247.415 |
0.937 |
-0.95% |
0.937 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<3, LessThanZero, None>
|
-1.04% |
46062.689 |
45584.941 |
119.507 |
0.85% |
119.507 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopTC127
|
-1.04% |
723.922 |
716.419 |
0.018 |
0.00% |
0.018 |
MicroBenchmarks/LoopVectorization/LoopInterleavingBenchmarks.test:benchForIC4VW4BigLoopWithReductionTC4
|
-1.03% |
51.931 |
51.397 |
0.138 |
0.04% |
0.138 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<15, GreaterThanZero, None>
|
-1.01% |
20607.023 |
20399.665 |
0.673 |
-0.00% |
0.673 |
MicroBenchmarks/MemFunctions/MemFunctions.test:BM_MemCmp<5, GreaterThanZero, None>
|
-1.00% |
32289.960 |
31966.357 |
1.318 |
0.00% |
1.318 |