Running benchmarks... Threads: 8 QoS: Utility Determining FP64 Neon FMLA performance... Repetitions: 200000000 Duration (s): 1.4984 GOPS: 128.137 Determining FP32 Neon FMLA performance... Repetitions: 200000000 Duration (s): 1.47843 GOPS: 259.735 Determining FP16 Neon FMLA performance... Repetitions: 200000000 Duration (s): 1.48216 GOPS: 518.162 Determining BF16-BF16-FP32 BFMMLA Neon performance Repetitions: 40000000 Duration (s): 1.7223 GOPS: 178.366 Determining FP32 SSVE FMLA (Z accumulation) performance... Repetitions: 30000000 Duration (s): 10.2792 GOPS: 22.4141 Detemining FP64 SSVE FMLA (Z accumulation) performance... Repetitions: 30000000 Duration (s): 10.2801 GOPS: 11.2061 Determining FP32 AMX performance... Repetitions: 10000000 Duration (s): 2.28629 GOPS: 358.31 Determining FP32 SME FMOPA performance (1 tile)... Repetitions: 25000000 Duration (s): 9.14371 GOPS: 358.366 Determining FP32 SME FMOPA performance (2 tiles)... Repetitions: 25000000 Duration (s): 9.14624 GOPS: 358.267 Determining FP32 SME FMOPA performance (4 tiles)... Repetitions: 25000000 Duration (s): 9.14562 GOPS: 358.292 Determining FP32 SME predicated (8/16) FMOPA performance (4 tiles)... Repetitions: 25000000 Duration (s): 9.14308 GOPS: 179.196 Determining FP32 SME predicated (15/16) FMOPA performance (4 tiles)... Repetitions: 25000000 Duration (s): 9.14477 GOPS: 335.93 Determining FP32 SME FMOPA performance (4 tiles, reordering)... Repetitions: 25000000 Duration (s): 9.14364 GOPS: 358.369 Determining FP32 SME SMSTART-SMSTOP performance (8 instructions per block).. Repetitions: 60000000 Duration (s): 8.23116 GOPS: 238.858 Determining FP32 SME SMSTART-SMSTOP performance (16 instructions per block)... Repetitions: 60000000 Duration (s): 13.7159 GOPS: 286.686 Determining FP32 SME SMSTART-SMSTOP performance (32 instructions per block)... Repetitions: 60000000 Duration (s): 24.6901 GOPS: 318.522 Determining FP32 SME SMSTART-SMSTOP performance (64 instructions per block)... Repetitions: 60000000 Duration (s): 46.6365 GOPS: 337.26 Determining FP32 SME SMSTART-SMSTOP performance (128 instructions per block)... Repetitions: 60000000 Duration (s): 90.5466 GOPS: 347.415 Determining FP16-FP16-FP32 SME FMOPA performance... Repetitions: 15000000 Duration (s): 10.9758 GOPS: 358.257 Determining BF16-BF16-FP32 SME BFMOPA performance... Repetitions: 15000000 Duration (s): 10.9712 GOPS: 358.407 Determining FP64 SME FMOPA performance ... Repetitions: 25000000 Duration (s): 9.14605 GOPS: 89.5687 Determining I8-I8-I32 SME SMOPA performance... Repetitions: 15000000 Duration (s): 10.9718 GOPS: 716.775 Determining I16-I16-I32 SME FMOPA performance... Repetitions: 15000000 Duration (s): 10.9701 GOPS: 358.444 Determining FP32 SME FMLA performance... Repetitions: 50000000 Duration (s): 9.1464 GOPS: 179.131 Determining FP64 SME FMLA performance... Repetitions: 50000000 Duration (s): 9.14582 GOPS: 89.571 Determining BF16-BF16-FP32 SME BFDOT performance... Repetitions: 25000000 Duration (s): 9.14601 GOPS: 179.138