Google Axion CPU With GCE C4A vs. AWS Graviton4 Performance
Last week Google announced the general availability of their C4A instances powered by their in-house Axion processors. I delivered launch-day benchmarks looking at the Google Axion CPU performance with the C4A instances compared to other Google Cloud i...
AMX-AVX512 Support Merged For LLVM Clang 20 Compiler
As the latest on the compiler enablement front for Intel's next-gen Xeon "Diamond Rapids processors, LLVM Git has merged support for the AMX-AVX512 instructions for next spring's Clang 20 compiler release...
NVIDIA GH200 Grace CPU vs. AMD EPYC 9005 Turin CPU Performance
With the AMD EPYC 9005 "Turin" testing over the past month since launch I have looked at how well the new EPYC Turin CPUs compete against Intel Xeon, how Turin Dense dominates in performance and power efficiency to AmpereOne at 192 cores, and the gener...
OpenCL Headers & SDK Updated For OpenCL 3.0.17
Near the end of October OpenCL 3.0.17 was released as the newest maintenance update to this low-level compute API for cross-vendor GPUs and other accelerators. The OpenCL Headers and SDK have now been updated for the new revision...