Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-50640][CORE][TESTS] Update
ChecksumBenchmark
by removing `Pu…
…reJavaCrc32C` and setting `Adler32` as a baseline ### What changes were proposed in this pull request? This PR aims to update `ChecksumBenchmark` by - Removing `PureJavaCrc32C` - Setting `Adler32` as a baseline ### Why are the changes needed? Not only Apache Spark, but also Apache Hadoop community doesn't use that legacy code on Java 9+ since 2018 from Hadoop 3.1.0 (HADOOP-15033). - apache/hadoop#291 We can save our resources by removing obsolete code usage and focusing on our available options. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Manual review. I also attached Apple Silicon result. **Java 17** ``` [info] OpenJDK 64-Bit Server VM 17.0.13+11-LTS on Mac OS X 15.3 [info] Apple M3 Max [info] Checksum Algorithms: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative [info] ---------------------------------------------------------------------------------------------------- [info] Adler32 8689 8709 28 0.0 8485001.2 1.0X [info] CRC32 3201 3205 4 0.0 3125877.4 2.7X [info] CRC32C 3199 3205 5 0.0 3124264.6 2.7X ``` **Java 21** ``` [info] OpenJDK 64-Bit Server VM 21.0.5+11-LTS on Mac OS X 15.3 [info] Apple M3 Max [info] Checksum Algorithms: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative [info] --------------------------------------------------------------------------------------------------- [info] Adler32 9208 9226 20 0.0 8991732.4 1.0X [info] CRC32 3238 3357 105 0.0 3162007.9 2.8X [info] CRC32C 3224 3351 110 0.0 3147966.1 2.9X ``` ### Was this patch authored or co-authored using generative AI tooling? No. Closes #49258 from dongjoon-hyun/SPARK-50640. Authored-by: Dongjoon Hyun <[email protected]> Signed-off-by: Dongjoon Hyun <[email protected]>
- Loading branch information