For anyone more into Rust than me what s the fastest way lib Pants #random

For anyone more into Rust than me, what's the fast...

wide-midnight-78598

06/28/2024, 4:02 AM

For anyone more into Rust than me, what's the fastest way (lib or no-lib) to calculate the SHA256 of a file in Rust? As a POC, in Python I was using

hashlib.file_digest

and then in Rust, using the

sha2

library, it took 2x as long. I swapped over to

ring

as per https://rust-lang-nursery.github.io/rust-cookbook/cryptography/hashing.html and that ended up being roughly on par with hashlib, as I would have expected. Ran this over 230k files and took the total run time as my comparison point

👀 1

wide-midnight-78598

06/28/2024, 4:09 AM

This is one of those rare cases where "as fast as possible" actually matters, as I'll be running it a few million times

wide-midnight-78598

06/28/2024, 4:19 AM

I've played around with feature flags a bit, and it changes some of the timings, but not jaw droppingly. Haven't dug into the source code of these libs to see whether the intrinsics on my machines are fully supported or not

rhythmic-morning-87313

06/28/2024, 7:28 AM

I'm not a Rust expert, but according to my experience with ripgrep, maybe you need some SIMD feature flag settings?

average-vr-56795

06/28/2024, 8:17 AM

Can you post your actual code? It could be something like: wrapping the file in a buffered reader would help?

wide-midnight-78598

06/28/2024, 12:05 PM

Yeah, so for the

sha2

lib, I enabled some of the feature flags that bought it roughly inline with

ring

(like, close enough that I didn't care) - otherwise, the code I used for the hashing was pulled straight from https://rust-lang-nursery.github.io/rust-cookbook/cryptography/hashing.html#calculate-the-sha-256-digest-of-a-file - and I just played with the buffer size

wide-midnight-78598

06/28/2024, 12:08 PM

Now, once I used rayon. different story, so the absolute time across multiprocess was super fast

average-vr-56795

06/28/2024, 12:09 PM

Bear in mind that a default build doesn't enable CPU-specific flags... Try throwing in

RUSTFLAGS="-C target-cpu=native"

average-vr-56795

06/28/2024, 12:10 PM

(env var)

wide-midnight-78598

06/28/2024, 12:11 PM

Yeah, I had tried a run with native enabled - small difference I think, nothing crazy

wide-midnight-78598

06/28/2024, 12:13 PM

I mean, this might just be the lower-bound short of writing some crazy code, and that's totally fine. Was just wondering if I had obviously missed something attempting to use those two libs. And the python-equivalent is pretty streamlined, as it's basically C-calls, other than the act of passing data between C and Python 🤷

average-vr-56795

06/28/2024, 12:14 PM

--release

+ native CPU + buffering IO are the only things that jump out to me 🙂

wide-midnight-78598

06/28/2024, 12:16 PM

Yep! Thanks! All 3 covered 🙂

powerful-scooter-95162

07/01/2024, 4:19 PM

Late to the party, but you probably want to be able to do I/O and hashing in parallel and figure out if you're I/O bound or CPU bound. SHA256 is also probably not the best algorithm for these sorts of things unless you have to conform to some existing protocol.

wide-midnight-78598

07/01/2024, 4:33 PM

Yeah, so after some more work - it’s pretty clear that the time due to reading in the data is substantially more than hashing or anything else. This is also intentionally not wholly optimized, as I was doing a 1:1 between a Python and Rust equivalent to see which language to write the tool in. e.g. once I ran it through rayon, the time dropped substantially, but again, not really the point of the exercise

wide-midnight-78598

07/01/2024, 4:35 PM

I’ll be running other experiments looking at different hashing methodologies, partial file hashes, and all that fun parametric testing later

powerful-scooter-95162

07/01/2024, 5:50 PM

I am not an expert at I/O performance, but you will probably get performance benefits from parallelizing your reads, especially if you have a lot of small files

powerful-scooter-95162

07/01/2024, 5:50 PM

though it certainly depends on your hardware/filesystem too

wide-midnight-78598

07/01/2024, 9:03 PM

Yep, thanks! The "actual" way I do this will be more clever. In the test I'm running, I'm intentionally hitting multi-gig files, small K files, symlinks, hardlinks, etc. Basically a worst-case folder design vs what I will actually run into in the future

4 Views

Open in Slack

Previous Next