this is my thread for this afternoon Pants #development

Join Slack

this is my thread for this afternoon

# development

aloof-angle-91616

05/11/2020, 11:49 PM

this is my thread for this afternoon

aloof-angle-91616

05/11/2020, 11:49 PM

here

aloof-angle-91616

05/11/2020, 11:51 PM

i'm probably gonna try drawing some things on paper

witty-crayon-22786

05/11/2020, 11:55 PM

would profile it probably

aloof-angle-91616

05/11/2020, 11:55 PM

yeah

aloof-angle-91616

05/11/2020, 11:55 PM

xcode installed finally

aloof-angle-91616

05/11/2020, 11:55 PM

on monday i think

aloof-angle-91616

05/11/2020, 11:55 PM

now i have instruments

👍 1

witty-crayon-22786

05/11/2020, 11:57 PM

i’d recommend creating a new benchmark for snapshotsubset here: https://github.com/pantsbuild/pants/blob/faf5e85d288102cee62a598238568661131e770c/src/rust/engine/fs/store/benches/store.rs#L56-L69

aloof-angle-91616

05/11/2020, 11:57 PM

omfg

aloof-angle-91616

05/11/2020, 11:57 PM

i love criterion

witty-crayon-22786

05/11/2020, 11:57 PM

then you can run just your new benchmark with eg

Copy code

./build-support/bin/native/cargo bench --manifest-path=src/rust/engine/Cargo.toml --package store -- $mybenchname

aloof-angle-91616

05/11/2020, 11:58 PM

does that have profiling? or is that a command line i can give to instruments?

aloof-angle-91616

05/11/2020, 11:58 PM

i am aware there is benchmarking but not sure if profiling is included

witty-crayon-22786

05/11/2020, 11:58 PM

i’d launch it, and then attach instruments once it has started

👍 1

witty-crayon-22786

05/11/2020, 11:59 PM

if you fiddle with the

measurement_time

, you can get it to basically go in a loop for a while.

witty-crayon-22786

05/11/2020, 11:59 PM

and then attach.

aloof-angle-91616

05/11/2020, 11:59 PM

this is great!

aloof-angle-91616

05/11/2020, 11:59 PM

and i'm rebased to today's master so no sweat about merge conflicts this code is very separate

aloof-angle-91616

05/12/2020, 12:01 AM

ok, cool. i can see how i can slot in a snapshot subset benchmark for the large henries snapshot here

aloof-angle-91616

05/12/2020, 12:01 AM

will probably extract the materialize_directory() call to a private
fn

aloof-angle-91616

05/12/2020, 12:04 AM

took me a second to understand

let path_buf = clean_line.split_whitespace().collect::<PathBuf>();

aloof-angle-91616

05/12/2020, 12:04 AM

gonna leave a comment

aloof-angle-91616

05/12/2020, 12:18 AM

i have

MODE=debug

in my environment....lol. gonna make the benchmark still but that would probably affect performance

aloof-angle-91616

05/12/2020, 12:44 AM

i can definitely repro the issue without

MODE=debug

in a criterion benchmark

aloof-angle-91616

05/12/2020, 12:44 AM

instruments is so nice to use

aloof-angle-91616

05/12/2020, 12:44 AM

the filesystem traces don't work though i'm gonna google that

witty-crayon-22786

05/12/2020, 1:01 AM

i mostly use Time Profiler

witty-crayon-22786

05/12/2020, 1:01 AM

haven’t had much luck with the rest.

aloof-angle-91616

05/12/2020, 1:01 AM

yep that is what i am settling on

aloof-angle-91616

05/12/2020, 1:01 AM

thanks

aloof-angle-91616

05/12/2020, 1:08 AM

the time profiler very clearly points

ingest_directory_from_sorted_path_stats

taking up most of the time and being incredibly recursive

aloof-angle-91616

05/12/2020, 1:08 AM

which is what the comment on the method says as well

aloof-angle-91616

05/12/2020, 1:09 AM

Screen Shot 2020-05-11 at 18.08.51.png

aloof-angle-91616

05/12/2020, 1:09 AM

which aligns with our earlier hunches

aloof-angle-91616

05/12/2020, 1:22 AM

so to clarify a bit more

aloof-angle-91616

05/12/2020, 1:23 AM

it looks like

merge_directories_recursive()

may also be the issue or another issue

aloof-angle-91616

05/12/2020, 1:23 AM

in that it is recursive

aloof-angle-91616

05/12/2020, 1:23 AM

and it appears to be used in the snapshot subsetting operation

aloof-angle-91616

05/12/2020, 1:23 AM

oh no

aloof-angle-91616

05/12/2020, 1:24 AM

i was right first

aloof-angle-91616

05/12/2020, 1:24 AM

get_snapshot_subset() => from_path_stats() => ingest_directory_from_sorted_path_stats()

, which is recursive

aloof-angle-91616

05/12/2020, 1:25 AM

and there are many different threads of execution that took e.g. 800ms

aloof-angle-91616

05/12/2020, 1:26 AM

i think this makes sense if there's massive contention at some point, either on lmdb or on a lock somewhere

aloof-angle-91616

05/12/2020, 1:26 AM

but the join_alls don't necessarily seem to end

aloof-angle-91616

05/12/2020, 1:27 AM

Screen Shot 2020-05-11 at 18.27.17.png

aloof-angle-91616

05/12/2020, 1:27 AM

373 ms on cloning a hashbrown table, it seems

aloof-angle-91616

05/12/2020, 1:28 AM

we might be able to remove the hashmap here and work with slices

aloof-angle-91616

05/12/2020, 1:39 AM

i'm beginning to think more about another

ShardedLmdb

to cache

Digest => Vec<FileContent>

instead of recomputing it each time in

Store::contents_for_directory()

(to improve

SnapshotSubset

time). playing around with it now

witty-crayon-22786

05/12/2020, 2:04 AM

so, from what i can tell you have only selected that last portion of the profile

witty-crayon-22786

05/12/2020, 2:04 AM

what was happening during the rest?

aloof-angle-91616

05/12/2020, 2:04 AM

great question

witty-crayon-22786

05/12/2020, 2:04 AM

headed to dinner, but: looking at only one portion of of the time will lie a bit

aloof-angle-91616

05/12/2020, 2:05 AM

i strongly agree

aloof-angle-91616

05/12/2020, 2:05 AM

thank you!

witty-crayon-22786

05/12/2020, 2:05 AM

recommend clicking the button at the top right of the “Heaviest Stack Trace” part

👍 1

Open in Slack

Previous Next