< hundreds father 404> it s applying the concurrency constra Pants #development

<@UB2J9BQA0>: it's applying the concurrency constr...

witty-crayon-22786

05/15/2019, 4:27 AM

@hundreds-father-404: it's applying the concurrency constraint: can see it in

top -o cpu

locally

witty-crayon-22786

05/15/2019, 4:33 AM

The concurrency constraint just only actually applies to process invokes: not

@rules

hundreds-father-404

05/15/2019, 4:43 AM

Hm wouldn’t that lead to too much memory usage? Each rule invocation for Pytest for example has to store

all_targets

in memory. Even if subprocess are limited, you could feasibly have to store over 400 of those variables in memory at a given time, right?

witty-crayon-22786

05/15/2019, 4:45 AM

all_targets is a list containing n pointers to the same objects

witty-crayon-22786

05/15/2019, 4:46 AM

Holding it for only the roots should be fine (you don't want to hold it for the transitive graph though)

hundreds-father-404

05/15/2019, 4:52 AM

We do hold in memory all transitive deps. We have to do that per https://github.com/pantsbuild/pants/pull/7720 Regardless though of what the list holds, my point is that the principle of not limiting # of rule invokes seems to be a risk for memory shortages. Each rule invoke is non-trivial in memory usage. Given enough concurrent rule invokes, as a principal you are bound to hit memory shortages like we’re seeing in CI. Further, some systems have more memory than others. Ideally we would not be sensitive to the amount of memory available

witty-crayon-22786

05/15/2019, 4:54 AM

./pants list ::

in Twitter's repo results in ~500k rule invokes and nodes in the graph

witty-crayon-22786

05/15/2019, 4:55 AM

And it takes less memory than running the unit tests in pantsbuild/pants with v2 right now

witty-crayon-22786

05/15/2019, 4:55 AM

So: I suspect that the culprit is not the rules themselves, but rather what is being held by them.

witty-crayon-22786

05/15/2019, 4:56 AM

I don't have a good pointer for tracking it down tonight, but can take a look tomorrow.

witty-crayon-22786

05/15/2019, 4:58 AM

I get your point about capping concurrently executing rules: I just suspect that that is not the issue in this case.

hundreds-father-404

05/15/2019, 4:59 AM

Okay sounds good. I’m afk all night so no worries. Cc @average-vr-56795 if you have any ideas for how to investigate / profile what’s going on.

witty-crayon-22786

05/15/2019, 5:11 PM

regarding this issue: one thing that is potentially peculiar is that after applying all of the outstanding patches, even for running a single target,

--native-engine-visualize-to

ends up spending a huge amount of time in

__repr__

... ie, i think that there might be some very large things

witty-crayon-22786

05/15/2019, 5:11 PM

@hundreds-father-404: would suggest comparing/contrasting a tiny test run before/after

witty-crayon-22786

05/15/2019, 5:12 PM

also, using https://github.com/benfred/py-spy to attach is very helpful

👍 1

hundreds-father-404

05/15/2019, 5:12 PM

Before and after all the patches?

witty-crayon-22786

05/15/2019, 5:15 PM

um, the potentially relevant ones?

witty-crayon-22786

05/15/2019, 5:15 PM

or each of them? could bisect?

👍 1

witty-crayon-22786

05/15/2019, 5:15 PM

after a run with

--native-image-visualize-to

, i change into the relevant directory and then do:

Copy code

export DOTFILE=graph.000.dot; dot -Tpdf -O "${DOTFILE}" && open "${DOTFILE}.pdf"

to view as PDF

👌 1

witty-crayon-22786

05/15/2019, 5:16 PM

but make sure it is a small usecase, because the result is large regardless

hundreds-father-404

05/15/2019, 5:56 PM

@witty-crayon-22786 how would I hook up Pyspy to

./pants

? We don’t directly call

python3.6 my_program.py

witty-crayon-22786

05/15/2019, 6:05 PM

sudo py-spy --pid $pid

👌 1

witty-crayon-22786

05/15/2019, 6:58 PM

any luck here?

hundreds-father-404

05/15/2019, 6:58 PM

Only now getting into it. Was making breakfast and taking a break

witty-crayon-22786

05/15/2019, 6:59 PM

whoops, sorry.

Open in Slack

Previous Next