< witty crayon 22786> would you expect the memory profile to Pants #development

<@U06A03HV1> would you expect the memory profile t...

hundreds-father-404

05/15/2019, 8:46 PM

@witty-crayon-22786 would you expect the memory profile to keep expannding like this? https://gist.github.com/Eric-Arellano/cf886685f990eb5520c1a1d97a56dcc6

witty-crayon-22786

05/15/2019, 8:46 PM

i don't know how that tool works

witty-crayon-22786

05/15/2019, 8:46 PM

so not sure.

hundreds-father-404

05/15/2019, 8:48 PM

It profiles where memory blocks are.

size

is the average size of a memory block, and count is the # of blocks allocated Should the memory block for the

native.py

handles you just referenced continue to substantially grow both in size and in count throughout the rule, or should any memory be freed up along the way?

witty-crayon-22786

05/15/2019, 8:50 PM

handles will grow and shrink, because not all handles end up "held" as Node values in the Graph: some are just temporary

witty-crayon-22786

05/15/2019, 8:51 PM

so while the intermediate memory usage is interesting, you probably want to be looking at the final memory usage

witty-crayon-22786

05/15/2019, 8:51 PM

one way to do that is to

--enable-pantsd

, and look at the memory usage of the process after running things

hundreds-father-404

05/15/2019, 8:52 PM

I think the issue is the intermediate memory, though. That more memory keeps getting used as the rule goes on, and with enough concurrent rules, too much memory is used before any gets freed up

witty-crayon-22786

05/15/2019, 8:53 PM

...or if you're logging things, inserting a log statement at the end of https://github.com/pantsbuild/pants/blob/fd3ff0c129fae0e75928d1ede69a48c1c98cd9ae/src/python/pants/init/engine_initializer.py#L172-L197

witty-crayon-22786

05/15/2019, 8:53 PM

@hundreds-father-404: i don't think so.

witty-crayon-22786

05/15/2019, 8:53 PM

the graph memoizes things, intentionally.

witty-crayon-22786

05/15/2019, 8:53 PM

so it's very, very likely that if the wrong things are ended up memoized, that that is the reason

witty-crayon-22786

05/15/2019, 8:54 PM

things created in the bodies of

@rules

won't persist past the end of the run

witty-crayon-22786

05/15/2019, 8:54 PM

but if you run with

--enable-pantsd

, you'll see that this persists.

hundreds-father-404

05/15/2019, 8:58 PM

things created in the bodies of
@rules
won’t persist past the end of the run

Agreed. I think the issue is that the rule body takes too much memory.

test

is using 2.42 more megabytes than

list

during the rule just for the memory allocated with `native.py`’s

to_value()

witty-crayon-22786

05/15/2019, 8:59 PM

i don't agree. but i don't know how to convince you.

witty-crayon-22786

05/15/2019, 8:59 PM

so see what tools are available to experiment and see!

witty-crayon-22786

05/15/2019, 9:00 PM

...my point about what persists being more important is that if you fix what persists after the run, then you will likely fix as a sideeffect anything created during it.

👍 1

witty-crayon-22786

05/15/2019, 9:00 PM

and what persists is very easy to see... it's the stuff in the

handles

map.

hundreds-father-404

05/15/2019, 9:01 PM

how do you profile what’s in that? You mentioned

__repr__

being in the outputted graph but I didn’t understand how you got to that insight

witty-crayon-22786

05/15/2019, 9:02 PM

when i ran

--native-engine-visualize-to

for what should have been a relatively small graph, i could see in py-spy that it was spending a crazy amount of time in

__repr__

(minutes)

witty-crayon-22786

05/15/2019, 9:03 PM

the visualization of the Graph is approximately the

repr

of everything that it holds (and in a stable state, should be ~equal to the content of the

_handles

map)

witty-crayon-22786

05/15/2019, 9:04 PM

so, maybe either 1) figuring out what is huge by rendering the graph and spotting it, or 2) "analyzing" the handles to see what is heavy

witty-crayon-22786

05/15/2019, 9:05 PM

Patrick previously used

objgraph

... possible that running an

objgraph

analysis on the handles map to find "large" things would work.

witty-crayon-22786

05/15/2019, 9:07 PM

maybe: https://mg.pov.lt/objgraph/#reference-counting-bugs

objgraph.show_most_common_types(objects=self._handles)

...?

👍 1

witty-crayon-22786

05/15/2019, 9:08 PM

(not because i suspect a reference counting bug: but just to get a summary of what we're holding onto)

witty-crayon-22786

05/15/2019, 9:08 PM

gonna log off for a bit

hundreds-father-404

05/16/2019, 12:36 AM

Okay I think I agree that one of my hypotheses is proven wrong on memory. I was hypothesizing that if we break up

run_python_test()

into multiple helper functions / rules, the memory usage would decrease because

run_python_test()

would have less variables in its scope that it has to keep in scope throughout the lifetime of the function. Maybe I’ve been reading too much about how Rust works and this doesn’t apply in Python, but was thinking along the lines of a function keeps its local variables on the stack then drops them after If this were true, then I would expect the new

inject_init()

rule to result in the memory usage increasing less than before between the init setup and what comes right before. But, it actually slightly increases from before. I’m a bit at a loss on this memory investigation. Going to take a break and come back in an hour or so. Will reread everything you suggested

witty-crayon-22786

05/16/2019, 12:45 AM

sounds good. would look at the content of

self._handles

, one way or the other

witty-crayon-22786

05/16/2019, 12:46 AM

during a run, the coroutines/generators are themselves inside the handles map as well as the produced values... after the run, they'll be gone, and it should contain only the produced values that are held in the Graph

witty-crayon-22786

05/16/2019, 12:47 AM

(and

objgraph

should show that)

hundreds-father-404

05/16/2019, 12:48 AM

would look at the content of
self._handles
, one way or the other

It’s all just

CDataOwnGC

, which wasn’t super helpful. I then had the idea to print the

obj

before it gets converted from CFFI into a

handle

. Wasn’t very helpful, but now I’m thinking to instead use

sys.getsizeof( obj)

, which can show us what’s so huge.

hundreds-father-404

05/16/2019, 1:03 AM

By far the biggest objects are text files being materialized and the captured stdout. Gist of everything passed through `native.py`’s

to_value()

that’s larger than 100 bytes: https://gist.github.com/Eric-Arellano/cf886685f990eb5520c1a1d97a56dcc6#file-size-of-objects-send Is this expected? We are materializing more now that we resolve all transitive deps

witty-crayon-22786

05/16/2019, 1:04 AM

given a handle, you can get the thing it points to

witty-crayon-22786

05/16/2019, 1:04 AM

...so i think before calling objgraph, could look-up all of the handles

witty-crayon-22786

05/16/2019, 1:06 AM

hm... that link shows a bunch of 100-200 byte objects... is that what you meant to link?

witty-crayon-22786

05/16/2019, 1:07 AM

...oh, and the file content of the tests themselves... that's sortof unexpected.

👍 1

witty-crayon-22786

05/16/2019, 1:08 AM

not a smoking gun yet, because i think that even if we pulled all of the transitive code in, it still wouldn't add up to that much.

witty-crayon-22786

05/16/2019, 1:08 AM

but yea

witty-crayon-22786

05/16/2019, 1:09 AM

would recommend looking at the objects in the handles map itself after a run:

context.from_value

does the opposite of

to_value

👍 1

hundreds-father-404

05/16/2019, 1:20 AM

Size of everything sent through

from_value

that’s

> 200

bytes: https://gist.github.com/Eric-Arellano/cf886685f990eb5520c1a1d97a56dcc6#file-size-of-objects-received That’s a whole lot of

datatypes

being passed around. And not sure why we ever materialize files in either direction

witty-crayon-22786

05/16/2019, 1:21 AM

i'm suggesting only looking at the handles after running

👍 1

hundreds-father-404

05/16/2019, 1:21 AM

Also I found that the

__init__.py

rule does make a difference, but I think primarily because it uses Daniel’s new builtin rule to go from

Digest -> Snapshot

so we avoid

Digest -> FIlesContent

witty-crayon-22786

05/16/2019, 1:21 AM

objgraph.show_most_common_types(objects=[self.from_value(h) for h in self._handles])

👍 1

witty-crayon-22786

05/16/2019, 1:21 AM

or whatever other analysis

witty-crayon-22786

05/16/2019, 1:22 AM

mm, yea: avoiding the FileContent is significant, for sure.

hundreds-father-404

05/16/2019, 1:23 AM

how would you know in

native.py

when it’s over and you should inspect

self._handles

? Maybe put it in

raise_or_return(self, pyresult)

, as I think that gets called near the end

witty-crayon-22786

05/16/2019, 1:26 AM

i linked above a spot to insert the call

witty-crayon-22786

05/16/2019, 1:27 AM

https://pantsbuild.slack.com/archives/C0D7TNJHL/p1557953584074300?thread_ts=1557953193.072200&cid=C0D7TNJHL

witty-crayon-22786

05/16/2019, 1:27 AM

(can just reach in there and grab it, afaik)

hundreds-father-404

05/16/2019, 2:16 AM

Number of references after running `./pants list tests/python/pants_test/util:strutil`:

Copy code

CDataOwn  985
str       60
function  50
partial   9
tuple     6
NoneType  5
ABCMeta   5
generator 4
Digest    3
int       3
None

After `./pants --no-v1 --v2 test tests/python/pants_test/util:strutil`:

Copy code

CDataOwn  1320
str       182
function  50
int       40
tuple     38
Digest    34
NoneType  32
generator 28
bytes     9
Snapshot  9
None

witty-crayon-22786

05/16/2019, 2:20 AM

Hm. CDataOwn is still the handle, right? Did you unwrap it with

from_value

...?

witty-crayon-22786

05/16/2019, 2:21 AM

(those handles are cffi handles, btw: https://cffi.readthedocs.io/en/latest/ref.html#ffi-new-handle-ffi-from-handle )

hundreds-father-404

05/16/2019, 2:21 AM

No, the handle is

CDataOwnGC

This is the snippet I added to `run_console_rules()`:

Copy code

from pants.engine.native import Native
import objgraph
native_context = Native().context    print(objgraph.show_most_common_types(objects=[native_context.from_value(h) for h in native_context._handles]))

witty-crayon-22786

05/16/2019, 2:22 AM

... that's unexpected

witty-crayon-22786

05/16/2019, 2:23 AM

Afaik,

type(from_value(...))

should ~always match one of the Params or return values of a @rule

witty-crayon-22786

05/16/2019, 2:24 AM

But you might consider using one of the other objgraph analysis methods... there are a lot

hundreds-father-404

05/16/2019, 2:25 AM

fyi what’s leftover in

ExternContext._handles

after the run: https://gist.github.com/Eric-Arellano/cf886685f990eb5520c1a1d97a56dcc6#file-leftover-in-handles The file content looks to be the culprit to me again. I ran the same on

./pants list

and found that it too has materialized file content, but the key difference is that it’s

O(1)

for space. For

./pants test

, it’s

O(n)

where n is # of source files in the closure

witty-crayon-22786

05/16/2019, 2:27 AM

Interesting.

witty-crayon-22786

05/16/2019, 2:29 AM

I think you're looking at the right data now... maybe try scaling it up slowly and seeing if the distribution changes? Run with 1, then 2, etc?

💯 1

Open in Slack

Previous Next