Is there a way to globally shut off python dependency infere Pants #general

Is there a way to globally shut off python depende...

flat-zoo-31952

05/04/2023, 9:16 PM

Is there a way to globally shut off python dependency inference? I thought there was one, but I only see it available https://www.pantsbuild.org/docs/reference-python-infer

bitter-ability-32190

05/04/2023, 9:17 PM

The

imports

option controls it globally (at this point it's a bit of a misnomer, and likely should be renamed). IIRC

flat-zoo-31952

05/04/2023, 9:17 PM

[python-infer].imports ?

bitter-ability-32190

05/04/2023, 9:18 PM

Yeah double check the code, but I think that's it

flat-zoo-31952

05/04/2023, 9:18 PM

I tried that and running

pants dependencies ::

got at least 2x slower

flat-zoo-31952

05/04/2023, 9:19 PM

We're hitting some kinda performance bug that isn't related to actually scanning the modules I think

flat-zoo-31952

05/04/2023, 9:19 PM

lmao

Copy code

Executed in  120.79 secs      fish           external
   usr time  699.66 millis  513.00 micros  699.14 millis
   sys time  138.12 millis  134.00 micros  137.99 millis

flat-zoo-31952

05/04/2023, 9:20 PM

I think there's a scheduler problem haha

flat-zoo-31952

05/04/2023, 9:20 PM

120 sec but < 2 sec of actual work?

bitter-ability-32190

05/04/2023, 9:24 PM

Can I ask why you wanna turn it off? Just curious

flat-zoo-31952

05/04/2023, 9:25 PM

I don't, I"m trying to isolate why we get such abysmal performance with it

flat-zoo-31952

05/04/2023, 9:25 PM

and I think I need to use

--no-pantsd

to get accurate metrics with

time

flat-zoo-31952

05/04/2023, 9:29 PM

Copy code

❯ time ./pants --no-pantsd --no-python-infer-imports dependencies ::
...
________________________________________________________
Executed in  130.08 secs    fish           external
   usr time  171.09 secs  914.00 micros  171.09 secs
   sys time   21.74 secs    0.00 micros   21.74 secs

bitter-ability-32190

05/04/2023, 9:29 PM

How many files do you have?

bitter-ability-32190

05/04/2023, 9:30 PM

Also you're gonna love https://pantsbuild.slack.com/archives/C0D7TNJHL/p1682710211211059?thread_ts=1682710211.211059&cid=C0D7TNJHL

flat-zoo-31952

05/04/2023, 9:31 PM

Copy code

❯ ./pants --no-pantsd --no-python-infer-imports list :: | wc -l
7773

bitter-ability-32190

05/04/2023, 9:31 PM

I think a

find

for

.py

might be a bit more accurate

flat-zoo-31952

05/04/2023, 9:33 PM

4465

bitter-ability-32190

05/04/2023, 9:35 PM

Oh yeah. That's 4.5k processes then

bitter-ability-32190

05/04/2023, 9:36 PM

(in the cold case)

bitter-ability-32190

05/04/2023, 9:37 PM

If the cache is hot I wouldn't expect that to take 2 minutes through

flat-zoo-31952

05/04/2023, 9:38 PM

The cache should be hot though. Without pantsd I know you loose memoization but it should be able to read the infered deps from the lmdb cache, right?

bitter-ability-32190

05/04/2023, 9:38 PM

Correct

flat-zoo-31952

05/04/2023, 9:38 PM

and turning it off shouldn't make it slower lol

bitter-ability-32190

05/04/2023, 9:39 PM

Well I might be wrong there. It might do some other thing if off

bitter-ability-32190

05/04/2023, 9:39 PM

I'd look at workunits and/debug debug traces

flat-zoo-31952

05/04/2023, 9:40 PM

how to look at those? I'm looking at

-ldebug

now, and I've grabbed some profiler data for similar issues before

bitter-ability-32190

05/04/2023, 9:41 PM

I'm about to hop off, so I'll let someone else pick this up, like @witty-crayon-22786

witty-crayon-22786

05/04/2023, 9:45 PM

@flat-zoo-31952: um, without being able to capture workunits, enabling

-ldebug

might be the best bet. we probably ought to add a generic workunit capture plugin, since it has all of this timing data.

flat-zoo-31952

05/04/2023, 9:45 PM

Copy code

17:40:00.11 ESC[32m[DEBUG]ESC[0m Completed: Find targets from input specs
17:40:57.82 [INFO] Long running tasks:
  60.41s        `dependencies` goal
17:41:27.84 [INFO] Long running tasks:
  90.43s        `dependencies` goal
17:41:56.75 ESC[32m[DEBUG]ESC[0m Completed: `dependencies` goal

flat-zoo-31952

05/04/2023, 9:47 PM

not a lot of granular info there

witty-crayon-22786

05/04/2023, 9:47 PM

and/or a

py-spy

trace

flat-zoo-31952

05/04/2023, 9:47 PM

Yeah I think that's where we'll have to go with this. I should make a GH issue too.

flat-zoo-31952

05/04/2023, 9:47 PM

cc @fresh-cat-90827

broad-processor-92400

05/04/2023, 9:49 PM

Does

-ltrace

give anything useful?

bitter-ability-32190

05/04/2023, 9:56 PM

@witty-crayon-22786 I have one locally. I should just upstream it 🙈

👍 1

flat-zoo-31952

05/04/2023, 11:20 PM

-ltrace gave me a 170 MiB log

broad-processor-92400

05/04/2023, 11:21 PM

170MiB of pants gold... yeah, maybe not super useful.

chilly-holiday-48058

05/04/2023, 11:22 PM

Is there any instrumentation I can provide from

shorts

to help in comparing performance?

flat-zoo-31952

05/04/2023, 11:31 PM

I think I'm hitting a Pants-specific bug, since it got worse with dependency inference turned off

flat-zoo-31952

05/04/2023, 11:52 PM

https://github.com/pantsbuild/pants/issues/18911 ... I've included flamegraph and speedscope data

flat-zoo-31952

05/05/2023, 2:46 PM

i've kept digging but it really looks like a scheduler issue to me from where I'm sitting. With a cold cache it takes 4-5 min to perform inference. I'm happy to keep looking into what could be causing this, because it seems outlandishly slow

3 Views

Open in Slack

Previous Next