Ugh Is there any magic bullet for reducing pants memory foot Pants #general

Ugh. Is there any magic bullet for reducing pants'...

ancient-france-42909

07/01/2024, 5:50 PM

Ugh. Is there any magic bullet for reducing pants' memory footprint? Doing a

pants --changed-since=$(git merge-base HEAD origin/master) --changed-dependents=transitive list

takes 23GB of memory in our repo 😞

😮 1

ancient-france-42909

07/01/2024, 5:51 PM

Smart solutions, like reducing the number of targets aren't particularly useful, unfortunately, at least short term we're stuck here.

curved-television-6568

07/01/2024, 6:52 PM

I’m not experienced in this area, but just out of curiosity how many targets do you have, roughly?

ancient-france-42909

07/01/2024, 6:52 PM

Many 😞

curved-television-6568

07/01/2024, 6:52 PM

Also, this is a topic @flat-zoo-31952 have been wrestling with, I believe..

ancient-france-42909

07/01/2024, 6:53 PM

Copy code

$ pants list :: | wc -l
28785

👍 1

ancient-france-42909

07/01/2024, 6:53 PM

So we're trying to upgrade to 2.21, and we have to move to lockfiles/resolves. But, we have to have basically the same resolve stuff 6 times. 😞

💯 1

ancient-france-42909

07/01/2024, 6:54 PM

Until we can either merge the dependencies or take the time to split them better...

ancient-france-42909

07/01/2024, 6:55 PM

It might be only tests that get one target per resolve, though, hm

ancient-france-42909

07/01/2024, 6:59 PM

Before adding the lockfiles,

targets and the same command peaked at 7GB of memory.

ancient-france-42909

07/01/2024, 6:59 PM

On 2.17

ancient-france-42909

07/01/2024, 7:04 PM

ANd, yeah, Josh and I talked about large graphs and other similar stuff in the past 🙂 back then, there wasn't really any solution to this.

flat-zoo-31952

07/01/2024, 7:10 PM

I'm not really sure. I don't think the Pants project is in a position to fix these things tbh. I took a look at what it would require, and while it's probably fixable, it would require a ton of work in Pants core, and most of the devs who have the biggest knowledge there are focused on other projects at this point.

ancient-france-42909

07/01/2024, 7:13 PM

It's a bit sad to say "oh, go monorepo. but not too big" 😞

flat-zoo-31952

07/01/2024, 7:15 PM

To be really honest, I've essentially decided that Pants' fine grained dependency model is probably bad for large monorepos. Even if the perf/memory issues were fixed, I'm not sure it maps to how humans think about projects anyways. The number of verticies in the graph just doesn't fit any any normal person's head, which can make you code architecture really difficult to reason about. In our current work monorepo repository we're trying to adopt a coarse-grained dependency model that explicitly pushes developers to think about libraries, applications, artifacts and boundaries, but without forcing a split to a separate repo. Seems to be more like what JS monorepo tools do.

flat-zoo-31952

07/01/2024, 7:16 PM

Pants is still used and will probably continue to be used, but probably not for every project, and certainly not as a single Pants install in the repo root

ancient-france-42909

07/01/2024, 7:16 PM

The problem is that we do define deps at a library level, but pants expands this to "oh, library X? you mean, files X-Y from library X, amirite?"

flat-zoo-31952

07/01/2024, 7:17 PM

Yeah this is what I mean by fine-grained dependencies

flat-zoo-31952

07/01/2024, 7:17 PM

It's a fundamental design decision in Pants that I'm pretty sure just does not scale well

ancient-france-42909

07/01/2024, 7:17 PM

Yeah, but what I mean is we don't use inference or anything, we actually do define these correctly.

flat-zoo-31952

07/01/2024, 7:17 PM

Or if it does scale, it only does so in specific circumstances

flat-zoo-31952

07/01/2024, 7:18 PM

It doesn't matter whether you use inference or not, Pants needs to build a graph for every single target

flat-zoo-31952

07/01/2024, 7:18 PM

And it uses its own execution graph engine to build that dependency graph, and that's where the memory usage is

ancient-france-42909

07/01/2024, 7:18 PM

Yeah, but for inference, I'd understand the per file thing. ANd it would definitely help us is many places, to prune the graph

ancient-france-42909

07/01/2024, 7:19 PM

but it'd only reduce the number of vertices, not nodes

flat-zoo-31952

07/01/2024, 7:19 PM

Pants wasn't built to not use inference

flat-zoo-31952

07/01/2024, 7:19 PM

The option exists, but it doesn't result in any performance improvement

flat-zoo-31952

07/01/2024, 7:20 PM

If your deps are all spelled out correctly manually, it seems like you could probably move to Bazel easier than most (although it is my understanding that Bazel hits this memory problem at some point too)

ancient-france-42909

07/01/2024, 7:21 PM

I think we're quite locked in to pants, even if we need 25GB of memory to run a fmt 🙂

ancient-france-42909

07/01/2024, 7:21 PM

If for no other reason than nobody in management will approve of us switching to another build system, wasting months of developer time on something that can be fixed by throwing more money at hardware

flat-zoo-31952

07/01/2024, 7:22 PM

There's also Polylith https://pantsbuild.slack.com/archives/C046T6TA4/p1715508080981669 to look at, which I think uses the operating model I describe

flat-zoo-31952

07/01/2024, 7:23 PM

Yeah for us, we have always been in this weird position of not really using Pants to its potential because we consume deps from RPM repos, not from PIP repos

flat-zoo-31952

07/01/2024, 7:23 PM

So it's a lot easier to phase it out as we work on new tooling

ancient-france-42909

07/01/2024, 7:23 PM

Obviously, I couldn't even get my company to sponsor pants (we got acquired just as we were about to sign, and then... the mothership is ignoring us when we ask if we can get the budget for it 😞 ), so it's not like I can advocate for any change to this, but yeah

😕 1

flat-zoo-31952

07/01/2024, 7:25 PM

We're finally doing a long-awaited great refactoring, so doing a bit of homespun build tooling is part of that

ancient-france-42909

07/01/2024, 7:27 PM

I know this is completely unrelated, but I'm not sure at what point people starting thinking creating the virtualenv in your project's root is okay, but I hate it 😛

ancient-france-42909

07/01/2024, 7:31 PM

And, heh, I'm not sure if my alcohol fuelled yak shaving drove me to too much alcohol, but that combo won't actually fix any of the issues we're having

flat-zoo-31952

07/01/2024, 7:35 PM

I yak-shaved on this myself quite a bit, ended up: • trying to rewrite a bit of https://github.com/pantsbuild/pants/blob/1d62bbaf7ef8c5b2a8821a9af0ca519d590010b1/src/python/pants/engine/internals/graph.py#L4 but too much depends on it • adding GC to the internal graph but I don't know rust well enough • trying to create a benchmark repo for reproducing these issues but it really was never as bad as what I was seeing at work What it comes down to is not having the time or expertise to solve or even really debug these problems

wide-midnight-78598

07/01/2024, 8:47 PM

Copy code

$ pants list :: | wc -l

Copy code

So, just under 1MB per target? That sounds like… a lot... @flat-zoo-31952 Were you seeing numbers like that too?

wide-midnight-78598

07/01/2024, 8:54 PM

Ah, wait, conflating two things - the "transitive" call was what brings you up to 23GB?

ancient-france-42909

07/01/2024, 8:54 PM

Yes

wide-midnight-78598

07/01/2024, 8:55 PM

And how long does the call take via

time

or similar?

ancient-france-42909

07/01/2024, 8:55 PM

Hm, give me a few minutes, it takes 2 minutes on the 28k one

ancient-france-42909

07/01/2024, 8:56 PM

It's a bit misleading though, since the 28k one errors out

wide-midnight-78598

07/01/2024, 8:58 PM

Okay, and how are you tracking memory usage of the call (so I can test on some repos I have)?

ancient-france-42909

07/01/2024, 8:58 PM

ah, well, I noticed this on circleci, it OOMd on a 16GB docker container, then looked at activity monitor on my mac and htop.

👍 1

ancient-france-42909

07/01/2024, 8:59 PM

I can try it on a linux laptop if it matters, but I doubt it'll be different.

ancient-france-42909

07/01/2024, 8:59 PM

anyway, it took 1:20 for the 11k targets one

wide-midnight-78598

07/01/2024, 9:00 PM

And silly question, but relevant only because I think someone else had a similar issue a couple weeks ago, are there any symlinks in the repo?

ancient-france-42909

07/01/2024, 9:00 PM

I doubt it

ancient-france-42909

07/01/2024, 9:01 PM

Nope, no symlinks

wide-midnight-78598

07/01/2024, 9:02 PM

Dag... This was what I was referencing, was hoping you just caught a stray like this thread https://pantsbuild.slack.com/archives/C046T6T9U/p1718188313286379

ancient-france-42909

07/01/2024, 9:05 PM

I can tell you, over the past 2 years, as we've upgraded pants, the memory usage went up significantly. I think it was 2.9->2.14 when we saw it, but yeah. We did increase the number of targets by a lot, but the memory increase wasn't linear, that's for sure

ancient-france-42909

07/01/2024, 9:06 PM

Tomorrow, I can try to run this on the repo 2 years ago, I'm 100% sure it won't go above 2-3 GB

wide-midnight-78598

07/01/2024, 9:09 PM

👍 Trying to find some comparables, but I'm not going much past 3-4GB on a repo with about 1/3rd the targets

wide-midnight-78598

07/01/2024, 9:09 PM

So, yeah, definitely feels non-linear

wide-midnight-78598

07/01/2024, 9:11 PM

Hmm, interestingly though - when I'm running in WSL, I seem to get multiple pantsd instances for the same repo - will need to dig into that, as they are not multiplying memory usage - but there are multiple PIDs

ancient-france-42909

07/01/2024, 9:11 PM

the behaviour doesn't change with --no-pantsd

wide-midnight-78598

07/01/2024, 9:13 PM

Yeah, I figured - I wonder if this is a pants or wsl artifact - multiple PIDs, but I kill one, and it kills all - and we seem to only be using 1 Pant's worth of memory

ancient-france-42909

07/01/2024, 9:23 PM

I can ask a coworker to run this in WSL, if you think it'd give you any useful info 🙂

wide-midnight-78598

07/01/2024, 9:24 PM

nah, there is something deeper going on - I just had a WSL sitting around

wide-midnight-78598

07/01/2024, 9:35 PM

How many resolves/deps are you grabbing as well?

ancient-france-42909

07/01/2024, 9:37 PM

Hm, we have.... 21 resolves, 315 lines in our main requirements.txt

ancient-france-42909

07/01/2024, 9:38 PM

Not sure how to answer that exactly

ancient-france-42909

07/01/2024, 9:38 PM

13 if we exclude the tools resolves, btw

wide-midnight-78598

07/01/2024, 9:39 PM

👍 And if you're able to succeed in running the command (big oof here):

Copy code

pants --stats-memory-summary {the command}

wide-midnight-78598

07/01/2024, 9:39 PM

I think an anonymized version of that information might be useful?

ancient-france-42909

07/01/2024, 9:49 PM

Where does that write the info?

wide-midnight-78598

07/01/2024, 9:57 PM

it should write it to the command line,

ancient-france-42909

07/01/2024, 9:58 PM

Untitled

wide-midnight-78598

07/01/2024, 10:06 PM

Bah, exception? Should look something like:

Copy code

pants.git/call-by-name-cue-deb-make-sql % pants --stats-memory-summary list :: | wc -l                                                                                                       ⎇ call-by-name-cue-deb-make-sql*
17:38:20.85 [INFO] Memory summary (total size in bytes, count, name):
  48            1               pants.backend.javascript.package_json.AllPackageJsonNames
  48            1               pants.backend.project_info.filter_targets.FilterSubsystem
  48            1               pants.backend.project_info.list_targets.List
  48            1               pants.backend.project_info.list_targets.ListSubsystem
  48            1               pants.backend.python.dependency_inference.subsystem.PythonInferSubsystem
  48            1               pants.backend.python.goals.lockfile.PythonSyntheticLockfileTargetsRequest
  48            1               pants.backend.python.subsystems.setup.PythonSetup
  48            1               pants.backend.scala.subsystems.scala.ScalaSubsystem
  48            1               pants.backend.scala.subsystems.scala_infer.ScalaInferSubsystem

ancient-france-42909

07/01/2024, 10:14 PM

Let me try it on the 2.17 one, the exception... gonna take a while to fix

ancient-france-42909

07/01/2024, 10:47 PM

Untitled

wide-midnight-78598

07/01/2024, 11:28 PM

Whoa That's... a lot of `AddressInput`s? I'll take a look at this trace (probably tomorrow) and compare against some of my repos, but it's good to see where a bunch of the memory goes.

Copy code

88377576		1052114		builtins.AddressInput

Would you call this a regression in memory, then? As in, the same repo, same command works on 2.17, but falls over on 2.21?

flat-zoo-31952

07/02/2024, 12:32 AM

It’s been a while since I looked at this last but 0.5-1 MB per target in the repo sounds about right. IIRC it reduced to anything hitting

AllDepenciesRequest

or something like that. So

pants dependencies —transitive ::

was one of the simpler ways to trigger that

flat-zoo-31952

07/02/2024, 12:34 AM

I’m fairly certain the memory overuse is a bug of some sort, but I was never able to isolate what triggers it

flat-zoo-31952

07/02/2024, 12:35 AM

For us the memory usage just gets kinda steadily worse each Pants version (and steadily worse as our repo grows in number of targets)

wide-midnight-78598

07/02/2024, 1:01 AM

I can understand it getting linearly worse with number of targets (and if they’re tightly coupled, maybe the dependency graph gets nutty), but getting worse each Pants version isn’t cool. Something we should be watching in regression (including perf).

wide-midnight-78598

07/02/2024, 1:43 AM

ValueError: The explicit dependency
requirements/python#connexion
of the target at
aio/aio/__init__.py:../lib
does not provide enough address parameters to identify which parametrization of the dependency target should be used.

The first trace actually had what seems to be a genuine error in it?

happy-kitchen-89482

07/02/2024, 6:09 AM

The dep granularity feedback would be really valuable on the new python backend wishlist

square-psychiatrist-19087

07/02/2024, 6:45 AM

Since you mentioned 21 resolves https://github.com/pantsbuild/pants/issues/20568

💯 1

happy-kitchen-89482

07/02/2024, 6:58 AM

Feels like the graph representation should be rewritten to be far more compact

happy-kitchen-89482

07/02/2024, 6:58 AM

I mean, that information is a strict subset of the information in the source files, which presumably are a lot smaller than 23GB...

happy-kitchen-89482

07/02/2024, 6:58 AM

So something is gratuitously wasteful here

💯 2

wide-midnight-78598

07/02/2024, 11:38 AM

Without having dug too far into it yet, I'm curious if the problem is more related to what Gregory linked to - rather than strictly first-party sources. Even on some of the larger projects I've worked on, with lots of first-party code (only), I haven't noticed anywhere near 23GB. But, start throwing in some AI libs, and if those are pulled in process at any point, I can see everything hitting the fan.

ancient-france-42909

07/02/2024, 11:40 AM

We have inference turned off, so that shouldn't matter.

🤨 1

flat-zoo-31952

07/02/2024, 12:29 PM

When I was investigating this, I remember looking at some code paths and noting that inference is not really the problem. The code for building the python dependency graph is quite complex, and inference is really a small part of it. IIRC adding resolves had a huge impact on memory, but it's been so long since I've done this analysis it's hard to remember.

ancient-france-42909

07/02/2024, 3:11 PM

Also, not only does it spike at 23GB, but it's sloooooooow:

Copy code

cbirzan@GP3CMXYV9V:~/PycharmProjects/cr_python$ pants --tag=-no_mono --changed-since=6ed4c2c0edf22f1e796f56dd57f0196113cbad5d --changed-dependents=transitive list
17:51:40.99 [INFO] Reading /Users/cbirzan/PycharmProjects/cr_python/.python-version to determine desired version for [python-bootstrap].search_path.
17:51:41.05 [INFO] Reading /Users/cbirzan/PycharmProjects/cr_python/.python-version to determine desired version for [python-bootstrap].search_path.
17:56:19.47 [INFO] Reading /Users/cbirzan/PycharmProjects/cr_python/.python-version to determine desired version for [python-bootstrap].search_path.
⠓ 1154.28s Map all targets to their dependents

wide-midnight-78598

07/02/2024, 3:13 PM

You mentioned 28785 total targets - how many python files are in src in total?

ancient-france-42909

07/02/2024, 3:14 PM

Copy code

$ find . -name '*.py' | wc -l
    7309

ancient-france-42909

07/02/2024, 3:15 PM

A few have the

no_mono

tag, but I guess those still go in for the graph.

ancient-france-42909

07/02/2024, 3:20 PM

Now it's just taking the piss...

Copy code

18:19:15.53 [INFO] Filesystem changed during run: retrying `@rule(pants.backend.project_info.dependents.map_addresses_to_dependents())` in 500ms...

ancient-france-42909

07/02/2024, 3:20 PM

almost 30 minutes in, heh

ancient-france-42909

07/02/2024, 3:21 PM

okay, so, is there a way to upgrade pants without using lockfiles? 😄

flat-zoo-31952

07/02/2024, 3:22 PM

I've wondered if the retries are where the memory leak is coming from

ancient-france-42909

07/02/2024, 3:22 PM

it's at around 11-13GB now

happy-kitchen-89482

07/02/2024, 3:32 PM

This seems pathologically wrong

ancient-france-42909

07/02/2024, 3:32 PM

That's the feeling I've been getting working on this PR 😄

happy-kitchen-89482

07/02/2024, 3:32 PM

@ancient-france-42909 this debugging is golden! I will dive into improving this if you can help come up with a smoking gun

happy-kitchen-89482

07/02/2024, 3:33 PM

I have a feeling something very stupid is happening under the covers

wide-midnight-78598

07/02/2024, 3:58 PM

^^ Exactly, this seems so wildly off-kilter that there must be some sort of redundant work, recursion, or something equally wild. I feel like some sort of 3rd party deps are coming into proc unnecessarily, and those are big enough to easily wipe out your process memory. I was testing on Josh's clam diggers repo, and while it took a lot of clock time, I didn't see any particular crazy memory spikes

wide-midnight-78598

07/02/2024, 4:00 PM

Or like... over memoization - like somehow we're re-memoizing the same data multiple times

flat-zoo-31952

07/02/2024, 4:03 PM

It would be nice to be able to trace requests through the engine

flat-zoo-31952

07/02/2024, 4:03 PM

Just need to wire it all up to a Prometheus backend 😄

🔥 1

flat-zoo-31952

07/02/2024, 4:05 PM

But seriously, the Pants execution engine is complex and asynchronous enough where it has a lot more in common with a distributed system than a conventional program, at least when you're trying to understand things at this level of detail. Reading

-ldebug

logs didn't really get me anywhere, just because so many log statements are really really generic, so you can't really trace what's happening with a particular request

flat-zoo-31952

07/02/2024, 4:09 PM

I too have a feeling something dumb is happening under the hood, but I was never really able to isolate it, and without publicly available reproducers, none of the other contributors can either.

flat-zoo-31952

07/02/2024, 4:10 PM

"Clam Diggers" is really simplistic. Maybe activating resolves or adding some real 3rd party dependencies might induce the behavior.

wide-midnight-78598

07/02/2024, 4:11 PM

Yep, that's what I was trying. First get a baseline, then start making it more complicated piece by piece. And then I fell asleep waiting for it to finish

😩 1

ancient-france-42909

07/02/2024, 4:18 PM

ugh, to get the apple profiler you need an apple id

ancient-france-42909

07/02/2024, 4:48 PM

Well, that's awkward. I added another resolve and now it errors out in 3 minute 😄

wide-midnight-78598

07/02/2024, 4:53 PM

Was it seqeval?

ancient-france-42909

07/02/2024, 5:20 PM

nah... So, the way I'm going about this, we have, basically, 6 sets of resolves that matter. We have the base one (

global

), that has the overwhelming majority of packages, and then a few overrides for specific libs. We're in the process of upgrading to

connexion

3, so we have a

connexion_3

resolve that has connexion 3 version, and also includes

global

. We then have a

global_2

resolve, that has

connexion

2 inside. And all apps that use

connexion

2 use

global_2

. So we have a default

resolve=parametrize('global', 'global_2', 'connexion_3')

at the top level. Anyway, I had forgotten to add one of these resolves to the parametrize, and it was taking 30+ minutes to print out the error message that it can't find a library (which, btw, is very confusing).

ancient-france-42909

07/02/2024, 5:21 PM

oh, wow, it worked.

ancient-france-42909

07/02/2024, 5:22 PM

It might be related to trying to find deps that don't exist, will test let me run without pantsd to see how long it takes.

🤔 1

ancient-france-42909

07/02/2024, 5:25 PM

Okay, about 3 minutes, that's not too far off from how much it took before.

flat-zoo-31952

07/02/2024, 5:25 PM

https://pantsbuild.slack.com/archives/C046T6T9U/p1719940932466919?thread_ts=1719856232.324909&cid=C046T6T9U if this is it this could be a huge clue for us as well

ancient-france-42909

07/02/2024, 5:26 PM

I'm not saying that's generally the issue, but it's suspicious it went from 30+ minutes to 3 when I fixed that

flat-zoo-31952

07/02/2024, 5:26 PM

Oh yeah... I was just wondering about the memory consumption

ancient-france-42909

07/02/2024, 5:26 PM

Also, weirdly... When I tried to generate the lockfiles for all the resolves, it took 15 minutes, but each individual one takes ~3-4 minutes

ancient-france-42909

07/02/2024, 5:26 PM

oh, let me check memory

ancient-france-42909

07/02/2024, 5:27 PM

Peaks at 13, now it's down to 9 and still going down, before it would go to 25 within a few seconds.

ancient-france-42909

07/02/2024, 5:31 PM

this is getting ridiculous

ancient-france-42909

07/02/2024, 5:31 PM

okay, let me try to break it

ancient-france-42909

07/02/2024, 5:46 PM

Okay, so, I reverted to a version where it cannot find dependencies because of resolves not being correct, and it's slower and uses more memory.

ancient-france-42909

07/02/2024, 5:48 PM

but these runs always error out

flat-zoo-31952

07/02/2024, 5:48 PM

@fresh-cat-90827 maybe this could be related to our fake 3rd party deps

fresh-cat-90827

07/02/2024, 5:53 PM

Catching up on the thread

flat-zoo-31952

07/02/2024, 6:04 PM

The idea is that it's possible that missing dependencies could be a reason for extra memory usages. In George-Cristian's case they result in an error, but it's possible that ours don't since we never really use those faked deps for anything anyways, but perhaps resources go towards trying to figure them out anyways.

ancient-france-42909

07/02/2024, 6:16 PM

When I say missing deps, I mean they exist in another resolve than the attributed to a python_sources.

ancient-france-42909

07/02/2024, 6:21 PM

This is really weirdly inconsistent in the time it takes... On circleci it was taking 45 seconds to fail, vs 2 on my laptop, now it's taking 3 minutes to finish on my laptop but took 10 on the CI

ancient-france-42909

07/02/2024, 6:30 PM

But, yeah, with missing deps (pic 1), and without missing deps (pic 2)

ancient-france-42909

07/02/2024, 6:57 PM

Yeah, hm. Changed something and now it cannot figure out which of the 10 resolves it must use, memory went back up...

wide-midnight-78598

07/02/2024, 6:59 PM

So, trying to grok the last few messages - you get an OOM when Pants is unable to figure out which resolve? requirement? to use - but when everything is using the correct resolve, no OOM?

ancient-france-42909

07/02/2024, 6:59 PM

I mean, it doesn't OOM, both my laptop and the CI have 32GB of memory

ancient-france-42909

07/02/2024, 6:59 PM

or, well, I have 36 because macs.

wide-midnight-78598

07/02/2024, 7:00 PM

OOM ==> "memory explosion"?

ancient-france-42909

07/02/2024, 7:00 PM

Yeah

ancient-france-42909

07/02/2024, 7:00 PM

And it's slower too

ancient-france-42909

07/02/2024, 7:02 PM

But, now at least I have something to profile 🙂

🎉 1

ancient-france-42909

07/02/2024, 7:17 PM

It's somewhat interesting, but also misleading. Most of the time is spent in python, which is not necessarily unexpected, but at least something. Also, misleading because half of the python time is spent waiting for the GIL, heh.

ancient-france-42909

07/02/2024, 7:25 PM

If py-spy is to be believed, this is kind of how it looks:

ancient-france-42909

07/02/2024, 7:51 PM

Recording gives me something else...

ancient-france-42909

07/02/2024, 8:06 PM

py-spy seems to be unable to keep up at even 50 snapshots per second towards the end... And it slows down things a loooot

ancient-france-42909

07/02/2024, 8:08 PM

I captured about 10 minutes from both, but since they didn't finish...

ancient-france-42909

07/02/2024, 8:12 PM

Gonna let it finish and go write some feedback about for the new python backend :)

ancient-france-42909

07/02/2024, 8:40 PM

Copy code

py-spy> 52.45s behind in sampling, results may be inaccurate. Try reducing the sampling rate

been running for more than 30 minutes, heh

ancient-france-42909

07/02/2024, 9:07 PM

Speedscope trace of the slow one, 30 samples per second.

pants-2024-07-02T23:50:52+03:00.json

ancient-france-42909

07/02/2024, 9:24 PM

Oh, and fast one:

pants-2024-07-03T00:05:35+03:00.json

🎉 1

ancient-france-42909

07/02/2024, 9:27 PM

I think the very low sample rate makes these kind of useless. And I also think the high number of threads screws it up, would setting the max parallelism variable limit that?

flat-zoo-31952

07/04/2024, 1:25 PM

I wonder if one of the things we could try would be to eliminate our "faked" dependencies and then allow uninferrable deps again and see if that has an effect

ancient-france-42909

07/04/2024, 6:31 PM

I might try to reproduce this, but this is... not a priority for us right now, so can't really spend that much time on it 😞 Plus, it turns out moving to resolves is a pain in the ass, I have to basically annotate every lib by hand 😞

ancient-france-42909

07/04/2024, 6:32 PM

so it being slow is not that important, the fact that I have to spend days on this is a bigger issue, heh

😩 2

ancient-france-42909

07/08/2024, 5:59 PM

@happy-kitchen-89482 Btw, is there any timeline for a new python backend? We were discussing this today, talking about whether it's worth spending the time to fix the issues we have, if we're gonna have to upgrade again and maybe have to re-do a lot of the work then

happy-kitchen-89482

07/08/2024, 6:01 PM

No timeline that you can rely on, no

happy-kitchen-89482

07/08/2024, 6:01 PM

What are the issues you're looking at?

ancient-france-42909

07/08/2024, 6:03 PM

Our biggest one is resolves being stupid to work with, and that type stubs thing, but that's easy to fix with, mostly, some sed magic. My main problem now is migrating a repo form no resolves to resolves, and it's just a game of whackamole. Oh, this is from the generic resolve, this is the generic without what we have overrides for, and then doing this for literally every target myself.

💯 1

💢 1

ancient-france-42909

07/08/2024, 6:05 PM

90% of the problem would go away if the default resolve would mean that any target with that resolve would use it, if it works with all it's dependencies.

happy-kitchen-89482

07/08/2024, 6:05 PM

Not quite grokking this, can you give an example?

ancient-france-42909

07/08/2024, 6:20 PM

Okay, imagine you have an app that needs a new version of lib X, but the rest of your codebase cannot use it. In non resolves pants, you can just add an override and it Just Works. In the new one, you have to make 3 resolves, 'generic without X', 'X old version', 'X old version'. You can have more than one requirements per resolve, so that's okay, but it's still annoying that you have to refer to it as

whatever:something-else#X

and

whatever:something-else2#X

whatever:python#X

, since you cannot have more than one requirements with the same name, even if they're in different resolves, but that's okay-ish. But now comes the hard part, let's say a library depends on

. All your libs have to have a resolve of

generic

, if they don't depend on

, and then

generic-with-old-X

if they depend on the old one,

generic-with-new-X

if they depend on the new one.

ancient-france-42909

07/08/2024, 6:21 PM

And on a large repo, this becomes tricky, you can't even use sed or something to do it, you have to parse the AST of

BUILD

files and write a new one, since formatting can sometimes put function calls on the next line, and that's just painful (not that I agree with very long lines 😄 )

ancient-france-42909

07/08/2024, 6:22 PM

Back when we upgraded through the version that made it so you have to put

conftest.py

in another target, I did just that, but it was painful

ancient-france-42909

07/08/2024, 6:42 PM

Was that okay, or should I try to give an example? 😄

steep-eve-20716

08/13/2024, 9:52 PM

I just read through this thread, and +1000 to @ancient-france-42909, I arrived at the same thought myself a bit ago. To have a "neutral" resolve or

None

resolve, so that a target could be shared between other resolves would be so helpful. https://github.com/pantsbuild/pants/issues/21194#issuecomment-2253596097

steep-eve-20716

08/13/2024, 9:53 PM

https://github.com/pantsbuild/pants/pull/21230

steep-eve-20716

08/13/2024, 9:56 PM

https://pantsbuild.slack.com/archives/C046T6T9U/p1719940826887709?thread_ts=1719856232.324909&cid=C046T6T9U Is there any good way to debug something like this? After enabling resolves,

Map all targets to their dependents

now takes 20+ minutes 😕

19 Views

Open in Slack

Previous Next