Another odd case from one of our engs They report touching a Pants #general

Another odd case from one of our engs. They report...

bitter-ability-32190

03/22/2022, 9:23 PM

Another odd case from one of our engs. They report touching a test file, then seeing us run the lint tools on the one file. Then a few pants runs (with the same exact cmd) later they see us run the lint tools on the "bucket" of files that test file is in.

bitter-ability-32190

03/22/2022, 9:24 PM

@witty-crayon-22786 if need be I can share the workunit log JSON privately. My only suspicion so far is it has to do with pantsd invalidation, but I don't know why that would matter

witty-crayon-22786

03/22/2022, 9:26 PM

hm, so: as it stands, the bucketing won’t coarsen up into larger buckets than have been specified. so if you ask to lint one file, one file will be linted (we won’t expand to a larger group)

witty-crayon-22786

03/22/2022, 9:27 PM

so i think that that is currently expected.

happy-kitchen-89482

03/22/2022, 9:29 PM

Assuming other files in that bucket were modified, that is

witty-crayon-22786

03/22/2022, 9:31 PM

Assuming other files in that bucket were modified, that is

no: regardless of that. the single file in the bucket was modified. there are two cache keys: one for the run with the single file, and one for the run with the bucket

witty-crayon-22786

03/22/2022, 9:31 PM

we don’t convert the former into the latter

bitter-ability-32190

03/22/2022, 9:32 PM

So if there's 2 files modified, what happens?

bitter-ability-32190

03/22/2022, 9:39 PM

Actually better question is if I only ever used the same --changed commands for specs, when would it bucket vs not

witty-crayon-22786

03/22/2022, 9:53 PM

Actually better question is if I only ever used the same --changed commands for specs, when would it bucket vs not

we always bucket: but we bucket the inputs, rather than expanding the inputs into some larger set of inputs and then bucketing those.

witty-crayon-22786

03/22/2022, 9:54 PM

so if the input set is two files, we’ll bucket two files (not go and find larger buckets which contain those two files)

happy-kitchen-89482

03/22/2022, 9:55 PM

Oh right

happy-kitchen-89482

03/22/2022, 9:56 PM

This gets back to the larger potential project of splitting batched processes into individual "virtual processes" and caching the latter as-if they had actually run

bitter-ability-32190

03/22/2022, 10:00 PM

So the only way we'd run the lint tools on 300-ish files is if pants thinks that many have been changed

witty-crayon-22786

03/22/2022, 10:00 PM

correct

bitter-ability-32190

03/22/2022, 10:00 PM

Now to figure out why that's our spec 😂

bitter-ability-32190

03/23/2022, 1:21 PM

Would that be possible to suss out from the workunit JSON?

witty-crayon-22786

03/23/2022, 4:14 PM

um, maybe not from the workunits, but from the raw run data, or debug logs maybe? at a fundamental level, all

--changed

is doing finding some “roots” which directly changed, and then (if

--…=transitive

) including the transitive dependees

witty-crayon-22786

03/23/2022, 4:15 PM

i think that what you will see in the raw run data is the specs after the changed calculation though, unfortunately

bitter-ability-32190

03/23/2022, 4:18 PM

I noticed a file being fingerprinted that was unexpected in the workunits, and has lots of transitive deps. My current suspicion is for some reason Pants is using it to calculate specs

witty-crayon-22786

03/23/2022, 4:19 PM

if you have direct access, i would suggest:

./pants --changed=.. list

to see what it thinks have been directly changed

✅ 1

witty-crayon-22786

03/23/2022, 4:19 PM

(without the transitive)

4 Views

Open in Slack

Previous Next