I finished my PoC for having a processes be batched for effe Pants #development

I finished my PoC for having a processes be batche...

bitter-ability-32190

05/05/2022, 6:14 PM

I finished my PoC for having a processes be batched for effeciency but split for the cache. It...... works! The code bakes in a lot of assumptions/hacks and the biggest issue/question is how to handle the one process result vs many process results (how to split/merge stdout/stderr) But... for a PoC it lays a neat foundation. First run:

./pants --no-pantsd -ldebug --stats-log fmt --only=black ::

(I only retrofitted black for PoC) shows

local_cache_requests_uncached: 1048

(vs 14 on

main

) When I then run

./pants --no-pantsd -ldebug --stats-log fmt --only=black src/python/pants/backend/python/lint/black/rules.py

I don't see any processes being run for

black

(there are uncached requests/processes for getting the ICs). https://github.com/thejcannon/pants/tree/synthcacheproc

🙌 2

👏 2

bitter-ability-32190

05/05/2022, 6:16 PM

CC @witty-crayon-22786 @happy-kitchen-89482 I've had a severely reduced capacity for Pants contributions lately. I've had 2 doctors tell me I should focus getting more sleep, so there goes my hobby time. Therefore it'll be a PoC for a while, but... pretty neat stuff 😈

💜 1

😴 1

bitter-ability-32190

05/05/2022, 6:24 PM

The TL;DR • Client opts into a new type. For PoC it's just a tuple of `Process`s but in the future would be more granular in the info. There's tradeoffs to be made when you opt into the new type. • The

<http://cache.rs|cache.rs>

command runner is responsible for taking the batch and querying the cache for each

process

object. It collects the uncached process objects to be merged and ran-as-one (TBD on whether

<http://cache.rs|cache.rs>

<http://bounded.rs|bounded.rs>

does the merging-and-running. PoC has it in

bounded

) • To merge we basically just combine the input digests into one and chain all the files to append to the "core" argv • If the process was successful, store the individual process runs in the cache (TBD on the output info) • (TBD collate the uncached batch run + cached results into a final result object)

👍 1

hundreds-father-404

05/05/2022, 7:53 PM

I've had 2 doctors tell me I should focus getting more sleep, so there goes my hobby time.

please do take care of yourself!! I've had to practice this a lot too this past month. Programming is a particularly addicting hobby w/ the sense of accomplishment 😱

happy-kitchen-89482

05/05/2022, 8:48 PM

Sleep is glorious and I can highly recommend it 🙂

happy-kitchen-89482

05/05/2022, 8:49 PM

this PoC is glorious too, but not worth sacrificing your health for

➕ 1

bitter-ability-32190

05/05/2022, 8:54 PM

Well the culrpit is my son, whom I nor my Drs are able to convince to sleep better 🙂

bitter-ability-32190

05/05/2022, 8:56 PM

On topic tho: I think one way to handle stdour/stderr is write a function passed down and called in Rust (if possible) to split/collate the output. It's brittle though because it's dependent on both tool's output decisions and verbosity levels.

happy-kitchen-89482

05/06/2022, 1:12 AM

Can you clarify the "store the individual process runs in the cache" part? How do you split a result of a merged run into individual per-file "processes" ?

bitter-ability-32190

05/06/2022, 1:14 AM

The key part is already taken care of since (in the PoC) we have the process objects (int he future I think we'd synthesize the processes from base info + per-process-info). The value is... well for the PoC it's hand-waivy

bitter-ability-32190

05/06/2022, 1:15 AM

See `black/rules.py`: https://github.com/thejcannon/pants/blob/57a324d90ef4cf4f05f8c27e207e00ffce00d6eb/src/python/pants/backend/python/lint/black/rules.py#L98 I have N processes which I bundle into a batch

bitter-ability-32190

05/06/2022, 1:16 AM

Here's the Rust side for caching: https://github.com/thejcannon/pants/blob/57a324d90ef4cf4f05f8c27e207e00ffce00d6eb/src/rust/engine/process_execution/src/cache.rs#L128 Specifically the cache key is the process, the value is just a copy of the output from the run-with-all-files run 🙈

bitter-ability-32190

05/06/2022, 1:18 AM

The strategy of "batch together" instead of "split apart" ensures I don't have to split inputs, just outputs.

happy-kitchen-89482

05/06/2022, 1:46 AM

by "we have the process objects" I take it you mean the process inputs, but what do you put in the process result that is cached? Just a fake exit code of 0 and no other outputs?

bitter-ability-32190

05/06/2022, 3:53 AM

• Only cache if exit code is 0 • Output info is a copy of the batched processes (for PoC, would need to be smarter for actual solution)

witty-crayon-22786

05/19/2022, 12:08 AM

sorry for taking so long to look at this!

witty-crayon-22786

05/19/2022, 12:13 AM

the rough shape looks reasonable… the biggest question around the whole thing is just whether the user-space /

@rule

API can be simple enough to make it worthwhile, including making splitting of outputs simple… i don’t know of any linters with enough JSON output to split safely, but our built in tools like dependency extraction could probably

witty-crayon-22786

05/19/2022, 12:14 AM

fwiw, we had a splitting/merging strategy for a tool in v1, and it was the biggest source of bugs in the whole system (admittedly, it was being used on a compiler, where inter-dependencies are the norm, but)

witty-crayon-22786

05/19/2022, 12:16 AM

if “all of these files fail/succeed together then they will fail independently” is not a guarantee (processes are not necessarily Associative), so even caching the error code and doing no splitting will require caution

witty-crayon-22786

05/19/2022, 12:16 AM

so… if there are enough usecases that can actually live within those constraints and gain some benefit, then maybe.

witty-crayon-22786

05/19/2022, 12:17 AM

but as pointed out on the batching-inference ticket,

pytest

is probably not one of the ones where this is the case… tests can definitely have sideffects on one another. so would need to be disabled by default.

➕ 1

witty-crayon-22786

05/19/2022, 12:18 AM

(not to mention the fact that it would be a huge refactor of the

test

goal)

witty-crayon-22786

05/19/2022, 12:22 AM

for my part, i will continue to focus on lowering per-process overheads, because there continues to be low hanging fruit there, and fixing it allows

@rule

code to be written in a readable and cache-friendly way

witty-crayon-22786

05/19/2022, 12:23 AM

(for example: getting

immutable_inputs

stable and used for PEXes would drop a lot of input overhead)

bitter-ability-32190

05/19/2022, 12:34 AM

Yeah I suspect this gets opted into per-tool and by the user (we shouldn't force them into this, as Benjy likes to say we can provide a "slider") Then we choose the tools. I think it's really things where output on success doesn't matter much and we're sure the files don't affect each other. Fmt/lint/check. Because we can punt on splitting output if we only cache success, and then just toss the output out the window (we already kinda toss it for formatters)

happy-kitchen-89482

05/19/2022, 6:21 AM

A big one for this might be dep inference

happy-kitchen-89482

05/19/2022, 6:21 AM

Instead of running 1000 processes to infer on 1000 files, run, say 10

happy-kitchen-89482

05/19/2022, 6:21 AM

And since we control the output, we can make it really easy to split

bitter-ability-32190

05/19/2022, 8:20 AM

Well I think v1 will have us ignoring output, since it's safe and easy. But yeah hopefully over time we can loosen that and dep inference can be batched and per-file-cached

Open in Slack

Previous Next