Is there a way I can easily run multiple `experimental run s Pants #general

Is there a way I can easily run multiple `experime...

famous-river-94971

01/05/2023, 7:36 PM

Is there a way I can easily run multiple `experimental_run_shell_command`s in parallel? For example, both of these

experimental_run_shell_commands

were identified as changed:

Copy code

 ./pants --changed-since=HEAD --changed-dependees=transitive --filter-tag-regex='^cdk$' list
aws/projects/project_1:cdk
aws/projects/project_2:cdk

But if I try to

run

them, I get an error:

Copy code

 ./pants --changed-since=HEAD --changed-dependees=transitive --filter-tag-regex='^cdk$' run
12:32:43.46 [ERROR] 1 Exception encountered:

  TooManyTargetsException: The `run` goal only works with one valid target, but was given multiple valid targets:

  * aws/projects/project_1:cdk
  * aws/projects/project_2:cdk

Please select one of these targets to run.

I can hack something with

xargs

like this:

Copy code

export PANTS_CONCURRENT=True && ./pants --changed-since=HEAD --changed-dependees=transitive --filter-tag-regex='^cdk$' list | xargs -L1 -P 2 ./pants run

but I was wondering if there's a "better"/more

pants

-y way to do this?

famous-river-94971

01/05/2023, 7:44 PM

Looks like maybe GNU

parallel

would be better than

xargs

because it batches up output as if commands were run sequentially.

bitter-ability-32190

01/05/2023, 7:56 PM

Interactive Processes can only be run serially. So multiple pants commands (without using the daemon) is the only way

famous-river-94971

01/05/2023, 7:57 PM

Interactive Processes

My command doesn't need to take any input - does that mean it's non-interactive and I should be using a different method other than

experimental_run_shell_command

bitter-ability-32190

01/05/2023, 8:00 PM

That's the internal name for anything being

run

. What do your processes do? AFAIK we don't have a mechanism to have the user request multiple processes run in parallel on-demand and without caching 🤔

famous-river-94971

01/05/2023, 8:02 PM

Ah, gotcha. That's helpful context. My script calls out to a

cdk

(AWS CDK infrastructure-as-code tool) to run a command (

diff

deploy

infrastructure). That command needs the Python context from Pants since the CDK code itself is defined in Python.

famous-river-94971

01/05/2023, 8:03 PM

So the command looks like this in the CDK project's

BUILD

file:

Copy code

experimental_run_shell_command(
  name="cdk",
  tags=["cdk"],
  command="../../scripts/cdk-deploy.sh",
  dependencies=["aws/projects/project_1:project_1"],
  workdir="aws/projects/project_1",
)

and then the script (simplified) looks like this:

Copy code

#! /bin/bash
source "$SCRIPT_DIR/../../dist/export/python/virtualenvs/cdk_dependencies/3.8.16/bin/activate"

export PYTHONPATH="$SCRIPT_DIR/../projects:$PYTHONPATH"
npx -y cdk synth

famous-river-94971

01/05/2023, 8:04 PM

So it's a little hacky. I have to

pants export

to get the virtualenv and set the PYTHONPATH myself. There might be a better way.

famous-river-94971

01/05/2023, 8:05 PM

CDK is kinda funky, the CDK "binary" is written in Node, so I need to use

npx

(NPM execute) to run it. Sadly, I can't just "run a Python file"

bitter-ability-32190

01/05/2023, 8:05 PM

Hmmm you could maybe plug into the

package

command. There's a

deploy

one as well. That'd involve a plugin today, but could also be extended

👀 1

famous-river-94971

01/05/2023, 8:06 PM

Let me look at the docs for those. IIRC I looked at

deploy

and I thought it was helm-specific.

famous-river-94971

01/05/2023, 8:06 PM

I don't think I looked into

package

at all for this

bitter-ability-32190

01/05/2023, 8:06 PM

It is today, but anything is pluggable 😌

famous-river-94971

01/05/2023, 8:10 PM

Ah, so via

experimental-deploy

is how I'd plug in? https://www.pantsbuild.org/docs/reference-experimental-deploy (not really any docs, but maybe I can reference the Helm impl for an example)

bitter-ability-32190

01/05/2023, 8:10 PM

There's a bit of a paradigm shift going on in regard to shell processes. I think we're ooching towards really opening up the floodgates with those. I could easily see, from recent changes, a way to specify an

experimental_shell_publish_command

which: can run in parallel, as part of

publish

CC @ancient-vegetable-10556 /@happy-kitchen-89482 /@witty-crayon-22786 while we're splashing at

shell

stuff

👍 1

famous-river-94971

01/05/2023, 8:11 PM

Sure. For some more perspective, I'm coming from `yarn`/`lerna` monorepos and they can

run

any arbitrary

script

in the

package.json

with a concurrency flag. It's super nice!

Copy code

yarn lerna run --since '' --concurrency 10 cdk -- deploy '**'

bitter-ability-32190

01/05/2023, 8:12 PM

(Oh my bad, I mean

publish

not

package

. brain fog got me there)

👀 1

ancient-vegetable-10556

01/05/2023, 8:15 PM

Yes, it would probably look a lot like https://github.com/pantsbuild/pants/commit/6d603dd1e487e495f28160f7d76166455cd96a36

bitter-ability-32190

01/05/2023, 8:18 PM

^ Yup that was my mental reference

ancient-vegetable-10556

01/05/2023, 8:20 PM

For what it’s worth, and I haven’t spent too much time reading whether this needs to run outside the sandbox, but you can do this:

Copy code

experimental_shell_command(name="a", command="first_command")

experimental_shell_command(name="b", command="second_command")

experimental_run_shell_command(name="c", dependencies=[":a", ":b"], command="/bin/true")

and then

./pants run path/to:c

ancient-vegetable-10556

01/05/2023, 8:20 PM

but the above assumes that

first_command

and

second_command

can be run inside the sandbox

ancient-vegetable-10556

01/05/2023, 8:20 PM

but if so, they’d run in parallel

bitter-ability-32190

01/05/2023, 8:20 PM

They'd also be cached, which isn't ideal 😕

ancient-vegetable-10556

01/05/2023, 8:21 PM

They’re only cached so long as their input dependencies don’t change

famous-river-94971

01/05/2023, 8:22 PM

I think this might be the behavior I want. If: • the Python source code (that defines the infrastructure) doesn't change; or, • the underlying 3rdparty dependencies don't change; or, • the shared "library" methods don't change then the CDK shell command(s) should not be run at all

ancient-vegetable-10556

01/05/2023, 8:23 PM

then try a format like the above; noting that you’ll probably need some hackery to handle reverts

ancient-vegetable-10556

01/05/2023, 8:25 PM

(specifically: reverts could set the state of the repo back to one where the results of those tasks were cached, and therefore wouldn’t run)

famous-river-94971

01/05/2023, 8:26 PM

Hmm, gotcha. Yeah the caching kinda scares me. With infra-as-code, I'd rather run "too often" and let the provisioning engine determine it's a no-op than not triggering when we should.

famous-river-94971

01/05/2023, 8:27 PM

I think the way I have it now, using GNU

parallel

is probably safer in that respect?

ancient-vegetable-10556

01/05/2023, 8:29 PM

Certainly.

famous-river-94971

01/05/2023, 8:30 PM

Alright, cool. I think I'll go that route for now. @ancient-vegetable-10556 and @bitter-ability-32190, thank you for taking the time to help me out! I hope that, in exchange, this use-case is useful for consideration (it sounds like maybe you're thinking about this anyway).

ancient-vegetable-10556

01/05/2023, 8:31 PM

I have considered making it possible to mark `experimental_shell_command`s as non-cacheable, but that is for a later date!

👍 1

famous-river-94971

01/05/2023, 8:32 PM

FWIW, I looked at

bazel

before

pants

and

pants

was so much more understandable for me!

🙌 3

famous-river-94971

01/05/2023, 8:32 PM

I got about halfway through a

bazel

tutorial video and I was like...

https://media.giphy.com/media/wYyTHMm50f4Dm/giphy.gif▾

😄 1

bitter-ability-32190

01/05/2023, 8:33 PM

I migrated our monorepo and haven't looked back lol

🙌 1

busy-vase-39202

01/05/2023, 10:17 PM

@famous-river-94971 glad it's working out for you! Would it be okay to quote you, with or without attribution, on Twitter?

famous-river-94971

01/05/2023, 10:31 PM

Hey Carina - sure, feel free! I don't want to ruffle any feathers with Bazel lovers, so just use the quote, no attribution necessary.

famous-river-94971

01/05/2023, 10:31 PM

I'm happy to tell you all how I really feel in our Pants slack safe-space, but not trying to upset folks RE: bazel. Every tool has its place and purpose 🙂

busy-vase-39202

01/05/2023, 10:49 PM

Sure!

busy-vase-39202

01/05/2023, 10:50 PM

We very much agree, by the way. We're rooting for people to have whatever tool best fits their use case.

bitter-ability-32190

01/07/2023, 12:37 PM

There's definitely situations bazel is a better choice for build+test. And even in those cases pants can still help out with fmt+lint+check. While we were migrating, that's the boat we were in.

7 Views

Open in Slack

Previous Next