Why does the `AdhocProcessRequest` construction specify the Pants #development

Why does the `AdhocProcessRequest` construction sp...

bitter-ability-32190

05/16/2023, 4:50 PM

Why does the

AdhocProcessRequest

construction specify the

shell_command.address.spec

after the command? https://github.com/pantsbuild/pants/blob/d12d7e2026225159219b21aba159bd0b6c3f8d39/src/python/pants/backend/shell/util_rules/shell_command.py#L147 So it looks like:

Copy code

run_shell_command(
    name = "foo",
    command=f"echo hi",
    workdir="/",
)

turns into:

Copy code

/usr/bin/bash -c $'echo hi' $'//:foo'

bitter-ability-32190

05/16/2023, 4:52 PM

CC @ancient-vegetable-10556

ancient-vegetable-10556

05/16/2023, 4:52 PM

@bitter-ability-32190 because

bash -c 'blah'

doesn’t inject a

$0

bitter-ability-32190

05/16/2023, 4:53 PM

So it isn't actually passing the arg to the command?

ancient-vegetable-10556

05/16/2023, 4:53 PM

so the first param needs to be

$0

, so that you can use other positional parameters.

$0

is also what bash shows on errors, so it makes the error messages friendlier

ancient-vegetable-10556

05/16/2023, 4:53 PM

The command does receive it as

$0

ancient-vegetable-10556

05/16/2023, 4:55 PM

the alternative would be to write out the

command

into a shell script and executing that, but that would be similarly obtuse (and less efficient)

bitter-ability-32190

05/16/2023, 4:55 PM

(I'm running

jq

in a sandbox and trying to figure out how to make these not reference a dead path:

Copy code

jq - commandline JSON processor [version 1.6]

Usage:  /tmp/pants-sandbox-zLIAv5/3rdparty/tools/jq-linux64 [options] <jq filter> [file...]
        /tmp/pants-sandbox-zLIAv5/3rdparty/tools/jq-linux64 [options] --args <jq filter> [strings...]
        /tmp/pants-sandbox-zLIAv5/3rdparty/tools/jq-linux64 [options] --jsonargs <jq filter> [JSON_TEXTS...]

jq is a tool for processing JSON inputs, applying the given filter to
its JSON text inputs and producing the filter's results as JSON on
standard output.

The simplest filter is ., which copies jq's input to its output
unmodified (except for formatting, but note that IEEE754 is used
for number representation internally, with all that that implies).

For more advanced filters see the jq(1) manpage ("man jq")
and/or <https://stedolan.github.io/jq>

Example:

        $ echo '{"foo": 0}' | jq .
        {
                "foo": 0
        }

For a listing of options, use /tmp/pants-sandbox-zLIAv5/3rdparty/tools/jq-linux64 --help.

bitter-ability-32190

05/16/2023, 4:56 PM

Also, assuming

$0

worked here, should we switch it to putting

f"{bin_name() run {spec}"

ancient-vegetable-10556

05/16/2023, 4:56 PM

how are you invoking

jq

, there’s not a world in which this should matter

ancient-vegetable-10556

05/16/2023, 4:57 PM

(yes,

{bin_name()} run {spec}

could be an improvement

👍 1

bitter-ability-32190

05/16/2023, 4:58 PM

(OK filed that issue and self-assigned. back on topic)

ancient-vegetable-10556

05/16/2023, 4:58 PM

so there’s likely something confusing going on with passing args into

run_shell_command

that we haven’t explored fully yet. This is generally why I prefer

adhoc_tool

and would prefer to make these runnable

bitter-ability-32190

05/16/2023, 4:59 PM

Remind me the difference?

bitter-ability-32190

05/16/2023, 4:59 PM

Oh CWD is the difference

ancient-vegetable-10556

05/16/2023, 4:59 PM

adhoc_tool

directly invokes processes, which means you don’t have to compensate for bash being obscene

bitter-ability-32190

05/16/2023, 5:00 PM

Is it runnable? Let me see

ancient-vegetable-10556

05/16/2023, 5:00 PM

Not presently

bitter-ability-32190

05/16/2023, 5:00 PM

Ah yeah so that's the razor

bitter-ability-32190

05/16/2023, 5:00 PM

So, my invocation... Here's the sandbox and command:

Copy code

josh@cephandrius:/tmp/pants-sandbox-zLIAv5$ tree .
.
├── 3rdparty
│   └── tools
│       └── jq-linux64
└── __run.sh

2 directories, 2 files

bitter-ability-32190

05/16/2023, 5:01 PM

Copy code

#!/bin/bash
# This command line should execute the same process as pants did internally.
export <a bunch of env vars>
cd /home/josh/work/<repo>
/usr/bin/bash -c $'/tmp/pants-sandbox-zLIAv5/3rdparty/tools/jq-linux64' $'3rdparty/tools:jq' --help

bitter-ability-32190

05/16/2023, 5:01 PM

(and FWIW looks like

jq

is using

argv[0]

https://github.com/stedolan/jq/blob/cff5336ec71b6fee396a95bb0e4bea365e0cd1e8/src/main.c#L292)

bitter-ability-32190

05/16/2023, 5:06 PM

I'm guessing this works for

$0

ina shell script, but not other programs?

Copy code

$ /usr/bin/bash -c $'echo $0' $'3rdparty/tools:jq'
3rdparty/tools:jq

ancient-vegetable-10556

05/16/2023, 5:07 PM

I’ll look here shortly

ancient-vegetable-10556

05/16/2023, 5:19 PM

So the short of it is that

jq

shouldn’t be receiving

argv[0]

from

bash

. Bash should only be using the

argv[0]

for its internal purposes

ancient-vegetable-10556

05/16/2023, 5:19 PM

are you passing

$@

into

jq

or something like that?

bitter-ability-32190

05/16/2023, 5:20 PM

You have the

__run.sh

and the tree

bitter-ability-32190

05/16/2023, 5:20 PM

I'm laid bare

ancient-vegetable-10556

05/16/2023, 5:20 PM

reads

ancient-vegetable-10556

05/16/2023, 5:21 PM

OK, I see the

__run.sh

, but in this case, I’ll actually need to see the target definition from your

BUILD

file, and possibly also your pants invocation

bitter-ability-32190

05/16/2023, 5:22 PM

Macros 🙈

ancient-vegetable-10556

05/16/2023, 5:22 PM

because the input from the target definition and the pants invocation are what determine the contents of the

__run.sh

file

ancient-vegetable-10556

05/16/2023, 5:22 PM

There’s no reason why $0 should be in

__run.sh

unless you pass it in

bitter-ability-32190

05/16/2023, 5:23 PM

Why not use

exec -a "string" <command>

According to

exec --help

Copy code

Options:
      -a name   pass NAME as the zeroth argument to COMMAND
      -c        execute COMMAND with an empty environment
      -l        place a dash in the zeroth argument to COMMAND

ancient-vegetable-10556

05/16/2023, 5:23 PM

bash -c "exec -a \"shell name\" shlexed_command"

bitter-ability-32190

05/16/2023, 5:23 PM

I think so? I'm gonna bang on it

ancient-vegetable-10556

05/16/2023, 5:24 PM

if you can somehow figure out what the value of

command

is and spit it out to the terminal, that will really help

bitter-ability-32190

05/16/2023, 5:25 PM

Copy code

file(
        name=f"downloaded-{name}",
        source = http_source(
            len = len,
            sha256 = sha256,
            url = url,
        ),
    )
    filename = url.rsplit("/", 1)[-1]
    command  = f"chmod +x {filename}"
    if filename.endswith(".tar.gz"):
        command = f"tar -xf {filename} &&" + command
    shell_command(
        name = f"sandboxed-{name}",
        command=command,
        tools = ["tar", "chmod"],
        execution_dependencies=[f":downloaded-{name}"],
        output_directories = ["."],
        #root_output_directory = f"unpacked-{name}",
    )
    run_shell_command(
        name = name,
        command=f"{{chroot}}/{build_file_dir() / exe}",
        execution_dependencies=[f":sandboxed-{name}"],
        workdir="/",
    )

bitter-ability-32190

05/16/2023, 5:25 PM

Let me try and

peek

and see if that dumps it easily

bitter-ability-32190

05/16/2023, 5:26 PM

Copy code

{
  "address": "3rdparty/tools:jq",
  "target_type": "run_shell_command",
  "command": "{chroot}/3rdparty/tools/jq-linux64",
  "dependencies": [
    "3rdparty/tools:sandboxed-jq"
  ],
  "description": null,
  "execution_dependencies": [
    ":sandboxed-jq"
  ],
  "runnable_dependencies": null,
  "tags": null,
  "workdir": "/"
}

ancient-vegetable-10556

05/16/2023, 5:26 PM

no args. So you’re just invoking pants with

--

bitter-ability-32190

05/16/2023, 5:26 PM

With

Copy code

{
  "address": "3rdparty/tools:downloaded-jq",
  "target_type": "file",
  "dependencies": [],
  "dependencies_raw": null,
  "description": null,
  "source_raw": {
    "url": "<https://github.com/stedolan/jq/releases/download/jq-1.6/jq-linux64>",
    "len": 3953824,
    "sha256": "af986793a515d500ab2d35f8d2aecd656e764504b789b66d7e1a0b727a124c44",
    "filename": "jq-linux64"
  },
  "sources": [],
  "sources_fingerprint": "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855",
  "tags": null
}

and

Copy code

{
  "address": "3rdparty/tools:sandboxed-jq",
  "target_type": "shell_command",
  "_sources_raw": null,
  "command": "chmod +x jq-linux64",
  "dependencies": [
    "3rdparty/tools:downloaded-jq"
  ],
  "description": null,
  "environment": "__local__",
  "execution_dependencies": [
    ":downloaded-jq"
  ],
  "extra_env_vars": null,
  "log_output": false,
  "output_dependencies_raw": null,
  "output_directories": [
    "."
  ],
  "output_files": [],
  "outputs": null,
  "root_output_directory": "/",
  "runnable_dependencies": null,
  "sources": [],
  "sources_fingerprint": "e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855",
  "tags": null,
  "timeout": 30,
  "tools": [
    "tar",
    "chmod"
  ],
  "workdir": "."
}

bitter-ability-32190

05/16/2023, 5:27 PM

Ah the pants command is

pants run <thing> -- --help

. Sorry there a small bit of messaging of the files before I posted here

bitter-ability-32190

05/16/2023, 5:27 PM

Let me get you a working

BUILD

file, one moment

bitter-ability-32190

05/16/2023, 5:33 PM

Copy code

file(
    name="downloaded-jq",
    source = http_source(
        len = 3953824,
        sha256 = "af986793a515d500ab2d35f8d2aecd656e764504b789b66d7e1a0b727a124c44",
        url = "<https://github.com/stedolan/jq/releases/download/jq-1.6/jq-linux64>",
    ),
)
shell_command(
    name = "sandboxed-jq",
    command="chmod +x jq-linux64",
    tools = ["chmod"],
    execution_dependencies=[":downloaded-jq"],
    output_directories = ["."],
)
run_shell_command(
    name = "jq",
    command=f"{{chroot}}/{build_file_dir() / 'jq-linux64'}",
    execution_dependencies=[":sandboxed-jq"],
    workdir="/",
)

bitter-ability-32190

05/16/2023, 5:38 PM

This did the right thing, although I suspect the escaping is going to be an issue:

Copy code

/usr/bin/bash -c "exec -a $'//:jq' /tmp/pants-sandbox-AqjSjG/jq-linux64 --help"

bitter-ability-32190

05/16/2023, 5:38 PM

Copy code

Usage:  //:jq [options] <jq filter> [file...]
        //:jq [options] --args <jq filter> [strings...]
        //:jq [options] --jsonargs <jq filter> [JSON_TEXTS...]

ancient-vegetable-10556

05/16/2023, 5:39 PM

I will take some time to figure this out. I think this is something that should work like being able to

run

system_binary

target (which I believe you can do).

bitter-ability-32190

05/16/2023, 5:39 PM

Oh hey, TIL

ancient-vegetable-10556

05/16/2023, 5:39 PM

The indirection of

bash

is always going to mean there’s some fuckery going on that’s impossible to inspect (this is why I prefer people to not use

shell_command

etc)

ancient-vegetable-10556

05/16/2023, 5:39 PM

The thing about

system_binary

is that it has to be on your system

ancient-vegetable-10556

05/16/2023, 5:40 PM

but you could probably make a target that looks very very similar, and makes a

file

dependency executable

ancient-vegetable-10556

05/16/2023, 5:40 PM

run_system_binary

is very easy to understand, and I’d welcome a pull request 🙂

ancient-vegetable-10556

05/16/2023, 5:40 PM

adhoc_tool

exists entirely to make it easier to reason about how a process is getting executed

bitter-ability-32190

05/16/2023, 5:43 PM

system_binary

has no

dependencies

field 😭

bitter-ability-32190

05/16/2023, 5:43 PM

(makes sense, but damn)

ancient-vegetable-10556

05/16/2023, 5:45 PM

I don’t think you should extend

system_binary

, but I think some concept of a

file_as_binary

would make sense; in case you have an arbitrary binary in your codebase, or need to download it

bitter-ability-32190

05/16/2023, 5:45 PM

I had that in a PR once 😅

ancient-vegetable-10556

05/16/2023, 5:45 PM

there’s a subsystem now, where that’s a coherent idea 🙂

bitter-ability-32190

05/16/2023, 5:46 PM

https://github.com/pantsbuild/pants/pull/17277

ancient-vegetable-10556

05/16/2023, 5:47 PM

I started working on

adhoc_tool

the following month 🙃

bitter-ability-32190

05/16/2023, 5:47 PM

Ideally we bugfix

2.16.x

though.

ancient-vegetable-10556

05/16/2023, 5:48 PM

Sure. Just pointing out that everything that messes around with

bash

is going to continue having bugs, and directing people away from indirection through bash is the way of the future 😉

ancient-vegetable-10556

05/16/2023, 5:48 PM

do not want buggy software before that happens though 🙂

bitter-ability-32190

05/16/2023, 5:48 PM

I'm all for: • Less intermediate targets (see my 3 tagrets) • Less bash

ancient-vegetable-10556

05/16/2023, 5:49 PM

Once I’m out of meetings I should be able to come up with an

exec

-based alternative that fixes your issue

ancient-vegetable-10556

05/16/2023, 5:49 PM

unless you get there first

bitter-ability-32190

05/16/2023, 5:49 PM

🙂

ancient-vegetable-10556

05/16/2023, 5:50 PM

but the requirement is that for

command="blah"

blah

is executed inside a bash shell

bitter-ability-32190

05/16/2023, 5:50 PM

Carry that thought over into the tests executed in PR, because otherwise there's no way to be sure we got it all 🙂

ancient-vegetable-10556

05/16/2023, 5:51 PM

well sure

bitter-ability-32190

05/16/2023, 6:15 PM

The escaping is horrific 😱

bitter-ability-32190

05/16/2023, 8:23 PM

Oh, the answer is staring us in the face...

bitter-ability-32190

05/16/2023, 8:26 PM

https://github.com/pantsbuild/pants/pull/19019

ancient-vegetable-10556

05/16/2023, 8:28 PM

Nice. That’ll do.

bitter-ability-32190

05/16/2023, 8:28 PM

Yeah, so obsessed with Pants running

exec -a

I realized I should be running it 🤦‍♂️

bitter-ability-32190

05/16/2023, 8:30 PM

OK here's the other one:https://github.com/pantsbuild/pants/pull/19020

Open in Slack

Previous Next