< red balloon 89377> < average vr 56795> interested in your Pants #development

<@UBV8BCQPJ>, <@U6YPB4SJX>: interested in your inp...

witty-crayon-22786

10/02/2019, 9:49 PM

@red-balloon-89377, @average-vr-56795: interested in your input on https://docs.google.com/document/d/1Hn73YlhTPROlULTMa_3A-Fdv7hAPiHFII8rvXri5l7E/edit#bookmark=id.swk5himqweo4 if you have time to give it tomorrow

witty-crayon-22786

10/02/2019, 10:32 PM

@hundreds-breakfast-49010: i'll review 30mins to an hour, sorry ... meetings.

witty-crayon-22786

10/02/2019, 10:33 PM

including enough context there for them or for @early-needle-54791 to come up to speed on it would be helpful though

hundreds-breakfast-49010

10/02/2019, 10:33 PM

no worries 🙂

hundreds-breakfast-49010

10/02/2019, 10:33 PM

yeah I'll add some clarifying context

witty-crayon-22786

10/02/2019, 11:44 PM

i'll look tonight.

witty-crayon-22786

10/02/2019, 11:44 PM

you should have edit access now, so can reload and switch to Edit mode if you want

witty-crayon-22786

10/03/2019, 3:45 AM

i added a biiiig comment expanding on the "environment variable" approach to portability... i think it might work, but let me know what you think.

hundreds-breakfast-49010

10/03/2019, 9:38 PM

I'm looking at everyone's comments now

hundreds-breakfast-49010

10/03/2019, 9:39 PM

@witty-crayon-22786 I think your idea about a new "exclusivity" param to

ExecuteProcessRequest

is reasonable

hundreds-breakfast-49010

10/03/2019, 9:40 PM

although I'm not sure we gain anything out of allowing a process to run locally but in a tmpdir, as opposed to running locally in the workspace

hundreds-breakfast-49010

10/03/2019, 9:41 PM

as far as running user binaries in the workspace - would this involve materializing the pex to

dist

, and then running it from there, just as if we had run

python binary

and then invoked

$ ./dist/<binary_name>

hundreds-breakfast-49010

10/03/2019, 9:46 PM

it looks like what the current pants run is doing is calling

.run()

on a pex: https://github.com/pantsbuild/pants/blob/77aaa1e4afb1b1ed0ae08aa9e87a9718cd5028ea/src/python/pants/backend/python/tasks/python_run.py#L46

witty-crayon-22786

10/03/2019, 9:48 PM

although I'm not sure we gain anything out of allowing a process to run locally but in a tmpdir, as opposed to running locally in the workspace

almost all things you will execute will have "n or more" files that they need somewhere.

witty-crayon-22786

10/03/2019, 9:48 PM

pexes are self contained... others won't necessarily be. JVM bundles, for example.

witty-crayon-22786

10/03/2019, 9:49 PM

as far as running user binaries in the workspace - would this involve materializing the pex to
dist
, and then running it from there, just as if we had run
python binary
and then invoked
$ ./dist/<binary_name>
?

um, for some designs, yea. but not for the "exclusivity" approach.

witty-crayon-22786

10/03/2019, 9:49 PM

the idea behind the tmp dir is that you don't actually put the process inputs from the Snapshot in the workspace.

hundreds-breakfast-49010

10/03/2019, 9:59 PM

right now the pex we make lives inside the

digest

on a

Pex

object, that's the change I got merged at the beginning of the week

hundreds-breakfast-49010

10/03/2019, 9:59 PM

so no matter what, if we want to run that pex we have to materialize that

digest

somewhere

hundreds-breakfast-49010

10/03/2019, 10:00 PM

the current code for ExecuteProcessRequest is underlyingly materializing the contents of a digest into a tmpdir, and then running the arguments listed in the

argv

parameter, and this is managed by the v2 engine in ways I haven't dug into deeply

hundreds-breakfast-49010

10/03/2019, 10:01 PM

so if we want to have the semantics that

pants run

runs a file in the workspace, we need to either materialize the pex there (or in general whatever the executable unit is for any language), or we need to materialize it into a tmpdir and then change the working directory, right?

hundreds-breakfast-49010

10/03/2019, 10:02 PM

python

subprocess

gives us the tools to do this, so if we wanted to we could bypass the engine

hundreds-breakfast-49010

10/03/2019, 10:03 PM

@red-balloon-89377’s concern was that this will block and it will be awkward to pass control of the tty from rust to python

hundreds-breakfast-49010

10/03/2019, 10:03 PM

but I think those are solveable problems, and blocking is desirable in the case of a user program

hundreds-breakfast-49010

10/03/2019, 10:04 PM

ah, your suggestion was that we materialize the build artifacts to a tmpdir, but then give the full path to that tmpdir in the argv, so that the process's current directory is still the workspace

hundreds-breakfast-49010

10/03/2019, 10:05 PM

python subprocess definitely lets us set the cwd arbitrarily: https://docs.python.org/3.6/library/subprocess.html?highlight=subprocess#module-subprocess

witty-crayon-22786

10/03/2019, 10:05 PM

https://pantsbuild.slack.com/archives/C0D7TNJHL/p1570140110401700?thread_ts=1570052945.363400&cid=C0D7TNJHL no. that was the point of my $HERE suggestion.

witty-crayon-22786

10/03/2019, 10:05 PM

right

witty-crayon-22786

10/03/2019, 10:06 PM

i would say that "the ability to grab the console" is less important that "the ability to run in the buildroot" right now

hundreds-breakfast-49010

10/03/2019, 10:06 PM

it sounds like, if we decided to run this via the v2 engine with the ExecuteProcessRequest abstraction, we'd have to implement whatever python subprocess does to change the current working directory ourselves

witty-crayon-22786

10/03/2019, 10:07 PM

~all process execution APIs let you specify the cwd

witty-crayon-22786

10/03/2019, 10:08 PM

so, in effect, adding support for the "$HERE" approach to the v2 engine would mean 1) defining/using variable in a few places, 2) setting the cwd to something other than the tmpdir when you invoke the process, 3) adding the option to EPR

hundreds-breakfast-49010

10/03/2019, 10:09 PM

yeah that makes sense to me

hundreds-breakfast-49010

10/03/2019, 10:09 PM

do you know offhand where in the v2 engine the actual process invocation happens?

hundreds-breakfast-49010

10/03/2019, 10:09 PM

I assume this uses some rust std lib subprocess API that in principle lets us set the cwd, and we're just not doing that yet

witty-crayon-22786

10/03/2019, 10:09 PM

... and sorry, i missed the second half of:

so if we want to have the semantics that
pants run
runs a file in the workspace, we need to either materialize the pex there (or in general whatever the executable unit is for any language), or we need to materialize it into a tmpdir and then change the working directory, right?

that's correct.

witty-crayon-22786

10/03/2019, 10:10 PM

intentionally, yes. this part of what we mean by "hermetic". it's supposed to be challenging by default to depend on things that you haven't declared deps on.

witty-crayon-22786

10/03/2019, 10:11 PM

src/rust/engine/process_execution/src/local.rs

hundreds-breakfast-49010

10/03/2019, 10:12 PM

sure, and that's maybe another argument for using a type other than

ExecuteProcessRequest

for this

hundreds-breakfast-49010

10/03/2019, 10:12 PM

because just adding a

cwd

option to EPR would make this available to any user of EPR, in principle

hundreds-breakfast-49010

10/03/2019, 10:15 PM

the other hting I"m not sure about is how to implement: "Cause any concurrent @rule execution to be serialized - or synchronize on exclusive Console access (including anything using Logger)."

hundreds-breakfast-49010

10/03/2019, 10:15 PM

if we just have a

yield Get(ExecuteProcessResult, ExecuteProcessRequest(exclusivity=CONSOLE_AND_WORKSPACE))

hundreds-breakfast-49010

10/03/2019, 10:16 PM

(or how to avoid caching this)

witty-crayon-22786

10/03/2019, 10:21 PM

witty-crayon-22786

10/03/2019, 10:21 PM

so the suggestion wasn't to add

cwd

. it was to add an option like

exclusivity

witty-crayon-22786

10/03/2019, 10:22 PM

about caching: ... yes. that does actually related to John's point on the Workspace ticket.

hundreds-breakfast-49010

10/03/2019, 10:22 PM

so if exclusivity is the hook that affects the cwd in

<http://local.rs|local.rs>

, that's probably not a problem then, since it doesn't give rule-writers the ability to change the cwd arbitrarily

witty-crayon-22786

10/03/2019, 10:22 PM

right

hundreds-breakfast-49010

10/03/2019, 10:23 PM

the comment about the 'cacheable' bit?

witty-crayon-22786

10/03/2019, 10:24 PM

i think that you could punt on touching caching in the first PR?

witty-crayon-22786

10/03/2019, 10:24 PM

yea.

hundreds-breakfast-49010

10/03/2019, 10:24 PM

yeah in the workspace PR I punted on touching caching

hundreds-breakfast-49010

10/03/2019, 10:26 PM

in general, if we have a type like

Workspace

that is only available in a

@console_rule

, then we can guarantee that any functionality that type provides for you isn't cached

hundreds-breakfast-49010

10/03/2019, 10:26 PM

(I know that we don't acutally enforce this yet)

hundreds-breakfast-49010

10/03/2019, 10:27 PM

it makes sense to me that if we have some functionality we don't want to ever cache - and running a user's code is this sort of functionality - then we can always implement it as a method called on a type that's only available in

@console_rule

hundreds-breakfast-49010

10/03/2019, 10:28 PM

and that would imply abandoning the modified-EPR strategy and going with the

LocalRunner

strategy I suggested in the doc

hundreds-breakfast-49010

10/03/2019, 10:29 PM

this is maybe a broader conversation about exactly what guarantees we do and do not want rules to be able to encode for us

witty-crayon-22786

10/03/2019, 10:32 PM

it might. but i'm hesitant about giving up the ability to effectively "toggle" running something in the foreground. i think that's super powerful.

witty-crayon-22786

10/03/2019, 10:32 PM

for the debug case.

witty-crayon-22786

10/03/2019, 10:33 PM

for example: how would you refactor our currently

pytest

rules to allow for debugging in a repl?

witty-crayon-22786

10/03/2019, 10:33 PM

with foreground/exclusivity: bam.

witty-crayon-22786

10/03/2019, 10:34 PM

but yea. all conversations about caching and sideeffects should be very considered, heh 😃

hundreds-breakfast-49010

10/03/2019, 10:34 PM

I'm also under a bit of deadline pressure for the v2 version of this rule 🙂

hundreds-breakfast-49010

10/03/2019, 10:35 PM

if we went with foreground/exclusivity on EPR, then we'd probably need to add the rule graph check @enough-analyst-54434 was talking bout

hundreds-breakfast-49010

10/03/2019, 10:35 PM

I dunno how hard that is to implement

witty-crayon-22786

10/03/2019, 10:40 PM

understood. as mentioned above, you could punt on considering caching in the first iteration.

hundreds-breakfast-49010

10/03/2019, 10:41 PM

legit, I'll probably punt and make an issue to come back to that question hten

hundreds-breakfast-49010

10/03/2019, 11:20 PM

@happy-kitchen-89482 brought up the idea that maybe the right behavior is for pants to just exit on run, which I think we could do with

os.execvp

hundreds-breakfast-49010

10/03/2019, 11:20 PM

although there are maybe complications around Console-handing if we go that route

witty-crayon-22786

10/03/2019, 11:22 PM

@happy-kitchen-89482: See the discussion around

foreground

here and in the doc

witty-crayon-22786

10/03/2019, 11:23 PM

For a few different kinds of processes (mostly tests), you want to be able to toggle running in the foreground

Open in Slack

Previous Next