re <https github com pantsbuild pants pull 8369 issuecomment Pants #development

Join Slack

re: <https://github.com/pantsbuild/pants/pull/8369...

# development

hundreds-breakfast-49010

10/02/2019, 8:26 PM

re: https://github.com/pantsbuild/pants/pull/8369#issuecomment-537661179

hundreds-breakfast-49010

10/02/2019, 8:26 PM

i think the implication of this comment

hundreds-breakfast-49010

10/02/2019, 8:26 PM

is that these two lines: https://github.com/pantsbuild/pants/pull/8369/commits/aa01f0a95d1d8a4f13d6cd470e951c582f7e4953#diff-ee3f54397b3e17d711851e17e92d07d9R82-R83 in that PR

hundreds-breakfast-49010

10/02/2019, 8:26 PM

are using an incorrect abstraction for running user code (whihc should not be cached and should be able to do anything at all on the system)

hundreds-breakfast-49010

10/02/2019, 8:27 PM

and we don't currently have the correct abstraction built, although it should probably look similar to

ExecuteProcessRequest

ExecuteProcessResult

hundreds-breakfast-49010

10/02/2019, 8:28 PM

so, we need a rule whose input is something like an

ExecuteProcessRequest

(maybe

LocalExecuteProcessRequest

?) that never caches the result (since a user program should be completely nondeterministic from pants' perspective)

witty-crayon-22786

10/02/2019, 8:29 PM

it's less about running "user code" (tests are user code, for example)

witty-crayon-22786

10/02/2019, 8:30 PM

but yes

witty-crayon-22786

10/02/2019, 8:30 PM

witty-crayon-22786

10/02/2019, 8:31 PM

so, the design doc describes one approach here.

witty-crayon-22786

10/02/2019, 8:31 PM

BUT, there is an aspect it didn't consider, which is the "should be able to modify the workspace" aspect

witty-crayon-22786

10/02/2019, 8:32 PM

and if not "modify", then "read from" at least

hundreds-breakfast-49010

10/02/2019, 8:32 PM

by "design doc" do you mean the google doc mentioned in that thread? if so I need to be granted access to it

witty-crayon-22786

10/02/2019, 8:32 PM

and that... makes it a pretty different case from what we were thinking initially.

witty-crayon-22786

10/02/2019, 8:32 PM

@hundreds-breakfast-49010: it's available to pants-devel@

witty-crayon-22786

10/02/2019, 8:32 PM

https://www.pantsbuild.org/community.html

witty-crayon-22786

10/02/2019, 8:34 PM

that doc talks about processes running in the "foreground". but i don't think we'd considered that they would also potentially need access to the workspace

witty-crayon-22786

10/02/2019, 8:34 PM

and that... is interesting. because i'm not exactly sure how to do it, heh.

hundreds-breakfast-49010

10/02/2019, 8:36 PM

still trying to figure out how to give myself access to that doc, but a user process should have access to everything on the system, right?

witty-crayon-22786

10/02/2019, 8:36 PM

@hundreds-breakfast-49010: join the pants-devel@googlegroups.com mailing list

hundreds-breakfast-49010

10/02/2019, 8:36 PM

just as if you had run it from the console with

python3 <whatever>

witty-crayon-22786

10/02/2019, 8:37 PM

ping me when you've taken a look

hundreds-breakfast-49010

10/02/2019, 8:41 PM

ok so that doc suggests adding a new flag on

ExecuteProcessRequest

(rahter than making a new type)

hundreds-breakfast-49010

10/02/2019, 8:41 PM

and that flag would force local execution and grab exclusive access to the console

hundreds-breakfast-49010

10/02/2019, 8:42 PM

which I think implies that any

@rule

that takes as one of its inputs an

ExecuteProcessRequest

with that flag, will act like a

@console_rule

in terms of not running concurrently with any other

@console_rules

witty-crayon-22786

10/02/2019, 8:45 PM

yep

witty-crayon-22786

10/02/2019, 8:45 PM

BUT, the new aspect here that we didn't consider is the filesystem access part

witty-crayon-22786

10/02/2019, 8:47 PM

not all usecases will require that... for example, i think that in a "please allow me to debug `pytest`" case, it's actually important that we're in the sandbox

hundreds-breakfast-49010

10/02/2019, 8:47 PM

yeah, we deliberately don't want to sandbox a python executable the user runs with

pants run

hundreds-breakfast-49010

10/02/2019, 8:48 PM

I don't know how hard that would be to implement, given how the

ExecuteProcessRequest

rule is currently implemented

witty-crayon-22786

10/02/2019, 8:48 PM

i think that something like

./pants repl

could go either way.. in a sandbox could be totally fine.

hundreds-breakfast-49010

10/02/2019, 8:48 PM

yeah

repl

is a different story (maybe)

witty-crayon-22786

10/02/2019, 8:48 PM

@hundreds-breakfast-49010: ...hardish, i think.

hundreds-breakfast-49010

10/02/2019, 8:49 PM

so maybe we don't want to use that abstraction at all

witty-crayon-22786

10/02/2019, 8:49 PM

because the sandbox might contain things like the pex itself

hundreds-breakfast-49010

10/02/2019, 8:49 PM

and instead actually shell out to run the pex, or call the native pex

run

method

hundreds-breakfast-49010

10/02/2019, 8:49 PM

which is what the v1

run

is doing I believe

witty-crayon-22786

10/02/2019, 8:50 PM

so a strawman approach might be extract all inputs into a sandbox, but invoke the paths as absolute

witty-crayon-22786

10/02/2019, 8:50 PM

(possible, but a bit awkward, because paths are currently relative** to the sandbox)

witty-crayon-22786

10/02/2019, 8:51 PM

err, fixed the above.

witty-crayon-22786

10/02/2019, 8:52 PM

could potentially require env vars that the process executor could set...? like

$HERE/my-pex

hundreds-breakfast-49010

10/02/2019, 8:52 PM

I made this `@rule`: https://github.com/pantsbuild/pants/pull/8369/commits/aa01f0a95d1d8a4f13d6cd470e951c582f7e4953#diff-ee879be9f22a6db58ca6f4de315b8de3R119-R136 in that PR

witty-crayon-22786

10/02/2019, 8:52 PM

yep. and that will run in the sandbox, without access to the workspace.

hundreds-breakfast-49010

10/02/2019, 8:52 PM

what if instead of yielding an

ExecuteProcessRequest

it took the

argv

string and digest that it puts into the

ExectuteProcessRequest

object, and just ran it with python subprocess?

hundreds-breakfast-49010

10/02/2019, 8:53 PM

(and then returned some other type than

ExecuteProcessRequest

, and was also marked as a

@console_rule

witty-crayon-22786

10/02/2019, 8:54 PM

@console_rule

is only at the top of the graph

witty-crayon-22786

10/02/2019, 8:55 PM

doing something similar to

Workspace

and running synchronously in the foreground would be an option. but the downside of that is that it doesn't address the "debug a

pytest

in the foreground" case i don't think.

hundreds-breakfast-49010

10/02/2019, 8:55 PM

okay, then that rule would return

ExecuteProcessRequest

with a local flag, and then the

@console_rule\ndef run()

rule that the PR also defines could be in charge of running it in the foreground, as a python subprocess

witty-crayon-22786

10/02/2019, 8:56 PM

if it was "definitely always" going to run in the foreground, i don't think you'd do

ExecuteProcessRequest

... something else that is

run

specific, maybe

hundreds-breakfast-49010

10/02/2019, 8:56 PM

so maybe a new type called

Runner

(that only

@console_rule

would be able to access, eventually) - that takes in an

ExecuteProcessRequest

+ local flag, and runs it

hundreds-breakfast-49010

10/02/2019, 8:56 PM

or equivalently a new type that isn't

ExecuteProcessRequest

witty-crayon-22786

10/02/2019, 8:57 PM

hm, possibly

witty-crayon-22786

10/02/2019, 8:57 PM

one approach would basically be to use

Workspace

to materialize something into the

dist

directory, and then expose a simpler API for running it

witty-crayon-22786

10/02/2019, 8:58 PM

@hundreds-breakfast-49010: i've gotta run, but this feels like it would be worth creating a 1 design doc for.

witty-crayon-22786

10/02/2019, 8:58 PM

back in a bit

hundreds-breakfast-49010

10/02/2019, 8:58 PM

ah so maybe

Workspace

itself could have a method on it that does this running

hundreds-breakfast-49010

10/02/2019, 8:58 PM

witty-crayon-22786

10/02/2019, 8:59 PM

yea, possibly. but we'd want to think about the potential connections to the "debug something in the foreground" case. maybe there isn't any.

witty-crayon-22786

10/02/2019, 8:59 PM

but should run through the

run/repl/binary/test

cases in the doc... it was a big oversight to forget about

run

like that, sorrry

hundreds-breakfast-49010

10/02/2019, 9:38 PM

I wrote up some thoughts at the end of the design doc: https://docs.google.com/document/d/1Hn73YlhTPROlULTMa_3A-Fdv7hAPiHFII8rvXri5l7E/edit#

hundreds-breakfast-49010

10/02/2019, 9:38 PM

I'm not 100% sure what the debugging

pytest

case that you're talking about actually would entail

witty-crayon-22786

10/02/2019, 9:46 PM

@hundreds-breakfast-49010: adding a

pdb.set_breakpoint()

or whatever, and then using the repl that that launches

witty-crayon-22786

10/02/2019, 9:46 PM

in a java test debugging case, it would be "the thing opens up a socket and waits for me", so that socket would need to be local

witty-crayon-22786

10/02/2019, 9:48 PM

@hundreds-breakfast-49010: made you an editor.

hundreds-breakfast-49010

10/02/2019, 9:59 PM

@witty-crayon-22786 if you run an arbitrary process using python

subprocess

, and that process spawns a repl, that repl will be available from the python process that called

subprocess.run

hundreds-breakfast-49010

10/02/2019, 10:00 PM

I just confirmed that this works on my system with a quick and dirty python script that invokes

pdb.set_trace

, and also with

ghci

, which spawns its own repl

hundreds-breakfast-49010

10/02/2019, 10:01 PM

anyway does the strategy I wrote up in that doc seem reasonable?

Open in Slack

Previous Next