OK I m going to rephrase a question I asked earlier I want p Pants #general

OK. I'm going to rephrase a question I asked earli...

proud-dentist-22844

05/13/2021, 3:06 PM

OK. I'm going to rephrase a question I asked earlier. I want pants to be the primary entry point for development, replacing the Makefile & supporting scripts. One responsibility of the Makefile (and a couple of scripts) is setting up a local development environment with services like mongo, rabbitmq, and redis. How can I do setup like that from within pants? Would I need to write a pants plugin? Something that makes sure the services are running when running tests? How would service requirements (mongo, rmq, redis, etc) interact with remote execution (REAPI)? (which is something I'm interested in using eventually)

hundreds-father-404

05/13/2021, 3:13 PM

@happy-kitchen-89482 you've started work on documenting things like this for Django, for example, iirc. Thoughts?

happy-kitchen-89482

05/13/2021, 3:49 PM

Hmm this is always a good question and really depends on specific setup. For example, in Toolchain's internal repo we set up test databases in code: the thing that tests call to get a handle to a db will actually launch postgres if there is no db already running at that port. So that's one way, which has the advantage of being seamless and doesn't involve Pants at all. That assumes you have postgres installed on your machine of course, but that's reasonable.

happy-kitchen-89482

05/13/2021, 3:51 PM

For a custom plugin to work I think we'd need to do a little rewiring to make sure that tests depend on the custom setup, and that they are not treated as cacheable. Generally only special Pants rules can have side effects, and "starting a database" is one hell of a side effect.

🤣 1

➕ 1

hundreds-father-404

05/13/2021, 3:52 PM

hat assumes you have postgres installed on your machine of course

Reasonable for a private org's codebase, possibly not as reasonable for an open source project. But perhaps the experience could be somewhat improved by eagerly erroring when it's not yet installed and giving instructions

happy-kitchen-89482

05/13/2021, 3:52 PM

With remote execution, you'd need those binaries to exist in the image, and there are some more complications because of the side-effecting nature of this.

happy-kitchen-89482

05/13/2021, 3:53 PM

And there needs to be some reasoning about whether any state needs to persist across individual test invocations

happy-kitchen-89482

05/13/2021, 3:54 PM

One other way to go is to have a pool of running services and each remote invocation is assigned one from the pool, with pants ensuring that no two concurrently access the same pooled service.

happy-kitchen-89482

05/13/2021, 3:55 PM

We already do slot management like this for local concurrency, we could extend it to remote

happy-kitchen-89482

05/13/2021, 3:57 PM

Basically each process runs with PANTS_CONCURRENCY_SLOT=<some small integer>

happy-kitchen-89482

05/13/2021, 3:57 PM

See https://www.pantsbuild.org/docs/reference-pytest#section-execution-slot-var

happy-kitchen-89482

05/13/2021, 3:57 PM

and your code can use that to pick a pool service to talk to, no matter where it runs, assuming it can access that service over the network

happy-kitchen-89482

05/13/2021, 3:58 PM

If the services have to be local to the machine the process is executing on, then concurrency isn't an issue but setup is

proud-dentist-22844

05/13/2021, 3:58 PM

I'm reading and processing your comments 🙂 will respond shortly...

happy-kitchen-89482

05/13/2021, 3:58 PM

No worries

proud-dentist-22844

05/13/2021, 4:04 PM

> that assumes you have postgres installed on your machine of course

Reasonable for a private org's codebase, possibly not as reasonable for an open source project. But perhaps the experience could be somewhat improved by eagerly erroring when it's not yet installed and giving instructions

Ooh, yeah. Erroring out with instructions would be a very good experience. Then the instructions could be tailored to the active platform (CentOS, Ubuntu, Gentoo (because that's what I like to use), Mac OS X, ...)

💯 1

proud-dentist-22844

05/13/2021, 4:08 PM

Currently, installing & running these services is very different for CI vs local development. For CI, we don't run the tests until the env is setup. So, I'm really thinking about how to improve the local dev experience without making the CI story more difficult. Adding DBs on the fly in a pytest fixture makes perfect sense, actually starting the services (super crazy side effect) is very sticky because CI vs local differs ...

proud-dentist-22844

05/13/2021, 4:12 PM

So, maybe I need two plugins: • one that injects some service availability assertions as a prerequisite to running tests (or a subset of tests? Not all of them need those services). This would provide instructions on installation and any ./pants <goal> that can be used to help with setup. • one that adds a new goal for setting up the local dev environment - for integration tests especially, we actually need to run stackstorm itself before running the tests, this goal could start that. (Hmm - is that kosher for a goal to start "background" tasks under pantsd that continue to run while running other pants goals?)

proud-dentist-22844

05/13/2021, 4:19 PM

a pool of running services and each remote invocation is assigned one from the pool

We already do slot management like this for local concurrency, we could extend it to remote

PANTS_CONCURRENCY_SLOT=<some small integer>

your code can use that to pick a pool service to talk to, no matter where it runs, assuming it can access that service over the network

So, remote execution in this way would require extra infrastructure to provide those pooled resources. The REAPI service wouldn't provide that itself.

If the services have to be local to the machine the process is executing on, then concurrency isn't an issue but setup is

This is where I imagine a pants plugin might come in? Do the pants plugins get sent over REAPI? Or would there have to be some kind of dependency on a script that would get executed to setup the REAPI environment before running the target (eg) pytest command?

proud-dentist-22844

05/13/2021, 4:22 PM

That service availability assertion would also catch issues in CI where the service container doesn't actually start correctly. Cool. I think that's where I'll need to start.

enough-analyst-54434

05/13/2021, 5:19 PM

Do the pants plugins get sent over REAPI?

The REAPI only gets sent the description of a process to run: {args, env, CWD & an optional input blob that gets materialized as the file system tree in CWD on the remote machine} and a description of the output to capture (blob paths).

enough-analyst-54434

05/13/2021, 5:20 PM

So pants doesn't run on a remote machine, just some process - like ["python", "--version"]

proud-dentist-22844

05/13/2021, 5:39 PM

using services with remote execution doesn't sound very promising. I'll probably use just the caching aspect of REAPI (once I get there) then.

enough-analyst-54434

05/13/2021, 5:54 PM

Yeah, if the service interaction time is << service startup time there is alot of work to do on the API and implementations of it. If that is reversed though, just having the services started up as part of the REAPI image specifed's init (all implementations I know of allow an image to be specified for each execution) should work just fine.

happy-kitchen-89482

05/13/2021, 6:45 PM

I'm not sure if service availability assertions is a Pants plugin or just something your tests call (say in a pytest hook, or implicitly when they try to acquire a handle to the service in question)?

➕ 1

proud-dentist-22844

05/13/2021, 7:02 PM

I guess I'm going for a fail-fast thing to warn as early as possible if someone is trying to test and they haven't got their env setup right. If we wait until pytest is running, then we'll end up with one failure for every file, wouldn't we? Is there any way to pair that down and get pants to bail asap?

👍 1

proud-dentist-22844

05/13/2021, 7:04 PM

(I'm also chatting about the implementation of a plugin in the #C01CQHVDMMW channel - there were several topics in here, so I was trying to pair down the thread's topic)

hundreds-father-404

05/13/2021, 7:07 PM

Ah, great point Jacob. You're right that Pants would not error eagerly if the error came from Pytest. But! I do think we can add a

--test-fail-fast

option, which could be a useful feature in general Only, I'm not sure you'd want that permanently toggled for your repo. I personally really like seeing all my test failures when iterating

proud-dentist-22844

05/13/2021, 7:09 PM

Yes. I like running all the tests and seeing all the failures. Occassionally I'll turn on fail fast when I'm working on a particular set of tests, but I wouldn't want that on by default. If there is an error in a pants plugin, then pants exits quickly, right?

happy-kitchen-89482

05/13/2021, 7:10 PM

Yeah, the fail-fast thing is a good point

happy-kitchen-89482

05/13/2021, 7:10 PM

in a plugin you can get pants to fail eagerly

👍 2

hundreds-father-404

05/13/2021, 7:14 PM

Occassionally I'll turn on fail fast

Using Pytest options w/o Pants, right? Trying to determine if we should add

--test-fail-fast

to Pants. It's pretty trivial to implement (like 10 lines), only that again we're careful to add new features w/o vetting them

proud-dentist-22844

05/13/2021, 7:17 PM

I would reach for passing the options straight to pytest, but I guess that doesn't actually help since pants is running one instance per file. I can see the utility, but maybe sit on that one for awhile longer. 🙂

✔️ 1

enough-analyst-54434

05/13/2021, 7:33 PM

In this case it would be adding back a long-term feature. In Pants v1 you could generically say

./pants --fail-fast ...

for any combination of goals.

➕ 1

2 Views

Open in Slack

Previous Next