Is there a way to limit the number of tests that kick off in Pants #general

Is there a way to limit the number of tests that k...

gentle-flower-25372

04/05/2024, 8:26 PM

Is there a way to limit the number of tests that kick off in parallel but doesn't limit other things like parallelization of building pex files? I used --process-execution-local-parallelism thinking it would limit just tests, but it's limiting everything

wide-midnight-78598

04/05/2024, 8:37 PM

Take a search through slack history for stuff like "serialize" "concurrent" and pytest - I know variants of this question have been covered for years, but I don't recall the specifics as each question had its own spin https://pantsbuild.slack.com/archives/C046T6T9U/p1658910657563539

gentle-flower-25372

04/05/2024, 8:44 PM

Yeah I'm already using that trick.

gentle-flower-25372

04/05/2024, 8:44 PM

My issue is that the resources required for the specific spark test fixture is quite large. We have 32 core machines, but want to limit that particular fixture from not being spun up more than 8x concurrently on a single host.

gentle-flower-25372

04/05/2024, 8:45 PM

I can achieve that setting --process-execution-local-parallelism=8, but then I completely avoid using more than 8 cores for all of pants. (the things pants spins up is using more, but pants gets limited)

wide-midnight-78598

04/05/2024, 8:47 PM

There might be a setting for testing specifically, or pytest even more specifically using some of those pytest-xdist, or execution slots, or whatever, but does it work well enough if you split into two steps?

pants mygoal :: && pants test --process-execution...

gentle-flower-25372

04/05/2024, 8:47 PM

well, the thing in particular that I don't want throttle is the creation of all of the pex for the test execution runners.

gentle-flower-25372

04/05/2024, 8:47 PM

can I create those without running tests and then run tests?

wide-midnight-78598

04/05/2024, 8:48 PM

Are your tests run on pexes?

gentle-flower-25372

04/05/2024, 8:48 PM

yes, it's just using the out of the box experience for pants test. Which seems to be building dedicated pex for each test run.

wide-midnight-78598

04/05/2024, 8:48 PM

I guess what I was asking, are you running tests on the output of your

pex_binary

gentle-flower-25372

04/05/2024, 8:49 PM

No, I'm letting pants handle it end-to-end.

gentle-flower-25372

04/05/2024, 8:49 PM

I just call

pants test ::

gentle-flower-25372

04/05/2024, 8:49 PM

I'm just using native functionality with python_sources/python_tests targets

wide-midnight-78598

04/05/2024, 8:49 PM

Yep, okay, cool, that's what I thought - I just got confused there for a sec

👍 1

wide-midnight-78598

04/05/2024, 8:50 PM

Have you looked into https://www.pantsbuild.org/2.18/reference/subsystems/pytest#execution_slot_var ? Would that work?

gentle-flower-25372

04/05/2024, 8:50 PM

That is exactly what I'm using, but the issue is there is no limit to how many slots are given by pants, except for limiting it via --process-execution-local-parallelism

gentle-flower-25372

04/05/2024, 8:52 PM

So pants spits out 32 slots (I don't know the literal number, just using it as an example) -- so great, now I have that and I could technically say, if you're slot 1-8, you can go forward, but if you're greater than 8 wait. but that feels wrong.

gentle-flower-25372

04/05/2024, 8:52 PM

I'd even be willing to exit a non-zero from the fixture (when slot is above 8) and then just depend on pants cache for test results, but shoot that starts to feel hacky as crap and not super supportable.

wide-midnight-78598

04/05/2024, 8:54 PM

Someone else might be able to speak better about the process granularity you're going for. Because you want to use all 32 cores for the setup of the test goal, but then jump that down to 8 for the running of the tests themselves? To me, that feels a bit custom or run-timey

gentle-flower-25372

04/05/2024, 8:54 PM

exactly what I want.

gentle-flower-25372

04/05/2024, 8:55 PM

Well, to be clear I still think that I was overly prescriptive 🤣 -- I don't actually care about the details, my goal is to limit the number of concurrent spark fixtures to 8. How I achieve that? I don't particular care, I'm an open book.

gentle-flower-25372

04/05/2024, 8:56 PM

Isn't there a way in pants to say, you're node 1/3, 2/3, 3/3 so to divide tests amongst three hosts?

wide-midnight-78598

04/05/2024, 8:57 PM

Sharding?

gentle-flower-25372

04/05/2024, 8:57 PM

yes

wide-midnight-78598

04/05/2024, 8:57 PM

https://www.pantsbuild.org/blog/2022/09/01/introducing-pants-2-13#easy-parallel-execution-in-ci-with-test-sharding

wide-midnight-78598

04/05/2024, 8:58 PM

Also worth checking out the CI, as there is some craziness there too https://github.com/pantsbuild/pants/blob/main/.github/workflows/test.yaml

gentle-flower-25372

04/05/2024, 8:58 PM

coo.

gentle-flower-25372

04/05/2024, 8:58 PM

is there a typo in these docs? https://www.pantsbuild.org/2.18/reference/goals/test#shard

gentle-flower-25372

04/05/2024, 8:59 PM

For example, you can run three shards with
--shard=0/3
,
--shard=1/3
,
--shard=2/3
.

💯 1

gentle-flower-25372

04/05/2024, 9:01 PM

I just read the github PR, apparently that's not a typo

wide-midnight-78598

04/05/2024, 9:01 PM

Seems to match - 0-index

wide-midnight-78598

04/05/2024, 9:02 PM

How are you using execution_slot_var right now? What's the problem you're running into? More precisely, are you running any of your own process logic?

gentle-flower-25372

04/05/2024, 9:02 PM

I'm just using it so that I can spin up spark on a different port number so they don't conflict.

👍 1

wide-midnight-78598

04/05/2024, 9:04 PM

I'm also guessing you've checked out whether pytest has any native fixture functionality that can help?

gentle-flower-25372

04/05/2024, 9:04 PM

I haven't looked deeply into that yet.

gentle-flower-25372

04/05/2024, 9:05 PM

This is what I'm doing

Copy code

# Base port for Spark UI
        base_port = 4040
        execution_slot = int(os.environ.get("PANTS_EXECUTION_SLOT", 0))
    
        # Check if the USER environment variable matches the expected pattern 'agent-N'
        user_name = os.environ.get("USER", "")
        match = re.match(r"agent-(\d+)", user_name)
        if match:
            agent_number = int(match.group(1))
            # Calculate port offset using both agent number and execution slot if pattern matches
            port_offset = ((agent_number - 1) * 100) + execution_slot
        else:
            # Use a simpler scheme if no match, to still avoid conflicts with multiple execution slots
            port_offset = execution_slot
    
        spark_ui_port = base_port + port_offset

wide-midnight-78598

04/05/2024, 9:14 PM

I'm sure there is some sort of concurrency/batch math you could play with pytest and pytest-xdist to get something like this working. Me personally, this feels like the kinda thing I might try to do in code - kinda like you've described above.

gentle-flower-25372

04/05/2024, 9:16 PM

I think you're right.

wide-midnight-78598

04/05/2024, 9:17 PM

It sounds like you've tried most of the stuff here: https://www.pantsbuild.org/2.19/docs/python/goals/test#batching-and-parallelism

gentle-flower-25372

04/05/2024, 9:17 PM

yes, playing with it all 🙂

gentle-flower-25372

04/05/2024, 9:17 PM

I'm using small batches and grouping thing into batch_compatibility_tag

gentle-flower-25372

04/05/2024, 9:17 PM

It's better for sure.

wide-midnight-78598

04/05/2024, 9:21 PM

I've never used spark - so I'm speaking out of turn, but is there a substantial overall time benefit in unit tests when adding more instances (given the cost of startup?)

gentle-flower-25372

04/05/2024, 9:31 PM

hell no, no benefit at all. That's why I'm batching.

silly-queen-7197

04/05/2024, 9:52 PM

Could you run two invocations of pants test? One for the spark tests with

process_execution_local_parallelism

and one for everything else?

gentle-flower-25372

04/05/2024, 10:19 PM

Potentially with tags, but oof that would be a lot of work.

gentle-flower-25372

04/05/2024, 10:19 PM

maybe I could use pants dependents for the fixture

2 Views

Open in Slack

Previous Next