I have a few unit tests in my Python repo that use a lot of Pants #general

I have a few unit tests in my Python repo that use...

numerous-pharmacist-91083

10/25/2024, 12:18 AM

I have a few unit tests in my Python repo that use a lot of RAM (they run some big ML models). If I run

pants test

they sometimes get run in parallel causing an OOM and then random processes on my machine get killed. Is there a way to do any of the following (in order of preference): • Test

pants

that if RAM usage is > X it should stop launching new tests until other tests have finished. • Mark individual tests as needing to be run alone without any other tests running in parallel. • Limit the test parallelism.

PANTS_TEST_BATCH_SIZE

seems like maybe it'd do that but I see more tests running than I've specified as the batch size.

loud-stone-80561

10/25/2024, 12:25 AM

• Mark individual tests as needing to be run alone without any other tests running in parallel.

In my experience w/ both pants & bazel this seems like the safest bet - tagging tests as

heavy

/`high-memory` etc & then excluding them in your regular pants test invocation, ie

pants --tag=-high-memory test ::

, and then serially executing the high-mem is reasonable

numerous-pharmacist-91083

10/25/2024, 12:31 AM

Thanks for the suggestion!

then serially executing the high-mem

Does that mean I have to remember which tests were marked high-mem and then manually invoke

pants test

for each one, or is there a way to say "run all tests with this tag without any parallelism"?

loud-stone-80561

10/25/2024, 12:37 AM

I'm not sure if there's a pants-native way to do that, but my naive approach would be:

Copy code

#!/usr/bin/env bash

TARGETS=`pants --filter-target-type=python_test --tag=high-memory list ::`

for TARGET in $TARGETS;
do
    pants test "$TARGET"
done

loud-stone-80561

10/25/2024, 12:37 AM

Not totally ideal, but simple enough for starters if it's baked into your CI checks

numerous-pharmacist-91083

10/25/2024, 12:40 AM

yeah, that works pretty well. Thanks!! I'll give that a shot.

broad-processor-92400

10/25/2024, 3:38 AM

https://www.pantsbuild.org/stable/reference/global-options#process_execution_local_parallelism might be handy too

pants --process-execution-local-parallelism=1 test --tag=high-memory ::

or similar

broad-processor-92400

10/25/2024, 3:40 AM

(downside of this approach is that'll run any required set-up like codegen or getting requirements serially too, unlike the

for

loop approach)

elegant-florist-94385

10/25/2024, 10:23 AM

for convenience, add

Copy code

[cli.alias]
--no-parallel = "--process-execution-local-parallelism=1"

to your

pants.toml

, and then run your tests with

pants --no-parallel test --tag=high-memory

numerous-pharmacist-91083

10/25/2024, 7:49 PM

Nice! Both of those suggestions are helpful. I didn't know about the

[cli.alias]

section; that's cool!

👍 1

3 Views

Open in Slack

Previous Next