Is it possible to control the degree of parallelism that `te Pants #general

Is it possible to control the degree of parallelis...

rapid-exabyte-76685

03/30/2022, 5:09 AM

Is it possible to control the degree of parallelism that

test

runs with (ideally to 1, since I have problems with tests running in parallel that I will solve in the medium-term, but not in the short-term)

fast-nail-55400

03/30/2022, 5:15 AM

--process-execution-local-parallelism

might be useful

rapid-exabyte-76685

03/30/2022, 5:20 AM

Ah ok, I've found that

./pants test --debug ::

also works, but doesn't process the same summary output that

./pants test ::

does. I'll give this a go

rapid-exabyte-76685

03/30/2022, 5:20 AM

Yep,

./pants --process-execution-local-parallelism=1 test

gives me the nice summary output

rapid-exabyte-76685

03/30/2022, 5:27 AM

Ok, I've added a documentation suggestion for this as I think its handy to know about when you are trying to adopt Pants in a new codebase where the test (i.e. integration tests) may not be friendly for being run in parallel.

rapid-exabyte-76685

03/30/2022, 5:52 AM

maybe the ability to add metadata to a test file specifying whether it can be run in parallel with other test files would be handy?

👍 1

happy-kitchen-89482

03/30/2022, 5:55 PM

Interesting idea. Or some sort of parallelism label, where tests with the same label cannot run concurrently with each other (so a label could represent a database or some shared resource)

💯 1

happy-kitchen-89482

03/30/2022, 5:55 PM

Rather than a boolean, which may be too restrictive

rapid-exabyte-76685

03/31/2022, 7:20 AM

so I should be able to do this with tags and multiple executions of

pants test

with appropriate combinations of the

--tag

filter and the

--process-execution-local-parallelism=1

parameter?

rapid-exabyte-76685

03/31/2022, 7:20 AM

I've tried this, in so far as attempting to exclude the tests that are causing me problems when run in parallel for the moment...

rapid-exabyte-76685

03/31/2022, 7:22 AM

in the directory with the troublesome tests I have a build file like this...

Copy code

python_tests(
  name="tests",
  overrides={
    "test_a.py": {"tags": ["run_serially"]),
    "test_b.py": {"tags": ["run_serially"]),

    # And the above can be expressed as?...
    ("test_a.py", "test_b.py"): {
      "tags": ["run_serially"]
    },
  }
)

rapid-exabyte-76685

03/31/2022, 7:23 AM

and then run

./pants --tag='-run_serially' test ::

but I still see

test_a.py

and

test_b.py

running and therefore blocking completion due to their race condition

rapid-exabyte-76685

03/31/2022, 7:26 AM

hmm, for fun I changed to

--tag='+run_serially'

and being told no files or targets specified, so maybe I'm not applying the tag correctly

rapid-exabyte-76685

03/31/2022, 7:38 AM

ok not sure what happened, there... maybe I mistyped the tag for the

--tag='run_serially'

example I'm finding that I can

./pants --tag='sometag' test ::

to include only tests WITH the tag, but

./pants --tag='-sometag' test ::

to EXCLUDE tests with the tag is not working, and there are still being included.

rapid-exabyte-76685

03/31/2022, 7:42 AM

ah, I've tripped over this I think https://github.com/pantsbuild/pants/issues/11123

rapid-exabyte-76685

03/31/2022, 7:43 AM

maybe? I'm invoking with

::

which is an address spec? not a file address? 🤷

rapid-exabyte-76685

03/31/2022, 10:00 AM

Looks like

+tag

and

-tag

work for

list

but only

+tag

works for

test

happy-kitchen-89482

03/31/2022, 3:16 PM

Sorry for the issues. Tag-based selection happens outside of any specific goals so I would have expected it to work the same for

list

and for

test

. Weird.

happy-kitchen-89482

03/31/2022, 3:16 PM

Let me see if I can reproduce

happy-kitchen-89482

03/31/2022, 3:19 PM

There are right-parens instead of right-braces in the BUILD file snippet above, but I assume that's not the issue ?

happy-kitchen-89482

03/31/2022, 3:21 PM

OK, I can reproduce this, so that's good

🙌 1

happy-kitchen-89482

03/31/2022, 8:37 PM

I think you've found a proper bug!

happy-kitchen-89482

03/31/2022, 8:37 PM

Am digging further

rapid-exabyte-76685

03/31/2022, 8:54 PM

Correct, the right-parens vs right-braces issue was just a typing error in slack

happy-kitchen-89482

03/31/2022, 9:32 PM

https://github.com/pantsbuild/pants/issues/14977

happy-kitchen-89482

03/31/2022, 9:32 PM

@witty-crayon-22786 thoughts on this?

rapid-exabyte-76685

03/31/2022, 9:52 PM

The workaround for this in the short term… having two

python_tests

targets in the relevant directory and including/excluding via

sources

directly?

witty-crayon-22786

03/31/2022, 9:56 PM

commented on the ticket: you can adjust your selection code slightly

witty-crayon-22786

03/31/2022, 9:57 PM

@rapid-exabyte-76685: also, i accepted your docs update: thanks! i added a bit more, because there is another facility for this: https://www.pantsbuild.org/v2.11/docs/troubleshooting#controlling-test-parallelism

✅ 1

witty-crayon-22786

03/31/2022, 11:08 PM

@rapid-exabyte-76685: does the

execution_slot_var

setting make sense for your usecase? we haven’t done a great job of explaining it

rapid-exabyte-76685

03/31/2022, 11:32 PM

If I understand correctly... if there are 4 cores in the machine, and

pants.toml

has...

Copy code

[pytest]
PANTS_PYTEST_EXECUTION_SLOT_VAR = "MY_SLOT_VAR"

... then the test code could do ...

Copy code

os.environ['MY_SLOT_VAR']

and it would get a value between 0-3 (or 1-4?) depending on which core/slot it is running in?

witty-crayon-22786

03/31/2022, 11:33 PM

correct.

witty-crayon-22786

03/31/2022, 11:33 PM

then you can have initialization code in your tests assume that it “owns” the slot on localhost, and safely

drop $database

(or the non-SQL equivalent) and recreate

rapid-exabyte-76685

03/31/2022, 11:34 PM

In my CI, can I ask pants what the range of values that it will be returning is? e.g. if I wanted to run some setup process to create

database_0

through

database_3

- where the setup is running outside of pants

witty-crayon-22786

03/31/2022, 11:35 PM

you can by querying the computed value of

--process-execution-local-parallelism

, yea. but if it’s possible to run the setup inside tests, that might be more self contained

witty-crayon-22786

03/31/2022, 11:36 PM

you can by querying the computed value of
--process-execution-local-parallelism

which you can do with something like

./pants help-all | jq …

✅ 1

rapid-exabyte-76685

03/31/2022, 11:46 PM

You would need to combine this with filtering tests by tag or target I think, to say: run all of these tests that use this resource at the same time, but they get their own copy of the resource

witty-crayon-22786

03/31/2022, 11:48 PM

i might be misunderstanding, but i don’t think so…? if you ran across all of your tests, tests which didn’t need “the resource” would run fine. they would just tie up a slot (such that that resource slot wasn’t used while they were running)

witty-crayon-22786

03/31/2022, 11:49 PM

but i suppose that it depends on the breakdown between “tests that need the resource” and “tests that don’t need the resource”, and the cost of having the resource be idle

rapid-exabyte-76685

04/01/2022, 1:09 AM

Yep, now that I've thought about this closer, you are correct

rapid-exabyte-76685

04/01/2022, 1:11 AM

which you can do with something like
./pants help-all | jq …

I think this is what I want?

Copy code

./pants help-all | jq -r  '.scope_to_help_info ."" .advanced[] | select (.env_var == "PANTS_PROCESS_EXECUTION_LOCAL_PARALLELISM") .value_history .ranked_values[] | select (.rank == "HARDCODED") .value'

rapid-exabyte-76685

04/01/2022, 1:11 AM

Another one for the recipe book? https://github.com/pantsbuild/pants/issues/14969 (link to message above added as a comment in this issue)

rapid-exabyte-76685

04/01/2022, 1:16 AM

That I'm plucking something out called

HARDCODED

gives me pause but it looks to return the correct core count across two different machines... although on my M1 Pro MacBook, which has 8 performance cores and 2 efficiency cores, it returns 10.

rapid-exabyte-76685

04/01/2022, 1:28 AM

My thinking here being... do you want to run build processes on efficiency cores or restrict them, if possible, to performance cores only?

witty-crayon-22786

04/01/2022, 3:20 AM

that value is computed via https://github.com/pantsbuild/pants/blob/0b5a5c514a790450965028f457956f7dba09b0c7/src/python/pants/util/osutil.py#L16-L29 : if there is a better default, we’d be happy to hear about it. but i expect that the efficiency cores are the first to be used, rather than the last…?

eager-dress-66405

04/08/2022, 6:27 AM

We use

python_tests

overrides

to apply tags and use the following in CI to run just the select tests. Same could work with

xargs -0

. Critical part that I don't see mentioned was the

--granularity=file

arg on filter, which without will end up including the entire expansion of

python_tests

in that directory.

Copy code

readarray -d '' targets < <(./pants \
        --tag=-skip_ci \
        --changed-dependees=transitive \
        --changed-since="$PANTS_CHANGED_SINCE_REF" \
        filter \
        --sep="\0" \
        --granularity=file)
./pants test "${targets[@]}"

eager-dress-66405

04/14/2022, 10:02 PM

One gotcha with the --granularity=file is that docker images will no longer be matched, so be careful if you use the same codepath for calculating targets

4 Views

Open in Slack

Previous Next