Hey Pants gurus I need help slightly smiling face I need to Pants #general

Hey, Pants gurus, I need help :slightly_smiling_fa...

aloof-tent-90836

07/19/2022, 11:06 AM

Hey, Pants gurus, I need help 🙂 I need to run shell command (like

coverage combine && coverage xml

to join coverage data files from different GitHub action workers) within

venv

created by Pants (I saw no `test`/`pytest`/`coverage-py` names in cache folder but hashes). Is there any way to invoke this command in a Pants-way?

hundreds-father-404

07/19/2022, 12:41 PM

Hi, welcome! Double checking if you saw https://www.pantsbuild.org/docs/python-test-goal#coverage? Pants can run coverage for you and run those exact commands

happy-kitchen-89482

07/19/2022, 3:07 PM

Ah but pants combined coverage across processes on a single run, not across sharded runs

happy-kitchen-89482

07/19/2022, 3:09 PM

But I think it wouldn't be hard to add that, since Pants already knows how to download and install the coverage tool, and how to invoke it with the right args

happy-kitchen-89482

07/19/2022, 3:09 PM

It would be a bit of refactoring and then I guess adding a custom goal

happy-kitchen-89482

07/19/2022, 3:10 PM

@aloof-tent-90836 how would you be fetching these coverage files off the various worker machines?

happy-kitchen-89482

07/19/2022, 3:12 PM

There is an open question on how to model all this. Pants is normally used to working on files in the repo. These are files on other machines. So we need to figure out how the best way to model this. For example, do these files get downloaded by some process outside of Pants? Or are they are well-known urls that Pants can reference? Do we throw them all in a local directory and then just tell pants to merge everything in that dir?

happy-kitchen-89482

07/19/2022, 3:13 PM

The work here will be more about figuring that out, the actual implementation would be straightforward, and we can guide you through it

🙌 1

aloof-tent-90836

07/19/2022, 3:45 PM

@happy-kitchen-89482 I've managed to split test workload across machines using GitHub Actions and "matrix" feature, this job have steps rename

.coverage

with hash suffixes (to be distinctive) and upload artefacts using actions/upload-artifact@v3 and having dedicated job actions/download-artifact@v3 further (the main trick is not providing any args - then all coverage files will be merged in the same destination). Pants cache is also shared between jobs within the same workflow. So, I've combined coverage result in this command sequence: • coverage combine • coverage report` • coverage xml (to be compatible to push data to SonarCloud, probably, "push results" feature would be a nice addition to

coverage-py

) But, in orded to be compatible, I've manually pin the coverage version and it can be eventually out of sync because now we have two places to lock dependencies. It would be nice to have some ability to reuse the same coverage executable that was used in testing stage. P.S. feel free to ask the same again if things above did not provide answers to you 🙂

happy-kitchen-89482

07/19/2022, 10:01 PM

Thanks, that is helpful! A further question is, where do you want to run this merging? On some GitHub Actions job that depends on those other jobs it sounds like? And it downloads them via GHA artifacts?

happy-kitchen-89482

07/19/2022, 10:01 PM

I think I have a sense of what this should look like then

happy-kitchen-89482

07/19/2022, 10:02 PM

Something like a custom

merge-coverage

goal that you pass paths that are not targets.

happy-kitchen-89482

07/19/2022, 10:03 PM

@hundreds-father-404 do we now have the ability to interpret CLI args as file paths that are not wrapped by targets, and may not even be in the repo?

aloof-tent-90836

07/20/2022, 11:31 AM

On some GitHub Actions job that depends on those other jobs it sounds like? And it downloads them via GHA artifacts?

Yes, correct.

Something like a custom
merge-coverage
goal that you pass paths that are not targets.

Thank you, I will look into. Would be helpful if you have can provide any link to the guide.

steep-eve-20716

08/05/2024, 7:24 PM

Reviving this a little bit: 1. IMO, pants shouldn't be responsible for moving files across the network. Thats a slippery slope and files could be stored anywhere 2. Pants should (by default) name coverage files by test shard index (i.e.

coverage_0.xml

) 3.

coverage-py

could add a goal i.e.

merge-coverage

which merges all

*.xml

files in

[coverage-py].output_dir

6 Views

Open in Slack

Previous Next