Hello Toolchain has been using V2 linters for two months now Pants #development

Hello. Toolchain has been using V2 linters for two...

hundreds-father-404

02/18/2020, 4:33 PM

Hello. Toolchain has been using V2 linters for two months now and, frankly, we’ve found the experience to be much worse than V1 due to the performance. We wrote up a proposal to allow sticking with the current caching scheme (which does have certain benefits) but to also allow using a different caching scheme for much better performance with colder caches, via a new option

--fmt-per-target-caching

and

--lint-per-target-caching

. Medium term, we wrote a proposal to replace that option with an implementation that has the benefits of both caching schemes I’d appreciate a look if you have a moment! https://docs.google.com/document/d/1Tdof6jx9aVaOGeIQeI9Gn-8x7nd6LfjnJ0QrbLFBbgc/edit#

witty-crayon-22786

02/20/2020, 10:58 AM

Will look, but: are you using

--changed

flags in this usecase?

witty-crayon-22786

02/20/2020, 10:59 AM

I don't think that per-target caching was intended to be a replacement for only operating on changed targets when you know about them.

hundreds-father-404

02/20/2020, 5:12 PM

We’re not using

--changed

. We want to be able to tell users to simply run

./pants fmt ::

. But even if we did, we expect per-target caching to not be worth it if 2-3+ targets have changed, depending on the tool and the size of the codebase. Linters are unique compared to Pytest in that the cost of running one additional file with a linter is incredibly low, whereas the overhead of starting the linter fresh is quite high -- There are downsides to losing per-target caching, which is why the option

--per-target-caching

is only temporary and we explain a medium-term proposal to get the best of both schemes, i.e. have fine-grained caching while avoiding the overhead of starting the tool multiple times

witty-crayon-22786

02/20/2020, 7:27 PM

How much of that overhead is pex overhead?

witty-crayon-22786

02/20/2020, 7:27 PM

Because the pex overhead is quite significant right now...

witty-crayon-22786

02/20/2020, 7:27 PM

And addressing that addresses a lot of things

hundreds-father-404

02/20/2020, 7:27 PM

See the design doc. That benchmark was running the tools directly, without Pants or Pex, and showed 5-30x worse performance, depending on the tool (on cold caches).

witty-crayon-22786

02/20/2020, 7:28 PM

Thanks, will look

hundreds-father-404

02/20/2020, 7:29 PM

The reason we proposed for the short-term to have it be an option is so that Twitter or anyone who wants to keep using per-target caching can continue to do so. Meanwhile, Toolchain can use the caching scheme that gets us better performance In the medium term, everyone will benefit from the combined caching scheme proposal

witty-crayon-22786

02/20/2020, 7:33 PM

So, is the way to interpret the "per file" column: "time to run for X files" sequentially?

witty-crayon-22786

02/20/2020, 7:33 PM

So 10 files -> 3 seconds for bandit, for example?

hundreds-father-404

02/20/2020, 7:33 PM

Yes, so it runs

black f1.py; black f2.py; black f3.py

, rather than

black f1.py f2.py f3.py

witty-crayon-22786

02/20/2020, 7:33 PM

Ok, so then there are two other variables here

witty-crayon-22786

02/20/2020, 7:34 PM

1. average target size

witty-crayon-22786

02/20/2020, 7:34 PM

2. parallelism

witty-crayon-22786

02/20/2020, 7:34 PM

That significantly changes the break even point, and might make --changed viable.

witty-crayon-22786

02/20/2020, 7:35 PM

Because we've done the "invoke in a batch and then try to tease everything back apart" thing before, and it's not pretty

hundreds-father-404

02/20/2020, 7:35 PM

1. average target size

Yes, 1-1-1 proves to be a good pattern with linters

witty-crayon-22786

02/20/2020, 7:36 PM

And a fourth foil is that you cannot enforce dependencies properly with batches

witty-crayon-22786

02/20/2020, 7:36 PM

(Doesn't matter for pure formatters, matters for things that need deps)

hundreds-father-404

02/20/2020, 7:39 PM

(Doesn’t matter for pure formatters, matters for things that need deps)

We do have one example of this now:

pylint

, which needs direct dependencies (but not transitive dependencies)

hundreds-father-404

02/20/2020, 7:41 PM

Because we’ve done the “invoke in a batch and then try to tease everything back apart” thing before, and it’s not pretty

Is there somewhere I can read up on this? On the other side, John mentioned Python Ants used to run things per-target and then added a mechanism to try batching things for these similar performance concerns. https://docs.google.com/document/d/1Tdof6jx9aVaOGeIQeI9Gn-8x7nd6LfjnJ0QrbLFBbgc/edit?disco=AAAAGOanEo8

witty-crayon-22786

02/20/2020, 7:42 PM

Zinc and Scala

witty-crayon-22786

02/20/2020, 7:42 PM

Benjy knows

👍 1

happy-kitchen-89482

02/20/2020, 7:48 PM

Yeah, so splitting results was awful in that case, because we had to write complex to take the result of running zinc, which includes .class files and analysis data, and figure out which bit belonged to each individual target. In the case of the analysis data that was very hard to get right.

👍 1

happy-kitchen-89482

02/20/2020, 7:49 PM

But linters are different, in that we only need to cache the single bit "this file has no lint"

👍 1

happy-kitchen-89482

02/20/2020, 7:49 PM

In this specific case, it's easy.

happy-kitchen-89482

02/20/2020, 7:49 PM

So if the file does have lint, we don't split

👍 1

happy-kitchen-89482

02/20/2020, 7:52 PM

So for example, in a clean build,

./pants fmt ::

will run on 1000 files in one pass, and say 10 of them fail. We fix those and run

./pants fmt ::

, which again runs on 1000 files, but they all pass. Now we cache the fact "this file is lint free" for each of the 1000 files individually. Now we pull a change that touched 8 files.

./pants fmt ::

will consult the cache for 1000 files, notice that 992 of them have no lint, and run only on the 8 remaining, in one pass.

👍 1

happy-kitchen-89482

02/20/2020, 7:54 PM

We can do even better if a specific linter happens to make it easy to figure out which files passed vs which failed (say because it emits this information to a JSON file). Then that second

./pants fmt ::

run could only operate on the 10 files, not the entire 1000.

happy-kitchen-89482

02/20/2020, 7:54 PM

But that's a nice-to-have enhancement

happy-kitchen-89482

02/20/2020, 7:54 PM

The key different here is that there is no complicated splitting logic when files pass lint, the information is just a bit.

👍 1

happy-kitchen-89482

02/20/2020, 7:55 PM

We could not do this easily for, say, tests, or compiles, or anything that has output.

witty-crayon-22786

02/20/2020, 7:56 PM

Ah, yea: John's note refers to the same thing

2 Views

Open in Slack

Previous Next