<https github com pantsbuild pants pull 8450> < hundreds fat Pants #development

<https://github.com/pantsbuild/pants/pull/8450> <@...

aloof-angle-91616

05/04/2020, 3:10 AM

https://github.com/pantsbuild/pants/pull/8450 @hundreds-father-404 this PR will go green again in a bit, wanted to mention that upon merging this we might be able to kill

list_targets_old.py

entirely?

witty-crayon-22786

05/05/2020, 4:21 PM

between this and

query

, my feeling is that query is probably a higher priority. i’m concerned that our existing goals are limited with regard to things like hydration and fields, and i think that a query API might provide a better foundation to build upon.

👍 1

witty-crayon-22786

05/05/2020, 4:24 PM

my fundamental concern moving forward is that the concept of “dependencies” is very limited, and i’m worried about proceeding to quickly on ports of v1 goals without having more of this in place

witty-crayon-22786

05/05/2020, 4:25 PM

the primary thing is that inference and codegen allow your dependencies to change based on how you are viewed

aloof-angle-91616

05/05/2020, 4:25 PM

if your concern is about

dependencies

, then we can remove that field from the json

aloof-angle-91616

05/05/2020, 4:25 PM

i've been trying to merge this after being asked by multiple users many times to get this in, over multiple objections. i'll do the work to make it mergeable, as usual, but this is definitely useful

witty-crayon-22786

05/05/2020, 4:26 PM

@aloof-angle-91616: the issue is just that the fingerprint is very likely to be wrong, and i don’t know how we’ll make it right.

aloof-angle-91616

05/05/2020, 4:27 PM

would you like to justify that? the fingerprint comes from the exact

_key()

method that we use to calculate the hash

witty-crayon-22786

05/05/2020, 4:28 PM

which is gone in v2 =/

👍 1

aloof-angle-91616

05/05/2020, 4:28 PM

???

witty-crayon-22786

05/05/2020, 4:28 PM

because how fingerprints are calculated is via process executions

aloof-angle-91616

05/05/2020, 4:28 PM

i'm referring to the v2

Struct

_key()

method

witty-crayon-22786

05/05/2020, 4:28 PM

right, that’s not the fingerprint of the target

aloof-angle-91616

05/05/2020, 4:28 PM

then why does it exist?

witty-crayon-22786

05/05/2020, 4:28 PM

(you’re right: it’s not gone. but it’s not used as the fingerprint)

aloof-angle-91616

05/05/2020, 4:29 PM

what do you mean by fingerprint?

witty-crayon-22786

05/05/2020, 4:29 PM

process executions are how you hit the cache, anything else is in memory

aloof-angle-91616

05/05/2020, 4:29 PM

yes, that is the goal of the fingerprint

witty-crayon-22786

05/05/2020, 4:29 PM

so we don’t use the python structures as a “fingerprint” in a strong hashing sense: only as in-memory memoization

witty-crayon-22786

05/05/2020, 4:30 PM

but in general: “what i depend on” and “what my fingerprint is” are 100% rule dependent in v2

aloof-angle-91616

05/05/2020, 4:30 PM

this feature is one users have specifically asked for, and the current implementation solves the issues. i had to push a very hacky v1 console task inside of twitter last week because this wasn't merged yet

witty-crayon-22786

05/05/2020, 4:30 PM

it’s not a question of “what is the fingerprint of a target”, it’s “what is the fingerprint of the target when used in this rule/process execution”

aloof-angle-91616

05/05/2020, 4:31 PM

if you're going to block features that users actually want on "the output doesn't conform entirely to my personal platonic ideal of what a 'fingerprint' is", which is something you're extremely fond of doing, i'm not going to waste time fighting you again and again

witty-crayon-22786

05/05/2020, 4:34 PM

if we land something that we can’t maintain moving forward, we anger users in a different way

witty-crayon-22786

05/05/2020, 4:34 PM

or that breaks on them

witty-crayon-22786

05/05/2020, 4:34 PM

i’m trying to explain why this is likely to be hard to maintain, or to break

👍 1

aloof-angle-91616

05/05/2020, 4:40 PM

i'm pretty sure that this solution just invalidates too coarsely as opposed to not invalidating correctly. are you able to describe why that's not the case?

aloof-angle-91616

05/05/2020, 4:40 PM

also, this is build graph information that by definition is separate from any v2 rule. this is intended to be consumable by external tools -- those external tools by definition are not running v2 rules.

aloof-angle-91616

05/05/2020, 4:41 PM

i really think some thought on how this is supposed to be used would be useful here.

witty-crayon-22786

05/05/2020, 4:41 PM

so, i think that is related to the difference between bazel

query

and

aquery

cquery

: rules have different inputs and configurations at each phase

aloof-angle-91616

05/05/2020, 4:41 PM

or really just any specifics on how to make this conform more to the platonic ideal so that i can merge it

aloof-angle-91616

05/05/2020, 4:41 PM

or just a clear statement that you don't see this being mergeable so i can close it

witty-crayon-22786

05/05/2020, 4:42 PM

we don’t have those three graphs. instead, we have various graphs of rules based on the context

witty-crayon-22786

05/05/2020, 4:42 PM

the digest of a target for

test

will be different from the digest of a target for

binary

aloof-angle-91616

05/05/2020, 4:42 PM

i'm pretty sure that this solution just invalidates too coarsely as opposed to not invalidating correctly. are you able to describe why that's not the case?

witty-crayon-22786

05/05/2020, 4:42 PM

if the goal is to determine whether “binary will redo something”, you cannot use the digest for

test

aloof-angle-91616

05/05/2020, 4:42 PM

aloof-angle-91616

05/05/2020, 4:43 PM

but you do know the static build graph deps

witty-crayon-22786

05/05/2020, 4:43 PM

they do not include things like pytest

aloof-angle-91616

05/05/2020, 4:43 PM

and you can calculate a stable hash of the structs

witty-crayon-22786

05/05/2020, 4:43 PM

or the version of your codegenerator

hundreds-father-404

05/05/2020, 4:43 PM

or just a clear statement that you don’t see this being mergeable so i can close it

Regardless of this discussion, the proposed first PR to refactor the

--provides

and

--documented

options is definitely mergeable and very valuable on its own.

aloof-angle-91616

05/05/2020, 4:43 PM

the version of pytest changing does not affect the consumer of this information

witty-crayon-22786

05/05/2020, 4:43 PM

@aloof-angle-91616: how do you know that though? that’s consumer specific

aloof-angle-91616

05/05/2020, 4:44 PM

i made this PR specifically for Metals

witty-crayon-22786

05/05/2020, 4:44 PM

i think that in this case folks are trying to use this for

binary

, right?

witty-crayon-22786

05/05/2020, 4:44 PM

ah, ok. so in that case it depends on a whole other set of things… which version of coursier, etc

witty-crayon-22786

05/05/2020, 4:45 PM

but that’s my point. this is only the targets, and doesn’t include any of the extra information needed to do something with them in some context

witty-crayon-22786

05/05/2020, 4:45 PM

if Metals is running export, it needs an export-specific digest

witty-crayon-22786

05/05/2020, 4:46 PM

so this fingerprint is something, but it’s a subset, rather than superset

👍 1

aloof-angle-91616

05/05/2020, 4:46 PM

this is purely to calculate file invalidation

aloof-angle-91616

05/05/2020, 4:47 PM

we don't have v2 export yet so i'm not sure why you would expect metals would be consuming something like export

aloof-angle-91616

05/05/2020, 4:47 PM

right now metals tracks invalidation itself

aloof-angle-91616

05/05/2020, 4:47 PM

you'll note olaf signed off on this

aloof-angle-91616

05/05/2020, 4:49 PM

if you don't plan to accept it understanding that this format is a first step to an actual v2 export, please state that on the ticket so people can stop asking me about it

aloof-angle-91616

05/05/2020, 4:52 PM

i don't have the time to devote to a whole design doc about this, i was trying to add a feature people were asking from pants for a while and that there's a clear use case for. i don't think it should be blocked. the only reason i'm pushing back is not ideological, i literally just do not have the time to have this discussion.

witty-crayon-22786

05/05/2020, 4:55 PM

i’m just not sure how to explain to users what that fingerprint is safe to use for. the reason i raised query is that “fingerprint of raw inputs” is different from “fingerprint in some other context”, and query would allow for that subtlety. i’m not sure how to explain this one…

file_fingerprint

with a big docstring…?

witty-crayon-22786

05/05/2020, 4:58 PM

going to snooze for a bit to try and get some other things done.

Open in Slack

Previous Next