Is there anyone who loves fixing Python typing issues If so Pants #development

Is there anyone who loves fixing Python typing iss...

wide-midnight-78598

02/20/2025, 4:10 PM

Is there anyone who loves fixing Python typing issues? If so, I'd be wildly appreciative of anyone who wants to tackle typing non-bare

@rule

wide-midnight-78598

02/20/2025, 4:10 PM

This works great

wide-midnight-78598

02/20/2025, 4:11 PM

Call-by-name migration with all the types hidden is a nightmare, or I suppose... a daymare

proud-dentist-22844

02/20/2025, 7:47 PM

Yeah. I've done something like this. The migration makes which rule to use more explicit, but the typing of

@rule

is very complex, so pycharm (and VSCode? It looks like your screenshot is of VSCode) have a hard time inferring the types. So, I end up looking up the rule and importing the rule's result to use in a type hint - at least while I'm developing. Sometimes, I then remove it to make mypy happy because the type checkers don't agree on inference logic.

wide-midnight-78598

02/20/2025, 7:53 PM

Yeah, trying to avoid that mental/keyboard overhead - as there are a LOT more backends to migrate

wide-midnight-78598

02/20/2025, 7:53 PM

I think ParamSpec will be my friend here, as that seems to be what is good for typing decorators. I've never needed to use it, but 🤷 always a first time

proud-dentist-22844

02/20/2025, 7:55 PM

Ooh. There's hope? Nice!

wide-midnight-78598

02/20/2025, 7:59 PM

I'll report back in emojis

😄 1

wide-midnight-78598

02/20/2025, 10:41 PM

❓

wide-midnight-78598

02/20/2025, 10:41 PM

Copy code

F = TypeVar('F', bound=Callable[..., Any])
@overload
def rule(func: F, /) -> F: ...

@overload
def rule(*, desc: str = "", level: LogLevel = LogLevel.DEBUG) -> Callable[[F], F]: ...

Sorta works

wide-midnight-78598

02/21/2025, 1:06 AM

Here's something I didn't clue into - how are these rules not async def?

proud-dentist-22844

02/21/2025, 1:22 AM

I believe there is some magic in

@rule

that allows the function to be either sync or async.

proud-dentist-22844

02/21/2025, 1:23 AM

Requiring async might simplify some of the typing there

proud-dentist-22844

02/21/2025, 1:23 AM

But that might have backwards compatibility issues

happy-kitchen-89482

02/21/2025, 2:23 AM

Yeah, the

async

is purely informational, since we’re not executing rules in a python event loop

happy-kitchen-89482

02/21/2025, 2:23 AM

But it would be fine to require

async

if it helps with typing

wide-midnight-78598

02/21/2025, 7:55 AM

I’ll give it a shot - it feels just wrong to allow either async or no async, and hide that behind complicated typing. I stripped down the typing to whatever the simplest thing was, and then all these async errors popped up (and some others, which I need to fix). But the number of red squiggles I was able to make disappear was pretty impressive. I think once I can handle a generic version too - we’re in a pretty good place.

wide-midnight-78598

02/21/2025, 1:46 PM

the
async
is purely informational,

But, we still require

await

for call-by-name right? At the very least due to the trampoline back into rust code and using the tokio executor? Would that imply that not having

async

is kinda technically incorrect? Or does it just get wrapped and fired off as a co-routine regardless of whether it's considered a Python coroutine or not

wide-midnight-78598

02/21/2025, 2:25 PM

Okay, yeah, it's basically here:

Copy code

def rule_decorator(func: SyncRuleT | AsyncRuleT, **kwargs) -> AsyncRuleT:

wide-midnight-78598

02/21/2025, 4:01 PM

Ah, so lame - PyRight likes the typing I have, but MyPy struggles

🫣 1

proud-dentist-22844

02/21/2025, 4:06 PM

Vscode uses pyright, right?

proud-dentist-22844

02/21/2025, 4:06 PM

So there are 3 type checkers that need to agree: mypy pyright PyCharm

proud-dentist-22844

02/21/2025, 4:07 PM

😅

wide-midnight-78598

02/21/2025, 4:08 PM

Oh man, pycharm - forgot

wide-midnight-78598

02/21/2025, 4:09 PM

Im hoping maybe just the mypy we use is old, and an upgrade fixes everything - because I'm literally using what mypy recommends

🙏 1

wide-midnight-78598

02/21/2025, 4:15 PM

Nope... Womp womp

wide-midnight-78598

02/21/2025, 4:25 PM

Oh good lord. Order matters in mypy with overloads apparently...

wide-midnight-78598

02/21/2025, 4:38 PM

Alright, I think the remaining 30 mypy errors are now ACTUALLY errors that got uncovered with proper typing of rules… I guess we also have warnings as errors on, because I’m getting redundant cast warnings and things

wide-midnight-78598

02/21/2025, 5:04 PM

@happy-kitchen-89482 @fast-nail-55400 This was early in my Pants time (or before). What should be the intended type for _masked_types?

Copy code

@rule(desc="Find targets from input specs", level=LogLevel.DEBUG, _masked_types=[EnvironmentName])

_masked_types: tuple[type, ...]

fails with

No overload variant of "rule" matches argument type "list[type[EnvironmentName]]"

Do we particularly care that this is a tuple, per se? Or just any iterable (as it suggests downstream)? I can change the callsites to tuples, or just expand the typing to Iterable to fix this - but 🤷

fast-nail-55400

02/21/2025, 5:11 PM

I didn't even know

_masked_types

was a thing until now.

fast-nail-55400

02/21/2025, 5:12 PM

Copy code

masked_types: tuple[type, ...] = tuple(kwargs.get("_masked_types", ()))

wide-midnight-78598

02/21/2025, 5:12 PM

Yeah, usually used for "[EnvironmentNames]

fast-nail-55400

02/21/2025, 5:12 PM

It gets converted to tuple, so maybe

Iterable

is appropriate?

wide-midnight-78598

02/21/2025, 5:12 PM

That's what I have for now, just wasn't sure if I should go through the codebase and tuple-ize

fast-nail-55400

02/21/2025, 5:13 PM

Is there a performance benefit from avoiding list -> tuple conversions? Seems like a rainy day refactor more than anything.

fast-nail-55400

02/21/2025, 5:13 PM

i.e., seems fine as is for now

wide-midnight-78598

02/21/2025, 5:14 PM

I mean, I'm already digging through this - there might be some, but I'm more wondering if using a mutable type vs a type that can't change would make a difference, as it does while running the rules

fast-nail-55400

02/21/2025, 5:14 PM

🤷

fast-nail-55400

02/21/2025, 5:15 PM

could be, guess we'd have to measure to truly know

fast-nail-55400

02/21/2025, 5:17 PM

(but not suggesting any measurements, just a thought)

wide-midnight-78598

02/21/2025, 5:18 PM

👍 Just changing the incoming type then, saves me a hassle

wide-midnight-78598

02/21/2025, 5:58 PM

https://github.com/pantsbuild/pants/pull/21987

wide-midnight-78598

02/21/2025, 6:01 PM

Leaving this in draft a bit, while I do some "before"/"after" pyright checking, but so far, so much better

happy-kitchen-89482

02/22/2025, 5:21 AM

I have no idea what

_masked_types

even is

happy-kitchen-89482

02/22/2025, 5:21 AM

First I’m seeing it

wide-midnight-78598

02/22/2025, 5:38 AM

Whelp. I'm glad none of us really know what it is 🙂

wide-midnight-78598

02/22/2025, 5:39 AM

Allows callers to prevent the given list of types from being included in the identity of

a @rule

happy-kitchen-89482

02/23/2025, 5:15 PM

And callers would want to do that… why?

wide-midnight-78598

02/23/2025, 5:39 PM

shrug

wide-midnight-78598

02/23/2025, 5:40 PM

I haven't dug into usages, I just think they're mostly

EnvironmentName

- and I'm not in the know about Environment stuff

wide-midnight-78598

02/23/2025, 5:40 PM

Copy code

⏺ pants.git/call-by-name-shell % rg _masked_types                                                                                                                                                                                                                                                                                                   ⎇ call-by-name-shell*
src/python/pants/engine/rules.py
193:    "_masked_types",
214:    _masked_types: NotRequired[Iterable[type[Any]]]
252:    masked_types: tuple[type, ...] = tuple(kwargs.get("_masked_types", ()))

src/python/pants/engine/internals/specs_rules.py
130:@rule(_masked_types=[EnvironmentName])
157:@rule(_masked_types=[EnvironmentName])
225:@rule(_masked_types=[EnvironmentName])
246:@rule(_masked_types=[EnvironmentName])
256:@rule(_masked_types=[EnvironmentName])
268:@rule(desc="Find targets from input specs", level=LogLevel.DEBUG, _masked_types=[EnvironmentName])
279:@rule(_masked_types=[EnvironmentName])

src/python/pants/engine/internals/graph.py
114:@rule(_masked_types=[EnvironmentName])
521:@rule(_masked_types=[EnvironmentName])
593:@rule(_masked_types=[EnvironmentName])
638:@rule(desc="Find all targets in the project", level=LogLevel.DEBUG, _masked_types=[EnvironmentName])
653:    _masked_types=[EnvironmentName],
808:@rule(desc="Resolve transitive targets", level=LogLevel.DEBUG, _masked_types=[EnvironmentName])
881:@rule(_masked_types=[EnvironmentName])
886:@rule(desc="Resolve coarsened targets", level=LogLevel.DEBUG, _masked_types=[EnvironmentName])
1077:@rule(desc="Find which targets own certain files", _masked_types=[EnvironmentName])
1494:@rule(desc="Resolve direct dependencies of target", _masked_types=[EnvironmentName])

witty-crayon-22786

03/10/2025, 4:47 PM

callers would want to do that because they do not want their identities to ever accidentally be dependent on the environment.

witty-crayon-22786

03/10/2025, 4:49 PM

for example: the build graph: as it stood, it felt reasonable to never allow the build-graph to be re-computed per-environment, as it could lead to whacky non-determinism of goals like

pants dependencies

, etc depending on which environment you were running them in. obviously the rest of the graph might change, but preventing it in the build graph "felt" like a good idea. who knows!

witty-crayon-22786

03/10/2025, 4:50 PM

the build-graph masks any inbound environment, and instead always explicitly runs itself in the local environment.

wide-midnight-78598

03/10/2025, 4:51 PM

Ah, interesting. I would have thought the environment would trigger a re-build of the graph - but I guess I never gave it more than a cursory thought

witty-crayon-22786

03/10/2025, 4:52 PM

well: there are (at least?) two graphs: the build-graph is the graph produced by rules in

graph.py

, and rendered using the introspection goals...

dependencies

list

, etc. the "graph graph" is the graph of rule invocations

witty-crayon-22786

03/10/2025, 4:53 PM

the former thing is masked... the latter thing is definitely mostly re-computed per-environment, depending on which subgraphs actually consume the environment

6 Views

Open in Slack

Previous Next