does anyone have any experience with submitting and getting Pants #development

does anyone have any experience with submitting an...

hundreds-breakfast-49010

03/09/2020, 6:06 PM

does anyone have any experience with submitting and getting merged changes to the bazel remote build protobufs? context is https://github.com/pantsbuild/pants/pull/9198#pullrequestreview-371104078

hundreds-breakfast-49010

03/09/2020, 6:07 PM

I'm inclined to think that getting a notion of platform defined in the bazel protobufs and accepted by those maintainers (google, right?) is a lot of overhead

hundreds-breakfast-49010

03/09/2020, 6:07 PM

but I'm curious if anyone has tried to do this before

aloof-angle-91616

03/09/2020, 6:07 PM

there’s a weekly remote execution api working group where these sorts of extensions are proposed

👍 1

aloof-angle-91616

03/09/2020, 6:09 PM

trying to find it now idk what to search + headache but i’ll try to find it later

aloof-angle-91616

03/09/2020, 6:09 PM

sorry

aloof-angle-91616

03/09/2020, 6:09 PM

we’ve proposed changes before based on implementation decisions in scoot

average-vr-56795

03/09/2020, 6:39 PM

https://groups.google.com/forum/#!forum/remote-execution-apis would be the group to propose it to

average-vr-56795

03/09/2020, 6:39 PM

We’ve made some changes in the past, people are generally pretty reasonable

average-vr-56795

03/09/2020, 6:40 PM

There’s already a notion of platform standardised somewhere, so I think it would “just” be a matter of re-using it…

average-vr-56795

03/09/2020, 6:40 PM

See https://github.com/bazelbuild/remote-apis/blob/master/build/bazel/remote/execution/v2/platform.md

average-vr-56795

03/09/2020, 6:41 PM

So I guess the proposal would be to add https://github.com/bazelbuild/remote-apis/blob/1b16ed76965afa8bb229ba22b90745c546416443/build/bazel/remote/execution/v2/remote_execution.proto#L562-L598 to the metadata response using the same agreed properties

hundreds-breakfast-49010

03/09/2020, 6:45 PM

I think that would amount to a one-line addition here: https://github.com/bazelbuild/remote-apis/blob/1b16ed76965afa8bb229ba22b90745c546416443/build/bazel/remote/execution/v2/remote_execution.proto#L792 , but I also think that just wrapping this ourselves (at least initially) will take less time than any kind of cycle of getting a patch merged upstream, even if they're prompt about it

average-vr-56795

03/09/2020, 6:46 PM

Sounds like doing both in parallel would be pretty reasonable 🙂

average-vr-56795

03/09/2020, 6:46 PM

That said, we should maybe think about versioning, if we’re going to change the on-disk format…

hundreds-breakfast-49010

03/09/2020, 6:50 PM

hm, that's a good point. have we ever changed the on-disk format before?

hundreds-breakfast-49010

03/09/2020, 6:50 PM

I don't know a whole lot about the internals of lmdb

average-vr-56795

03/09/2020, 6:50 PM

We haven’t, as far as I know

average-vr-56795

03/09/2020, 6:50 PM

In LMDB, within a file, databases have names, so we could switch the name to be the name of the format

☝️ 1

aloof-angle-91616

03/09/2020, 6:51 PM

i like that idea

average-vr-56795

03/09/2020, 6:53 PM

In the

ShardedLmdb

constructor, you’re looking at the

Some("content")

and

Some("leases")

arguments

👍 2

average-vr-56795

03/09/2020, 6:53 PM

Those I think are database names

hundreds-breakfast-49010

03/09/2020, 6:57 PM

ah yeah we're using a specific git checkout of the rust lmdb binidngs. is tehre a reason for that?

aloof-angle-91616

03/09/2020, 7:00 PM

before the rust ecosystem is full of mission critical high quality open source crates, someone has to make the crates mission critical, and some projects have unclear maintenance or it wasn’t clear whether we’d need to make further changes down the line so we didn’t try to close the loop yet. i think it’s probably correct to call it tech debt but there’s no immediate reason to fix that yet

hundreds-breakfast-49010

03/09/2020, 7:00 PM

so, hm, if I just changed that name, then once the change hit master, pants would ignore the database name that currently existed and always try to create a new one

hundreds-breakfast-49010

03/09/2020, 7:00 PM

so we'd invalidate any existing db and nothing would clean it up

hundreds-breakfast-49010

03/09/2020, 7:00 PM

but I don't think that would break anything?

aloof-angle-91616

03/09/2020, 7:01 PM

as in, is that what the code does right now, or are you asking whether that is what would happen?

hundreds-breakfast-49010

03/09/2020, 7:01 PM

I think that's what would happen if I did this naively

hundreds-breakfast-49010

03/09/2020, 7:02 PM

or, hm, what if I created a new

ShardedLmdb

object on

CommandRunner

, just for platforms?

aloof-angle-91616

03/09/2020, 7:03 PM

there are also recurrent concerns in #general and from users within twitter that our lmdb usage doesn’t gc file descriptors as much as it could and we hit against limits during normal use. if we’re considering cleaning up other parts of the impl i’d also consider taking a look at that

aloof-angle-91616

03/09/2020, 7:04 PM

creating a new ShardedLmdb on a CommandRunner doesn’t sound bad actually but i would usually expect to add the table and then access it through the

store

crate instead of making a new one in the process_execution crate itself

hundreds-breakfast-49010

03/09/2020, 7:06 PM

Store

is meant to store file contents it looks like. is it adaptable to store the new

Platform

type?

hundreds-breakfast-49010

03/09/2020, 7:07 PM

right now when we construct the

<http://cache.rs|cache.rs>

CommandRunner

we pass in a

ShardedLmdb

to the

new

method

hundreds-breakfast-49010

03/09/2020, 8:09 PM

fyi @aloof-angle-91616 trying to upgrade the

lmdb

create used in

sharded_lmdb

to the latest version of https://crates.io/crates/lmdb results in some API changes around cloning the

env

aloof-angle-91616

03/09/2020, 8:10 PM

hm ok

aloof-angle-91616

03/09/2020, 8:10 PM

and thank you for mentioning we just pass in a sharded db to the new method, that should be fine to do here as well, forget what i said

aloof-angle-91616

03/09/2020, 8:11 PM

API changes that you’ve figured out or that are causing compile errors?

hundreds-breakfast-49010

03/09/2020, 8:12 PM

specifically https://github.com/pantsbuild/pants/blob/master/src/rust/engine/sharded_lmdb/src/lib.rs#L255 <- there's no longer an

EnvironmentCopyFlags

type

hundreds-breakfast-49010

03/09/2020, 8:12 PM

and i'm not 100% sure what this code is meant to do to begin with

hundreds-breakfast-49010

03/09/2020, 8:12 PM

I think I'll just create another lmdb instance called "processes" or something, and passit in in

<http://context.rs|context.rs>

hundreds-breakfast-49010

03/09/2020, 8:13 PM

I'm a little concenred that this isn't the cleanest way to do this, and we're only doing it because we don't have a good lmdb schema upgrade path

aloof-angle-91616

03/09/2020, 8:13 PM

i remember the sharded lmdb code was difficult to understand a lot, especially the boolean lease parameter

aloof-angle-91616

03/09/2020, 8:13 PM

i don’t understand the concern you’re expressing

hundreds-breakfast-49010

03/09/2020, 8:13 PM

is it a problem to create arbitrarily many instances of

ShardedLmdb

hundreds-breakfast-49010

03/09/2020, 8:13 PM

performance-wise or otherwise?

hundreds-breakfast-49010

03/09/2020, 8:14 PM

it looks like every instance of ShardedLmdb maps to a new directory in

~/.cache/pants/lmdb_store

hundreds-breakfast-49010

03/09/2020, 8:14 PM

I'm not sure how much overhead this implies

aloof-angle-91616

03/09/2020, 8:14 PM

we are already not cleaning up file descriptors and people are seeing issues during normal pants operation as a result

aloof-angle-91616

03/09/2020, 8:15 PM

if you’re adding a new feature, i generally don’t worry about performance until there’s a need identified to make it faster

hundreds-breakfast-49010

03/09/2020, 8:15 PM

hm, I just realized - there's no way around handling this as a schema upgrade

aloof-angle-91616

03/09/2020, 8:16 PM

if you’re saying we’re considering not creating a new db because of possible performance concerns, i don’t think that’s an issue and i think the separation of logic by creating a new db is what we would want

hundreds-breakfast-49010

03/09/2020, 8:16 PM

if we're going to start tracking Platform in lmdb, what do we do for entries stored before this commit, that just don't have this information?

aloof-angle-91616

03/09/2020, 8:16 PM

i’m extremely partial to just killing every entry from older schemas

hundreds-breakfast-49010

03/09/2020, 8:17 PM

if this change necessarily breaks every existing pants lmdb store out there, then there's no reason I think not to just store the new

Platform

parameter in the existing

ShardedLmdb

hundreds-breakfast-49010

03/09/2020, 8:17 PM

I wonder what the user experience for that is gonna look like though

hundreds-breakfast-49010

03/09/2020, 8:18 PM

store

and

load

<http://cache.rs|cache.rs>

are just gonna fail sometimes if they read an older entry or can't properly deserialize one from bytes

aloof-angle-91616

03/09/2020, 8:18 PM

does it make sense to have a platform parameter in every shardedlmdb instance?

aloof-angle-91616

03/09/2020, 8:18 PM

which is why making a new shardedlmdb sounds very reasonable

hundreds-breakfast-49010

03/09/2020, 8:19 PM

well in every shardedlmdb instance that currently stores a bazel

ExecuteResponse

hundreds-breakfast-49010

03/09/2020, 8:20 PM

hm, I guess I could write the

lookup

function there to try to deserialize the new version of the type - with a

Platform

on it - from the raw

Bytes

we store in lmdb, and just return None if it can't, same as if there was just no cache entry for this yet

hundreds-breakfast-49010

03/09/2020, 8:21 PM

so then the user experience would be, having to re-run every old EPR that was made before they upgraded to the version of pants with this change

hundreds-breakfast-49010

03/09/2020, 8:21 PM

but I think that might happen anyway?

hundreds-breakfast-49010

03/09/2020, 8:21 PM

if you upgrade pants in general doesn't that invalidate lmdb?

average-vr-56795

03/10/2020, 10:28 AM

Some unstructured thoughts:

average-vr-56795

03/10/2020, 10:28 AM

1. The reason we’re on a forked version of lmdb is because the maintainer has been ignoring pull requests for about two years. We’re on exactly master + https://github.com/danburkert/lmdb-rs/pull/43

average-vr-56795

03/10/2020, 10:29 AM

2. No, we don’t invalidate things on pants releases - one of the nice things about the process execution API is that it doesn’t matter what version of pants you used, if it’s running the same process, we’ll hit the same cache.

average-vr-56795

03/10/2020, 10:30 AM

3. Deleting the old LMDB and just using a new one with a new schema would be reasonable. But we haven’t built any mechanism to do this, and sometimes people go backwards in time in terms of pants releases, as well as forwards. The nearest equivalent would be to pick a new name for the file and ignore the old one, or to pick a new database name. We can leave the deleting to users, but also, the process execution lmdb should be trivially small for everyone right now, as it’s only really used for cloc (except by us pants developers)

average-vr-56795

03/10/2020, 10:32 AM

4. It occurs to me that we actually may want platform in the cache key rather than value. Or both. I’m not sure whether Platform is already mixed into the cache key, in terms of a platform property. But if I want to ask for “a pex built on Linux” I can’t really ask for that if it’s not in the key…

hundreds-breakfast-49010

03/10/2020, 4:50 PM

@average-vr-56795 if we're going to start introducing some kind of notion of schema version, it might be good to do that in a single commit that also renames the lmdb filename, so we never have to deal with trying to read incompatible data from the pre-schema-version version of the store

average-vr-56795

03/10/2020, 4:51 PM

One of the joys of protobuf’s design is that if we’re using a protobuf as a wrapper, that is a form of schema versioning, as long as we’re happy with its constraints…

average-vr-56795

03/10/2020, 4:52 PM

But if we’d like another mechanism for versioning a schema, I agree that rolling it into this PR sounds good 🙂

hundreds-breakfast-49010

03/10/2020, 4:52 PM

yeah, I don't think it's a good idea to always count on protobuf versioning; and in any case I don't think there's any way to implement the Platform change we need without some kind of schema-breaking change

hundreds-breakfast-49010

03/10/2020, 4:53 PM

even if we use our own protobuf for this, we'll still break things once

average-vr-56795

03/10/2020, 4:53 PM

Yeah 🙂 Sounds good

hundreds-breakfast-49010

03/10/2020, 4:54 PM

did that suggestion of creating a global constant integer

SCHEMA_VERSION

in the

sharded_lmdb

code and hashing that integer into something that can be XOR'd with the 32 bytes of the existing

Fingerprint

seem reasonable?

average-vr-56795

03/10/2020, 4:55 PM

I’d be tempted to do it as a prefix byte to the 32 bytes, rather than an xor - makes it easier for us to remove stale entries. But the general approach sounds reasonable, yeah 🙂

hundreds-breakfast-49010

03/10/2020, 4:55 PM

hm, that's easier to implement too

hundreds-breakfast-49010

03/10/2020, 4:56 PM

is there any lmdb-related reason why we can't just add one or two bytes, have a 33 or 34 byte key?

average-vr-56795

03/10/2020, 4:56 PM

No reason at all 🙂

average-vr-56795

03/10/2020, 4:56 PM

Treat LMDB as just a mapping from arbitrary byte-array to arbitrary byte-array :)

hundreds-breakfast-49010

03/10/2020, 4:56 PM

awesome

Open in Slack

Previous Next