so in order to fully remove the `Bytes` `stderr` and `stdout Pants #development

so in order to fully remove the `Bytes` `stderr` a...

hundreds-breakfast-49010

06/03/2020, 8:55 PM

so in order to fully remove the

Bytes

stderr

and

stdout

fields from

FallibleProcessResultWithPlatform

, I think we need to change the flow on the

store

method in

<http://cache.rs|cache.rs>

hundreds-breakfast-49010

06/03/2020, 8:56 PM

right now one of the things

store()

does is read the bytes from the

FallbileProcessResultWithPlatform

, create new digests by storing them in

self.file_store

, and then puting those digests into

self.process_execution_store

witty-crayon-22786

06/03/2020, 8:56 PM

not quite, i don’t think

witty-crayon-22786

06/03/2020, 8:56 PM

the process_execution_store holds a protobuf struct

witty-crayon-22786

06/03/2020, 8:57 PM

which has digest fields

hundreds-breakfast-49010

06/03/2020, 8:57 PM

right

witty-crayon-22786

06/03/2020, 8:57 PM

… so i think that in the new flow you just pass them in.

witty-crayon-22786

06/03/2020, 8:57 PM

the per-file digests do not go in the process_execution_store.

witty-crayon-22786

06/03/2020, 8:57 PM

just the protobuf, afaik.

witty-crayon-22786

06/03/2020, 8:58 PM

but

witty-crayon-22786

06/03/2020, 8:59 PM

does is read the bytes from the
FallbileProcessResultWithPlatform
, create new digests by storing them in
self.file_store

part of the reason this might be misleading is that these aren’t “new” digests

witty-crayon-22786

06/03/2020, 8:59 PM

they’ll always be identical for the same bytes. so there is no new/old

witty-crayon-22786

06/03/2020, 8:59 PM

and so, if you already have the digest, you’re good to go.

hundreds-breakfast-49010

06/03/2020, 8:59 PM

the thing I'm confused about is that, digests only make sense with respect to a specific store, right?

witty-crayon-22786

06/03/2020, 9:00 PM

witty-crayon-22786

06/03/2020, 9:00 PM

@hundreds-breakfast-49010: ish.

hundreds-breakfast-49010

06/03/2020, 9:00 PM

like, if you have a digest that was saved in one store, and try to read it from a different store, it will fail

witty-crayon-22786

06/03/2020, 9:00 PM

but what the process_execution_store stores is only a protobuf

witty-crayon-22786

06/03/2020, 9:00 PM

not the files or directories

hundreds-breakfast-49010

06/03/2020, 9:00 PM

so that means that the

hasing::Digest

type is being stored as just the 40 hex characters within the protobuf, right?

witty-crayon-22786

06/03/2020, 9:01 PM

the protobuf has a type for a digest

witty-crayon-22786

06/03/2020, 9:01 PM

it’s the same protobuf as is used in remote execution

hundreds-breakfast-49010

06/03/2020, 9:02 PM

hm, and when we deserialize a

FallibleProcessResultWithPlatform

lookup()

, we'll just get back a

Digest

that points at actual data stored in the store in the underlying CommandRunner, right?

hundreds-breakfast-49010

06/03/2020, 9:02 PM

so actually, we shouldn't even need

self.file_store

anymore?

witty-crayon-22786

06/03/2020, 9:03 PM

um, let me confirm some of what i’m saying. i expect that the process_execution_store is a cache from process request to response

witty-crayon-22786

06/03/2020, 9:04 PM

…yea.

witty-crayon-22786

06/03/2020, 9:04 PM

it’s a cache from the digest of a Request to an actual Response object (which was stored as protobuf)

witty-crayon-22786

06/03/2020, 9:05 PM

but in the protocol, neither Request nor Response actually “contain” files/directories… instead, they have digests for them

witty-crayon-22786

06/03/2020, 9:05 PM

and those file/directory digests are stored in our capital S Store

witty-crayon-22786

06/03/2020, 9:06 PM

so: process_execution_store is only a cache from request to response, and the capital S Store is only a store of files/directories

witty-crayon-22786

06/03/2020, 9:06 PM

@hundreds-breakfast-49010: does that make sense?

hundreds-breakfast-49010

06/03/2020, 9:07 PM

right.

process_execution_store

ShardedLmdb

file_store

Store

hundreds-breakfast-49010

06/03/2020, 9:07 PM

so what I'm saying is, I don't think there's any reason for

file_store

to exist anymore

witty-crayon-22786

06/03/2020, 9:07 PM

there is.

witty-crayon-22786

06/03/2020, 9:07 PM

…oh. i see what you mean.

hundreds-breakfast-49010

06/03/2020, 9:07 PM

unless there's a way that the

Store

that contains the data that a given

Digest

contains can be lost somehow

witty-crayon-22786

06/03/2020, 9:08 PM

because

Copy code

crate::remote::populate_fallible_execution_result(
        self.file_store.clone(),
        execute_response,
        vec![],
        platform,
      )

…doesn’t need to use the file store to load the content anymore?

witty-crayon-22786

06/03/2020, 9:08 PM

yea, that makes sense.

witty-crayon-22786

06/03/2020, 9:09 PM

@hundreds-breakfast-49010: ahm, re: https://pantsbuild.slack.com/archives/C0D7TNJHL/p1591218466108400?thread_ts=1591217721.101200&cid=C0D7TNJHL … yes, that is still sortof a thing. although a change i’m about to make will make it much less likely

hundreds-breakfast-49010

06/03/2020, 9:09 PM

I think so? i.e. my change would entail removing

store: Store

as an argument from

populate_fallible_execution_result

hundreds-breakfast-49010

06/03/2020, 9:09 PM

but yeah, I think the data structurs that we have kind of divorce a

Digest

from the

Store

that actually contains the data that

Digest

fingerprints

hundreds-breakfast-49010

06/03/2020, 9:10 PM

and at some point if we're passing around a

Digest

we'll need to get the

Bytes

for it out of a

Store

, and if it's the wrong

Store

that operation will just fail

witty-crayon-22786

06/03/2020, 9:10 PM

@hundreds-breakfast-49010: yes to digests being used for multiple things.

witty-crayon-22786

06/03/2020, 9:11 PM

but sortof no: the process_execution_store is not

digest->content

… it’s

digest->content_of_a_response

witty-crayon-22786

06/03/2020, 9:11 PM

anyway… the

<http://cache.rs|cache.rs>

file should be almost the only thing using digests for processes.

witty-crayon-22786

06/03/2020, 9:12 PM

@hundreds-breakfast-49010: so, yes: i think you can remove the file_store from

<http://cache.rs|cache.rs>

hundreds-breakfast-49010

06/03/2020, 9:12 PM

ok that makes sense to me

hundreds-breakfast-49010

06/03/2020, 9:13 PM

and that means that

populate_remote_execution_result

also needs to not have a

store

argument anymore

hundreds-breakfast-49010

06/03/2020, 9:13 PM

tom's recent commit calls that function in

<http://streaming.rs|streaming.rs>

witty-crayon-22786

06/03/2020, 9:13 PM

yea. all of this will move to somewhere where we lift the result for python.

witty-crayon-22786

06/03/2020, 9:35 PM

…so, @hundreds-breakfast-49010: can i take back what i said above about removing the store from those codepaths?

hundreds-breakfast-49010

06/03/2020, 9:35 PM

yeah

hundreds-breakfast-49010

06/03/2020, 9:35 PM

espeically if this is about the way that

<http://remote.rs|remote.rs>

uses

extract_output_files

🙂

witty-crayon-22786

06/03/2020, 9:35 PM

not because it’s necessary now (i don’t think it is), but because the change i’m making is going to use it.

hundreds-breakfast-49010

06/03/2020, 9:36 PM

hm, what change?

witty-crayon-22786

06/03/2020, 9:36 PM

basically, when we hit the cache, we have a

FalliablePRWP

witty-crayon-22786

06/03/2020, 9:36 PM

for https://github.com/pantsbuild/pants/issues/9942

witty-crayon-22786

06/03/2020, 9:36 PM

things can be garbage collected out of the store. and to avoid that, we hold “leases” on files

hundreds-breakfast-49010

06/03/2020, 9:36 PM

that's exactly the class of error I was concerned about

witty-crayon-22786

06/03/2020, 9:36 PM

right.

witty-crayon-22786

06/03/2020, 9:37 PM

i don’t think that you have to worry about it, but in my change i’m going to add lease extension when we hit that cache

witty-crayon-22786

06/03/2020, 9:37 PM

so if you could leave it (marked unused, if need be) that would be appreciated.

hundreds-breakfast-49010

06/03/2020, 10:38 PM

okay, so the last place we need to think about

Store

concerns is in

<http://intrinsics.rs|intrinsics.rs>

multi_platform_process_request_to_process_results

hundreds-breakfast-49010

06/03/2020, 10:39 PM

oh wait, that has a

Core

and

Core

has a

Store

. so maybe that's fine

witty-crayon-22786

06/03/2020, 10:39 PM

hundreds-breakfast-49010

06/03/2020, 10:39 PM

assuming that's the right store

witty-crayon-22786

06/03/2020, 10:39 PM

there is only one.

witty-crayon-22786

06/03/2020, 10:39 PM

(capital S Store)

Open in Slack

Previous Next