I m still struggling with Pants restarting the scheduler whe Pants #general

I'm still struggling with Pants restarting the sch...

acoustic-library-86413

03/14/2024, 7:57 AM

I'm still struggling with Pants restarting the scheduler whenever any file changes across all of the machines in my developer team. It seems like all of my source roots are being appended to the Pants invalidation globs and file changes results in an exception that kills the scheduler. My configuration file is not modifying the pythonpath or the pants sys.path directly. Can anyone think of a way to troubleshoot this issue?

broad-processor-92400

03/14/2024, 8:47 AM

Often the pantsd log file will have information about which paths are causing the invalidation, so one step would be identifying some exact paths that are causing this. If you have any in-repo plugins, understanding their dependencies will be relevant too.

acoustic-library-86413

03/14/2024, 9:17 AM

We do not have any in-repo plugins. The pants.d logs show the following two messages in order whenever a file is edited:

Copy code

10:09:04.00 [ERROR] saw filesystem changes covered by invalidation globs: content changed (Digest('e43cc803ec7b0f7f52a9e9f9bfb93d8dc27120e6d3fe25b2f3d69f1108b3d54f', 170) fs Digest('82c2fe5500aea967af7a1d6d9996b7e64335b998edbb122d376808b2b2cd35ac', 170)). terminating the daemon.
...
10:09:04.65 [ERROR] service failure for <pants.pantsd.service.scheduler_service.SchedulerService object at 0x7bdff57c7970>.

I haven't been able to map this digest back to a specific filename, but the digest changes on every run.

broad-processor-92400

03/14/2024, 9:19 AM

Huh, that is unhelpful. Does increasing the log level to debug (https://www.pantsbuild.org/2.20/reference/global-options#level) provide any more info about that?

acoustic-library-86413

03/14/2024, 9:34 AM

Ugh. It's difficult to say. Every time I change logging level the whole Pants daemon goes haywire and plays by its own rules for subsequent runs. It did log some additional information when I set the level to trace, but unfortunately when I tried to set it back to

info

it just stopped showing logs entirely, even when restarting the scheduler. Running a single unit test takes ~5 seconds, so I'm assuming it's still restarting the scheduler behind the scenes, and just not printing it.

broad-processor-92400

03/14/2024, 9:35 AM

Ugh! sorry for the trouble. What version of pants?

acoustic-library-86413

03/14/2024, 9:35 AM

2.17.0. I'll try upgrading to the latest version now, shouldn't cause any issues 🙂 Thanks for the patience!

acoustic-library-86413

03/14/2024, 9:43 AM

I'm not able to make it print the "Reinitializing scheduler..." message, but running a single unit test takes 4.838630 seconds. I also notice a ton of these on subsequent runs:

Copy code

10:42:50.73 [DEBUG] Dependency SessionValues of Some("@rule(pants.engine.internals.build_files.parse_address_family(projects/some-folder))") changed.

Even if there are no file changes. Does that make sense?

broad-processor-92400

03/14/2024, 9:45 AM

To definitely force a restart of the daemon, either

pkill -f pantsd

rm -rf .pants.d/pids

should do the trick.

acoustic-library-86413

03/14/2024, 9:50 AM

Hmm. Is there a reason some messages aren't appearing in

pants.log

? I saw the

[INFO] Initializing scheduler...

in stdout, but it's not shown in any of the run_tracker logs or the main log file

pants.log

.pants.d

broad-processor-92400

03/14/2024, 9:52 AM

Yeah, I think the

pants.log

file is only messages from the daemon itself. I believe some of the messages visible in the terminal are from the client (that talks to the daemon), but and so aren't written to the file.

👍 1

acoustic-library-86413

03/14/2024, 9:52 AM

Ah, okay that makes sense.

acoustic-library-86413

03/14/2024, 9:55 AM

It seems like my issue may have been caused my multiple Pants invocations attempting to run at the same time. I'm running a file watcher that will trigger a docker rebuild. The watcher glob was a bit greedy, so it would also trigger a rebuild on changes to test-files. After modifying it I'm no longer able to reproduce the issue it seems. I was only able to see this due to the logging option that you suggested though!

👍 1

acoustic-library-86413

03/14/2024, 10:34 AM

But to be honest, I don't understand why a Docker build would kill the scheduler. I'll have to do some more digging later.

5 Views

Open in Slack

Previous Next