For a long time suspected that the batching was off - I had some junit tests that would periodically report as failing 2x but with different results.
They were also some of the cruftiest tests in our repo (tldr hardcoded to a localhost port) so I ended up forcing TestSerial and moving on.