We could more easily see the effects of crossover, mask errors, and the process in general if two sets of negatives and prints were made, one with the standard process, and one with the diluted process, and compared them side-by-side, with skin tones and a gray scale included in the subject matter. Any tests made of this or any non-standard process should always include a grayscale, IMO. It easily shows any crossover.
In my view this sort of test is the most obvious way to somewhat conclusively prove things out. I say "somewhat" because the lighting conditions are not specified nor are any reference colors in the scene, etc., other than a gray ramp. Now, if a gray scale shows a visual color shift this is pretty conclusive that there is a color cross. But a good gray scale does not prove the absence of ANY color cross.
I recall as a youngster being involved with such tests and finding significant color crosses in the skin tones, but NOT in a gray scale. At the time it seemed surprising, but when one thinks about the situation it is apparent that the gray scale is using a more or less equal balance between the cyan, magenta, and yellow dye layers of the film. Whereas with a flesh tone this relationship between the film dye layers is significantly unbalanced. So if interactions between dye layers are used as a means of controlling color reproduction then one can see how the two color scales (light to dark) might have different color cross characteristics.
A good point, RPC, in respect of your suggested test.. My problem which you may have gathered is that what I look at governs what I see and what I see depends on what I look at.
And I would say these are good points. So as I consider what I think you are looking at, ok there are original scenes that you presumably did not see personally. So you don't know for sure what they looked like. Someone photographed them, developed the film, and made some darkroom prints. Then they photographed the PRINTS with a digital camera, which we presumably don't know the spectral characteristics of that camera ( but they're almost certainly not a match for the human eye, which is what photo printscare really intended for). Then the prints and negs were sent to a different person who substantially repeated the process. Then the digital images were uploaded, and you viewed them on a monitor (with its own spectral characteristics). That's what you are using to make a judgment, right?
Now personally I would not wanna make too many conclusions from this sort of data. Now, if someone specifically wanted me to, I'd probably ask for a somewhat universal color reference, such as a Macbeth ColorChecker with its known spectral characteristics, to be included in the scene. Then I could at least sort of qualify the digital images (I wouldn't do it visually; I'd look at the RGB values in whatever color space is used, then convert these to CIELAB color, then compare against what the Macbeth chart nominally is. Without this, I wouldn't want to make any hard judgments about the film results.
I've spent a lot of years working with color printing (large scale, overseeing QC and solving lab color problems), and I know how touchy it is getting consistent good color reproduction in portrait work. So if the process varies significantly from spec I take it almost as a "given" that there will be unwanted color crosses. For reference I described some of the testing in this post:
https://www.photrio.com/forum/posts/2192714/
Now, I've been specifically trying to keep my nose out of this thread, as I recognize that a group of people are trying to explore this situation, and come to their own understanding of how such things work. But I think that RPC's position should be taken seriously, so I'm just weighing in a bit.
Ps, I should point out that the sort of testing I was involved with was specifically directed at studio portrait work. If we had been involved in photographing brightly colored cars, or perhaps foliage and flowers, etc., the test scenes and methodology would have probably been quite a lot different.