Another article from the BTZS link, is "In Defense of Testing." In this excerpt, Davis is clearly familiar with an argument which frequently appears in the APUG threads.
"To avoid these unpleasant surprises, many photographers — especially those who choose to work in black-and-white with large-format cameras — test their materials in one way or another Most follow the traditional zone system test methods that involve in-camera exposure of the test films, more or less arbitrarily assigned film development times, a standardized printing method, and eye-match appraisal of the print results.
When done with care these tests can provide general guidance for the field use of the materials, but these empirical methods are neither very reliable nor very efficient for a number of reasons: for example, there’s no convenient way to calibrate the individual increments of film exposure with any accuracy. In addition, although visual appraisal of print grays can provide some indication of the overall effect of the processes, it doesn’t permit very reliable analysis of the characteristics of the individual materials. In other words, we’re not likely to know for sure what we’ve done to the materials, nor can we know for sure how they’ve reacted. Finally of course, this approach to testing is very wasteful of both time and materials.
Traditionalists defend this testing method — some vehemently — on the grounds that involving the camera in the test simulates the conditions of practical use and is, therefore, not only convenient but desirable, Similarly, they are apt to argue emphatically that, after all, the purpose of this whole thing is to produce prints, so appraising print values must therefore be the most appropriate way to judge the materials’ performance.
In fact, that’s a technical non sequitur. These traditional testing procedures can’t supply material-specific information any more than driving your car around the block can inform you about the comparative quality of your motor oil, You can obviously tell whether the car runs satisfactorily or not, but you can’t know for sure what part the oil has played in that performance. There are simply too many unrecognized or uncontrolled variables in the procedure; there is no accurate way to quantify the results of such subjective tests, and you have no logical basis for assuming that the conclusions drawn are valid."