Well thanks for the response anyway. My curiosity has been somewhat sated.
Fwiw I don't think that either is strictly "correct." Rather they are given as empirical methods where sometimes one seems to be a better predictor than the other.
My three opinions:
1. The modulation transfer functions of two parts of the system can be multiplied to get the MTF of the composite system. That's how MTFs are intended to work. If you do that, and define resolution as "the frequency at which the MTF is some value (like 20% or whatever)", then you will probably get a formula similar to: 1/f_total = 1/f1 + 1/f2, where the f's are frequency numbers such as 30 lp/mm. So if your two parts resolve at f1 = 50 lp/mm and f2 = 33 lp/mm, the composite resolution is f_tot ~= 20 lp/mm.
2. If you define resolution as spot size, then you can convolve (not multiply) the spot diameters of two parts of the system to approximate the spot diameter of the total. Convolution means that the diameters are added in quadrature. This is described by: d_tot^2 = d1^2 + d2^2. So for example if the spot sizes produced by the two parts of the system were d1=0.02mm (1/50 mm) and d2=0.03mm (1/30 mm), the composite spot diameter is d_tot ~= 0.036 mm (1/27 mm).
If you figure that the spot size is about the reciprocal of the resolution frequency, d ~ 1/f, you can see that these two formulas work out fairly similar, though not exactly the same. It is my guess that the formula cited by Erwin Puts was a garbled version of argument #2.
3. My third opinion is that you don't need to know any of this stuff for pictorial use. I am a research scientist and do optical and angular resolution calculations for work, so I have to think about it at work. But for pictorial use, it would be better to practice nailing your focus, stopping down to f/5.6-8 as needed, and using a tripod where possible. I don't stay up late worrying about the resolving power of film.