The comprehensive evaluation already exists. It’s called film study. Luckily we can access it for free on youtube from former NFL quarterbacks such as JT OSullivan and Kurt Warner. There are also people like Greg Cosell and Brian Baldinger at NFL Network.
Ultimately any omnibus number that tries to rank a QB’s overall play against the other, even if it has a number of variables feeding into an equation with an effect size as an output, will strip valuable context from the discussion. A QB on one team may look terrible in one place but amazing in another or vice versa (see Tua, Geno Smith, and Watson.) Some QBs are exceptional in some areas but severely lack efficacy in others (See Cam Newton). To put it in statistical terms, football is a gnarly mess of difficult to measure interaction effects.
I would advocate for qualitatively assessing traits and only using quantitative assessments as supplements. With any statistical analysis we need to understand the context in which the numbers exist or we’re prone to misinterpretation.
Chasing down a rabbit hole for a single way to rank quarterbacks in 2023, with our currently measured variables, is unlikely to yield a practically valuable outcome.
It is not possible for PFF to do a good qualitative analysis the way they are doing it. They are assigning an omnibus number to a multifaceted construct with serious myriad interaction effects. Their rankings are asking a question that’s inherently leading us away from useful information.
My beef isn’t just about the evaluation. The stats are the stats and I trust that you’ve done them appropriately. It’s that the entire approach, even downstream. Even if you added every single variable available on football reference, we still wouldn’t have truly actionable information. Good outcomes come from schemes that fit with good player traits and the stats on PFR do a poor job at measuring both those things.