The evidence shows that, under controlled conditions, LLM judges can align closely with clinician judgments on concrete, ...