When we talk about evaluating classifiers, it is tempting to think of metrics as neutral tools (accuracy, ROC AUC, precision, recall, etc.), just different ways of measuring the same thing. But that intuition is misleading. A metric does not simply measure performance; it defines what we mean by good performance in the first place.