Clubcard data-quality lesson — multiple users on one card produced false positives in mining (the "garbage in" warning from the best-documented winner)

Summary

Claim. Even in the best-documented loyalty-card win, data-quality failures bit the analytics: multiple distinct shoppers using one Clubcard produced false positives in segmentation and propensity mining.

Source. mugleston.co.uk summary of Humby et al., Scoring Points (2004), accessed 2026-06-21.

Confidence. Verified — documented in the participant-authored book itself; a candid acknowledgement, not an external criticism.

Caveats. Documented as anecdote rather than measured error-rate; the precise size of the bias is not stated.

Implication / use. Concrete instantiation of the "garbage in" principle for any information-edge argument. If it happened to Tesco at Clubcard's analytical sophistication, expect it for any SMB. Supports Rule: three thresholds before claiming an information edge — volume to clear noise, clean data, an actual decision someone will act on (and timeliness before commoditisation) (clean data is the second of three thresholds).