Every week, we run a calibration drill on our inspection team. It takes about 45 minutes. It is the single most valuable thing we do to keep the grading rubric honest, and it costs us nothing but the time.
What we do
On Monday morning we pull 25 Gaylord boxes from inbound. We deliberately bias the sample toward edge cases: a couple of Grade B boxes that almost-but-not-quite slip to C, a Grade C that has been repaired and might honestly be a B, a single Grade D that is wet and obvious, a couple of textbook Grade As. We do not tell the inspection team which is which.
Each inspector grades the 25 boxes independently, writing their grade on a clipboard. They do not see each other's answers.
Then we sit at a table, look at the boxes one by one, and read out the grades.
What we look for
Drift. Specifically, we look for any inspector who is more than half a letter grade off the team mean on three or more boxes in the set. That is the signal that their internal rubric has slipped, usually unconsciously.
When drift shows up, we go back to the photo standards. We have a folder of about 200 reference photos, organized by grade, that everyone trained against initially. We pull the relevant references and walk through where the drift happened.
Drift goes both ways. Sometimes an inspector is grading too generously. Sometimes too strictly. Both are corrected the same way.
Why it matters
Without calibration, grading standards slowly soften. The yard gets busy. Someone is in a hurry. A box that should be C gets called B because it is faster to move on. The next inspector sees the B and adjusts their own internal sense of B downward to match. Within a month the standard has drifted half a letter and the customer who has been buying B from you for two years suddenly notices the load is different.
The drill catches this before customers do.
What it does not catch
It does not catch deliberate fraud. We have never had that issue. It does not catch macro-level mis-calibration — if our entire team has drifted in the same direction, the drill will not flag it. For that we use a quarterly external audit: a customer or third party comes through and grades a load with us, and we compare.
Why we are writing it down
Because we get asked. And because if every yard in the country adopted some version of this, the used-corrugate market would be more trustworthy for everyone, including our competitors. We would like that. A market built on consistent grading is one where more customers feel confident buying used, which grows the addressable market for all of us.