How Scoring Works

Updated Apr 2026

It's more complicated than you might think.

The short version

We don't just compare slider values. Both colors are converted into a color space designed to match human vision, then the distance between them is measured using CIEDE2000 — a formula from color science that quantifies how different two colors look, not how far apart their numbers are. It corrects for known issues where older formulas scored greens, blues, and purples unfairly. That distance is shaped into a game score that rewards getting the right color family — because remembering "it was a warm orange" is the hardest part, and that should count. Five rounds, 0-10 per round, max 50.

Why not just compare the slider values?

The game picker uses three sliders: Hue, Saturation, and Brightness (HSB). It's tempting to just measure how far off each slider is and call it a day. But that would produce scores that feel unfair — because human eyes don't see color the way numbers work.

Same 10% brightness shift — dark

Same 10% brightness shift — bright

Same 40° hue shift — saturated

Same 40° hue shift — gray

The same numerical difference on a slider can look dramatic or invisible depending on context. A scoring system based on raw slider math would reward and punish the wrong things. So we use color science instead.

The Scoring Pipeline

Four stages: convert to a perceptual color space, measure distance, shape it into a score, then adjust for hue accuracy.

1. Color Space Conversion

Both colors are converted from HSB to CIELAB — a color model specifically designed so that equal distances correspond to equal perceived differences. It was created by the International Commission on Illumination (CIE) and is the standard model in color science for measuring how colors look to people.

CIELAB has three axes:

Lightness — 0 is black, 100 is white. Each step looks equally different to the eye.

Green-Red axis. Negative values are green, positive values are red.

Blue-Yellow axis. Negative values are blue, positive values are yellow.

2. Measuring the Difference

With both colors in CIELAB, we measure the perceptual distance between them using CIEDE2000 — the most accurate Delta E formula in color science. Unlike the simpler CIE76 (which just measures straight-line distance in Lab), CIEDE2000 applies corrections for lightness, chroma, and hue that match how human vision actually works. It's the industry standard in manufacturing, printing, and display calibration.

The formula accounts for the fact that we're more sensitive to differences in some color regions than others — a hue shift near green looks bigger to us than the same shift near blue. CIE76 ignores this, which is why the original version of the game scored greens and purples unfairly harshly.

What does a CIEDE2000 distance actually mean?

Imperceptible to most people.

1-5

Slight difference. You'd notice it if you looked closely.

5-15

Clearly not the same color.

15-50

Wrong color family entirely.

50+

Unrelated colors.

3. Turning Distance into a Score

A raw CIEDE2000 value is a distance, not a game score. We need to map it to 0-10 in a way that feels fair. A straight line wouldn't work — it would be too generous for mediocre guesses and too harsh near the top. Instead, the scoring uses an S-shaped curve that's generous for close matches, punishing for misses, and steepest in the middle where differentiation matters most:

base score = 10 / (1 + (dE / 25.25)^1.55)

The two constants control the shape:

25.25

Midpoint. At a CIEDE2000 distance of 25.25, the score crosses 5/10. CIEDE2000 produces smaller distances than CIE76 for the same color pairs, so this is equivalent to the old CIE76 midpoint of 38.

1.55

Steepness. How sharply the curve drops off. Higher values make it more all-or-nothing. 1.55 gives a gradual falloff that rewards incremental precision.

This means precision matters most above 7/10 — you need to be very close to earn a high score, but bad guesses all compress toward 1-2.

Score (0-10) vs CIEDE2000 perceptual distance. White dot tracks your guess in the demo below.

4. Rewarding Color Memory

The base score treats all perceptual errors equally. But in a memory game, remembering the right color family is the hardest part and the most satisfying to get right. If you remembered "it was a warm orange" and nailed the hue but were off on brightness, that should count for something. Two adjustments tilt the score toward hue accuracy.

Hue Recovery

If you got the hue right (within about 25°), you earn back some of the points you lost from saturation or brightness errors:

hue accuracy = max(0, 1 - (hueDiff / 25)^1.5) sat weight = min(1, avgSat / 30) recovery = (10 - base) * hueAccuracy * satWeight * 0.25

Recovery is lighter than it was under the old CIE76 system (0.25 vs 0.50) because CIEDE2000 already handles hue-region differences more accurately. The bonus is still meaningful — nailing the hue when brightness or saturation are off can recover 1-2 points. On grays (saturation under 30%), recovery fades to zero — because hue is visually meaningless on desaturated colors.

Hue Penalty

If your hue is off by more than 30°, you take a penalty — but only on vivid colors where that difference is actually visible:

hue penalty factor = max(0, (hueDiff - 30) / 150) sat weight = min(1, avgSat / 40) penalty = base * huePenFactor * satWeight * 0.15

The penalty is much lighter than the old CIE76 system (0.15 vs 0.4) because CIEDE2000 already produces high distances for wrong-hue guesses — no need to double-count. The 30° dead zone means small hue errors are never penalized. And guessing the wrong hue on a gray costs nothing.

Final Score

round score = clamp(base + recovery - penalty, 0, 10)

Hue recovery (green) and penalty (red) by hue difference, at high saturation. Recovery rewards getting close; penalty kicks in after 30°.

Interactive See the scoring live

Adjust the sliders and watch the score update in real time. The white dot on the S-curve above tracks your position.

Target

Your Guess

— / 10

—

Delta E

—

Hue Recovery

—

Hue Penalty

Hue 180°

Saturation 50%

Brightness 60%

What Changed (Apr 2026)

The game originally used CIE76 (from 1976), the simplest Delta E formula — a straight-line distance in Lab space. It worked, but it didn't treat all colors equally. A 20° hue shift on green produced a CIE76 distance 3x larger than the same shift on blue. That meant greens, purples, and cyans were scored unfairly harshly for the same quality of guess. We switched to CIEDE2000, which corrects for this. The comparison tool below still works — you can see the difference for yourself.

Compare them yourself

Adjust the pick color and see how both systems score the same pair. Try the presets for the most interesting cases where they disagree.

Target

Your Pick

H 180

S 50

B 50

Old System (CIE76) higher = better

0.00

/ 10 per round

0.0

CIE76

0.00

Base

0.00

Adj.

Current System (CIEDE2000) higher = better

0.00

/ 10 per round (CIEDE2000 + hue adj.)

0.0

dE00

—

Feel

Component Differences

Hue

Saturation

Brightness

CIE76 dE

CIEDE2000

Most of the time, they agree

For small errors and obvious misses — the majority of real gameplay — both systems produce nearly identical scores.

Old (CIE76)/10

Current/10

Hue recovery vs honest distance

Both systems reward remembering the right color family. But the old CIE76 system gave heavy hue recovery (up to 50% of lost points) which could mask large brightness/saturation errors. CIEDE2000 keeps hue recovery lighter (25%) and lets the perceptual distance speak more honestly.

Old (CIE76)/10

Current/10

Uneven across the color wheel

CIE76 didn't measure all colors equally. A 20° hue shift on green produced a CIE76 distance 3x larger than the same shift on blue. This wasn't a design choice — it was a known limitation of the color space. CIEDE2000 corrects for it, which is why we switched.

Old (CIE76)/10

Current/10

What the data shows

We ran both systems across every hue at multiple error sizes. The results were clear.

For small errors, they agree

When the pick is close to the target — the most common gameplay scenario — both systems produce nearly identical scores. Across 12 hues with a typical small error, 8 pairs were within 0.3 points. For the range where most games are decided, it doesn't matter which system you use.

CIEDE2000 is more uniform across colors

A 20° hue shift on green produces a CIE76 distance of 19.0 but a CIEDE2000 distance of just 6.2 — a 3:1 ratio. The same shift on red is 22.9 vs 15.1 — only 1.5:1. The old system was significantly harsher on greens and purples than on reds and oranges for the same size error. There was no gameplay reason for that — it was an artifact of CIE76's known non-uniformity. CIEDE2000 corrects for this.

Hue recovery works with either foundation

Hue recovery — the game-design layer that rewards remembering the right color family — works with either foundation. When you nail the hue but miss the brightness, recovery adds 1-2 points. When you get the hue wrong, a light penalty applies. This carried over to CIEDE2000 with retuned constants.

The result

CIEDE2000 is now live. The game-design layer — hue recovery, hue penalty, the S-curve — carried over with recalibrated constants. The S-curve midpoint moved from 38 (CIE76 scale) to 25.25 (CIEDE2000 scale), and the hue adjustments were reduced (recovery 0.50 → 0.25, penalty 0.4 → 0.15) because CIEDE2000 already handles the cross-region fairness that the old adjustments were partially compensating for.

The overall difficulty is unchanged — average scores across 50,000 random color pairs are within 0.001 of the old system. But greens, blues, and purples are no longer penalized for being in the wrong part of the color wheel.