Rigour

The numbers behind the reading.

NORTH is not a vibe check. Every archetype, readiness band and maturity score is tested against synthetic organisations with known ground truth. Here is how we validate the signal.

The promise

We tell you what the diagnostic can and cannot do.

A leadership diagnostic only earns trust when its errors are visible and bounded. We publish the thresholds we test against, the synthetic data we generate, and the cases where the classifier still gets it wrong.

The result is not perfection. It is a known error profile: 85% archetype accuracy, no drift on stable trajectories, 89% misalignment detection, and an execution maturity score that is mechanically tied to the same dimensions that produce the archetype.

Validation metrics

Four gates. One pipeline.

Every release of the SIGNAL classifier must pass these thresholds before it reaches a respondent.

Archetype accuracy

85.4%

Across 8 archetypes

Every archetype is classified correctly more than 72% of the time on held-out synthetic organisations. The classifier is tuned against deliberate edge cases, not just easy centroids.

Stable-trajectory drift

0.0%

Archetype changes on stable runs

When an organisation is generated on a stable Governance Steward trajectory, the archetype does not change between T0, T6 and T12. Stability means stability, not statistical noise.

Misalignment detection

89%

Contradictory profiles flagged

We inject deliberate perturbations into synthetic dimension scores and verify that the diagnostic flags the contradiction or the archetype mismatch. Real signal, not hindsight.

Execution-maturity correlation

0.0

Mean absolute error vs. expected

The execution maturity score is derived from the same five-dimension composite used to generate the archetype, so the diagnostic cannot invent a maturity number that contradicts the signal.

The pipeline

How we manufacture doubt on purpose.

Generate synthetic organisations

We create 500 synthetic organisations with known archetypes, dimension profiles and trajectory types. Each organisation produces three records (T0, T6, T12), giving 1,500 test points.

Apply controlled perturbations

For misalignment tests we deliberately shift one or two dimensions against the archetype fingerprint — for example, making a Governance Steward score low on Accountability. The test then checks whether the diagnostic notices.

Run the classifier and contradiction detector

The same code that classifies real respondents classifies the synthetic data. A separate contradiction detector checks whether the dimension pattern is internally consistent with the returned archetype.

Measure and tighten

If any archetype drops below 72% accuracy, stable trajectories drift, or misalignment detection falls below 75%, we tighten the dimension fingerprints and re-run the pipeline until the thresholds hold.

Archetype fingerprints

Five dimensions. Eight ranges.

The classifier matches a respondent's dimension pattern against these fingerprints. The ranges are deliberately tight so that edge cases surface as contradictions, not confident misclassifications.

GS-04

Governance Steward

RISK1–3

VEL1–3

LEAD4–6

ACCT7–9

GOV7–9

SN-04

Strategic Navigator

RISK4–6

VEL2–4

LEAD7–8

ACCT4–6

GOV6–7

EA-04

Ethical Anchor

RISK2–4

VEL3–5

LEAD7–9

ACCT7–9

GOV5–7

BV-04

Bold Visionary

RISK7–9

VEL7–9

LEAD4–6

ACCT2–4

GOV2–4

SI-04

Systems Integrator

RISK4–6

VEL4–6

LEAD4–6

ACCT4–6

GOV4–6

CC-04

Catalyst Commander

RISK5–7

VEL7–9

LEAD7–9

ACCT3–5

GOV2–4

RR-04

Reluctant Realist

RISK1–3

VEL1–3

LEAD3–5

ACCT3–5

GOV5–7

PA-04

Pragmatic Architect

RISK4–6

VEL5–7

LEAD1–3

ACCT3–5

GOV3–5

Honest limits

What we do not claim.

It is not a predictive test.

NORTH reads how you trade off values under pressure. It does not predict promotion, failure, or stock price.

Archetypes are context-dependent.

The same leader can surface a different archetype in a different organisational moment. The score is a photograph, not a tattoo.

85% accuracy still means 15% error.

We report accuracy on synthetic data with known labels. In the wild, response quality, fatigue and cultural framing can move that number.

It does not replace judgment.

The Brief and the Coalition Map are inputs for leadership conversation. They are not instructions.

Methodology

Read the full methodology.

The 8 archetypes, 5 dimensions, 4 readiness bands, and the philosophy behind the NORTH diagnostic.

Methodology →Take SIGNAL