The evidence hierarchy we use, and why

Wellness & AI

METHOD

The evidence hierarchy we use, and why

Strong, promising, anecdotal — labelled out loud. The three-tier hierarchy that prevents both naïve scientism and uncritical wellness.

By Sabin · Wellness & AI3 March 20263 min read

Most wellness claims sit somewhere between a press release and a pilot study. You already know that. The problem is we too often treat everything as either proven or bogus.

Labeling evidence—out loud—changes what you choose, and how you change it. The three-tier hierarchy we use does that. Strong, promising, anecdotal—each label means something precise.

why a simple hierarchy matters more than another scale

Long taxonomies and weighted scoring systems sound rigorous. They often hide judgment beneath complexity. A three-tier system clarifies decisions. It resists both naïve scientism and uncritical wellness optimism. That tension shows up in trials of digital interventions and lifestyle programs alike (Lancet, 2024).

the three tiers defined—what each label actually signals

strong: replicated, controlled evidence that aligns with mechanism and clinical plausibility. Think multiple randomized trials or large meta-analyses. Strong means you can treat the intervention as a candidate for routine use (Cochrane review, 2024).

promising: limited trials, plausible mechanism, or consistent observational data. Promising means worth testing in your personal ledger—tracked, time-limited, reversible. It is not a prescription, but it is structured (BMJ Open, 2023).

anecdotal: case reports, testimonials, early prototypes, or single-site pilots. Use these for curiosity and hypothesis generation. Label them honestly. They are the least reliable, but often the most innovative (Hashimoto et al., 2025).

how the hierarchy sits inside the 3-Layer Stack

The hierarchy lives across Research, Ledger, Protocol. The research model surfaces evidence and assigns the label. The ledger model records your trial and outcomes. The protocol model turns promising steps into time-limited experiments. That division prevents evidence from being misread as mandate.

For example: a promising breathing app becomes a 30-day protocol in your ledger. You predefine outcomes. You stop or escalate based on data. That method respects both curiosity and safety [RCT, 12 weeks].

a short playbook you can use tonight

Find the evidence label before you commit. Ask: strong, promising, or anecdotal.
If strong — integrate with routine care and monitor for rare harms.
If promising — run an N=1 protocol in your ledger for 4–12 weeks with objective measures.
If anecdotal — read as a hypothesis. Do not spend money or abandon proven treatments.

This ordered playbook keeps you honest. It reduces decision friction. It also creates a reusable habit: label, track, decide.

practical signals that move an item from one tier to another

Look for replication, sample size, control conditions, and biological plausibility. Also look for harms reporting—absence of harm data is not evidence of safety. Small, consistent effect sizes across diverse cohorts push something from promising toward strong (JAMA, 2022).

replication across teams
pre-registered endpoints or registered trials
transparent adverse-event data
plausible mechanism and dose–response

Rank evidence as strong / promising / anecdotal in your notes. Use the ledger model to timestamp decisions. It’s a small ritual with outsized returns for clarity and consent.

The point: label evidence before you believe it.

ethical and practical limits—what the hierarchy does not do

This hierarchy is a tool, not a verdict. It cannot replace shared clinical judgment or context. You should not use it to short-circuit informed consent or to justify ignoring comorbidities. Nor does it eliminate uncertainty—only frames it (meta-analysis, n=4,200).

“Evidence labeled honestly wins trust. Claims dressed as certainty lose it.”

Strong evidence can still be misapplied. Promising findings often fail larger trials. Anecdote can mislead. Naming these states helps you steward risk and curiosity responsibly (Lancet, 2024).

Why it matters: you preserve agency and reduce harm by treating evidence as a map, not a mandate.

If you use free chat tools as part of your Research layer, ask them to return the label—strong, promising, or anecdotal—and the reasons. Force the model to say its confidence and the key gaps. That simple prompt changes outcomes and consent.

Evidence is not binary. The three-tier hierarchy keeps you between blind faith and sterile skepticism. It favors clarity. It invites judgement, documented and reversible. Use it across Research, Ledger, Protocol—and label everything out loud.

Recommended next

Three things to read next.

See all →

Keep reading usRSS feed Follow on Google News Save to Pocket

why a simple hierarchy matters more than another scale

the three tiers defined—what each label actually signals

how the hierarchy sits inside the 3-Layer Stack

a short playbook you can use tonight

practical signals that move an item from one tier to another

ethical and practical limits—what the hierarchy does not do

Three things to read next.

Find your starting point

Glossary · 5 terms

More personalised picks

Suggested for you