Evidence, Guidelines, Registries, and Living-Textbook Method

Part 1/Chapter 1/5-min read

Evidence, Guidelines, Registries, and Living-Textbook Method

How to read vascular evidence at the bedside: which trial, registry, or guideline statement is strong enough to change management for this patient, this lesion, and this anatomy. The chapter frames evidence appraisal, guideline trustworthiness, and registry use so that recommendations stay tied to current source support.

Edited by Tal M. Hörer MD, PhD · David T. McGreevy MD, PhD·Reviewed May 2026·6 sections · 26 references

Evidence-methods explainer: A journal-club style map of how evidence, guidelines, registries, and living updates should be read.

General medical education, not patient-specific advice.

Choose the hosts

Evidence appraisal and methodological standards

Evidence-based vascular care integrates the best current evidence with patient anatomy, operative risk, life expectancy, symptom burden, care goals, and local procedural performance .

GuidelinesEvidence appraisal tools by study design

Evidence type	Primary appraisal focus	Prescribed tool
Parallel-group randomized trial	Reporting completeness and bias domains	CONSORT 2010, RoB 2
Observational intervention study	Reporting transparency and confounding	STROBE, ROBINS-I
Diagnostic accuracy study	Reporting completeness and patient-selection bias	STARD 2015, QUADAS-2
Prognostic or prediction model	Transparency, calibration, and discrimination	TRIPOD
Systematic review and meta-analysis	Search rigour and synthesis credibility	PRISMA 2020, AMSTAR 2

Parallel-group randomized trial

Primary appraisal focus: Reporting completeness and bias domains
Prescribed tool: CONSORT 2010, RoB 2
Citation

Observational intervention study

Primary appraisal focus: Reporting transparency and confounding
Prescribed tool: STROBE, ROBINS-I
Citation

Diagnostic accuracy study

Primary appraisal focus: Reporting completeness and patient-selection bias
Prescribed tool: STARD 2015, QUADAS-2
Citation

Prognostic or prediction model

Primary appraisal focus: Transparency, calibration, and discrimination
Prescribed tool: TRIPOD
Citation

Systematic review and meta-analysis

Primary appraisal focus: Search rigour and synthesis credibility
Prescribed tool: PRISMA 2020, AMSTAR 2
Citation

Randomized trials are evaluated for reporting completeness using CONSORT 2010 and for methodological bias using RoB 2 . Observational evidence, including cohort, case-control, and registry analyses, is evaluated for reporting transparency using STROBE and for risk of bias using ROBINS-I . Diagnostic accuracy studies underpinning vascular imaging and physiologic tests are evaluated for reporting completeness using STARD 2015 and for methodological quality and applicability using QUADAS-2 . Systematic reviews and meta-analyses are evaluated for reporting completeness using PRISMA 2020 and for methodological credibility using AMSTAR 2 .

Guideline development and recommendation strength

Guidelines translate evidence into structured practice recommendations. The GRADE methodology provides the standard vocabulary for rating evidence certainty and guidance strength . GRADE rates certainty of evidence as high, moderate, low, or very low. Randomized-trial evidence starts at high certainty and observational evidence at low certainty; certainty is rated down for risk of bias, inconsistency, indirectness, imprecision, and publication bias, and rated up for a large effect magnitude, a dose-response gradient, or plausible residual confounding that would bias against the observed effect. The Evidence to Decision framework incorporates clinical priority, expected effects, evidence certainty, patient values, resources, cost-effectiveness, equity, acceptability, and feasibility .

A strong recommendation applies to most patients and supports default practice unless contraindicated by patient-specific factors. A conditional recommendation requires explicit individual selection based on benefit-harm trade-offs and patient preferences . ACC/AHA guidelines grade every recommendation by Class of Recommendation and Level of Evidence . Class I means benefit far outweighs risk (is recommended), Class IIa that benefit outweighs risk (is reasonable), Class IIb that benefit equals or marginally exceeds risk (may be considered), and Class III captures either no benefit or harm. Level A rests on high-quality evidence from more than one randomized trial or meta-analyses of high-quality trials, B-R on moderate-quality evidence from one or more randomized trials, B-NR on nonrandomized or observational studies, C-LD on limited data, and C-EO on expert opinion.

Guideline trustworthiness and methodology are appraised using AGREE II, Guidelines 2.0, and NASEM standards, which assess scope, stakeholder involvement, development rigour, conflict management, and external review . Recommendation strength reflects the balance of benefits and harms rather than citation volume alone .

Registries and routinely collected data

Registries capture real-world practice, uncommon presentations, device use, and outcomes in anatomic subgroups typically excluded from randomized trials. Observational databases remain vulnerable to selection bias, unmeasured confounding, changing definitions, missing data, and variation in follow-up. Studies using electronic health records, claims data, and disease registries are reported according to the RECORD extension to STROBE . Registry analyses used to infer causal intervention effects require explicit bias assessment with ROBINS-I .

Artificial intelligence and prediction models

Prediction scores, risk calculators, and diagnostic classifiers require transparent reporting, external validation, and calibration to the local target population. The TRIPOD statement governs the reporting of multivariable prediction models, while the TRIPOD+AI extension applies these standards to artificial-intelligence and machine-learning models . Randomized trials of AI-supported interventions are assessed using the CONSORT-AI extension .

Clinical integration and living updates

Clinical integration matches the patient-specific uncertainty to the corresponding evidence tool. Follow-up plans and surveillance tests rely on diagnostic accuracy and prediction-model calibration proven in similar patient populations. Evidence application is fundamentally constrained by false-positive findings, low prior probability, inadequate statistical power, bias, and multiple testing . Avoidable research waste stems from poor question prioritization, weak methodology, and incomplete reporting .

Living systematic reviews and guidelines continuously integrate emerging evidence, reacting to practice-changing trials, regulatory actions, and safety warnings . Updating methodologies require explicit triggers and defined thresholds to prevent unwarranted practice shifts from single studies. A topic enters living mode when three conditions hold together: it is a priority for decision-making, current certainty in the evidence is low or moderate, and new evidence is emerging fast enough to plausibly change the conclusion. Living mode then runs continual active surveillance with searches typically re-run monthly, against pre-specified criteria for when a newly identified study triggers re-synthesis and a guideline or textbook update.

Areas of controversy

The translation of observational registry data into causal treatment effects remains methodologically controversial, despite formal frameworks such as ROBINS-I and RECORD . The criteria for adopting artificial-intelligence models into routine vascular decision-making lack universal consensus, specifically regarding the required degree of external validation versus local calibration . The threshold at which new evidence triggers a living guideline update requires balancing rapid integration against the risk of overreaction to isolated findings .

References

1.
Evidence based medicine: what it is and what it isn't. 1996.
PubMed-indexed article1996
Evidence based medicine: what it is and what it isn't. 1996. doi:10.1136/bmj.312.7023.71.
PubMed DOI
2.
CONSORT 2010 reporting standard for parallel-group RCTs.
PubMed-indexed article2010
PubMed DOI Open PDF
3.
RoB 2: a revised tool for assessing risk of bias in randomised trials. 2019.
PubMed-indexed article2019
RoB 2: a revised tool for assessing risk of bias in randomised trials. 2019. doi:10.1136/bmj.l4898.
PubMed DOI
4.
STROBE statement for observational studies. 2007.
DOI publisher route2007
STROBE statement for observational studies. 2007. doi:10.1371/journal.pmed.0040296.
DOI
5.
ROBINS-I: a tool for assessing risk of bias in non-randomised studies of interventions. 2016.
PubMed-indexed article2016
ROBINS-I: a tool for assessing risk of bias in non-randomised studies of interventions. 2016. doi:10.1136/bmj.i4919.
PubMed DOI
6.
STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. 2015.
PubMed-indexed article2015
STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. 2015. doi:10.1136/bmj.h5527.
PubMed DOI
7.
QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies. 2011.
PubMed-indexed article2011
QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies. 2011. doi:10.7326/0003-4819-155-8-201110180-00009.
PubMed DOI
8.
Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD Statement. 2015.
PubMed-indexed article2015
Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD Statement. 2015. doi:10.7326/m14-0697.
PubMed DOI
9.
The PRISMA 2020 statement. 2021.
PubMed-indexed article2020
The PRISMA 2020 statement. 2021. doi:10.1136/bmj.n71.
PubMed DOI
10.
AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. 2017.
PubMed-indexed article2017
AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. 2017. doi:10.1136/bmj.j4008.
PubMed DOI
11.
PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. 2021.
PubMed-indexed article2020
PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. 2021. doi:10.1136/bmj.n160.
PubMed DOI
12.
Grading quality of evidence and strength of recommendations. 2004.
PubMed-indexed article2004
Grading quality of evidence and strength of recommendations. 2004. doi:10.1136/bmj.328.7454.1490.
PubMed DOI
13.
GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. 2008.
PubMed-indexed article2008
GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. 2008. doi:10.1136/bmj.39489.470347.ad.
PubMed DOI
14.
GRADE Evidence to Decision frameworks for clinical practice guidelines. 2016.
DOI publisher route2016
GRADE Evidence to Decision frameworks for clinical practice guidelines. 2016. doi:10.1136/bmj.i2089.
DOI
15.
AGREE II: advancing guideline development, reporting and evaluation in health care. 2010.
DOI publisher route2010
AGREE II: advancing guideline development, reporting and evaluation in health care. 2010. doi:10.1503/cmaj.090449.
DOI
16.
Guidelines 2.0: systematic development of a comprehensive checklist for a successful guideline enterprise. 2013.
PubMed-indexed article2013
Guidelines 2.0: systematic development of a comprehensive checklist for a successful guideline enterprise. 2013. doi:10.1503/cmaj.131237.
PubMed DOI
17.
Clinical Practice Guidelines We Can Trust. 2011.
NCBI Bookshelf2011
Source
18.
Tricoci ACC AHA Guideline Evidence 2009.
PubMed-indexed article2009
Tricoci ACC AHA Guideline Evidence 2009. doi:10.1001/jama.2009.205. PMID:19244190.
PubMed DOI Source
19.
RECORD statement for routinely collected observational health data. 2015.
DOI publisher route2015
RECORD statement for routinely collected observational health data. 2015. doi:10.1136/bmj.h214.
DOI
20.
TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. 2024.
PubMed-indexed article2024
TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. 2024. doi:10.1136/bmj-2023-078378.
PubMed DOI
21.
Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension. 2020.
PubMed-indexed article2020
Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI Extension. 2020. doi:10.1136/bmj.m3164.
PubMed DOI
22.
Why Most Published Research Findings Are False. 2005.
PubMed-indexed article2005
Why Most Published Research Findings Are False. 2005. doi:10.1371/journal.pmed.0020124.
PubMed DOI
23.
How to increase value and reduce waste when research priorities are set. 2014.
PubMed-indexed article2014
How to increase value and reduce waste when research priorities are set. 2014. doi:10.1016/s0140-6736(13)62229-1.
PubMed DOI
24.
Living systematic review framework (Macfarlane et al 2017).
PubMed-indexed article2017
PubMed DOI
25.
Optimizing AHA/ACC guidelines for the digital age. 2024.
PubMed-indexed article2024
Optimizing AHA/ACC guidelines for the digital age. 2024. doi:10.1161/cir.0000000000001294.
PubMed DOI
26.
Further Evolution of the ACC/AHA Clinical Practice Guideline Recommendation Classification System. 2016.
PubMed-indexed article2016
Halperin JL, Levine GN, Al-Khatib SM, et al. Further Evolution of the ACC/AHA Clinical Practice Guideline Recommendation Classification System: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation. 2016;133(14):1426-1428. doi:10.1161/CIR.0000000000000312.
PubMed DOI

Educational use only

AI assists this editorial workflow. Published updates are human-reviewed before publication.

Not intended to diagnose, monitor, predict, prognose, treat, or alleviate disease.

Verify clinically relevant information against primary sources and current guidelines.

AI Disclosure Privacy Terms Cautions

1.1Evidence appraisal and methodological standards

1.2Guideline development and recommendation strength

1.3Registries and routinely collected data

1.4Artificial intelligence and prediction models

1.5Clinical integration and living updates

1.6Areas of controversy