|
1
|
- Jan S. Krouwer, Ph.D.
Krouwer Consulting
- *:
info@KrouwerConsulting.com
|
|
2
|
- Introduction
- Problems with current evaluations
- Suggested improvements
- Better way to estimate total error
- Outliers
- Process capability
- Understanding published evaluations
- Remedies to improve assays
|
|
3
|
|
|
4
|
- Diagnostic assay quality has improved
- At the same time, clinicians rely more on test results to make clinical
decisions
- Elevated cholesterol → statin therapy
- Elevated PSA → prostate biopsy
- Elevated troponin → ACS treatment
|
|
5
|
- Manufacturers evaluate their assays
- Huge amounts of data are collected internally and externally
- Design improvements
- Leads to error prevention algorithms
- Package insert lists performance claims
- FDA reviews part of manuf. data (510(k), PMA, CBER)
- Hospital labs are required to evaluate assays
- Most evaluations are short “verification” assays
- Some hospital labs publish larger evaluation results
- But, …
|
|
6
|
- There are still problems
- Lifescan SureStep glucose meter
- 2700 complaints → 61 hospitalizations → 105 million
dollars in criminal and civil fines and settlements
- University of Washington hCG assay
- Unnecessary treatment due to test error, (not the only hCG case) →
16 million dollar lawsuit
- St. Agnes Prothrombin assay
- Lab error leads to wrong coumadin dose for 920 patients → 5
deaths
- Weekly product recalls are listed on the FDA web site
- http://www.fda.gov/opacom/Enforce.html
|
|
7
|
- How can manufacturers, hospital labs, and regulators improve evaluations
to better detect problems?
- We need to:
- ask better questions
- provide better data analysis
|
|
8
|
|
|
9
|
- Total error (TE) = average bias + 1.96 x imprecision
- Total error is calculated from a model which is wrong
- Implies that if you meet bias and precision goals, then you will meet
total error, not true
- In general, this analysis won’t detect the aforementioned problems
- NCEP (National Cholesterol Education Program) Bachorik PS and Ross JW. Clin Chem 1995 41: 1414-1420
- Peterson PH, Stöckl D, Westgard JO Clin Chem Lab Med 2001;39:589-595
|
|
10
|
|
|
11
|
|
|
12
|
- Many approaches to simulations (and software such as Simscript, SAS,
Excel add-ins)
- GUM approach is recent, international guideline
- Guide to the Expression
of Uncertainty in Measurement, (ISBN 92-67-10188-9) ISO, Geneva (1995).
|
|
13
|
- Glucose meter model is too simple! (2 error sources)
- “Worst case”: 3 sd errors occur for both error sources
- Tempted to improve assay for both error sources
- But, two worst case errors at the same time only 0.09%
- Better models have more than 2 error sources
- Must use probabilities to account for frequency of error
- Means that occasional 3 sd error is ok for an error source
- If error reduction is needed, use cost allocation
- Reduce error in source that is least costly – not equal reduction for
all sources
- Boyd JC and Bruns DE Clin Chem 2001 47: 209-214.
- Krouwer JS Clin Chem 2001 47: 1329-1330
|
|
14
|
- Effect of last slide
- Also says simulation is not easy: requires
- modeling
- characterizing distributions
- programming
- verifying results
|
|
15
|
- Krouwer JS Clin Chem 2002;48:919-927
|
|
16
|
- Correctly asses total error
- Estimate outlier rate
- And we (manufacturers, hospital labs, and regulators) can do this with
the same amount of data
|
|
17
|
- Total error accounts for 95% (or 99%) of differences
- This leaves 5% (or 1%) of differences largely unspecified
- For a busy lab that reports 1 million results, this could mean unknown
quality for 50,000 (or 10,000) results per year
- That’s why we need outlier limits and rates
|
|
18
|
- Outliers often get a special designation (as in, “Oh that error is due
to an outlier”)
- Outliers often get special treatment
- They are removed from many parameter estimates (that’s ok) but never
show up anywhere else (that’s not ok)
- Outliers are errors just like all other errors, they are simply larger
errors
|
|
19
|
|
|
20
|
- But there is no reference for our assay
- There is always a reference, although it may not be an accepted
reference
- True reference
- Existing commercial (field) assay used by clinicians
- But, we don’t know which assay is correct. True, but clinicians will
observe the difference
- Clinician’s expectation of patient result
- Replicating the reference result
is needed
|
|
21
|
- Bland – Altman method
- Get differences from reference method
- Transform to normal distribution, if needed
- Get standard deviation of differences
- TE = average difference ± t x sd of differences (t is approximately 2
for 95% total error)
- Bland JM and Altman DG Lancet 1986; 307-10
|
|
22
|
- Mountain plot method
- Get differences from reference method
- TE = desired percentiles from ordered distribution of differences (2.5
and 97.5 for 95% total error)
- Krouwer JS and Monti KL Eur J. Clin. Chem. Biochem. 1995;33:525-27
- Proposed Guideline NCCLS EP21P. NCCLS, 771 E. Lancaster Ave. Villanova,
PA, 2002
|
|
23
|
|
|
24
|
|
|
25
|
|
|
26
|
|
|
27
|
- Note what isn’t done
- Scatter plot
- r (correlation coefficient)
- Slope and intercept
- Nothing wrong in getting these but, they don’t answer the total error
question
|
|
28
|
|
|
29
|
|
|
30
|
|
|
31
|
- Two methods to estimate outlier rates
- Modeling GUM = Guide to Uncertainty in Measurement and others
- Guide to the Expression of Uncertainty in Measurement, (ISBN
92-67-10188-9) ISO, Geneva (1995).
- Counting
- Krouwer JS Clin Chem 2002;48:919-927
- EP20 (NCCLS guideline on outliers – in preparation)
|
|
32
|
|
|
33
|
- Cpm – A unitless measure of distance from target
- Allows for comparison among assays
- Can be estimated during evaluations with patient samples or during
routine assay use with controls
- Equivalent to total error scaled by specification limits
- Visualized with mountain plot
- Specification limits are medically acceptable limits
|
|
34
|
|
|
35
|
|
|
36
|
- Let’s define some types of financial support
- Completely independent – Site buys all materials
- Partially independent – Site receives free materials from
manufacturer(s). Study design, data collection, analysis, and report
done by site.
- Not independent – Manufacturer gives site materials. Study design, data
collection, analysis, and report done by some combination of site and
manufacturer.
|
|
37
|
- Conclusion: “Accu TnI is a sensitive and precise assay for the
measurement of cTnI”
- But, inside the article…
- Assay fails ESC / ACC recommendations for precision at medical decision
limit
- In 3 two way comparisons with other assays, 6%, 9%, and 9% of the time,
the 2 assays give results on different sides of the medical decision
limit
- Uettwiller-Geiger, Wu, Apple, et. al., Clin Chem 2002;48:869-876
|
|
38
|
- Manufacturer:
- Hospital lab:
- Imprecision – run more replicates
- Bias – re-standardize assay
- Interferences – Check for interfering substance in patient sample
- HAMA – Add blocking agent to sample and rerun
- High dose hook effect – dilute and rerun
- Krouwer JS. Arch Pathol Lab Med 1992;116:726-731
|
|
39
|
- Lab: Run more replicates (duplicates) for troponin
- Costs
- Assay cost is (at least) double – if all results are replicated
- Workflow is disrupted
- Benefits
- Some positives may be classified as negative, saving treatment costs
- Some negatives may be classified as positive, possibly saving lives
and malpractice lawsuits
- Evaluation: Find out how many “some” is and which costs more
- Leape LL Clin. Chem. 2002 48: 1871-1872
|
|
40
|
|
|
41
|
|