Skip to main content
MedCalc
Mail a PDF copy of this page to:
(Your email address will not be added to a mailing list)
working

Method comparison: Comparison of multiple methods

Description

Comparison of multiple methods is an extension of the Bland-Altman plot (Bland & Altman, 1986 and 1999) for more than two methods. For each method, the differences with a reference method are plotted against the values of this reference method (Krouwer, 2008).

The procedure produces multiple bias plots in one single display with all axes aligned to facilitate comparison of the different methods.

If duplicate or multiple measurements (with two methods) were performed per subject, you should use Bland-Altman plot with multiple measurements per subject.

Required input

Dialog box for multiple method comparison

Variables

First select the variables for the methods you want to compare.

Method

Select one of the following 3 methods.

  • Parametric (conventional): plots the differences or ratios between measurements against their average. Assumes constant bias and homoscedasticity (Bland & Altman 1986).
  • Non-Parametric: uses ranks or quantiles to assess agreement without assuming normality or constant variance (Bland & Altman 1999).
  • Regression-Based: models bias and limits of agreement as functions of the measurement magnitude. Useful when heteroscedasticity is present (Bland & Altman 1999).For details on the regression-based method, see Bland-Altman plot (section regression).

Options

  • First method is the reference method: the measurements of first selected method are the reference values (informational, this option is fixed).
  • Plot differences or ratios
    • Plot differences: Differences are calculated as measurement‑reference (*) so a positive difference is an overestimation and a negative difference is an underestimation.
    • Plot differences as %: With this option the differences will be expressed as percentages of the values on the axis (i.e. proportionally to the magnitude of measurements). This option is useful when there is an increase in variability of the differences as the magnitude of the measurement increases.
    • Plot ratios: When this option is selected then the ratios of the measurements will be plotted instead of the differences (avoiding the need for logarithmic transformation). This option as well is useful when there is an increase in variability of the differences as the magnitude of the measurement increases. Ratios are calculated as measurement/reference (*) so a ratio > 1 indicates an overestimation and a ratio < 1 indicates an underestimation.
    (*) You can reverse this by selecting the option Reference-Variable.
  • Draw line of equality: useful for detecting a systematic difference.
  • 95% CI of mean difference (*): the 95% Confidence Interval of the mean difference illustrates the magnitude of the systematic difference. If the line of equality is not in the interval, there is a significant systematic difference.
  • 95% CI of limits of agreement: shows error bars representing the 95% confidence interval for both the upper and lower limits of agreement.
  • Draw regression line of differences (*): this regression line may help to detect a proportional difference. Optionally, you can select to show the 95% confidence interval of this regression line.
  • Click Subgroups if you want to identify subgroups in the plots. A new dialog box is displayed in which you can select a categorical variable. The graph will use different markers for the different categories in this variable. Note that a legend cannot be displayed in this plot. To identify the subgroups, double-click on one of the observations to see its identification in the spreadsheet.

(*) or ratios when this option was selected.

Results for Parametric (conventional) and Non-Parametric method

The graphs and reports for the parametric and non-parametric methods are similar. In the parametric method, the mean difference and the standard deviation of the differences are used to define the limits of agreement.

In the parametric approach, the limits of agreement (LoA) are calculated as the mean difference ± 1.96 times the standard deviation of the differences. In the non-parametric approach, the LoA are defined by the 2.5th and 97.5th percentiles of the differences. Both methods aim to define an interval within which 95% of the differences between measurements are expected to lie.

For details on the regression-based method, see Bland-Altman plot (section regression).

Report

The results panel displays the following information:

Method comparison - Multiple methods

Reference method

Method1

Methodology: Parametric (conventional) (Bland & Altman, 1986)

Systematic differences

Variable

n

Differences (Variable - Reference)

Mean

SD

95% CI

Method2

85

-1.3082

8.2327

-3.0840 to 0.4675

Method3

85

-2.1894

9.7012

-4.2819 to -0.09691

Method4

85

0.4729

7.1568

-1.0707 to 2.0166

Method5

85

6.6212

7.2190

5.0641 to 8.1783

Limits of agreement

Variable

n

Limits of agreement

Lower Limit of Agreement

95% CI

Upper Limit of Agreement

95% CI

Method2

85

-17.4444

-20.4911 to -14.3977

14.8280

11.7813 to 17.8746

Method3

85

-21.2038

-24.7939 to -17.6136

16.8249

13.2348 to 20.4151

Method4

85

-13.5544

-16.2030 to -10.9059

14.5003

11.8518 to 17.1488

Method5

85

-7.5281

-10.1997 to -4.8566n>

20.7705

18.0989 to 23.4420

Regression

Variable

Intercept

95% CI

Slope

95% CI

P

Method2

2.2897

-5.2372 to 9.8165

-0.02799

-0.08490 to 0.02891

0.3307

Method3

7.2383

-1.4269 to 15.9036

-0.07335

-0.1389 to -0.007837

0.0287

Method4

-1.8633

-8.4232 to 4.6965

0.01818

-0.03142 to 0.06777

0.4681

Method5

5.8733

-0.7626 to 12.5091

0.005819

-0.04435 to 0.05599

0.8181

Absolute percentage error

Variable

n

Median

95% CI

95th Percentile

95% CI

Method2

85

3.97%

2.92 to 5.38

13.00%

10.78 to 17.66

Method3

85

5.41%

4.31 to 6.86

15.80%

13.05 to 18.81

Method4

85

3.72%

2.96 to 4.63

9.98%

9.04 to 15.73

Method5

85

6.71%

5.98 to 7.70

15.85%

12.49 to 19.01

  • Identifier for the reference value: the variable for the reference method.
  • Systematic differences: n (sample size), mean, SD and 95% CI of the differences.
  • Limits of agreement: the lower and upper limits of agreement with 95% CI.
  • Parameters of the regression of the differences against the reference value: intercept and slope with 95% CI, and P-value for slope.
  • Absolute percentage error: the absolute percentage error (APE) is calculated as 100 x ABS[(yref)/ref] where y is the observation and ref is the reference value. MedCalc calculates the median APE (MdAPE) and the 95th percentile of the absolute percentage error. The 95th percentile APE is interpreted as follows: the percentage difference between a measurement and the reference value is not expected, with 95% certainty, to exceed this value. MedCalc also reports the 95% confidence intervals for both statistics, if sample size is large enough.

Graph

The graphical display consists of multiple frames with Bland & Altman plots using the selected options.

Multiple method comparison graph (Bland-Altman plots)

Unlike other MedCalc graphs, this graphical display has limited editing possibilities:

  • you cannot add reference lines or draw boxes, arrows or add text frames.
  • the scaling of the axes is the same in each frame.
  • the titles are the same for each frame. To create variations in the titles for the different frames, you can use the following symbolic fields:
    • !#!: this will insert the frame number in the title
    • !a!: this will insert a character A, B, ..., corresponding with the frame number, in the title
    • !var!: this will insert the variable name in the title

Confidence intervals

Optionally, confidence intervals may be displayed for the average difference and for the limits of agreement. These confidence intervals can be represented as error bars or horizontal lines. Right-click on the error bar to set formatting options.

Graph

The example shows the regression line with its 95% confidence interval and the 2 limits of agreement curves.

Regression-based Bland-Altman plot

Literature

  • Bland JM, Altman DG (1986) Statistical method for assessing agreement between two methods of clinical measurement. The Lancet i:307-310. PubMed
  • Bland JM, Altman DG (1995) Comparing methods of measurement: why plotting difference against standard method is misleading. The Lancet 346:1085-1087. PubMed
  • Bland JM, Altman DG (1999) Measuring agreement in method comparison studies. Statistical Methods in Medical Research 8:135-160. PubMed
  • Hanneman SK (2008) Design, analysis, and interpretation of method-comparison studies. AACN Advanced Critical Care 19:223-234. PubMed
  • Krouwer JS (2008) Why Bland-Altman plots should use X, not (Y+X)/2 when X is a reference method. Statistics in Medicine 27:778-780. PubMed

See also