An introduction to the Stochastic Search Inconsistency Factor Selection method using the ssifs package
Georgios Seitidis
Source:vignettes/An_introduction_to_ssifs.Rmd
An_introduction_to_ssifs.Rmd
Quick Revision
Stochastic Search Inconsistency Factor Selection (SSIFS) is the extension of Stochastic Search Variable Selection (SSVS) (George and McCulloch 1993) for identifying inconsistencies in Network Meta-Analysis (NMA). SSIFS is a two-step method, where in the first step inconsistency factors are specified, and in the second step, variable selection on inconsistency factors is performed using the SSVS method.
Inverse-variance NMA model
The inverse-variance random-effects NMA model adjusted to include \(\ell =1, 2, \ldots, p\) inconsistency factors is described by the following equation \[ \boldsymbol{y = X\mu + \beta +bZ + \epsilon}, \quad \boldsymbol{\epsilon} \sim N(\boldsymbol{0}, \boldsymbol{\Sigma} ) \quad and \quad \boldsymbol{\beta} \sim N(\boldsymbol{0}, \boldsymbol{\Delta})\]
where \(\boldsymbol{y}\) denotes the treatments’ effect, \(\boldsymbol{X}\) the design matrix, \(\boldsymbol{\mu}\) the underlying basic contrasts, \(\boldsymbol{\beta}\) the normally distributed random-effects, \(\boldsymbol{b}\) the effect of inconsistency factors, \(\boldsymbol{Z}\) the inconsistency factor’s index matrix, and \(\boldsymbol{\epsilon}\) the normally distributed sampling errors. Correlation matrix \(\boldsymbol{\Delta}\) is a block diagonal matrix, assuming common heterogeneity across treatment comparisons, while covariance matrix \(\boldsymbol{\Sigma}\) is assumed known and obtained from the data based on Franchini et al. (2012). Matrix \(\boldsymbol{Z}\) contains as elements values 1, -1 and 0, indicating in which comparisons inconsistency factor is added. Among the choices that may be considered for the specification of the \(\boldsymbol{Z}\) matrix are the Lu and Ades model (Lu and Ades 2006), the design-by-treatment model (Higgins et al. 2012), and the random-effects implementation of the design-by-treatment model (Jackson et al. 2014).
Variable Selection
In SSIFS the effect of an inconsistency factor \(\ell\) is described from a mixture of two normal distributions, which can be written as \[b_{\ell} | \gamma_{\ell} \sim (1-\gamma_{\ell}) N(0, \psi_{\ell}^{2}) + \gamma_\ell N(0, c^{2}\psi_\ell^{2})\] where \(b_{\ell}\) is the effect of the inconsistency factor, \(\gamma_{\ell}\) is an auxiliary variable indicating if the inconsistency factor is included in the NMA model, and \(c\), \(\psi_{\ell}\) are tuning parameters controlling the mixing ability of the method.
In matrix notation SSIFS is written as \[\boldsymbol{b | \gamma} \sim N( \boldsymbol{0}, \boldsymbol{D_\gamma R D_\gamma} ) \] where \[ \boldsymbol{D}_{\boldsymbol{\gamma}}= \begin{pmatrix} a_1 \psi_1 & 0 & \dots & 0 \\ 0 & a_2 \psi_2 & \dots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \dots & a_\text{p} \psi_\text{p} \end{pmatrix} ,\qquad a_\ell= \begin{cases} 1, \quad \gamma_\ell=0\\ c, \quad \gamma_\ell=1 \end{cases}. \] Matrix \(\boldsymbol{R}\) denotes the prior correlation between the inconsistency factors. We can assume that inconsistency factors are independent by setting \(\boldsymbol{R} = \boldsymbol{I}\), or we can assume a dependency between inconsistency factors by using a Zellner g-prior as described bellow \[\boldsymbol{R}=g \boldsymbol{ (Z^{'}Z) } \sigma^{2}, \quad \pi (\sigma^{2} ) \propto \frac{1}{\sigma^2}.\] For the specification of the parameter \(g\), the unit information criterion (Kass and Wasserman 1995) is used, which translates in SSIFS to the total number of observed comparisons in the network.
Prior inclusion probabilities of the inconsistency factors are specified as \[\gamma_\ell \sim Bernoulli(1 - \pi_{con}^{ \frac{1}{p}}), \quad \ell = 1, 2, \ldots, p\] where \(\pi_{con}\) is the probability to have a consistent network and reflects our prior believes on how likely is to have a consistent network. In a review of 201 networks, 44 networks were found to be globally inconsistent (Veroniki et al. 2021). Thus, \(\pi_{con} \sim Beta(157, 44)\) is proposed.
Tuning
Tuning is crucial in SSIFS in order to ensure a good mixing of the method. Ideally, the effect of an inconsistency factor when it is included in the NMA model should lie in an area close to zero, and far away from this area when it is not included in the NMA model. Regarding parameter \(c\), values between 10 and 100 usually perform well in most cases (George and McCulloch 1993; Perrakis and Ntzoufras 2015). Possible values of parameter \(\psi_\ell\) could be obtained from a pilot MCMC run of the NMA model as the standard deviation of the inconsistency factors.
Minimum value of inconsistency
By properly tuning parameters \(c\) and \(\psi_\ell\), a difference between direct and indirect evidence that is of practical significance (say \(\omega\)) can be defined. Thus, an inconsistency factor with a coefficient larger than \(\omega\) in absolute values (\(|b_{\ell}| > \omega\)), should be included in the NMA model (\(\gamma_\ell = 1\)). In the case where \(\boldsymbol{R = I}\), the inconsistency factor will have higher probability to be included in the NMA model when \[|b_\ell |> \psi_\ell \sqrt{\xi(c)}, \quad \xi(c) = \frac{2c^2 \log{c}}{c^2 -1}.\] For example, if a difference above 0.2 is considered important, one possible parameterization is to set \(c = 10\) and \(\psi_\ell = \frac{0.2}{\sqrt{\xi(10)}} \approx 0.1.\)
Inconsistency Detection
Inconsistency in SSIFS is evaluated by examining the posterior inclusion probabilities of the inconsistency factors, the posterior model probabilities, the posterior model odds and the Bayes factor of the consistent NMA model over the inconsistent NMA model.
Posterior Inclusion probabilities
Posterior inclusion probabilities estimated as the average of times the inconsistency factor was included in the NMA model in the MCMC draws. Estimates above 0.5 indicates local inconsistency, which cause global inconsistency to the network.
Posterior Model Odds
Posterior model odds are obtained as the ratio of the posterior model probabilities which are estimated as \[f\hat{(m|\boldsymbol{y})}=\frac{1}{M-B} \sum_{t=B+1}^{M}{I(m^{(t)}=m)}, \quad m(\boldsymbol{\gamma})=\sum_{\ell = 1}^{p}{\gamma_\ell 2^{\ell-1}}\] where \(M\) is the number of MCMC iterations, \(B\) the burn-in period and \(m^{(t)}\) a model indicator which transforms the \(\boldsymbol{\gamma}\) to a unique decimal number. By examined the posterior odds of the consistent NMA model (\(m(\boldsymbol{\gamma}) = 0\)) over the inconsistent NMA models, we can evaluate the consistency assumption. Also, the comparison between the consistent NMA model (\(m(\boldsymbol{\gamma}) = 0\)) over all the other observed inconsistent NMA models (\(m(\boldsymbol{\gamma}) \ne 0\)), indicates if the NMA model is globally consistent.
Implementation through ssifs
Installation
You can install the development version of ssifs like so:
install.packages("devtools")
devtools::install_github("georgiosseitidis/ssifs")
Data
ssifs
requires the contrast-based data used for the NMA
model. Also, in the multi-arm studies, all possible comparisons must be
provided. In the case where the network is disconnected,
ssifs
keeps only those studies that belong to the largest
sub-network in order to maintain one connected network.
Example
Load the brief alcohol intervention dataset from the
ssifs
package. The dataset is from a published NMA (Seitidis et al. 2022; Hennessy et al.
2019) and contains 37 studies evaluating the comparative
effectiveness of brief alcohol interventions on preventing hazardous
drinking in college students.
Prepare the data for the ssifs
.
TE <- Alcohol$TE
seTE <- Alcohol$seTE
study <- Alcohol$studyid
treat1 <- Alcohol$treat1
treat2 <- Alcohol$treat2
Run the ssifs
function, using AO-CT as a
reference intervention.
set.seed(12)
m <- ssifs(TE = TE, seTE = seTE, studlab = study, treat1 = treat1, treat2 = treat2, ref = "AO-CT")
The function by default for the specification of the matrix \(\boldsymbol{Z}\) uses the
design-by-treatment model. You can use the Lu & Ades model by
setting the argument method = "LuAdes"
, or the
random-effects implementation of the design-by-treatment model by
setting method = "Jackson"
. Also, the function by default
specifies the correlation matrix \(\boldsymbol{R}\) by using a Zellner
g-prior. You can assume that inconsistency factors are independent by
setting the argument zellner = FALSE
.
Regarding the prior inclusion probabilities, the function assumes
that \(\pi_{con} \sim Beta(157, 44)\)
(argument rpcon = TRUE
). By setting the argument
rpcon = FALSE
you can set the probability of \(\pi_{con}\) fixed. If
rpcon = FALSE
, the function assumes that \(\pi_{con} = 0.5\). You can change this
probability from the argument pcons
.
Detection of inconsistency
Posterior inclusion probabilities
Posterior inclusion probabilities can be obtained like so:
m$Posterior_inclusion_probabilities
#> Comparison Design PIP b b.lb b.ub
#> 1 Alc101 ; BASICS Alc101BASICS 0.0221 0.0002 -0.1490 0.1464
#> 2 AO-CT ; Alc101 AO-CTAlc101 0.0222 0.0011 -0.1098 0.1518
#> 3 e-CHUG ; BASICS e-CHUGBASICS 0.0234 -0.0007 -0.1738 0.1423
#> 4 Active-CT ; THRIVE Active-CTTHRIVE 0.0236 -0.0025 -0.2077 0.1106
#> 5 AO-CT ; AlcEdu AO-CTAlcEdue-CHUG 0.0237 0.0018 -0.0992 0.1671
#> 6 AO-CT ; e-CHUG AO-CTAlcEdue-CHUG 0.0216 0.0015 -0.1069 0.1623
#> 7 AO-CT ; e-CHUG AO-CTe-CHUGBASICS 0.0237 -0.0013 -0.1443 0.1080
#> 8 AO-CT ; BASICS AO-CTe-CHUGBASICS 0.0232 0.0034 -0.0882 0.1957
#> 9 AO-CT ; AlcEdu AlcEduAO-CTAlc101 0.0227 -0.0027 -0.1424 0.0662
#> 10 AO-CT ; Alc101 AlcEduAO-CTAlc101 0.0219 0.0016 -0.0998 0.1475
The first two columns refer to comparisons where inconsistency factors are added. For example, the first row refers to the inconsistency factor that added to the comparisons between interventions Alc101 and BASICS, obtained from the two-arm studies that compare these interventions. The fifth row refers to the inconsistency factor that added to the comparison between interventions AO-CT and AlcEdu, obtained from the multi-arm studies that compare the interventions AO-CT, AlcEdu and e-CHUG. Column PIP refers to posterior inclusion probability, while columns b, b.lb and b.ub to the inconsistency factors effect estimates with the corresponding 95% credible interval.
If method = "LuAdes"
, the column Design
is NA
. This is because the Lu & Ades model accounts
only for loop inconsistencies.
In this example, the posterior inclusion probabilities suggest that the network is globally and locally consistent, since there are not any significant local inconsistencies that causes global inconsistency to the network (posterior inclusion probabilities \(\approx 0 < 0.5\)). Also, note that the corresponding effect estimates are not significant and close to zero.
Posterior model odds
The posterior model odds can be obtained like so:
head(m$Posterior_Odds)
#> IFs Freq f(m|y) PO_IFCONS
#> 1 No IFs 63543 0.7943 1.0000
#> 2 Active-CT ; THRIVE 1549 0.0194 41.0219
#> 3 AO-CT ; e-CHUG_AO-CTe-CHUGBASICS 1545 0.0193 41.1282
#> 4 AO-CT ; BASICS_AO-CTe-CHUGBASICS 1519 0.0190 41.8321
#> 5 AO-CT ; AlcEdu_AO-CTAlcEdue-CHUG 1517 0.0190 41.8873
#> 6 e-CHUG ; BASICS 1496 0.0187 42.4753
Column IFs refers to the model observed in the MCMC
draws (Inconsistency factors are separated by the symbol
,
), Freq refers to the number of times the
model was observed in the MCMC draws, f(m|y) denotes
the posterior model probability and PO_IFCONS to the
posterior odds of the consistent NMA model (NO IFs) over the
model in the corresponding row. An estimate over 1 favors the consistent
NMA model.
In this example, the first row refers to the consistent NMA model (NO IFs), showing that the posterior model probability of the consistent NMA model is 0.79. The posterior odds is 1 as expected, since \[PO = \frac{f(m_{con}|\boldsymbol{y})}{f(m_{con}|\boldsymbol{y})}=\frac{0.7943}{0.7943} = 1.\] The third row refers to the inconsistent NMA model (say \(m_{3}\)) where inconsistency factor was added in the comparisons between interventions AO-CT and e-CHUG, obtained from the multi-arm studies that compare the interventions AO-CT, e-CHUG and BASICS. The posterior odds calculated as \[PO_{m_{con}m_{3}} = \frac{f(m_{con}|\boldsymbol{y})}{f(m_{3}|\boldsymbol{y})} =\frac{0.7942875}{0.0193125} = 41.1282.\]
By looking the posterior model probabilities and the posterior model odds, we conclude that the consistent NMA is the most dominant model since \(\hat{f(m|y)}=0.79\). Also, the posterior odds clearly favors the consistent NMA model. Thus, we conclude that the network is both globally and locally consistent, because significant local inconsistencies that causes globally inconsistency to network were not observed.
Global test of inconsistency
The global test of inconsistency is conducted by calculating the Bayes factor of the consistent NMA model over the rest inconsistent NMA models. Thus, the posterior model probabilities of the inconsistent NMA models are summed. An estimate above 1 indicates that the network is globally consistent.
The global test of inconsistency can be obtained like so:
m$Bayes_Factor
#> [1] 1.0754
In the example, the Bayes factor estimated above 1, suggesting that the network is globally consistent.