PLoS OnePLoS ONEplosplosonePLoS ONE1932-6203Public Library of ScienceSan Francisco, USA244662113900587PONE-D-13-2713710.1371/journal.pone.0086721Research ArticleMathematicsStatisticsBiostatisticsMedicineClinical ImmunologyImmune SystemCytokinesImmunologic TechniquesImmunoassaysAntigen Processing and RecognitionImmune ResponseDiagnostic MedicineClinical Laboratory SciencesClinical ImmunologyTest EvaluationEpidemiologyBiomarker EpidemiologyClinical EpidemiologyEpidemiological MethodsInfectious DiseasesBacterial DiseasesMycobacteriumTuberculosisInfectious Disease ControlPublic HealthPreventive MedicinePulmonologyRespiratory InfectionsVariability of the QuantiFERON®-TB Gold In-Tube Test Using Automated and Manual MethodsAutomated vs. Manual QuantiFERON-GIT VariabilityWhitworthWilliam C.1*GoodwinDonald J.2¤aRacsterLaura2¤bWestKevin B.3ChukeStella O.14¤cDanielsLaura J.15¤dCampbellBrandon H.14BohanonJamaria25JaffarAtheer T.25¤eDraneWanzer6SjobergPaul A.7MazurekGerald H.1Division of Tuberculosis Elimination, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of AmericaEpidemiology Services Branch, United States Air Force School of Aerospace Medicine, Brooks City-Base, Texas, United States of AmericaDepartment of Occupational Medicine/TB Prevention/Deployment Medicine, Wilford Hall Medical Center, Reid Clinic, Lackland Air Force Base, Texas, United States of AmericaNorthrop Grumman Information Systems Sector, Atlanta, Georgia, United States of AmericaCDC Foundation, Atlanta, Georgia, United States of AmericaProfessor Emeritus of Biostatistics, University of South Carolina, Columbia, South Carolina, United States of AmericaEpidemiology Consult Services, United States Air Force School of Aerospace Medicine, Wright-Patterson Air Force Base, Dayton, Ohio, United States of AmericaCaylàJoan A.EditorPublic Health Agency of Barcelona, Spain* E-mail: wcw2@cdc.gov

Competing Interests: Two authors (Chuke and Campbell) were employed by and affiliated with Northrop Grumman Information Systems Sector during the study. One author (Daniels) was employed by and affiliated with Locum Tenens, Inc., within five years after completion of this study. These affiliations do not alter the authors’ adherence to all the PLOS ONE policies on sharing data and materials. All authors have declared that no competing interests exist.

Conceived and designed the experiments: DJG PAS GHM. Performed the experiments: WCW DJG ATJ GHM. Analyzed the data: WCW GHM BHC WD. Wrote the paper: WCW GHM DJG KBW LR LJD SOC BHC JB ATJ WD PAS.

Current address: Epidemiology Consult Services, United States Air Force School of Aerospace Medicine, Wright-Patterson Air Force Base, Dayton, Ohio, United States of America

Current address: Healthcare Informatics Division/SG6H, United States Air Force, Lackland Air Force Base, Texas, United States of America

Current address: Division of HIV/AIDS Prevention/Macro International, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America

Current address: Locum Tenens, Alpharetta, Georgia, United States of America

Current address: Geneva Foundation, Akimeka Division, San Antonio, Texas, United States of America

2014231201491e86721172013131220132014This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.Background

The QuantiFERON®-TB Gold In-Tube test (QFT-GIT) detects Mycobacterium tuberculosis (Mtb) infection by measuring release of interferon gamma (IFN-γ) when T-cells (in heparinized whole blood) are stimulated with specific Mtb antigens. The amount of IFN-γ is determined by enzyme-linked immunosorbent assay (ELISA). Automation of the ELISA method may reduce variability. To assess the impact of ELISA automation, we compared QFT-GIT results and variability when ELISAs were performed manually and with automation.

Methods

Blood was collected into two sets of QFT-GIT tubes and processed at the same time. For each set, IFN-γ was measured in automated and manual ELISAs. Variability in interpretations and IFN-γ measurements was assessed between automated (A1 vs. A2) and manual (M1 vs. M2) ELISAs. Variability in IFN-γ measurements was also assessed on separate groups stratified by the mean of the four ELISAs.

Results

Subjects (N = 146) had two automated and two manual ELISAs completed. Overall, interpretations were discordant for 16 (11%) subjects. Excluding one subject with indeterminate results, 7 (4.8%) subjects had discordant automated interpretations and 10 (6.9%) subjects had discordant manual interpretations (p = 0.17). Quantitative variability was not uniform; within-subject variability was greater with higher IFN-γ measurements and with manual ELISAs. For subjects with mean TB Responses ±0.25 IU/mL of the 0.35 IU/mL cutoff, the within-subject standard deviation for two manual tests was 0.27 (CI95 = 0.22–0.37) IU/mL vs. 0.09 (CI95 = 0.07–0.12) IU/mL for two automated tests.

Conclusion

QFT-GIT ELISA automation may reduce variability near the test cutoff. Methodological differences should be considered when interpreting and using IFN-γ release assays (IGRAs).

The authors have no support or funding to report.
Introduction

The QuantiFERON®-TB Gold In-Tube test (QFT-GIT) was designed to detect Mycobacterium tuberculosis (Mtb) infection by quantifying the amount of interferon-γ (IFN-γ) released when whole blood is stimulated with specific Mtb antigens [1]. The amount of IFN-γ released is determined by enzyme-linked immunosorbent assay (ELISA). QFT-GIT and other IFN-γ release assays (IGRAs) are alternatives to the tuberculin skin test (TST) for detecting Mtb infection, both latent infection (LTBI) and infection manifesting as active disease. However, variability may limit QFT-GIT utility. Serial testing of healthcare workers (HCW) has demonstrated higher than expected QFT-GIT conversion and reversion rates in low-prevalence settings [2][17]. In addition, comparisons of simultaneously-performed second- and third-generation QuantiFERON IGRAs have demonstrated greater than expected interpretative discordance [18], [19]. Assessments of QFT-GIT repeatability and reproducibility have demonstrated appreciable amounts of variability [20], [21].

Estimates of variability have varied widely among studies that used different methods of performing QFT-GIT, different indices to assess variability, and different study populations with varied prevalence of Mtb infection and risk of infection. QFT-GIT variability in published studies has been attributed to temporal biologic fluctuations within subjects due to new Mtb infection [2], [22], progression or treatment of human immunodeficiency virus (HIV) infection [23], response to treatment [24][27], differences in testing methods (such as difference in delay to incubation, duration of incubation, or incubation temperature) [14], [28], [29], and nonspecific test fluctuations due to random variation [2][4], [21], [30]. Determination of the background variability (noise, a change beyond which represents a “true” change) is challenging, especially near the cutoff separating positive and negative test interpretations. This is of critical importance in detecting new infection.

QFT-GIT is a complex test and may be prone to nonspecific random variation. Technical errors attributable to test complexity appear to contribute to IGRA variability [19]. Few studies have assessed the nonspecific random variability of QFT-GIT when repeated on the same samples or samples collected at the same time using identical methods. Discordance in interpretation when QFT-GIT was repeated on the same sample in different ELISAs has been approximately 3.6% [2], [10] and 8.0% to 8.3% [29], [31] when repeated in the same ELISA.

Although the development and initial evaluation of QFT-GIT relied on manual ELISA methods, automation may reduce QFT-GIT variability. Of the 126 measurements required for one QFT-GIT, 115 are automatable (Goodwin et. al., manuscript in preparation). To our knowledge, a comparison of variability between tests performed manually and between tests performed using an automated workstation has not been reported. To assess the impact of ELISA automation on QFT-GIT, we compared test results and measured variability when tests were performed with manual and automated methodologies.

MethodsEthics Statement

The Centers for Disease Control and Prevention (CDC) and Wilford Hall Medical Center human subjects institutional review boards approved this study. All subjects provided written informed consent.

Subject Selection

After obtaining approval from human subjects review boards at the Centers for Disease Control and Prevention (CDC, Protocol # 5078) and Wilford Hall Medical Center (U.S. Air Force (USAF), Protocol # FWH20080002H), subjects were recruited from among CDC and USAF staff located in Atlanta, Georgia, and San Antonio, Texas, respectively, as part of a larger study investigating QFT-GIT variability. To increase the proportion of subjects with positive QFT-GIT results and to assess subjects with a continuous range of IFN-γ measurements (including those with IFN-γ measurements near the cutoff separating positive and negative interpretations), only persons with self-reported prior positive TST results were recruited. Prior unpublished assessments among a similar cohort found that 40% to 50% of persons with self-reported prior positive TST results were positive by QFT-GIT. Exclusion criteria were age of less than 18 years or a history of a severe TST reaction (e.g., blistering, scarring, or anaphylaxis). All subjects provided informed written consent and completed a detailed study questionnaire.

QFT-GIT Procedure

Blood from each subject was collected at one morning visit into two sets of QFT-GIT tubes (Set 1 or Set 2) so that an automated ELISA and a manual ELISA could be performed from each set of tubes. Tubes were purchased from Cellestis, Ltd (Cellestis Limited, Carnegie, Victoria, Australia), and each set of tubes included a Nil tube, a TB antigen tube, and a Mitogen tube. Each tube was labeled with a number and a barcode that (1) identified the specimen, (2) identified the tube type (i.e., Nil tube, TB antigen tube, or Mitogen tube), and (3) linked the specimen to subject and collection information. One mL of blood was collected into each tube and tube contents were mixed with a Stuart rock and roll mixer (SciTech Instruments, Inc., Franklin, NJ) for 3 minutes at 33 RPM. Within one hour of blood collection, tubes were incubated at 37±0.5°C for 23 to 24 hours and then centrifuged at 3,000 g for 10 minutes.

IFN-γ concentrations in plasmas in Nil tubes (Nil), TB antigen tubes (TB), and Mitogen tubes (Mitogen) were determined by ELISAs performed on the day after blood collection using reagents included in QFT-GIT kits. ELISAs were performed with the aid of an automated ELISA workstation (automated ELISA) or without the aid of an automated ELISA workstation (manual ELISA). Triturus automated ELISA workstations (Grifols, USA, Inc., Miami, FL) were used in CDC and USAF labs. For manual ELISAs, reagents were dispensed with Rainin LTS single and multichannel pipetters (Rainin Instrument, LLC, Oakland, CA); plates were washed with a Biotrak II Microplate washer (Biochrom, Ltd., Cambridge, UK) in the CDC lab and a Dynex Ultrawash Plus Microplate washer (Dynex Technologies, Chantilly, VA) in the USAF lab; and optical densities (ODs) were measured with a Thermo Scientific, Multiskan Ascent (Waltham, MA) in the CDC lab and a BioTek ELX800 microplate reader (BioTek Instruments, Inc., Winooshi, VT) in the USAF lab. IFN-γ standards from QFT-GIT kits were serially diluted and eight IFN-γ concentrations (i.e., 8, 4, 2, 1, 0.5, 0.25, 0.125, and 0 IU/mL) were used in duplicate to create a standard curve for each ELISA. OD values were imported electronically, and plasma IFN-γ concentrations were determined using a Microsoft Access database (Microsoft, Inc., Seattle, WA) developed at CDC. ELISAs not meeting quality specifications as defined by the manufacturer [32] were immediately repeated. TB Responses were calculated by subtracting Nil from TB, and Mitogen Responses were calculated by subtracting Nil from Mitogen.

Test results were interpreted as indicated in the CDC guidelines and Cellestis package insert [1], [32]. The interpretation was “positive” if the Nil was ≤8.0 IU/mL and the TB Response was ≥0.35 IU/mL and ≥25% of the Nil. The interpretation was “negative” if the Nil was ≤8.0 IU/mL, the Mitogen Response was ≥0.5 IU/mL, and the TB Response was <0.35 IU/mL or <25% of the Nil. The interpretation was “indeterminate” if (1) the Nil was >8.0 IU/mL or (2) the Nil was ≤8.0 IU/mL, the Mitogen Response was <0.5 IU/mL, and the TB Response was <0.35 IU/mL or <25% of the Nil.

Statistical Methods

Variability in test interpretations was assessed by calculating the percentage of subjects with any discordance among the four ELISAs. Additionally, positive agreement, negative agreement, and agreement beyond chance (Cohen’s kappa statistic, k) were calculated for each pair of ELISAs. To assess variability in IFN-γ measurements (i.e., Nil, TB, and TB Response), distributions were compared using the Wilcoxon signed-rank test. Five additional indices of quantitative variability were examined for each pair of ELISAs, the last two of which were derived from the standard deviation of the differences (SDdiff): (1) within-subject coefficient of variation (W-S CV%), (2) intraclass correlation coefficient (ICC), (3) mean difference (bias), (4) the smallest detectable difference (SDD), and (5) the within-subject standard deviation (W-S SD). SDD = 1.96*SDdiff, and is the smallest change in a subsequent measurement that must occur to detect a change beyond the variability (e.g., noise) with 95% certainty [33], [34], W-S SD = ±(SDdiff/√2) [35], and represents 68% of the variation expected around the true value [36]. Limits of agreement (LOA) = bias ± SDD and encompass the range around the bias that contains 95% of within-subject differences [37]. ICCs were calculated using the SAS macro ICC_SAS [38]. W-S CV% was calculated as described by Bland (root mean square approach) [39] for Nil and TB and estimated for TB Response using the formula √((W-S CV%TB)2+ (W-S CV%Nil)2) (root sum square method for estimating aggregate uncertainty). The W-S CV%s for the TB Response could not be directly determined due to inflation caused by zeros and negative mean values in the denominator (because some TB Response values were ≤0). A confidence level of 0.95 was used in all hypothesis tests. Stratified analyses for quantitative indices were performed on groups stratified by mean TB Response from all four ELISAs. SAS v9.2 (SAS, Cary, NC) and “Analyse-It” v2.22 for Excel (Analyse-It Software, Ltd., Leeds, UK) were used to perform the analyses.

ResultsSubject Characteristics

Study participation is depicted in Figure 1. Of the 268 people asked to participate, 55 declined and 55 were not eligible. Of the 158 persons enrolled, 146 had four ELISAs completed (one automated and one manual ELISA for the first set of QFT-GIT tubes, and one automated and one manual ELISA for the second set of QFT-GIT tubes, referred to as A1, M1, A2, and M2, respectively). Characteristics of the study subjects are shown in Table 1.

10.1371/journal.pone.0086721.g001Study participation diagram.10.1371/journal.pone.0086721.t001Subject characteristics.
CharacteristicCategoryn (%)
Age, yr20–2916 (11.0%)
30–3932 (21.9%)
40–4941 (28.1%)
50–5937 (25.3%)
≥6020 (13.7%)
GenderM65 (44.5%)
F81 (55.5%)
Race/EthnicityWhite, non-Hispanic70 (48.0%)
Black, non-Hispanic36 (24.7%)
Asian/Pacific18 (12.3%)
Hispanic13 (8.9%)
Native American1 (0.7%)
Other8 (5.5%)
Year of Last Positive TST1950–19591 (0.7%)
1960–19698 (5.5%)
1970–197910 (6.9%)
1980–198916(11.0%)
1990–199960 (41.1%)
2000–200951 (34.9%)
Received Therapy for TBYes3 (2.1%)
No143 (98.0%)
Received Therapy for LTBIYes106 (72.6%)
No/Unknown40 (27.4%)
Known Exposure to Active TBYes55 (37.7%)
No/Unknown91 (62.3%)
Received BCG VaccineYes30 (20.5%)
No/Unknown116 (79.5%)
Region of BirthUnited States andCanada103 (70.5%)
Asia14 (9.6%)
Central America/Caribbean12 (8.2%)
Africa6 (4.1%)
Europe/Russia4 (2.7%)
Pacific3 (2.1%)
Southeast Asia2 (1.4%)
Middle East2 (1.4%)
Years Lived Outside USANone62 (42.5%)
1–1056 (38.4%)
11–2013 (8.9%)
21–3012 (8.2%)
31–403 (2.1%)
Qualitative Results

QFT-GIT interpretations are summarized by ELISA method and tube set in Table 2. Among the four tests, interpretations were concordantly positive for 24 (16%) subjects, concordantly negative for 106 (72.6%) subjects, and discordant for 16 (11%) subjects. Forty subjects (27.4%) had at least one positive interpretation. Two subjects (1.4%) had three positive interpretations, eight subjects (5.5%) had two positive interpretations, and five subjects (3.4%) had one positive interpretation. One subject had three indeterminate interpretations with low Mitogen Responses of 0.249 to 0.474 IU/mL and one negative interpretation with a Mitogen Response of 0.55 IU/mL. Nil, TB, and TB Response values for the 15 subjects with discordant results among the four tests (excluding the one subject with three indeterminate results) are shown in Table 3. Results are grouped as either single discordant (one discordant/three concordant) or double discordant (two opposing pairs of concordant results) and additionally categorized into eight groups according to the specific nature of the discordance. Twelve subjects (categories 1–6) were discordant between first and second tests. Two subjects had both automated tests positive and both manual tests negative (category 7), and one had both automated tests negative and both manual tests positive (category 8).

10.1371/journal.pone.0086721.t002QFT-GIT interpretations when ELISAs were performed with automated and manual methods.
ResultAutomated 1Automated 2Manual 1Manual 2
Positive29 (19.9%)30 (20.6%)33 (22.6%)31 (21.2%)
Negative117 (80.1%)115 (78.8%)112 (76.7%)114 (78.1%)
Indeterminate01 (0.7%)1 (0.7%)1 (0.7%)
10.1371/journal.pone.0086721.t003Fifteen subjects discordant among any of the four tests.
Automated 1Automated 2Manual 1Manual 2
Category*IDTBNilTBResp.Interp.TBNilTBResp.Interp.TBNilTBResp.Interp.TBNilTBResp.Interp.
Single Discordant
1350.3630.1530.210Neg0.4390.1470.292Neg0.4900.1250.365Pos0.2890.1120.177Neg
11130.2280.310−0.082Neg0.1490.1000.049Neg0.9700.5490.421Pos0.3550.360−0.005Neg
11270.5190.2340.285Neg0.5270.2790.248Neg1.4970.3591.138Pos0.5810.4470.134Neg
11330.1400.0970.043Neg0.0920.0900.002Neg2.1200.4131.707Pos0.7620.4690.293Neg
21040.0760.082−0.006Neg0.1250.1050.020Neg0.8600.6110.249Neg1.1270.5140.613Pos
3960.3660.0350.331Neg0.6120.0730.539Pos0.5090.0660.443Pos0.7660.0690.697Pos
4630.7750.1840.591Pos0.4960.1710.325Neg0.6850.2060.479Pos0.5550.1860.369Pos
Double Discordant
5320.3630.1660.197Neg0.5590.1710.388Pos0.3610.2080.153Neg0.7250.2620.463Pos
51290.3380.0460.292Neg0.6750.0440.631Pos0.3360.1430.193Neg1.0550.1170.938Pos
51360.4400.1160.324Neg0.6330.0750.558Pos0.9030.8670.036Neg3.9430.8043.139Pos
61000.5890.1860.403Pos0.3880.1380.250Neg0.6780.1640.514Pos0.4520.1330.319Neg
61351.9340.1071.827Pos0.0680.075−0.007Neg2.3220.2882.034Pos0.3170.502−0.185Neg
7780.6510.0800.571Pos0.5420.0780.464Pos0.3370.0780.259Neg0.2670.0850.182Neg
71011.1630.3060.857Pos1.0510.1450.906Pos1.4661.637−0.171Neg1.3831.1850.198Neg
81020.1310.0530.078Neg0.1450.0790.066Neg0.5990.1950.404Pos0.8470.3810.466Pos

(1) 1st manual positive/others negative, (2) 2nd manual positive/others negative, (3) 1st automated negative/others positive, (4) 2nd automated negative/others positive, (5) 1st test negative/2nd test positive, (6) 1st test positive/2nd test negative, (7) automated positive/manual negative, (8) automated negative/manual positive.

Indices of interpretation variability between pairs of ELISAs are shown in Table 4. Seven (4.8%) subjects had discordant results with automated ELISAs compared to 10 (6.9%) subjects with manual ELISAs (p = 0.17). Results from the 15 subjects with discordant results are depicted in Figure 2. Five of the 7 subjects discordant with the two automated tests (71%) had both TB Responses within ±0.25 IU/mL of the QFT-GIT cutoff (0.1 to 0.6 IU/mL, gray dot-dashed lines) vs. 3 of 10 (30%) subjects discordant with the two manual tests.

10.1371/journal.pone.0086721.g002Comparison of TB Responses for subjects with discordant test interpretations.

*A = automated, M = manual; squares = first test, circles = second test; 0.35 IU/mL cutoff shown by black dashed line; 0.1 to 0.6 IU/mL borderline zone (0.35±0.25 IU/mL) shown by grey dot-dashed line.

10.1371/journal.pone.0086721.t004Variability in QFT-GIT interpretations between ELISAs.
% Agreement% Discordant
Results Compared (Group 1 vs. Group 2)Both PositiveBoth NegativePositive*/NegativeNegative*/PositiveOverallPositiveNegativeOverallKappa
A1 vs. A2261123495.278.894.14.80.85
M1 vs. M2271086493.173.091.56.90.80
A1 vs. M1271102694.577.193.25.50.84
A1 vs. M2251104693.171.491.76.90.73
A2 vs. M1251075891.065.889.29.00.74
A2 vs. M2281122396.684.895.73.40.90

Group 1/Group 2; subject with indeterminate results not included.

Quantitative Results

Means, medians, and ranges for Nil, TB, and TB Response are shown in Table 5. There were no significant distributional differences between the two automated tests or between the two manual tests, but TB and NIL values in manual tests were significantly greater than in automated tests (p<0.03). There were no significant differences in TB Response between manual and automated tests. ICCs and W-S CV%s are shown in Table S1. Examination of difference (Bland-Altman) plots for TB Response, shown in Figure 3, shows an increase in variation as the mean of the paired measurements increased.

10.1371/journal.pone.0086721.g003Bland-Altman plot of TB Responses.

X- axis (mean of paired TB Responses) shown on log scale. Points between 0 to +0.001 and 0 to −0.001 not shown.

10.1371/journal.pone.0086721.t005IFN-γ means, medians, and ranges for the four tests (IU/mL).
TBNilTB Response
TestMeanMedianRangeMeanMedianRangeMeanMedianRange
A10.890.120.03 to 20.170.120.080.03 to 1.990.770.05−1.19 to 20.04
A20.870.120.04 to 18.070.110.070.03 to 1.430.760.02−0.69 to 17.99
M10.910.180.04 to 17.880.210.120.03 to 1.710.700.03−0.83 to 17.78
M20.920.190.05 to 16.110.220.110.04 to 1.710.700.03−1.36 to 15.99

Analyses were performed examining variation within seven strata of mean TB Response, based on the mean of the four tests. Bias and 95% LOA are shown in Figure 4. The relatively large variability seen for the first stratum (<0.1 IU/mL) is due to grouping subjects with negative means, many of whom had large differences (which also may be seen in Figure 3). The fourth stratum (0.2 IU/mL to 0.499 IU/mL) shows variability in a range surrounding the QFT-GIT cutoff (0.35±0.15 IU/mL). In this category, bias and LOA for manual tests were greater than for automated tests. As shown in Table S2, significantly higher W-S SDs were observed within this range for manual tests than for automated tests, as demonstrated by non-overlapping 95% confidence intervals (95% CI). SDDs for this range were also significantly higher for the manual tests than for the automated tests. When this range was expanded to 0.1 IU/mL to 0.6 IU/mL (0.35±0.25 IU/mL), W-S SDs remained significantly higher for manual tests (0.27, 95% CI: 0.22–0.37) than for automated tests (0.09, 95% CI: 0.07–0.12). SDDs were also significantly higher for manual tests (0.75, 95% CI: 0.61–1.03) than for automated tests (0.25, 95% CI: 0.19–0.33) for this broader range.

10.1371/journal.pone.0086721.g00495% Limits of agreement and bias in TB Responses.

*A = automated, M = manual.

Discussion

This study assessed the precision of the QFT-GIT using both automated and manual ELISA methods. We determined repeatability of QFT-GIT when performed manually on two blood samples collected at the same time and when performed with the aid of an automated ELISA workstation on two blood samples collected at the same time. We observed discordance of 4.8% between two automated tests and 6.9% between two manual tests. Additionally, we evaluated reproducibility of QFT-GIT when one test was performed manually and one test was performed with the aid of an automated ELISA workstation on blood samples collected at the same time. We observed discordance of 3.4% to 9.0% for automated versus manual paired combinations. Eleven percent of subjects (including the one subject with one negative result and three indeterminate results) had at least one discordant result among the four tests. Quantitative indices of variability showed that variation in TB Response near the cutoff separating positive and negative test interpretations was significantly greater with the manual method than with the automated method.

Our discordance rates of 4.8% for two repeated automated QFT-GITs and 6.9% for two repeated manual QFT-GITs are slightly higher than those from two similar studies in which ELISAs were repeated on blood collected at the same time [2], [10]. Discordant rates of 3.6% were reported in both studies; however, the ELISA methods used for these studies were not described. QFT-GIT is a complex assay, but investigators rarely specify details for performing the ELISAs.

Our estimates of QFT-GIT reproducibility when performed in the same lab using automated or manual methods ranged from 3.4% to 9.0%. Prior estimates of QFT-GIT reproducibility when ELISAs were performed in different labs using automated methods ranged from 3.3% to 6.6% [21]. Our finding of greater variability when the QFT-GIT ELISA is performed manually than when performed with the aid an automated ELISA workstation is not surprising, given the complexity of the assay. In a prior study, we reported that a reduction in the number of steps required for QFT-GIT compared to QFT-G was associated with a significant reduction in the number of unusual measurements [19].

We and others have previously suggested the need for a zone of uncertainty surrounding the 0.35-IU/mL cutoff currently used to separate positive and negative QFT-GIT results [2][4], [6], [13], [21], [40]. Clinicians may need to repeat testing when initial results are within a borderline zone to increase diagnostic certainty. However, there is no consensus on the size of the zone, and different sizes have been suggested or applied. Our finding of greater variability when the QFT-GIT ELISA is performed manually than when aided by an automated workstation suggests that a broader borderline zone would be needed when using manual methods. Use of a broader borderline zone may, in turn, necessitate more repeat testing. Greater precision may justify the cost of an automated ELISA workstation.

Our study has several limitations. First, the small sample size for some strata resulted in large confidence intervals for estimates of variability. Despite the small sample size, differences in variability between automated and manual TB Response in the stratum surrounding the cutoff were significant. Second, we only studied TB Response in persons who reported a prior positive TST. While other populations may have different proportions of negative, positive, and borderline TB Response values, this limitation would not be expected to alter variability within strata of TB Response values.

In conclusion, automation of QFT-GIT ELISA may reduce variability near the cutoff separating positive or negative interpretations. Methodological differences should be considered when interpreting and using IGRAs.

Supporting Information

ICC and W-S CV%, total population. *95% confidence interval.

(DOC)

Click here for additional data file.

W-S SD and SDD for TB Response, total population and stratified. *95% confidence interval.

(DOC)

Click here for additional data file.

The authors would like to express their gratitude to the subjects for their participation in this study; to Matthew Crum and David Temporado for logistical support; to Michelle Owen, Clyde Hart, Tammy Evans-Strickfaden, and Davis Lupo for laboratory space and technical advice; to Erin Justen for administrative support; and to Eva Bozeman for assisting with blood collection.

ReferencesMazurekGH, JerebJ, VernonA, LoBueP, GoldbergS, et al (2010) Updated guidelines for using Interferon Gamma Release Assays to detect Mycobacterium tuberculosis infection - United States, 2010. MMWR Recomm Rep59: 12520577159VeerapathranA, JoshiR, GoswamiK, DograS, MoodieEE, et al (2008) T-cell assays for tuberculosis infection: deriving cut-offs for conversions using reproducibility data. PLoS ONE3: e185018365006PerryS, SanchezL, YangS, AgarwalZ, HurstP, et al (2008) Reproducibility of QuantiFERON-TB gold in-tube assay. Clin Vaccine Immunol15: 42543218199741PaiM, JoshiR, DograS, MendirattaDK, NarangP, et al (2006) Serial testing of health care workers for tuberculosis using interferon-gamma assay. Am J Respir Crit Care Med174: 34935516690977PollockNR, Campos-NetoA, KashinoS, NapolitanoD, BeharSM, et al (2008) Discordant QuantiFERON-TB Gold test results among US healthcare workers with increased risk of latent tuberculosis infection: a problem or solution?Infect Control Hosp Epidemiol29: 87888618713053van Zyl-SmitRN, PaiM, PeprahK, MeldauR, KieckJ, et al (2009) Within-subject Variability and Boosting of T Cell IFN-{gamma} Responses Following Tuberculin Skin Testing. Am J Respir Crit Care Med180: 495819342414BakerCA, ThomasW, StaufferWM, PetersonPK, TsukayamaDT (2009) Serial testing of refugees for latent tuberculosis using the QuantiFERON-gold in-tube: effects of an antecedent tuberculin skin test. Am J Trop Med Hyg80: 62863319346390Torres CostaJ, SilvaR, SaR, CardosoMJ, RibeiroC, et al (2010) Comparison of interferon-gamma release assay and tuberculin test for screening in healthcare workers. Rev Port Pneumol16: 21122120437000BelknapR, KelaherJ, WallK, DaleyC, SchlugerN, et al (2009) Diagnosis of Latent Tuberculosis Infection in U.S. Health Care Workers: Reproducibility, Repeatability and 6 Month Follow-Up with Interferon-gamma Release Assays (IGRAs). American Journal of Respiratory and Critical Care Medicine179: A4101DetjenAK, LoebenbergL, GrewalHM, StanleyK, GutschmidtA, et al (2009) Short-term reproducibility of a commercial interferon gamma release assay. Clin Vaccine Immunol16: 1170117519535542RingshausenFC, NienhausA, SchablonA, SchlosserS, Schultze-WerninghausG, et al (2010) Predictors of persistently positive Mycobacterium-tuberculosis-specific interferon-gamma responses in the serial testing of health care workers. BMC Infect Dis10: 22020653946ZwerlingA, van denHS, ScholtenJ, CobelensF, MenziesD, et al (2011) Interferon-gamma release assays for tuberculosis screening of healthcare workers: a systematic review. Thorax67: 627021228420RingshausenFC, NienhausA, TorresCJ, KnoopH, SchlosserS, et al (2011) Within-subject Variability of Mycobacterium-tuberculosis-specific Interferon-gamma Responses in German Health Care Workers. Clin Vaccine Immunol18: 1176118221593237DoberneD, GaurRL, BanaeiN (2011) Preanalytical delay reduces sensitivity of QuantiFERON-TB Gold In-Tube for detection of latent tuberculosis infection. Journal of Clinical Microbiology49: 3061306421697332ZwerlingA, Cloutier-LadurantayeJ, PietrangeloF, BehrM, SchwartzmanK, et al (2009) Conversions and Reversions in Health Care Workers in Montreal, Canada Using QuantiFERON-TB-Gold In-Tube. Am J Respir Crit Care Med179: A1012ParkJS, LeeJS, KimMY, LeeCH, YoonHI, et al (2012) Monthly follow-ups of interferon gamma release assays among healthcare workers in contact with patients with TB. Chest142: 1461146822556318PaiM, ElwoodK (2012) Interferon-gamma release assays for screening of health care workers in low tuberculosis incidence settings: Dynamic patterns and interpretational challenges. Can Respir J19: 818322536575MahomedH, HughesEJ, HawkridgeT, MinniesD, SimonE, et al (2006) Comparison of mantoux skin test with three generations of a whole blood IFN-gamma assay for tuberculosis infection. Int J Tuberc Lung Dis10: 31031616562712PowellRDIII, WhitworthWC, BernardoJ, MoonanPK, MazurekGH (2011) Unusual Interferon Gamma Measurements with QuantiFERON-TB Gold and QuantiFERON-TB Gold In-Tube Tests. PLoS ONE6: e2006121687702van Zyl-SmitRN, ZwerlingA, DhedaK, PaiM (2009) Within-subject variability of interferon-g assay results for tuberculosis and boosting effect of tuberculin skin testing: a systematic review. PLoS ONE4: e851720041113WhitworthWC, HamiltonLR, GoodwinDJ, BarreraC, WestKB, et al (2012) Within-Subject Interlaboratory Variability of QuantiFERON-TB Gold In-Tube Tests. PLoS ONE7: e4379022970142DielR, GolettiD, FerraraG, BothamleyG, CirilloD, et al (2011) Interferon-gamma release assays for the diagnosis of latent Mycobacterium tuberculosis infection: a systematic review and meta-analysis. Eur Respir J37: 889921030451SantinM, MunozL, RigauD (2012) Interferon-γ release assays for the diagnosis of tuberculosis and tuberculosis infection in HIV-infected adults: a systematic review and meta-analysis. PLoS ONE7: e3248222403663KatiyarSK, SampathA, BihariS, MamtaniM, KulkarniH (2008) Use of the QuantiFERON-TB Gold In-Tube test to monitor treatment efficacy in active pulmonary tuberculosis. Int J Tuberc Lung Dis12: 1146115218812044PaiM, JoshiR, DograS, MendirattaDK, NarangP, et al (2006) Persistently elevated T cell interferon-gamma responses after treatment for latent tuberculosis infection among health care workers in India: a preliminary report. J Occup Med Toxicol1: 716722616PollockNR, KashinoSS, NapolitanoDR, SloutskyA, JoshiS, et al (2009) Evaluation of the effect of treatment of latent tuberculosis infection on QuantiFERON-TB gold assay results. Infect Control Hosp Epidemiol30: 39239519236281CheeCB, KhinMarKW, GanSH, BarkhamTM, KohCK, et al (2010) Tuberculosis treatment effect on T-cell interferon-gamma responses to Mycobacterium tuberculosis-specific antigens. Eur Respir J36: 35536119926734HerreraV, YehE, MurphyK, ParsonnetJ, BanaeiN (2010) Immediate incubation reduces indeterminate results for QuantiFERON-TB Gold in-tube assay. J Clin Microbiol48: 2672267620519472ShanaubeK, deHP, SchaapA, MoyoM, KosloffB, et al (2010) Intra-assay reliability and robustness of QuantiFERON(R)-TB Gold In-Tube test in Zambia. Int J Tuberc Lung Dis14: 82883320550764MachingaidzeS, VerverS, MulengaH, AbrahamsDA, HatherillM, et al (2012) Predictive Value of Recent QuantiFERON Conversion for Tuberculosis Disease in Adolescents. Am J Respir Crit Care Med186: 1051105622955316DominguezM, SmithA, LunaG, BradyMF, ustin-BrenemanJ, et al (2010) The MIT D-lab electricity-free PortaTherm incubator for remote testing with the QuantiFERON(R)-TB Gold In-Tube assay. Int J Tuberc Lung Dis14: 1468147420937189Cellestis Limited (2009 January) QuantiFERON®-TB Gold In-Tube: Package Insert. Carnegie, Victoria, Australia: Cellestis Limited website. Available: http://www.cellestis.com/IRM/Company/ShowPage.aspx?CPID=1370 Accessed 2012.BeckermanH, RoebroeckME, LankhorstGJ, BecherJG, BezemerPD, et al (2001) Smallest real difference, a link between reproducibility and responsiveness. Qual Life Res10: 57157811822790GuyattGH, KirshnerB, JaeschkeR (1992) Measuring health status: what are the necessary measurement properties?J Clin Epidemiol45: 134113451460470HopkinsWG (2000) Measures of reliability in sports medicine and science. Sports Med30: 11510907753AtkinsonG, NevillA (2000) Typical error versus limits of agreement. Sports Med30: 37537711103850BlandJM, AltmanDG (1986) Statistical methods for assessing agreement between two methods of clinical measurement. Lancet1: 3073102868172Lu L, Shara N (2007) Reliability analysis: Calculate and Compare Intra-class Correlation Coefficients (ICC) in SAS. Northeast SAS User’s Group Website, 2007 Conference Proceedings. Available: http://www.nesug.org/proceedings/nesug07/sa/sa13.pdf Accessed 2012.Bland JM (2006) How should I calculate a within-subject coefficient of variation? Martin Bland website. Available: http://www-users.york.ac.uk/~mb55/meas/cv.htm Accessed 2012.PaiM, JoshiR, DograS, ZwerlingAA, GajalakshmiD, et al (2009) T-cell assay conversions and reversions among household contacts of tuberculosis patients in rural India. Int J Tuberc Lung Dis13: 849219105884