<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.2 20190208//EN" "JATS-archivearticle1-mathml3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties manuscript?><front><journal-meta><journal-id journal-id-type="nlm-journal-id">1302422</journal-id><journal-id journal-id-type="pubmed-jr-id">2993</journal-id><journal-id journal-id-type="nlm-ta">Clin Chim Acta</journal-id><journal-id journal-id-type="iso-abbrev">Clin Chim Acta</journal-id><journal-title-group><journal-title>Clinica chimica acta; international journal of clinical chemistry</journal-title></journal-title-group><issn pub-type="ppub">0009-8981</issn><issn pub-type="epub">1873-3492</issn></journal-meta><article-meta><article-id pub-id-type="pmid">30292631</article-id><article-id pub-id-type="pmc">7982963</article-id><article-id pub-id-type="doi">10.1016/j.cca.2018.10.006</article-id><article-id pub-id-type="manuscript">HHSPA1553481</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>Quality specifications and their daily application to evaluate the accuracy of reference measurements for serum concentrations of 25-hydroxyvitamin D<sub>3</sub> and 25-hydroxyvitamin D<sub>2</sub></article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Mineva</surname><given-names>Ekaterina M.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Sternberg</surname><given-names>Maya R.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Pfeiffer</surname><given-names>Christine M.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Momin</surname><given-names>Shahzad S.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Maw</surname><given-names>Khin L.</given-names></name></contrib><contrib contrib-type="author"><name><surname>Schleicher</surname><given-names>Rosemary L.</given-names></name><xref rid="CR1" ref-type="corresp">*</xref></contrib><aff id="A1">Division of Laboratory Sciences, National Center for Environmental Health, Centers for Disease Control and Prevention, Atlanta, GA, 30341</aff></contrib-group><author-notes><corresp id="CR1"><label>*</label>Corresponding Author: Rosemary L Schleicher. Division of Laboratory Sciences, National Center for Environmental Health, Centers for Disease Control and Prevention. 4770 Buford Hwy, NE, Mail Stop F55, Atlanta, GA 30341. <email>zwa5@cdc.gov</email>. Phone: 770-488-4424. Fax: 770-488-4139</corresp></author-notes><pub-date pub-type="nihms-submitted"><day>12</day><month>3</month><year>2021</year></pub-date><pub-date pub-type="epub"><day>04</day><month>10</month><year>2018</year></pub-date><pub-date pub-type="ppub"><month>12</month><year>2018</year></pub-date><pub-date pub-type="pmc-release"><day>22</day><month>3</month><year>2021</year></pub-date><volume>487</volume><fpage>241</fpage><lpage>249</lpage><!--elocation-id from pubmed: 10.1016/j.cca.2018.10.006--><abstract id="ABS1"><sec id="S1"><title>Background:</title><p id="P1">Reference measurement procedures (RMP) have rigorous accuracy specifications. For total 25-hydroxyvitamin D, 25(OH)D, bias &#x02264;1.7% and CV &#x02264;5% are recommended. These quality specifications are impractical for minor analytes, such as 25(OH)D<sub>2</sub>. Furthermore, documentation on RMP quality performance specifications for the individual 25(OH)D metabolites and their daily application are missing.</p></sec><sec id="S2"><title>Methods:</title><p id="P2">To assess accuracy, we used zeta-scores. Daily, 5&#x02013;10 specimens (duplicate) and 3 reference materials (singleton or duplicate) were measured for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> using JCTLM-accepted LC-MS/MS RMPs. Protocols were repeated on 3&#x02013;4 occasions to generate campaign results. We used separate zeta-score acceptability criteria for daily (&#x02264;|2|) and campaign (&#x02264;|1|) evaluations. Allowable imprecision was determined experimentally.</p></sec><sec id="S3"><title>Results:</title><p id="P3">Across 7 campaigns, unacceptable daily zeta-scores required repeating 2 runs for 25(OH)D<sub>3</sub> and 5 runs for 25(OH)D<sub>2</sub>. Hence, the zeta-scores of acceptable reference material results indicated high accuracy. The allowable imprecision for the RMPs was &#x02264;5% (daily) and &#x02264;3% (campaign) for 25(OH)D<sub>3</sub> and &#x02264;7% (daily) and &#x02264;4% (campaign) for 25(OH)D<sub>2</sub>, respectively.</p></sec><sec id="S4"><title>Conclusions:</title><p id="P4">Using zeta-scores and experimentally derived imprecision, we developed a straightforward approach to assess the acceptability of individual 25(OH)D reference measurements, providing also much-needed practical accuracy specifications for 25(OH)D<sub>2</sub>.</p></sec></abstract><kwd-group><kwd>Vitamin D</kwd><kwd>25(OH)D<sub>3</sub></kwd><kwd>25(OH)D<sub>2</sub></kwd><kwd>quality performance specifications</kwd><kwd>serum</kwd><kwd>reference measurement procedure</kwd></kwd-group></article-meta></front><body><sec id="S5"><label>1.</label><title>Introduction</title><p id="P5">Vitamin D status in humans is currently assessed by measuring the serum concentrations of two liver metabolites, 25-hydroxyvitamin D<sub>3</sub> and 25-hydroxyvitamin D<sub>2</sub> [25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>], usually expressed as the sum of both, so-called total 25-hydroxyvitamin D [25(OH)D]. Accurate measurement of both vitamin D metabolites is essential for interpreting patient status. To improve the accuracy and reliability of 25(OH)D laboratory testing, reference measurement systems consisting of reference measurement procedures (RMPs) [<xref rid="R1" ref-type="bibr">1</xref>&#x02013;<xref rid="R3" ref-type="bibr">3</xref>] and higher order standard reference materials (SRMs) [<xref rid="R4" ref-type="bibr">4</xref>] have been developed. According to metrological traceability, RMPs are used to assign target values to candidate reference materials (RM), which can be used for trueness assessment of lower-order measurement procedures, and ultimately improve the accuracy of routine measurements. Each assigned target value has measurement uncertainty associated with it. Factors known to have a significant effect on a measurement result are included in the uncertainty calculation, e.g., imprecision, calibrator purity, sample preparation effects of unspecific interferences, weight and density measurements [<xref rid="R3" ref-type="bibr">3</xref>].</p><p id="P6">We developed two liquid chromatography-tandem mass spectrometry (LC-MS/MS) RMPs for quantitation of serum 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>, recognized by the Joint Committee for Traceability in Laboratory Medicine (JCTLM) [<xref rid="R5" ref-type="bibr">5</xref>], and currently used to support the CDC Vitamin D Standardization Certification Program [<xref rid="R6" ref-type="bibr">6</xref>]. This program was developed to improve the accuracy of 25(OH)D testing and focuses on the enrollment of manufacturers of test kits to provide wide-reaching benefits to kit users.</p><p id="P7">To provide highly accurate measurements and assure analytical data quality, RMPs must have strict and clearly defined analytical accuracy specifications. Such quality performance goals have been proposed for accuracy evaluation of reference measurements of total 25(OH)D, namely bias &#x02264;1.7%, and imprecision CV &#x02264;5% [<xref rid="R7" ref-type="bibr">7</xref>, <xref rid="R8" ref-type="bibr">8</xref>]. We performed method validation using these pre-defined quality specifications, because they are considered to be a good balance between a state-of-the-art routine LC-MS/MS method and the required accuracy for a fit-for-purpose RMP [<xref rid="R3" ref-type="bibr">3</xref>]. These quality performance goals were developed from different data sources, specifically, from 25(OH)D biological variation and the imprecision of state-of-the art routine methods. It has been suggested that the bias and imprecision goals could be applied to the individual forms, 25(OH)D<sub>3</sub> and 25(OH)D<sub>2,</sub> if they occur in equimolar amounts [<xref rid="R7" ref-type="bibr">7</xref>]. However, in human serum the concentration of 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> are typically very different from one another. Vitamin D<sub>3</sub>, the precursor of 25(OH)D<sub>3</sub> is produced by action of sunlight on skin and obtained from diet and over-the-counter supplements. In the U.S., vitamin D<sub>2</sub> is primarily obtained through prescribed supplementation and therefore serum 25(OH)D<sub>2</sub> is typically present at very low concentrations or not present at all, compared to the major metabolite 25(OH)D<sub>3</sub>. Trying to use the suggested quality performance specifications for higher-order measurements for both metabolites poses problems because of the vastly different concentrations in serum.</p><p id="P8">Because the current literature lacks accuracy rules for individual vitamin D metabolites and does not provide clear guidance on how to assess a measured concentration against an established target concentration when multiple measurements are involved, we developed in-house criteria along with procedures that we applied to single and multiple runs to assess analytical accuracy. We developed these criteria based on the years of experience running the reference method in our laboratory. Our approach features accuracy evaluations of multiple RMs analyzed together with candidate RM (unknowns) over multiple days. This paper presents our in-house analytical quality specifications for a single measurement series (run) and for multiple independent reference measurements series (campaign, 3 runs).</p></sec><sec id="S6"><label>2.</label><title>Materials and methods</title><p id="P9">Reference standard for 25(OH)D<sub>3</sub> was purchased from U.S. Pharmacopeia Convention (USP, Rockville, MD). Isotopically labeled 25-hydroxyvitamin D<sub>2</sub>-[<sup>2</sup>H<sub>3</sub>] (<italic>d</italic><sub><italic>3</italic></sub>-25(OH)D<sub>2</sub>) and 25-hydroxyvitamin D<sub>3</sub>-[<sup>2</sup>H<sub>6</sub>] (<italic>d</italic><sub><italic>6</italic></sub>-25(OH)D<sub>3</sub>) were purchased from Medical Isotopes (Pelham, NH). Reference materials, SRM 2972a and SRM 972a, were purchased from the National Instate of Standards and Technology (NIST, Gaithersburg, MD). ACS grade methanol and <italic>n</italic>-hexane were obtained from Burdick &#x00026; Jackson (Muskegon, MI). ASC/USP grade ethanol was purchased from Pharmco-AAPER (Brookfield, CT). Purified water (18 M&#x003a9;) was obtained from a water purification system (Aqua Solutions, Inc., Falmouth, ME). In-house developed reference materials (Ghent RM) were prepared from pooled human serum, obtained from anonymous blood donors (Tennessee Blood Services, Memphis, TN). The analysis of these RMs by the Centers for Disease Control and Prevention (CDC) laboratory was determined not to constitute engagement in human subject research. For these in-house Ghent RMs the concentration of each 25(OH)D metabolite was assigned by JCTLM-accepted reference methods at Ghent University (Belgium). Captiva 96-well filter plates (0.45 &#x003bc;m pore size) were purchased from Agilent (Santa Clara, CA). The two analytical columns, used in the reported analytical methods [<xref rid="R3" ref-type="bibr">3</xref>, <xref rid="R9" ref-type="bibr">9</xref>] were the Ascentis Express F5 (2.1 mm x 150 mm x 2.7 &#x003bc;m), Sigma-Aldrich (St. Louis, MO) and the Acquity HSS PFP (3.0 mm x 150 mm x 1.8 &#x003bc;m), Waters (Milford, MA).</p><sec id="S7"><label>2.1.</label><title>Calibration</title><p id="P10">Calibration of our RMPs was based on a previously described procedure [<xref rid="R3" ref-type="bibr">3</xref>] with modifications described in this section, namely, choice of calibration matrices (serum or solvent) and a narrower mass ratio range. Individual working internal standard solutions (ISTD) were prepared from concentrated internal standard stock solutions (prepared from solid material) with a targeted mass concentration of 25 ng/mL (32 nmol/L) for <italic>d</italic><sub>6</sub>-25(OH)D<sub>3</sub> and 4 ng/mL (10 nmol/L) for <italic>d</italic><sub>3</sub>-25(OH)D<sub>2</sub>; the exact mass concentrations were determined at the time of gravimetric preparation. For solvent-based calibration, 2 independent ethanolic working solutions were gravimetrically prepared from SRM 2972a [<xref rid="R10" ref-type="bibr">10</xref>] using calibrated balances under environmentally controlled conditions. The targeted mass concentration was 40 ng/mL (100 nmol/L) for 25(OH)D<sub>3</sub> and 8 ng/mL (20 nmol/L) for 25(OH)D<sub>2</sub>; the exact mass concentrations were determined at the time of gravimetric preparation. Alternatively, for serum-based calibration, we used certified reference materials, e.g., SRM 972a [<xref rid="R11" ref-type="bibr">11</xref>] or RM prepared in-house from pooled human serum with concentrations provided by the JCTLM-accepted RMPs at Ghent University [<xref rid="R2" ref-type="bibr">2</xref>]. For each analyte, 2 independent calibration curves (3 calibration levels each) were prepared, 1 from each independent ethanolic working solution (solvent-based) or from 2 different RMs (serum-based). To prepare each calibration level, we gravimetrically added individual working ISTD solutions (0.500 mL) to a pre-calculated amount (0.125&#x02013;0.555 mL) of serum- or solvent-based calibration level to obtain a mass ratios of unlabeled to labeled analyte (analyte/ISTD) of approximately 0.7, 1.0, and, 1.3. Solvent-based calibrators (containing the standard and ISTD) were evaporated (in a vacuum centrifuge at 45 &#x000b0;C or at room temperature under N<sub>2</sub>), reconstituted in 75% methanol/water, and injected in the instrument. For serum-based calibration, 1 mL of deionized water was added to each calibrator (containing the serum RM and ISTD). After 1 hour (room temperature, dark) we adjusted the pH to ~10 with 0.1 g/mL Na<sub>2</sub>CO<sub>3</sub> (200 &#x003bc;L) to release the metabolites from vitamin D binding protein. After thorough mixing, the analytes of interest and their ISTDs were extracted twice with hexane, following the previously described procedure for liquid-liquid extraction from serum [<xref rid="R3" ref-type="bibr">3</xref>]. We evaporated the combined extracts (vacuum, 45 &#x000b0;C), followed by reconstitution with 75% methanol/water (0.300 mL). The serum extracts were filtered (0.45 &#x003bc;m filter plate) and injected for LC-MS/MS analysis. To assess the trueness of measurement for each calibration matrix, we analyzed 2 Ghent RMs (RM 001 and RM 003) using solvent- and serum-based calibrations in 3 independent campaigns. The assigned reference mass concentrations for RM 001 and RM 003 were 16.3 (40.7) and 11.8 (29.5) ng/mL (nmol/L) for 25(OH)D<sub>3</sub> and 1.24 (3.0) and 5.43 (13.2) ng/mL (nmol/L) for 25(OH)D<sub>2</sub>, respectively.</p><p id="P11">For the routine LC-MS/MS procedure, we used serum-based calibrators prepared similarly to previously described calibrators in PBS-4% albumin [<xref rid="R9" ref-type="bibr">9</xref>]. We prepared 7 calibration levels by mixing ethanolic stock solutions of 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> with low analyte baseline serum. The concentration range of the calibration levels was from 2.9 to 57.7 ng/mL (7.2 to 144 nmol/L) and from 0.9 ng/mL to 24.7 ng/mL (2.2 to 60 nmol/L) for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> respectively. For samples with high concentrations, we used one additional high calibrator per analyte at 117 and 47 ng/mL (292 and 114 nmol/L), respectively. The value assignments for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> calibrators were confirmed by the CDC RMPs. ISTD solutions in 67% ethanol/water were used at 30 ng/mL (75 nmol/L) <italic>d</italic><sub>6</sub>-25(OH)D<sub>3</sub> and 8 ng/mL (19 nmol/L) <italic>d</italic><sub>3</sub>-25(OH)D<sub>2</sub>.</p></sec><sec id="S8"><label>2.2</label><title>Sample preparation</title><p id="P12">All samples for 25(OH)D used in our RMPs were prepared according to a previously described procedure for liquid-liquid extraction from serum, with the modifications described in this section, namely a mass ratio of approximately 1 (analyte/ISTD) [<xref rid="R3" ref-type="bibr">3</xref>]. Each sample was spiked gravimetrically with pre-determined amount of ISTDs to get approximately a 1:1 mass ratio of analyte to ISTD for each analyte; wider mass ratios were used previously. All samples were initially screened for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> using our routine method to obtain orientation values for each analyte. We gravimetrically added pre-calculated amounts of isotopically labeled ISTDs to serum (0.200&#x02013;0.750 g, typically 0.500 g). After 1 h equilibration with the ISTD, the pH was adjusted (0.2 mL of 0.1 g/mL Na<sub>2</sub>CO<sub>3</sub>), the samples were mixed, and the analytes of interest were extracted with hexanes, using a previously described procedure [<xref rid="R3" ref-type="bibr">3</xref>]. Our RMP protocol included in each independent measurement series (run) the analysis of 1 NIST RM in singleton and 2 different Ghent RMs and all CDC candidate RMs (unknowns, typically 10) in duplicate. We carried the analysis of this set of samples typically through 3 runs, which provided the final campaign result, as a mean of all daily measurements for each sample. A diagram of the process is presented in <xref rid="F1" ref-type="fig">Figure 1</xref>.</p><p id="P13">For routine measurements, we mixed 0.100 mL of serum sample with ISTD solution (0.075 mL) and 72% MeOH/water (0.1 mL) followed by liquid-liquid extraction with hexanes (1.5 mL) according to a previously published procedure [<xref rid="R9" ref-type="bibr">9</xref>]. We analyzed all samples used for imprecision studies with the RMP design protocol shown in <xref rid="F1" ref-type="fig">Fig. 1</xref>.</p></sec><sec id="S9"><label>2.3.</label><title>LC-MS/MS methods</title><p id="P14">The RMPs featured isocratic (Ascetis Express HPLC column) [<xref rid="R3" ref-type="bibr">3</xref>] or alternatively gradient chromatographic separation (Acquity HPLC column) with atmospheric pressure chemical ionization (APCI) in positive ion mode and tandem mass-spectrometric detection. The routine method was isotope dilution LC-MS/MS with isocratic separation and positive APCI ionization detection mode [<xref rid="R9" ref-type="bibr">9</xref>].</p></sec><sec id="S10"><label>2.4.</label><title>Precision</title><p id="P15">We used data generated with our RMPs for RMs and CDC candidate RMs to develop quality specifications for imprecision. We selected 20 specimens each for 25(OH)D<sub>3</sub> (no specific criteria regarding concentration) and 25(OH)D<sub>2</sub> (targeting samples with mass concentrations of 25(OH)D<sub>2</sub> above the limit of quantitation, LOQ (0.6 ng/mL, 1.5 nmol/L) and below 15 ng/mL (36 nmol/L)). The mass concentrations in the study samples ranged from 9.0 to 54 ng/mL (from 22.5 to 135 nmol/L) for 25(OH)D<sub>3</sub> and from 0.64 to 13.6 ng/mL (from 1.6 to 33 nmol/L) for 25(OH)D<sub>2</sub>. Mass concentrations (X<sub>1</sub> and X<sub>2</sub>) from daily duplicate preparation were averaged to obtain the daily mass concentration (X<sub>D</sub>); to assess the daily variability we calculated the relative pair difference as a percent [abs(X1-X2)/X<sub>D</sub> x 100] (<xref rid="F1" ref-type="fig">Fig. 1</xref>). Daily mass concentrations over 3 days were used to calculate the campaign mass concentration (X<sub>C</sub>) for each material. We calculated the campaign imprecision CV<sub>C</sub> from the daily mass concentrations (X<sub>D</sub>). We also selected 20 specimens from the routine LC-MS/MS method, using similar concentration requirements for 25(OH)D<sub>2</sub> (above the limit of detection, LOD (0.84 ng/mL, 2.0 nmol/L) to assess imprecision compared to the RMPs. The 20 specimens were not the same across methods or analytes. The mass concentrations in the study samples ranged from 6.8 to 41 ng/mL (17 to 102 nmol/L) for 25(OH)D<sub>3</sub> and from 0.89 to 8.7 ng/mL (2.2 to 21 nmol/L) for 25(OH)D<sub>2</sub>. We calculated the mean and median relative pair differences from all daily relative pair differences (n=60) and campaign CV<sub>C</sub> (n=20) for each analyte from reference and routine measurements.</p><p id="P16">We then applied the quality performance specifications for imprecision to reference data for CDC candidate RMs obtained during 3 recent campaigns, which we will call &#x0201c;training set.&#x0201d; The training set included 31 samples for 25(OH)D<sub>3</sub> (4.8&#x02013;42 ng/mL, 12&#x02013;105 nmol/L); only 9 of these samples had reportable concentrations for 25(OH)D<sub>2</sub> (0.62&#x02013;14.2 ng/mL, 1.5&#x02013;34 nmol/L).</p></sec><sec id="S11"><label>2.5.</label><title>Accuracy</title><p id="P17">Typically we analyze at least 1 NIST RM material, e.g., SRM 972a (prepared in singleton) [<xref rid="R11" ref-type="bibr">11</xref>], and at least 2 Ghent RMs (prepared in duplicate) with each reference run. The detailed data evaluation procedure is outlined in <xref rid="T1" ref-type="table">Table 1</xref>. Typically if one or multiple CDC candidate RM violate the imprecision specification, those reference measurement results are rejected and the samples are re-analyzed in another measurement series (repeat CDC candidate RM). If one Ghent RM or the NIST SRM violate the accuracy or imprecision criteria, the reference run is rejected and all samples are re-analyzed in another measurement series (repeat run). We used data from reference measurements of 2 materials, namely, SRM 972a level 2 and RM 003 from 7 recent campaigns, to demonstrate the accuracy of our RMPs. We selected these 2 RMs because both of them were analyzed in these campaigns and because the target concentrations were assigned by different reference methods, NIST and Ghent University.</p><p id="P18">We opted to use zeta-scores (&#x003b6;) for quantitative assessment of accuracy. This standardized parameter shows how well a measurement matches the assigned target concentration relative to the combined uncertainty of the measurement and the target concentration [<xref rid="R12" ref-type="bibr">12</xref>, <xref rid="R13" ref-type="bibr">13</xref>]. A zeta-score of 0 indicates a perfect match. In proficiency testing programs, a zeta-score of &#x000b1;2 indicates acceptable participant result. The zeta-score is defined as follows:
<disp-formula id="FD1"><label>(equation 1),</label><mml:math display="block" id="M1"><mml:mi>&#x003b6;</mml:mi><mml:mo>=</mml:mo><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:msqrt><mml:mrow><mml:mi>u</mml:mi><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mi>u</mml:mi><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:math></disp-formula></p><p id="P19">where <italic>x</italic> is the CDC RMP result and <italic>y</italic> is the NIST or Ghent RMP result; <italic>u(x</italic>) is the standard uncertainty of the CDC RMP [<xref rid="R3" ref-type="bibr">3</xref>] and <italic>u(y)</italic> is the reported standard uncertainty for the Ghent and NIST RMs. For SRM 972a level 2, we used the target concentrations (<italic>y</italic>) listed in the Certificate of Analysis (COA) and the reported standard uncertainties <italic>u(y)</italic> of 1.1% and 3.8% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>, respectively [<xref rid="R11" ref-type="bibr">11</xref>]. The standard uncertainties were derived by dividing the reported expanded uncertainty [<italic>U=2 u(x</italic>)] from the certificate by the coverage factor k=2. For RM 003, we used the target concentrations (<italic>y</italic>) of 11.8 ng/mL (29.5 nmol/L) for 25(OH)D<sub>3</sub> and 5.43 ng/mL (13.2 nmol/L) for 25(OH)D<sub>2</sub> and the reported standard uncertainties <italic>u(y)</italic> of 1.5% and 1.8% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2,</sub> respectively, as provided in the report of analysis. The estimated standard uncertainties <italic>u(x)</italic> for CDC RMPs were 1.6% and 1.8% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2,</sub> respectively [<xref rid="R3" ref-type="bibr">3</xref>].</p><p id="P20">To evaluate the daily accuracy, we calculated the zeta-score<sub>D</sub> as described above, using the certified result and uncertainties from the NIST COA for the SRM material [<xref rid="R11" ref-type="bibr">11</xref>], and the Ghent assigned target value and reported uncertainties for the Gent RM material, listed above. In both cases, we used our daily reference measurement (<italic>x</italic>: X<sub>D</sub>) and the uncertainty <italic>u(x)</italic> for our RMP. To evaluate the campaign accuracy, we calculated the zeta-score<sub>C</sub> from the campaign mass concentration (<italic>x</italic>: X<sub>C</sub>). Similarly, we determined the daily and campaign systematic deviation from the target concentration (bias, %) for each material. We developed a macro in SAS version 9.3 to evaluate daily and campaign reference measurements and to automatically apply the numerical quality performance specifications for accuracy.</p></sec></sec><sec id="S12"><title>Results and discussion</title><sec id="S13"><label>3.1.</label><title>Calibration</title><p id="P21">In our laboratory, we have well-established methods for analysis of serum 25(OH)D metabolites, namely, LC-MS/MS routine and reference measurement procedures [<xref rid="R9" ref-type="bibr">9</xref>, <xref rid="R3" ref-type="bibr">3</xref>]. For the RMPs, we studied 2 types of calibrations, solvent- and serum-based. For solvent-based calibration, ethanolic working solutions were prepared by gravimetric dilution of SRM 2972a with absolute ethanol. This is the most direct way to establish traceability to the highest order of commercially available accuracy-based NIST RM and SI units. All working solutions were stable when stored in tightly closed glass containers at &#x02212;20 &#x000b0;C &#x000b1; 2 and used within 1 month of preparation. To prepare serum-based calibrators, we used SRM 972a level 1 or level 4 for 25(OH)D<sub>3</sub> and SRM 972a level 3 for 25(OH)D<sub>2</sub>. These 3 levels were chosen because they had the highest certified concentrations for each respective analyte, and therefore 1 vial each was sufficient to prepare 1 independent calibration curve. The second calibration curve for each analyte was prepared from Ghent RMs. We used both curves for quantitation. To compare the calibration matrices (solvent vs. serum) with regards to method trueness, 2 RMs (RM 001 and RM 003) were processed independently with each calibration matrix in 3 independent campaigns. We evaluated the calculated mass concentrations from each curve against the target concentrations for accuracy, expressed quantitatively with the zeta-score. Using 2 RMs over 3 campaigns, the mean daily zeta-score<sub>D</sub> (SD) from serum- vs. solvent-based calibrations was &#x02212;0.93 (0.66) vs. 0.01 (0.94) for 25(OH)D<sub>3</sub> and 0.43 (1.15) vs. 0.11 (1.25) for 25(OH)D<sub>2</sub>, respectively. The overall daily bias from the serum-based curve was &#x02212;2% and 1% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>, respectively. The overall daily bias from the solvent-based curve was &#x02212;0.1% and 0.3% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>, respectively. Accuracy from solvent-based calibration showed nearly perfect agreement, as expressed by a mean zeta-score of approximately 0 and a mean bias &#x02264;0.3%. Our preferred RMP calibration approach is therefore solvent-based.</p></sec><sec id="S14"><label>3.2.</label><title>Sample preparation</title><p id="P22">For reference measurements, all samples were prepared by accurate weighing of serum and pre-calculated amounts of isotopically labeled ISTD solutions to obtain approximately 1:1 mass ratios of analyte to ISTD. In our earlier studies, the amount of working ISTD was not closely matched to the amount of measurand, resulting in wide mass ratios, 0.25&#x02013;2.50. The impact of the use of the different mass ratios (wide vs narrow) on overall accuracy of all valid results has not been shown to be significantly different in our laboratory (data not shown). We noted improvement with imprecision violations using the narrow mass ratio from 4% and 17% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> down to 0% and 10%, respectively. Daily zeta-score violations were marginally improved for 25(OH)D<sub>3</sub> (3% to 2%) and greatly improved for 25(OH)D<sub>2</sub> (18% to 7%). Considering the low throughput (10 samples/run) and laborious sample preparation for RMPs, switching to a mass ratio of analyte to ISTD of approximately 1:1 was beneficial.</p></sec><sec id="S15"><label>3.3.</label><title>Analytical quality performance specifications</title><p id="P23">During the development and validation of the RMPs [<xref rid="R3" ref-type="bibr">3</xref>] we used the pre-defined proposed quality specifications for accuracy, i.e., CV&#x0003c;5% and bias &#x0003c;1.7% [<xref rid="R7" ref-type="bibr">7</xref>]. Our RMPs showed similar performance characteristics to those of other established RMPs [<xref rid="R1" ref-type="bibr">1</xref>, <xref rid="R2" ref-type="bibr">2</xref>]. We used our RMPs to analyze the same sample over multiple days (campaign), which involves complex data review with independent daily and campaign data assessment from multiple RM. Because the accuracy specifications for total 25(OH)D are not applicable to the individual metabolites occurring at very different concentrations, we developed our in-house numerical quality performance imprecision and accuracy goals, based on years of experience with the method. We had to consider the complex multiday data analysis protocol of our RMPs (<xref rid="F1" ref-type="fig">Fig. 1</xref>). We evaluated all daily data for imprecision and accuracy. If the run passed our criteria, the exact same analyses were repeated on 2 more days and the same daily data review was conducted. Data from all 3 valid runs comprise the campaign data; the combined data were evaluated for overall campaign imprecision and accuracy. For a summary of our quality performance specifications procedure, acceptability criteria, and actions taken upon data evaluation, see <xref rid="T1" ref-type="table">Table 1</xref>.</p><sec id="S16"><label>3.3.1.</label><title>Daily precision</title><p id="P24">In developing numerical quality performance goals for daily imprecision evaluation, we used data from the CDC RMPs for 20 specimens analyzed in duplicate over 3 days. Even though the 25(OH)D<sub>2</sub> mass concentration for the study samples covered a wide range (0.64&#x02013;13.6 ng/mL, (1.6&#x02013;33 nmol/L), the majority of the samples had a mass concentration &#x0003c;3 ng/mL (7 nmol/L). We assessed the relative pair difference as a measure of daily variance. Plots of the daily relative pair differences as a function of the daily mass concentration (X<sub>D</sub>, ng/mL) from all daily reference measurements (3 days, duplicate preparations, n=60) are presented in <xref rid="F2" ref-type="fig">Figure 2</xref> for each metabolite (panels A and B), where the median relative pair difference and the 95<sup>th</sup> percentile are depicted by solid and dotted lines, respectively. The plots for both analytes showed equally distributed variability independent of the concentration. The distribution of the relative pair difference for the major metabolite was slightly right skewed (not shown) and therefore we calculated the median in addition to the mean and we chose to use the 95<sup>th</sup> percentile of the relative pair differences to set the daily imprecision limit for each metabolite (<xref rid="T2" ref-type="table">Table 2</xref>). The median relative pair difference (n=60) was 2.1% for 25(OH)D<sub>3</sub> and 3.0% for 25(OH)D<sub>2</sub> and our daily imprecision limit was &#x02264;5% (rounded from 5.2%) for 25(OH)D<sub>3</sub> and &#x02264;7% for 25(OH)D<sub>2</sub>.</p><p id="P25">To confirm the daily imprecision goals, we used a common approach that compared the performance of the RMPs to that of a hierarchically lower measurement procedure, namely, a routine method. The expectation is that the overall measured imprecision of the RMPs should be half of that of a routine method LC-MS/MS [<xref rid="R7" ref-type="bibr">7</xref>, <xref rid="R8" ref-type="bibr">8</xref>]. We calculated the relative pair differences for each analyte for the selected study samples from routine measurements and plotted it as a function of the daily mass concentration, X<sub>D</sub> (<xref rid="F2" ref-type="fig">Fig. 2</xref> panels C and D). Plot D in <xref rid="F2" ref-type="fig">Figure 2</xref> shows increased variability at lower mass concentrations for 25(OH)D<sub>2</sub>, however, the majority of measurements with higher variability were at concentrations lower than the LOQ of the assay 2.5 ng/mL (6.1 nmol/L) with the maximum relative pair difference of 24% recorded at 0.96 ng/mL (2.3 nmol/L). The median relative pair difference of the routine method (n=60) depicted with a solid line, was 4% for 25(OH)D<sub>3</sub> and 5% for 25(OH)D<sub>2</sub>. The 95<sup>th</sup> percentile, depicted with a dotted line, was 11% for 25(OH)D<sub>3</sub> and 21% for 25(OH)D<sub>2</sub>. Thus, the median relative pair difference for the RMPs were close to half of that obtained for the routine measurements for both analytes (<xref rid="T2" ref-type="table">Table 2</xref>). Our lower order LC-MS/MS method for serum 25(OH)D measurements [<xref rid="R9" ref-type="bibr">9</xref>] historically featured excellent long-term imprecision (CV of 3% for 25(OH)D<sub>3</sub> and 5% for 25(OH)D<sub>2</sub>) [<xref rid="R14" ref-type="bibr">14</xref>]. In a recent review of 24 LC-MS/MS procedures the inter-assay imprecision was listed for 13 routine LC-MS/MS methods, i.e., mean CV of 7.1% for 25(OH)D<sub>3</sub> and 7.8% for 25(OH)D<sub>2</sub> [<xref rid="R15" ref-type="bibr">15</xref>]. The reported imprecision of our routine method was in line with and even superior to the imprecision performance reported by other routine LS-MS/MS 25(OH)D procedures and thus can be appropriately used to confirm the daily imprecision goals.</p><p id="P26">Had we derived cutoffs for reference imprecision from our routine measurements, the limits would have been 5.5% for 25(OH)D<sub>3</sub> and 10.5% for 25(OH)D<sub>2</sub> (half of the calculated 95<sup>th</sup> percentile). Our RMP imprecision limits for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> (5% and 7%, respectively) derived from reference data more than met these theoretical goals from routine data.</p><p id="P27">We applied our daily quality performance imprecision limits to all CDC candidate reference materials from the &#x0201c;training set&#x0201d; analyzed with CDC RMPs. For all RMs and CDC candidate RMs the daily imprecision goals were met 100% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>. The mean relative pair difference was 1.5% for 25(OH)D<sub>3</sub> and 3.1% for 25(OH)D<sub>2</sub>. The maximum relative pair difference was 4.4% and 6.1% for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>, respectively. The 95<sup>th</sup> percentile for this recent set of data (n=31) was 4.0% for 25(OH)D<sub>3</sub> and 6.0% for 25(OH)D<sub>2</sub>, which suggests that our daily difference limits may be reduced to 4% and 6%, respectively. The daily mean relative pair differences were less than half of the limits, nearly ensuring that the maximum goals were not exceeded. This indicates that our RMPs consistently met the daily imprecision performance goals, despite the fact that most of the 25(OH)D<sub>2</sub>-containing samples were below 3 ng/mL (7.3 nmol/L).</p></sec><sec id="S17"><label>3.3.2.</label><title>Campaign precision</title><p id="P28">We developed numerical quality performance specifications for campaign imprecision reference measurements using the same study set of 20 specimens, analyzed over 3 days with our RMPs. For each specimen, the mass concentration from independent daily measurement (X<sub>D</sub>) was used to calculate the campaign imprecision, CV<sub>C</sub> as depicted in <xref rid="F1" ref-type="fig">Figure 1</xref>. The distribution of the CV<sub>C</sub> as a function of the mass concentration (X<sub>C</sub>) for each of the 20 serum specimens is illustrated in <xref rid="F3" ref-type="fig">Figure 3</xref> (<bold>panels A and B</bold> for reference measurements and <bold>panels C and D</bold> for routine measurements). Similarly to the daily trends, the campaign plot for 25(OH)D<sub>2</sub> (<xref rid="F3" ref-type="fig">Figure 3D</xref>) showed a marked increase in imprecision at low mass concentrations (&#x0003c;3 ng/mL (7.3 nmol/L) for routine measurements, while the RMP data showed a slight trend towards higher imprecision with lower concentrations (<xref rid="F3" ref-type="fig">Figure 3B</xref>). We used the same concept to establish the campaign imprecision limits based on the calculated 95<sup>th</sup> percentile of the CV<sub>C</sub> distribution from RMPs: &#x02264;3% for 25(OH)D<sub>3</sub> (rounded from 3.1%) and &#x02264;4% for 25(OH)D<sub>2</sub> (rounded from 4.2%) (<xref rid="T2" ref-type="table">Table 2</xref>). The calculated 95<sup>th</sup> percentile CV<sub>C</sub> from routine measurements was at least twice that of reference measurements for each analyte, which demonstrated the validity of the selected cut-offs, confirmed by running samples with the routine method. For all RMs and CDC candidate RM from the &#x0201c;training set&#x0201d; the campaign imprecision goals were met 100% for both analytes. The mean CV<sub>C</sub> was 0.9% for 25(OH)D<sub>3</sub> and 2.0% for 25(OH)D<sub>2</sub>, which were at least half of the campaign cut-offs. Thus, the RMPs consistently met the campaign imprecision quality performance goals, which were stricter than the proposed specifications of &#x02264;5%. Based on all of the above, we can claim with confidence that our numerical imprecision goals for campaign reference measurements fit the purpose of intended use.</p></sec><sec id="S18"><label>3.3.3.</label><title>Daily accuracy</title><p id="P29">The result from multiple serum reference materials, a single preparation of NIST RM (X<sub>D</sub>, ng/g) and the mean result from duplicate preparations of each Ghent RM (X<sub>D</sub>, ng/g), were evaluated against the certified mass fraction concentration of each analyte and RM for equivalence. We looked into approaches for quantitative assessment of the equivalence between 2 measurement results of the same RM, obtained by different laboratories.</p><p id="P30">A panel of experts recommended using the bias approach for total 25(OH)D, which we followed during the method validation process [<xref rid="R7" ref-type="bibr">7</xref>, <xref rid="R8" ref-type="bibr">8</xref>]. However, the minor metabolite, 25(OH)D<sub>2</sub> is generally undetectable or just above the limit of detection in the US population [<xref rid="R14" ref-type="bibr">14</xref>]. Random and systematic effects can easily affect measurements at low concentrations. The factors that are likely to have a significant effect on a measurement result, based on our long experience performing vitamin D measurements, are included in the uncertainty, e.g., type A errors such as imprecision and type B errors such as calibrator purity, sample preparation effect of unspecific interferences, or weight and density measurements. The type A variance is calculated from the variability of repeated measurements, which is more notable at lower concentrations, resulting in higher uncertainties [<xref rid="R3" ref-type="bibr">3</xref>]. In the COA for SRM 972a, NIST lists the certified value (<italic>Y</italic>) for each analyte in the certified matrix as <italic>Y</italic><sub>NIST</sub> &#x000b1; <italic>U</italic><sub>95</sub>(<italic>Y</italic><sub>NIST</sub>); this means that the interval <italic>Y</italic><sub>NIST</sub>-<italic>U</italic><sub>95</sub>(Y<sub>NIST</sub>) to <italic>Y</italic><sub>NIST</sub>+<italic>U</italic><sub>95</sub>(Y<sub>NIST</sub>) is expected to contain the true value of <italic>Y</italic> with a 95% level of confidence. For example, NIST offers 2 materials with certified 25(OH)D<sub>2</sub> mass concentrations for SRM 972a, at 0.81 (2.0) and at 13.3 (32.3) ng/mL (nmol/L), where the expanded uncertainty associated with each target is 7.4% and 2.3%, respectively [<xref rid="R11" ref-type="bibr">11</xref>]. Thus, at the lower mass concentration (0.81 ng/mL, 2.0 nmol/L), the uncertainty is significantly higher. When using the so called &#x0201c;acceptance interval&#x0201d; (target value &#x000b1; <italic>U</italic><sub>95</sub>) approach with SRM 972a [<xref rid="R11" ref-type="bibr">11</xref>], we can assess how well our &#x0201c;reference measurement &#x000b1; <italic>U</italic><sub>95</sub>&#x0201d; (calculated range) complies with the certified target interval.</p><p id="P31">Next, we looked for guidance from procedures used by proficiency testing (PT) programs to assess the difference between the participant&#x02019;s result and the assigned value. In a review article for scoring results in PT programs, Miller <italic>et al.</italic> pointed out that when the acceptance interval is expressed as a percent (e.g., &#x000b1;15%), the concept may be not be reasonable below a certain concentration, because the SD of a measurement procedure becomes a larger fraction of the acceptance interval. To overcome this problem, the authors proposed to use a fixed unit interval concept (e.g., &#x000b1; XX ng/mL (nmol/L)) instead of a percentage [<xref rid="R16" ref-type="bibr">16</xref>]. Both of these assessment approaches, involving &#x0201c;acceptance intervals&#x0201d; could be easily adopted, however, we wanted to find a single measure to assess how well CDC&#x02019;s reference measurement matched a target result.</p><p id="P32">International guidelines and a panel of experts from the Royal Society of Chemistry&#x02019;s Analytical Methods Committee explained in detail how a zeta-score could be used in PT programs as a standardized quantitative indicator to assess accuracy of the participant&#x02019;s result [<xref rid="R12" ref-type="bibr">12</xref>, <xref rid="R13" ref-type="bibr">13</xref>]. The zeta-score calculation incorporates the uncertainties associated with each measurement procedure [participant&#x02019;s method (<italic>x</italic>) and target assignment method (<italic>y</italic>)]. For meaningful scoring, the 2 measurements (<italic>x</italic> and <italic>y</italic>) should have similar and relatively low uncertainties. The standard uncertainties of the CDC RMPs are comparable with the uncertainties reported by Ghent and NIST. In our hands, the zeta-score is a good accuracy indicator at all concentrations. For daily reference measurements, we set a zeta-score<sub>D</sub> of &#x02264;&#x000b1;2 as acceptable (corresponding to a 95% confidence level for 2-sided confidence intervals assuming a normal distribution, or 2SD). We believe that this cut-off point fits the purpose of use and will allow us to make a technically correct decision about the validity of the run.</p><p id="P33">In this report, we demonstrate the daily accuracy quality performance of the CDC RMPs using 2 serum reference materials, namely, SRM 972a level 2 and RM 003 analyzed in 7 campaigns. We selected these 2 RMs because they had notably different target concentration for 25(OH)D<sub>2</sub>, 0.81 and 5.43 ng/mL (1.96 and 13.2 nmol/L), respectively. For the major metabolite, the average zeta-score<sub>D</sub> and bias (%) from all daily measurements were 0.1 and 0.1% in SRM 972a-2 and 0.2 and 0.4% in RM 003, respectively (<xref rid="F4" ref-type="fig">Figure 4</xref>, <bold>panels A and B</bold>). For the minor metabolite, the average daily zeta-score<sub>D</sub> and bias (%) were &#x02212;0.2 and &#x02212;0.9% for SRM 972a and 0.4 and 1.0% for RM 003, respectively (<xref rid="F4" ref-type="fig">Figure 4</xref>, <bold>panels C and D</bold>). In these 7 campaigns, daily negative and positive zeta-scores<sub>D</sub> were in near perfect agreement with the daily bias for both metabolites (r<sup>2</sup>&#x0003e;0.999, data not shown). A zeta-score of 2 corresponds to a bias of 4% for 25(OH)D<sub>3</sub> and between 5 and 8% for 25(OH)D<sub>2</sub>, depending on the analyte concentration. The low certified reference concentration for 25(OH)D<sub>2</sub> and the larger expanded uncertainty respectively in SRM 972a-2 contributed to the overall higher bias of 8% corresponding to a zeta-score of 2. It should be pointed out that to assess measurement accuracy for SRM materials, the zeta-score<sub>D</sub> is calculated from a single preparation; nonetheless, the mean bias from all daily reference measurements in the 7 campaigns was 0.1% for 25(OH)D<sub>3</sub> and &#x02212;0.9% for 25(OH)D<sub>2</sub> in SRM 972a-2. The majority of zeta-score<sub>D</sub> from our RMP measurements were less than half of the upper cut-off point of 2 for both analytes (<xref rid="F4" ref-type="fig">Figure 4A</xref>&#x02013;<xref rid="F4" ref-type="fig">D</xref>), which indicates excellent accuracy quality performance over a period of 1 year. Accuracy violations (unacceptable daily zeta-scores of &#x02265;2) across 7 campaigns required repeating 2 runs for 25(OH)D<sub>3</sub> and 5 runs for 25(OH)D<sub>2</sub>. Overall, the accuracy quality performance goal was met in all valid daily reference measurements.</p></sec><sec id="S19"><label>3.3.4.</label><title>Campaign accuracy</title><p id="P34">The campaign zeta-score<sub>C</sub> for each metabolite in each RM was calculated from the campaign mass concentration (Xc), according the analysis scheme presented in <xref rid="F1" ref-type="fig">Figure 1</xref>, and used as an indicator for accuracy. Data from 7 campaigns for each metabolite illustrated good correspondence between the calculated campaign zeta-score<sub>C</sub> and corresponding bias in each RM (<xref rid="F5" ref-type="fig">Figure 5</xref>). Our cut-off for campaign accuracy was a zeta-score<sub>C</sub> of &#x02264;&#x000b1;1 (corresponding to a 67% confidence level for a 2-sided confidence intervals, or 1SD). The accuracy goal was met for each analyte; the mean zeta-score<sub>C</sub> and bias (%) for 25(OH)D<sub>3</sub> was 0.1 and 0.1% in SRM 972a-2 and 0.2 and 0.4% in RM 003; and for 25(OH)D<sub>2</sub>, it was &#x02212;0.2 and &#x02212;0.9% in SRM 972a-2 and 0.4 and 1.0% in RM 003 (<xref rid="F5" ref-type="fig">Figure 5A</xref>&#x02013;<xref rid="F5" ref-type="fig">5D</xref>). Our cut-point for the campaign zeta-score<sub>C</sub> fits the purpose of intended use based on the current state-of-the-art, assures the quality of reference measurements, and at the same time is consistent with the proposed specification of maximum systematic deviation for total 25(OH)D of 1.7% [<xref rid="R7" ref-type="bibr">7</xref>, <xref rid="R8" ref-type="bibr">8</xref>]. The CDC RMPs consistently achieved the established accuracy goals. The mean of all zeta-score<sub>C</sub> and bias (%) across the 7 campaigns and 2 materials (SRM 972a-2 and RM 003) was 0.1 and 0.3% for 25(OH)D<sub>3</sub> 0.1 and 0.04% for 25(OH)D<sub>2</sub>, indicating no bias and confirming the long-term metrological comparability of our RMPs to other RMPs.</p></sec></sec></sec><sec id="S20"><label>4.</label><title>Conclusions</title><p id="P35">Our in-house analytical accuracy specifications were tailored to each individual metabolite, 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub>. We used zeta-scores to evaluate accuracy. A maximum zeta-score<sub>C</sub> of &#x000b1;1, our cut-point for campaign accuracy, was consistently met in 7 campaigns, where the mean zeta-score<sub>c</sub> from 2 RMs was nearly 0 for both metabolites, indicating no bias.</p><p id="P36">For assessment of imprecision of campaign measurements, we developed refined numerical goals from RMP data, namely a maximum campaign imprecision of 3% for 25(OH)D<sub>3</sub> and 4% for 25(OH)D<sub>2</sub>, which is stricter than the literature goal of 5% CV for 25(OH)D. We confirmed the selected cutoffs by analyzing samples with our routine LC-MS/MS method. The overall imprecision of the reference method was half of that determined by our state-of-the-art routine measurements, confirming the validity of our numerical quality performance goals. We demonstrated excellent reference method imprecision, where the goals were consistently met in all CDC candidate RM from 3 campaigns, and the mean CV<sub>C</sub> was 0.9% for 25(OH)D<sub>3</sub> and 2.0% for 25(OH)D<sub>2</sub>. The reported daily quality specifications are our in-house limits, developed based on the state-of-the art performance of our RMPs that allow us to systematically check our daily performance and assure that our reference methods essentially achieve the pre-defined quality specifications for total 25(OH)D. Recent data suggests that the daily imprecision limits may be reduced by 1%. The reported concept may be beneficial to others who are developing reference methods and can be adapted or modified depending on the design and the state-of-the-art technology.</p><p id="P37">To our knowledge, we are reporting for the first time accuracy specifications for daily and campaign reference measurements tailored for each vitamin D metabolite. Our approach is unique because it only uses data from the RMs analyzed in the same runs as the candidate RMs to independently determine the accuracy during daily runs and campaign assessments. In combination, the reported analytical quality performance goals and detailed application approach provide a unique internal quality control system that is suitable for the intended use and consistently assures high quality reference measurement results.</p></sec></body><back><ack id="S21"><title>Acknowledgements</title><p id="P38">The authors thank Drs. Susan Tai and Mary Bedner from NIST and Dr. Katleen Van Uytfanghe from Ghent University for sharing their experience in accuracy assessments of vitamin D metabolite reference measurements. We also thank Kevin Powell, Myat Win, and David Scully for providing routine measurements.</p><p id="P39">The findings and conclusions in this manuscript are those of the authors and do not necessarily represent the official view or position of the Centers for Disease Control and Prevention/Agency for Toxic Substances and Disease Registry.</p><p id="P40">Funding:</p><p id="P41">This research was supported by direct appropriations from U.S. Congress. We did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.</p></ack><glossary><title>Abbreviations:</title><def-list><def-item><term>25(OH)D</term><def><p id="P42">25-hydroxyvitamin D</p></def></def-item><def-item><term>RMP</term><def><p id="P43">reference method procedure</p></def></def-item><def-item><term>RM</term><def><p id="P44">reference material</p></def></def-item><def-item><term>SRM</term><def><p id="P45">standard reference material</p></def></def-item><def-item><term>JCTLM</term><def><p id="P46">Joint Committee for Traceability in Laboratory Medicine</p></def></def-item><def-item><term>CDC</term><def><p id="P47">Centers for Disease Control and Prevention</p></def></def-item><def-item><term>VDSCP</term><def><p id="P48">Vitamin D Standardization Certification Program</p></def></def-item><def-item><term>NIST</term><def><p id="P49">National Institute of Standards and Technology</p></def></def-item><def-item><term>ISTD</term><def><p id="P50">internal standard</p></def></def-item><def-item><term>LC-MS/MS</term><def><p id="P51">liquid chromatography tandem mass spectrometry</p></def></def-item><def-item><term>ID</term><def><p id="P52">isotope dilution</p></def></def-item><def-item><term><italic>u</italic></term><def><p id="P53">standard uncertainty</p></def></def-item><def-item><term><italic>U</italic></term><def><p id="P54">expanded uncertainty</p></def></def-item><def-item><term>NHANES</term><def><p id="P55">National Health and Nutrition Examination Survey</p></def></def-item></def-list></glossary><ref-list><title>References</title><ref id="R1"><label>[1]</label><mixed-citation publication-type="journal"><name><surname>Tai</surname><given-names>S-C</given-names></name>, <name><surname>Bedner</surname><given-names>M</given-names></name>, <name><surname>Phinney</surname><given-names>KW</given-names></name>, <article-title>Development of a candidate reference measurement procedure for the determination of 25-hydroxyvitamin D<sub>3</sub> and 25-hydroxyvitamin D<sub>2</sub> in human serum using isotope-dilution liquid chromatography-tandem mass spectrometry</article-title>, <source>Anal. Chem</source>. <volume>82</volume> (<year>2010</year>) <fpage>1942</fpage>&#x02013;<lpage>1948</lpage>.<pub-id pub-id-type="pmid">20136128</pub-id></mixed-citation></ref><ref id="R2"><label>[2]</label><mixed-citation publication-type="journal"><name><surname>M Stepman</surname><given-names>HC</given-names></name>, <name><surname>Vanderroost</surname><given-names>A</given-names></name>, <name><surname>van Uytfanghe</surname><given-names>K</given-names></name>, <name><surname>Thienpont</surname><given-names>LM</given-names></name>, <article-title>Candidate reference measurement procedure for serum 25-hydroxyvitamin D<sub>3</sub> and 25-hydroxyvitamin D<sub>2</sub> by using isotope-dilution liquid chromatography-tandem mass spectrometry</article-title>, <source>Clin. Chem</source>. <volume>53</volume>(<issue>3</issue>) (<year>2011</year>) <fpage>441</fpage>&#x02013;<lpage>448</lpage>.</mixed-citation></ref><ref id="R3"><label>[3]</label><mixed-citation publication-type="journal"><name><surname>Mineva</surname><given-names>EM</given-names></name>, <name><surname>Schleicher</surname><given-names>RL</given-names></name>, <name><surname>Chaudhary-Webb</surname><given-names>M</given-names></name>, <name><surname>Maw</surname><given-names>KL</given-names></name>, <name><surname>Botelho JC</surname><given-names>JC</given-names></name>, <name><surname>Vesper HW</surname><given-names>C</given-names></name><name><surname>Pfeiffer</surname><given-names>M</given-names></name>, <article-title>A candidate reference measurement procedure for quantifying serum concentrations of 25-hydroxyvitamin D<sub>3</sub> and 25-hydroxyvitamin D<sub>2</sub> using isotope-dilution liquid chromatography-tandem mass spectrometry</article-title>, <source>Anal. Bioanl. Chem</source>. <volume>407</volume> (<year>2015</year>) <fpage>5615</fpage>&#x02013;<lpage>5624</lpage>.</mixed-citation></ref><ref id="R4"><label>[4]</label><mixed-citation publication-type="journal"><name><surname>Phinney</surname><given-names>KW</given-names></name>, <name><surname>Bedner</surname><given-names>M</given-names></name>, <name><surname>Tai</surname><given-names>S-C</given-names></name>, <name><surname>Vamathevan</surname><given-names>VV</given-names></name>, <name><surname>Sander</surname><given-names>LC</given-names></name>, <name><surname>Sharpless</surname><given-names>KE</given-names></name>, <name><surname>Wise</surname><given-names>SA</given-names></name>, <name><surname>Yen</surname><given-names>JH</given-names></name>, <name><surname>Schleicher</surname><given-names>RL</given-names></name>, <name><surname>Chaudhary-Webb</surname><given-names>M</given-names></name>, <name><surname>Pfeiffer</surname><given-names>CM</given-names></name>, <name><surname>Betz</surname><given-names>JM</given-names></name>, <name><surname>Coates</surname><given-names>PM</given-names></name>, <name><surname>Picciano</surname><given-names>MF</given-names></name>, <article-title>Development and certification of a standard reference material for vitamin D metabolites in human serum</article-title>, <source>Anal. Chem</source>. <volume>84</volume>(<issue>2</issue>) (<year>2012</year>) <fpage>956</fpage>&#x02013;<lpage>962</lpage>.<pub-id pub-id-type="pmid">22141317</pub-id></mixed-citation></ref><ref id="R5"><label>[5]</label><mixed-citation publication-type="web"><source>Joint Committee for Traceability in Laboratory Medicine (JCTLM)</source>. <comment><ext-link ext-link-type="uri" xlink:href="http://www.bipm.org/en/committees/jc/jctlm/;JCTLMmethodidentifier:C12RMP2andC12RMP3">http://www.bipm.org/en/committees/jc/jctlm/;JCTLMmethodidentifier:C12RMP2andC12RMP3</ext-link></comment> (<date-in-citation>accessed 04 June 2018</date-in-citation>).</mixed-citation></ref><ref id="R6"><label>[6]</label><mixed-citation publication-type="book"><collab>Centers for Disease Control and Prevention (CDC)</collab>. <source>Laboratory Quality Assurance and Standardization Programs: Hormone and Vitamin D Standardization Program</source>. <publisher-loc>Atlanta (GA)</publisher-loc>, <publisher-name>CDC</publisher-name>, <comment><ext-link ext-link-type="uri" xlink:href="http://www.cdc.gov/labstandards/hs.html">http://www.cdc.gov/labstandards/hs.html</ext-link></comment> (<date-in-citation>accessed 05 June 2018</date-in-citation>).</mixed-citation></ref><ref id="R7"><label>[7]</label><mixed-citation publication-type="journal"><name><surname>St&#x000f6;ckl</surname><given-names>D</given-names></name>, <name><surname>Sluss</surname><given-names>PM</given-names></name>, <name><surname>Thienpont</surname><given-names>LM</given-names></name>, <article-title>Specifications for trueness and precision of a reference measurement system for serum/plasma 25-hydroxyvitamin D analysis</article-title>, <source>Clin. Chim. Acta</source>
<volume>408</volume> (<year>2009</year>) <fpage>8</fpage>&#x02013;<lpage>13</lpage>.<pub-id pub-id-type="pmid">19563791</pub-id></mixed-citation></ref><ref id="R8"><label>[8]</label><mixed-citation publication-type="journal"><name><surname>Thienpont</surname><given-names>LM</given-names></name>, <name><surname>Stepman</surname><given-names>HC</given-names></name>, <name><surname>Vesper</surname><given-names>HW</given-names></name>, <article-title>Standardization of measurements of 25 hydroxyvitamin D<sub>3</sub> and D<sub>2</sub></article-title>, <source>Scand. J. Clin. Lab. Invest</source>. <volume>72</volume> (Suppl <issue>243</issue>) (<year>2012</year>) <fpage>41</fpage>&#x02013;<lpage>49</lpage>.</mixed-citation></ref><ref id="R9"><label>[9]</label><mixed-citation publication-type="journal"><name><surname>Schleicher</surname><given-names>RL</given-names></name>, <name><surname>Encisco</surname><given-names>SE</given-names></name>, <name><surname>Chaudhary-Webb</surname><given-names>M</given-names></name>, <name><surname>Paliakov</surname><given-names>EM</given-names></name>, <name><surname>McCoy</surname><given-names>LF</given-names></name>, <name><surname>Pfeiffer</surname><given-names>CM</given-names></name>, <article-title>Isotope dilution ultra performance liquid chromatography-tandem mass spectrometry method for simultaneous measurement of 25-hydroxyvitamin D<sub>2</sub>, 25-hydroxyvitamin D<sub>3</sub> and 3-epi-25-hydroxyvitamin D<sub>3</sub> in human serum</article-title>, <source>Clin. Chim. Acta</source>
<volume>412</volume> (<year>2011</year>) <fpage>1594</fpage>&#x02013;<lpage>1599</lpage>.<pub-id pub-id-type="pmid">21601563</pub-id></mixed-citation></ref><ref id="R10"><label>[10]</label><mixed-citation publication-type="book"><collab>National Institute of Standards and Technology</collab>, <source>Certificate of analysis, standard reference material 2972a: 25-Hydroxyvitamin D calibration solutions</source>, <publisher-name>NIST</publisher-name>, <publisher-loc>Gaithersburg</publisher-loc>, <year>2014</year>.</mixed-citation></ref><ref id="R11"><label>[11]</label><mixed-citation publication-type="book"><collab>National Institute of Standards and Technology</collab>, <source>Certificate of analysis, standard reference material 972a: vitamin D metabolites in frozen human serum</source>, <publisher-name>NIST</publisher-name>, <publisher-loc>Gaithersburg</publisher-loc>, <year>2017</year>.</mixed-citation></ref><ref id="R12"><label>[12]</label><mixed-citation publication-type="journal"><name><surname>Thompson</surname><given-names>M</given-names></name>, <name><surname>Ellison</surname><given-names>SLR</given-names></name>, <name><surname>Wood</surname><given-names>R</given-names></name>, <article-title>International harmonized protocol for proficiency testing of analytical chemistry laboratories (IUPAC technical report)</article-title>, <source>Pure Appl. Chem</source>. <volume>78</volume>(<issue>1</issue>) (<year>2006</year>) <fpage>145</fpage>&#x02013;<lpage>196</lpage>.</mixed-citation></ref><ref id="R13"><label>[13]</label><mixed-citation publication-type="book"><collab>Royal Society of Chemistry</collab>, <source>Analytical Methods Committee, Understanding and acting on scores obtained in proficiency testing schemes</source>, <publisher-name>AMC Technical Brief</publisher-name> No <fpage>11</fpage>, Dec <year>2002</year>.</mixed-citation></ref><ref id="R14"><label>[14]</label><mixed-citation publication-type="journal"><name><surname>Schleicher</surname><given-names>RL</given-names></name>, <name><surname>Sternberg</surname><given-names>MR</given-names></name>, <name><surname>Looker</surname><given-names>AC</given-names></name>, <name><surname>Yetley</surname><given-names>EA</given-names></name>, <name><surname>Lacher</surname><given-names>DA</given-names></name>, <name><surname>Sempos</surname><given-names>CT</given-names></name>, <name><surname>Taylor</surname><given-names>CL</given-names></name>, <name><surname>Durazo-Arvizu</surname><given-names>RA</given-names></name>, <name><surname>Maw</surname><given-names>KL</given-names></name>, <name><surname>Chaudhary-Webb</surname><given-names>M</given-names></name>, <name><surname>Johnson</surname><given-names>CL</given-names></name>, <name><surname>Pfeiffer</surname><given-names>CM</given-names></name>, <article-title>National estimates of serum total 25-hydroxyvitamin D and metabolite concentrations measured by Liquid-chromatography-tandem mass spectrometry in the US population during 2007&#x02013;2010</article-title>, <source>J. Nutr</source>. <volume>146</volume> (<issue>5</issue>) (<year>2016</year>) <fpage>1051</fpage>&#x02013;<lpage>1061</lpage>.<pub-id pub-id-type="pmid">27052537</pub-id></mixed-citation></ref><ref id="R15"><label>[15]</label><mixed-citation publication-type="journal"><name><surname>Le Goff</surname><given-names>C</given-names></name>, <name><surname>Cavalier</surname><given-names>E</given-names></name>, <name><surname>Souberbielle</surname><given-names>J-C</given-names></name>, <name><surname>Gonz&#x000e1;les-Antuna</surname><given-names>A</given-names></name>, <name><surname>Delvin</surname><given-names>E</given-names></name>, <article-title>Measurement of circulating 25-hydroxyvitamin D: A historical review</article-title>, <source>Practical Lab. Med</source>. <volume>2</volume> (<year>2015</year>) <fpage>1</fpage>&#x02013;<lpage>14</lpage>.</mixed-citation></ref><ref id="R16"><label>[16]</label><mixed-citation publication-type="journal"><name><surname>Miller</surname><given-names>WG</given-names></name>, <name><surname>Jones</surname><given-names>GRD</given-names></name>, <name><surname>Horowitz</surname><given-names>GL</given-names></name>, <name><surname>Weykamp</surname><given-names>C</given-names></name>, <article-title>Proficiency testing/external quality assessment: current challenges and future directions</article-title>, <source>Clin. Chem</source>. <volume>57</volume> (<issue>12</issue>) (<year>2011</year>) <fpage>1670</fpage>&#x02013;<lpage>1680</lpage>.<pub-id pub-id-type="pmid">21965556</pub-id></mixed-citation></ref></ref-list></back><floats-group><fig id="F1" orientation="portrait" position="float"><label>Fig. 1</label><caption><p id="P56">Study design of daily and campaign reference measurements. Footnote:<italic>Daily:</italic>RM, reference material prepared in-house and value-assigned at Ghent University&#x02019;s reference laboratoryNIST RM, commercial standard reference materialCDC candidate RM, unknown to be value-assigned using CDC RMPsX<sub>m</sub>, mean measurement (ng/mL) from two independent reference measurements, X1 and X2, on the same dayX, single daily reference measurementRelative pair difference, absolute percent difference between X<sub>1</sub> and X<sub>2</sub>, divided by the X<sub>m</sub>Zeta-score<sub>D</sub>, calculated daily for RM with target values<italic>Campaign:</italic>X<sub>m</sub>, mean measurement (ng/mL) from two independent reference measurements, X1 and X2 per dayX<sub>C</sub>, mean measurement (ng/mL) from 3 independent measurement seriesX, single daily reference measurementCV<sub>C</sub>, mean coefficients of variation for campaign (average from 3 runs)Zeta-score<sub>C</sub>, calculated for campaign (average from 3 runs) for RM with target values</p></caption><graphic xlink:href="nihms-1553481-f0001"/></fig><fig id="F2" orientation="portrait" position="float"><label>Fig. 2</label><caption><p id="P57">Daily 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> reference (panel A and B) and routine (panel C and D) imprecision (relative pair difference, %) in 20 serum materials (3 days, n=60). Solid line represents the median relative pair difference (%) and dotted line represents the 95<sup>th</sup> percentile.</p></caption><graphic xlink:href="nihms-1553481-f0002"/></fig><fig id="F3" orientation="portrait" position="float"><label>Fig. 3</label><caption><p id="P58">Campaign 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub> imprecision of reference (panel A and B) and routine (panel C and D) in 20 serum materials (n=20). Solid line represents the median CV (%) and dotted line represents the 95<sup>th</sup> percentile.</p></caption><graphic xlink:href="nihms-1553481-f0003"/></fig><fig id="F4" orientation="portrait" position="float"><label>Fig. 4</label><caption><p id="P59">Trueness performance (7 campaigns) of CDC&#x02019;s daily reference measurements for 25(OH)D<sub>3</sub> (panels A and B) and 25(OH)D<sub>2</sub> (panels C and D) using 2 reference materials, SRM 972a-2 (panels A and C) and RM 003 (panels B and D).Footnote: zeta-score: dark gray vertical lines; bias: light gray line.</p></caption><graphic xlink:href="nihms-1553481-f0004"/></fig><fig id="F5" orientation="portrait" position="float"><label>Fig. 5</label><caption><p id="P60">Trueness performance (7 campaigns) of CDC&#x02019;s campaign reference measurements for 25(OH)D<sub>3</sub> (panels A and B) and 25(OH)D<sub>2</sub> (panels C and D) using 2 RMs, SRM 972a-2 (panels A and C) and RM 003 (panels B and D).Footnote: zeta-score: dark gray vertical lines, bias: light gray line.</p></caption><graphic xlink:href="nihms-1553481-f0005"/></fig><table-wrap id="T1" position="float" orientation="landscape"><label>Table 1.</label><caption><p id="P61">Quality performance specifications</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1">Quality parameter</th><th align="left" valign="top" rowspan="1" colspan="1">Procedure</th><th align="left" valign="top" rowspan="1" colspan="1">Acceptability criteria</th></tr><tr><th colspan="3" align="left" valign="top" rowspan="1"><hr/></th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1">Daily imprecision</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Calculate daily relative pair difference from duplicate mass concentration results (X<sub>1</sub> and X<sub>2</sub>) for each of 2 RMs for each analyte</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Experimentally derived as 95<sup>th</sup> percentile of relative pair difference for 20 specimens analyzed in duplicate over 3 days (n=60)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Relative pair difference for each RM needs to be within acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Relative pair difference &#x02264;5% for 25(OH)D<sub>3</sub> and &#x02264;7% for 25(OH)D<sub>2</sub></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Repeat run if relative pair difference for &#x02265;1 RM is outside acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Criteria applied to concentrations above the LOQ</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Repeat CDC candidate RM if relative pair difference is outside acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td colspan="3" align="left" valign="top" rowspan="1"><hr/></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Campaign imprecision</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Calculate campaign CV<sub>C</sub> from 3 daily mass concentration results (X<sub>D</sub>) for each of 3 RMs for each analyte</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Experimentally derived as 95<sup>th</sup> percentile of CV<sub>C</sub> for 20 specimens analyzed in duplicate over 3 days (n=20)</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; CV<sub>C</sub> for each RM needs to be within acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; CV<sub>C</sub> &#x02264;3% for 25(OH)D<sub>3</sub> and &#x02264;4% for 25(OH)D<sub>2</sub></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Repeat run if CV<sub>C</sub> for &#x02265;1 RMs is outside acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Criteria applied to concentrations above the LOQ</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Repeat CDC candidate RM if CV<sub>C</sub> is outside acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td colspan="3" align="left" valign="top" rowspan="1"><hr/></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Daily accuracy</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Calculate daily zeta-score<sub>D</sub> from daily mass concentration result (X<sub>D</sub>) for each of 3 RMs for each analyte</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Zeta-score<sub>D</sub> &#x02264; |2| for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub><break/>&#x02022; Criteria applied to concentrations above the LOQ</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Zeta-score<sub>D</sub> for each RM needs to be within acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Repeat run if zeta-score<sub>D</sub> for &#x02265;1 RMs is outside acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td colspan="3" align="left" valign="top" rowspan="1"><hr/></td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1">Campaign accuracy</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Calculate campaign zeta-score<sub>C</sub> from campaign mass concentration result (X<sub>C</sub>) for each of 3 RMs for each analyte</td><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Zeta-score<sub>C</sub> &#x02264; |1| for 25(OH)D<sub>3</sub> and 25(OH)D<sub>2</sub><break/>&#x02022; Criteria applied to concentrations above the LOQ</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Zeta-score<sub>C</sub> for each RM needs to be within acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">&#x02022; Repeat run if zeta-score<sub>C</sub> for &#x02265;1 RMs is outside acceptance limit</td><td align="left" valign="top" rowspan="1" colspan="1"/></tr></tbody></table></table-wrap><table-wrap id="T2" position="float" orientation="portrait"><label>Table 2.</label><caption><p id="P62">Daily and campaign imprecision for reference and routine measurements. Each daily estimate consisted of 20 specimens tested in 3 runs.</p></caption><table frame="hsides" rules="none"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1"/><th align="left" valign="top" rowspan="1" colspan="1"/><th colspan="4" align="center" valign="top" rowspan="1">Imprecision (%)</th></tr><tr><th align="left" valign="top" rowspan="1" colspan="1"/><th align="left" valign="top" rowspan="1" colspan="1"/><th colspan="2" align="center" valign="top" rowspan="1">Reference Method</th><th colspan="2" align="center" valign="top" rowspan="1">Routine Method</th></tr><tr><th align="left" valign="top" rowspan="1" colspan="1"/><th align="left" valign="top" rowspan="1" colspan="1"/><th align="center" valign="top" rowspan="1" colspan="1">25(OH)D<sub>3</sub></th><th align="center" valign="top" rowspan="1" colspan="1">25(OH)D<sub>2</sub></th><th align="center" valign="top" rowspan="1" colspan="1">25(OH)D<sub>3</sub></th><th align="center" valign="top" rowspan="1" colspan="1">25(OH)D<sub>2</sub></th></tr><tr><th colspan="6" align="left" valign="top" rowspan="1"><hr/></th></tr></thead><tbody><tr><td align="left" valign="top" rowspan="1" colspan="1"><bold>Daily</bold></td><td align="left" valign="top" rowspan="1" colspan="1">Mean relative pair difference</td><td align="center" valign="top" rowspan="1" colspan="1">2.2</td><td align="center" valign="top" rowspan="1" colspan="1">3.0</td><td align="center" valign="top" rowspan="1" colspan="1">4.8</td><td align="center" valign="top" rowspan="1" colspan="1">6.7</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">Median relative pair difference</td><td align="center" valign="top" rowspan="1" colspan="1">2.1</td><td align="center" valign="top" rowspan="1" colspan="1">3.0</td><td align="center" valign="top" rowspan="1" colspan="1">4.0</td><td align="center" valign="top" rowspan="1" colspan="1">5.3</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">95<sup>th</sup> %ile relative pair difference</td><td align="center" valign="top" rowspan="1" colspan="1"><bold>5.2</bold></td><td align="center" valign="top" rowspan="1" colspan="1"><bold>7.0</bold></td><td align="center" valign="top" rowspan="1" colspan="1">11</td><td align="center" valign="top" rowspan="1" colspan="1">21</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"><bold>Campaign</bold></td><td align="left" valign="top" rowspan="1" colspan="1">Mean CV</td><td align="center" valign="top" rowspan="1" colspan="1">1.6</td><td align="center" valign="top" rowspan="1" colspan="1">2.2</td><td align="center" valign="top" rowspan="1" colspan="1">2.4</td><td align="center" valign="top" rowspan="1" colspan="1">5.4</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">Median CV</td><td align="center" valign="top" rowspan="1" colspan="1">1.7</td><td align="center" valign="top" rowspan="1" colspan="1">2.2</td><td align="center" valign="top" rowspan="1" colspan="1">2.3</td><td align="center" valign="top" rowspan="1" colspan="1">4.9</td></tr><tr><td align="left" valign="top" rowspan="1" colspan="1"/><td align="left" valign="top" rowspan="1" colspan="1">95<sup>th</sup> %ile CV</td><td align="center" valign="top" rowspan="1" colspan="1"><bold>3.1</bold></td><td align="center" valign="top" rowspan="1" colspan="1"><bold>4.2</bold></td><td align="center" valign="top" rowspan="1" colspan="1">6.8</td><td align="center" valign="top" rowspan="1" colspan="1">13</td></tr></tbody></table><table-wrap-foot><fn id="TFN1"><p id="P63">Daily relative pair difference (%) was calculated from the absolute difference of the two daily measurements divided by the mean of the two measurements times 100.</p></fn><fn id="TFN2"><p id="P64">Campaign CV (%) was the mean of 6 independent preparations in 3 runs (duplicates per run)</p></fn></table-wrap-foot></table-wrap></floats-group></article>