<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties manuscript?><front><journal-meta><journal-id journal-id-type="nlm-journal-id">100897395</journal-id><journal-id journal-id-type="pubmed-jr-id">29765</journal-id><journal-id journal-id-type="nlm-ta">Obes Rev</journal-id><journal-id journal-id-type="iso-abbrev">Obes Rev</journal-id><journal-title-group><journal-title>Obesity reviews : an official journal of the International Association for the Study of Obesity</journal-title></journal-title-group><issn pub-type="ppub">1467-7881</issn><issn pub-type="epub">1467-789X</issn></journal-meta><article-meta><article-id pub-id-type="pmid">27071775</article-id><article-id pub-id-type="pmc">5996374</article-id><article-id pub-id-type="doi">10.1111/obr.12412</article-id><article-id pub-id-type="manuscript">NIHMS969921</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>Methodological quality of behavioural weight loss studies: a systematic review</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Lemon</surname><given-names>S. C.</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib><contrib contrib-type="author"><name><surname>Wang</surname><given-names>M. L.</given-names></name><xref ref-type="aff" rid="A2">2</xref></contrib><contrib contrib-type="author"><name><surname>Haughton</surname><given-names>C. F.</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib><contrib contrib-type="author"><name><surname>Estabrook</surname><given-names>D. P.</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib><contrib contrib-type="author"><name><surname>Frisard</surname><given-names>C. F.</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib><contrib contrib-type="author"><name><surname>Pagoto</surname><given-names>S. L.</given-names></name><xref ref-type="aff" rid="A1">1</xref></contrib></contrib-group><aff id="A1">
<label>1</label>UMass Worcester Prevention Research Center, Division of Preventive and Behavioral Medicine, University of Massachusetts Medical School, Worcester, MA, USA</aff><aff id="A2">
<label>2</label>Department of Community Health Sciences, Boston University School of Public Health, Boston, MA, USA</aff><author-notes><corresp id="FN1">Address for correspondence: Professor S. C. Lemon, UMass Worcester Prevention Research Center, Division of Preventive and Behavioral Medicine, University of Massachusetts Medical School, 55 Lake Avenue North, Worcester, MA 01655, USA., <email>Stephenie.Lemon@umassmed.edu</email></corresp></author-notes><pub-date pub-type="nihms-submitted"><day>6</day><month>6</month><year>2018</year></pub-date><pub-date pub-type="epub"><day>12</day><month>4</month><year>2016</year></pub-date><pub-date pub-type="ppub"><month>7</month><year>2016</year></pub-date><pub-date pub-type="pmc-release"><day>12</day><month>6</month><year>2018</year></pub-date><volume>17</volume><issue>7</issue><fpage>636</fpage><lpage>644</lpage><!--elocation-id from pubmed: 10.1111/obr.12412--><abstract><title>Summary</title><p id="P1">This systematic review assessed the methodological quality of behavioural weight loss intervention studies conducted among adults and associations between quality and statistically significant weight loss outcome, strength of intervention effectiveness and sample size. Searches for trials published between January, 2009 and December, 2014 were conducted using PUBMED, MEDLINE and PSYCINFO and identified ninety studies. Methodological quality indicators included study design, anthropometric measurement approach, sample size calculations, intent-to-treat (ITT) analysis, loss to follow-up rate, missing data strategy, sampling strategy, report of treatment receipt and report of intervention fidelity (mean = 6.3). Indicators most commonly utilized included randomized design (100%), objectively measured anthropometrics (96.7%), ITT analysis (86.7%) and reporting treatment adherence (76.7%). Most studies (62.2%) had a follow-up rate &#x0003e;75% and reported a loss to follow-up analytic strategy or minimal missing data (69.9%). Describing intervention fidelity (34.4%) and sampling from a known population (41.1%) were least common. Methodological quality was not associated with reporting a statistically significant result, effect size or sample size. This review found the published literature of behavioural weight loss trials to be of high quality for specific indicators, including study design and measurement. Identified for improvement include utilization of more rigorous statistical approaches to loss to follow up and better fidelity reporting.</p></abstract><kwd-group><kwd>Intervention</kwd><kwd>quality</kwd><kwd>review</kwd><kwd>weight loss</kwd></kwd-group></article-meta></front><body><sec sec-type="intro" id="S1"><title>Introduction</title><p id="P2">Scientific articles and popular press publications have revealed broad public skepticism about scientific findings (<xref rid="R1" ref-type="bibr">1</xref>). This attitude persists despite efforts to make research more accessible and immediate. The quality of behavioural research specifically, including its reproducibility (<xref rid="R2" ref-type="bibr">2</xref>), has also been called into question.</p><p id="P3">The efficacy of behavioural weight loss interventions in particular has been questioned. Benchmark studies such as the Diabetes Prevention Program (<xref rid="R3" ref-type="bibr">3</xref>), the Finnish Diabetes Prevention Study (<xref rid="R4" ref-type="bibr">4</xref>), the Da Qing Study (<xref rid="R5" ref-type="bibr">5</xref>) and the Look Ahead study (<xref rid="R6" ref-type="bibr">6</xref>) have found lifestyle interventions to be efficacious and associated with numerous long-term benefits, including sustained weight loss, improvements in metabolic measures such as blood pressure, fitness, glycemic control and cholesterol levels (<xref rid="R6" ref-type="bibr">6</xref>), and decreased incidence of type 2 diabetes. A large number of efficacy and effectiveness studies have attempted to replicate and/or adapt the interventions used in these studies across diverse settings and populations and using a range of different intervention modalities (e.g. individual counselling, group counselling, web-based, multi-method approaches), number of visits and treatment durations (<xref rid="R7" ref-type="bibr">7</xref>&#x02013;<xref rid="R10" ref-type="bibr">10</xref>).</p><p id="P4">Although not all studies have observed statistically significant or clinically meaningful weight loss, a systematic review and meta-analysis systematic review of 80 studies with a minimum 1-year follow-up found that behavioural weight loss interventions aimed at reducing caloric intake and increasing caloric expenditure produce modest weight loss (<xref rid="R7" ref-type="bibr">7</xref>). The US Preventive Services Task Force recommends that clinicians screen all adults for obesity and offer or refer patients with body mass indexes (BMI) of 30 or higher to intensive behavioural treatment (Grade B recommendation), a recommendation that has been adopted by the American Academy of Family Physicians (<xref rid="R11" ref-type="bibr">11</xref>). Despite this evidence, behavioural weight loss interventions have been referred to as ineffective because of the high prevalence of weight regain and lacking methodological rigour in trials (<xref rid="R12" ref-type="bibr">12</xref>&#x02013;<xref rid="R15" ref-type="bibr">15</xref>). In depth examination of the methodological quality of trials in this area might increase the veracity of findings within science and in public opinion.</p><p id="P5">Given the importance of quality in informing practice, policies and public perceptions, evaluation of the body of literature testing behavioural weight loss interventions from a quality perspective is critical to advance the field. The goal of this systematic review was to assess the methodological quality, of published studies testing the effectiveness of behavioural interventions for weight loss among adults and to examine whether quality was associated with report of a statistically significant weight loss outcome, strength of intervention effectiveness and sample size.</p></sec><sec sec-type="methods" id="S2"><title>Methods</title><sec id="S3"><title>Search strategy</title><p id="P6">Literature searches for behavioural weight loss trials published between January 1, 2009 and December 31, 2014 were conducted using PUBMED, MEDLINE and PSYCINFO search engines. Search terms included: &#x02018;weight&#x02019;, &#x02018;loss&#x02019;, &#x02018;overweight&#x02019;, &#x02018;obese&#x02019;, &#x02018;intervention&#x02019; and &#x02018;trial&#x02019;. The reference lists of studies that met inclusion criteria (described below) were searched to identify additional articles.</p></sec><sec id="S4"><title>Inclusion and exclusion criteria</title><p id="P7">Study inclusion criteria included: (i) weight or BMI as a primary or secondary outcome; (ii) intervention that tested a behavioural approach to dietary and/or physical activity change for the purpose of weight loss; (iii) inclusion of an adult (age 18+), non-hospitalized or institutionalized sample, (iv) a minimum sample size of 100 total participants at baseline; (v) a minimum of 6 month follow-up period and (vi) publication in English in a peer-reviewed journal. Exclusion criteria included: (i) study aim was to evaluate efficacy of a pharmacological or surgical intervention; (ii) basic science study; (iii) policy and/or environmental intervention only; (iv) evaluation of an already implemented program; (v) survey development study; (vi) weight maintenance or weight gain prevention was the outcome and (vii) secondary/mediation analysis of a trial. For interventions that had multiple publications meeting inclusion criteria, only the article with the final data assessment point was reviewed.</p></sec><sec id="S5"><title>Initial article review process</title><p id="P8">To determine whether articles met the study inclusion and exclusion criteria, an initial title review of all articles generated from the search was performed by one author. Two authors (SL and CH) then independently reviewed the abstracts of articles that passed initial review. Inter-rater reliability was calculated (93.5%). The authors met to discuss articles with discrepant codings to achieve consensus. Full text articles were pulled for those deemed to meet inclusion criteria based upon abstract review.</p></sec><sec id="S6"><title>Coding</title><p id="P9">Three authors (SL, CH, MW) developed the initial coding criteria and coding form. This form was pilot tested on five articles by four authors (SL, CH, MW, DE). These rating forms were reviewed by the study team item by item. Items that yielded any discrepancies were discussed to clarify meaning and wording. This process was repeated on a new set of five articles until 100% agreement was achieved on all data elements. A protocol document that described coding rules for each item was then created for use by coders. Each article had two reviewers. The reviewers met to compare ratings, code inter-rater agreement (IRA) for each item and discuss inconsistencies to come to consensus.</p></sec><sec id="S7"><title>Coding criteria</title><p id="P10">Study elements abstracted included intervention characteristics, study design characteristics and methodological quality indicators.</p><sec id="S8"><title>Intervention characteristics</title><p id="P11">assessed included the intervention delivery setting (health care, community organization, worksite, church, college, neighbourhood, academic/lab, combination) (IRA = 86.0%), behavioural target(s) of the intervention (diet, exercise or both) (IRA = 96.5%), intervention modality (group or individual; in-person, phone, web, other technology) (IRA = 100%) and number of intervention sessions offered/dose (IRA = 100%).</p></sec><sec id="S9"><title>Study design characteristics</title><p id="P12">Factors related to study methodology included number of study conditions (IRA = 100%), comparison condition (IRA = 98.8%), level of randomization (IRA = 100%), sample size at baseline (IRA = 100%) and final follow-up assessment time period (IRA = 100%), and final follow-up assessment time period in months (IRA = 98.8%). Weight-related outcomes used in the study included change in weight (in both lbs. and kgs.), change in BMI, percent of weight change and achieving a specific weight loss percentage (typically 5% of baseline weight) (IRA = 86.2%).</p></sec><sec id="S10"><title>Study effectiveness</title><p id="P13">was assessed in two ways. First, we abstracted whether or not a statistically significant result (<italic>p</italic> &#x0003c; .05) was reported for any weight-related outcome at the final assessment point (IRA = 98.8%). Second, within each intervention condition, baseline values and results at the final assessment point were abstracted for each weight-related outcome that was reported (IRA = 100%). Types of outcomes included were change in weight (in either pounds or kilogrammes), change in BMI, change in percentage weight loss and the percentage of the sample that achieved a pre-defined weight loss threshold (e.g. 5% of baseline weight). For intervention conditions that included a given outcome, we calculated the median and interquartile range across those results. Then, within each intervention condition that included change in weight or BMI, a Cohen&#x02019;s D statistic was computed in order to standardize estimates across the multiple ways in which weight-related outcomes were assessed. Cohen&#x02019;s D was calculated as the standardized mean change from baseline in weight or BMI. Mean change in weight or BMI at the last follow-up visit was divided by the standard deviation of the baseline measure. The effect size was very similar for studies reporting both change in weight and change in BMI. Because of this, when both were reported, we used effect size for change in weight in our analyses as this was the most commonly reported outcome measure.</p></sec><sec id="S11"><title>Methodological quality</title><p id="P14">We included ten indicators reflecting key components of methodological and/or reporting quality (<xref rid="R16" ref-type="bibr">16</xref>). Quality indicators were an amalgam of known standards for rating intervention studies that were adapted by the study team to be specific for behavioural weight loss interventions (<xref rid="R17" ref-type="bibr">17</xref>&#x02013;<xref rid="R19" ref-type="bibr">19</xref>). Individual indicators, rather than a summary score, were used so that specific elements could be assessed (<xref rid="R17" ref-type="bibr">17</xref>,<xref rid="R18" ref-type="bibr">18</xref>). The quality indicators selected therefore covered these two areas and were as follows:
<list list-type="order" id="L1"><list-item><p id="P15"><italic>Study design</italic> was assessed (randomized controlled trial (RCT), non-randomized design with control group, single arm design). RCTs were considered the gold standard. Level of randomization (i.e. individual participant, provider, site) was also assessed (IRA = 100%).</p></list-item><list-item><p id="P16">We assessed the <italic>measurement approach</italic> used to assess weight and/or height data. Options included: measured by research staff, obtained from medical or other records, self-report or a combination of these (IRA = 100%).</p></list-item><list-item><p id="P17">Whether or not <italic>power/sample size calculations</italic> were reported was assessed (yes/no) (IRA = 95.3%).</p></list-item><list-item><p id="P18">We defined &#x02018;<italic>intent-to-treat</italic>&#x02019; <italic>(ITT)</italic> analysis as whether or not participants&#x02019; data were included in the intervention condition to which they were randomized in the analysis, regardless of whether or not they participated in/attended that intervention/program/ treatment. ITT was considered independently of how missing data at follow-up was handled. Because of discrepancies in how investigators define ITT (<xref rid="R20" ref-type="bibr">20</xref>&#x02013;<xref rid="R23" ref-type="bibr">23</xref>), text was reviewed for a clear definition of ITT consistent with the one used in this study and CONSORT-type diagrams and denominators in tables were reviewed to establish whether the ITT criteria was met. ITT was classified as yes or no (IRA = 95.3%).</p></list-item><list-item><p id="P19"><italic>Loss to follow-up rate</italic> was assessed by determining the percentage of individuals at baseline who completed assessments at the final follow-up point. We used the most conservative approach, not taking into consideration specific study designed exclusion criteria (e.g. some studies excluded participants who became pregnant or began taking weight gain medications), we used this approach to apply a consistent definition across studies (IRA = 93.5%). While there is no universal consensus on what loss follow-up rate results in biassed study results, we considered less than 25% to indicate high quality (<xref rid="R24" ref-type="bibr">24</xref>), as suggested by the Methodological Quality Rating Scale.</p></list-item><list-item><p id="P20"><italic>Missing data strategy</italic> refers to the approach(es) used to handle missing data at follow-up assessments. Classifications were based on standards described in 2010 by the National Research Council (<xref rid="R25" ref-type="bibr">25</xref>) and included: no or minimal (&#x0003e;95% of the baseline sample included in the final follow-up time point) missing data (<xref rid="R26" ref-type="bibr">26</xref>), multiple imputation, generalized estimating equations or other non-parametric model-based approaches, inverse probability weighting, maximum likelihood/parametric models, imputed weight gain carried forward; single imputation (i.e. last or baseline carried forward) and complete case analysis with and without comparisons to baseline sample). We additionally assessed whether sensitivity analyses were performed using more than one method (IRA = 96.5%).</p></list-item><list-item><p id="P21">The <italic>sampling strategy</italic> used to enrol participants was classified as being from a known population (i.e. having a denominator) or not (IRA = 91.9%). Among those studies with a known population, we assessed whether comparisons were made between the included sample and the known population (IRA = 100%).</p></list-item><list-item><p id="P22">We assessed whether data were reported in participant <italic>treatment receipt or adherence</italic> was reported (IRA = 97.7%).</p></list-item><list-item><p id="P23">Last, we assessed whether attempts to ensure <italic>intervention fidelity</italic> were described or reported (IRA = 95.3%). We additionally assessed the reporting/methods used for fidelity which was categorized as none stated, followed a standardized protocol, or quality review by study staff.</p></list-item></list></p></sec></sec><sec id="S12"><title>Analysis</title><p id="P24">The distribution of intervention characteristics, study design characteristics and each of the nine methodological quality indicators were computed among the included studies. To determine whether or not methodological quality was associated with intervention effectiveness, bivariate comparisons were made to assess the associations of the sum of quality indicators with whether or not the study reported a statistically significant outcome at the final assessment point (<italic>t</italic>-test) and the strength of association/Cohen&#x02019;s D statistic (using Pearson&#x02019;s correlation coefficient). The latter was computed only among those studies that included change in weight (lbs. or kgs.) or BMI as an outcome. The bivariate association of the study sample size and the sum of number of quality indicators was also assessed (one-way ANOVA).</p></sec></sec><sec sec-type="results" id="S13"><title>Results</title><sec id="S14"><title>Study selection</title><p id="P25">A total of 5,499 non-duplicate articles were retrieved from the initial search strategy (<xref ref-type="fig" rid="F1">Fig. 1</xref>). Of these, 4,765 were excluded based on title review; 734 abstracts were reviewed. Of these, 543 were excluded and 191 articles were selected for full review. An additional six articles were identified from reference review for full review, resulting in 197 articles for full review. Of these, 107 were deemed ineligible. The most common reasons for ineligibility included: follow-up period of less than 6 months (<italic>n</italic> = 30), results presented in another publication (<italic>n</italic> = 17) and methods paper (<italic>n</italic> = 9). A total of 90 articles were included in this review; 156 unique interventions were tested in these 90 studies. A list of these articles is included in the <xref ref-type="supplementary-material" rid="SD1">online Appendix</xref>.</p></sec><sec id="S15"><title>Study characteristics</title><p id="P26"><xref ref-type="table" rid="T1">Table 1</xref> describes the intervention characteristics of the included studies. Most targeted both diet and physical activity (93.3%). The most common primary intervention delivery settings were health care (38.9%) and academic research labs (36.7%). A range of intervention delivery modalities were used, with many studies reporting multiple modalities within a single intervention condition and/or across intervention conditions. More than half of the interventions (51.3%) offered more than 20 sessions.</p><p id="P27"><xref ref-type="table" rid="T2">Table 2</xref> presents a summary of the design characteristics and outcomes of the included studies. The majority included two treatment conditions (66.7%). A wide variety of comparison conditions were used, with comparative effectiveness (i.e. comparing one intervention approach to other) (48.9%) (either with or without a no intervention control group), usual care (22.2%) and materials only (20.0%) being the most common. Most studies included more than one outcome measure. The majority of the 90 studies included change in weight and/or BMI as outcomes (87.7%). Among those that did not include such a measure, outcomes included proportion of participants who reached a certain threshold of weight loss (typically 5%) (1.1%); percentage of weight loss achieved (8.9%); or both a threshold of weight loss and weight loss percentage (2.2%).</p><p id="P28">One hundred forty of the 156 unique interventions included in the 90 studies presented change in weight as an outcome (in pounds or kilogrammes). Converting each of these to pounds, median weight loss in the 140 interventions was &#x02212;5.13 (interquartile range (IQR) = &#x02212;10.14 to &#x02212;2.55). The median change in BMI, among the 86 interventions that assessed it, was &#x02212;.80 (IQR = &#x02212;1.8 to &#x02212;.30). With respect to change in percentage weight loss (included in 46 interventions), the median was &#x02212;4.7% (IQR = &#x02212;1.68% to &#x02212;6.4%). Among studies that included proportion achieving a specified weight loss threshold, a median of 36.0% of participants did so (IQR = 21% &#x02013; 49%). Cohen&#x02019;s D could be calculated for 76 of the 79 studies that assessed change in weight or BMI; these included 133 unique weight loss interventions. Mean Cohen&#x02019;s D statistic was &#x02212;.26 (sd = .24; median = &#x02212;.21). Almost half (47.4%) demonstrated no effect; 36.1% a small effect; 12.8% a medium effect and 3.8% a large effect.</p></sec><sec id="S16"><title>Methodological quality</title><p id="P29">A summary of the methodological quality indicators is presented in <xref ref-type="table" rid="T3">Table 3</xref>. Quality indicators that were most commonly utilized included a randomized controlled trial design (100% (individual level randomization 95.6%, site level randomization 4.4%), objectively measured weight/height (96.7%) and description of an ITT analytic approach (86.7%). Most studies did not describe fidelity assessments (65.6%), with 15.5% reporting following a standardized protocol and 18.9% reporting quality review by the study team. Sixty-two percent (62.2%) of studies had a follow-up retention rate of more than 25%. The majority of studies reported using a strategy to address loss to follow-up (either primary or sensitivity analysis) or having minimal missing follow-up data (69.9%). A variety of approaches to loss to follow-up were used; the most common were last/baseline value carried forward (35.6%) and multiple imputation (24.5%), either alone or as a sensitivity analysis to a primary analysis that used a complete case approach. Overall, the mean number of quality indicators achieved was 6.3, with a median of 6 (range 2&#x02013;9).</p><p id="P30">No differences in methodological quality were observed between studies that reported a statistically significant reduction in a primary outcome at final assessment point (mean number of quality indicators = 6.3 (SD = 1.5)) and those that did not (mean = 6.0 (SD = 1.4); <italic>p</italic> = .31). We also found no association between methodological quality and effect size (Pearson&#x02019;s correlation coefficient = 0.07) among those studies for which a Cohen&#x02019;s D statistic could be calculated. No differences in quality indicators were observed by study sample size (mean number of quality indicators for sample size 0&#x02013;199 = 6.1 (SD = 1.4); 200&#x02013;399 = 6.3 (SD = 1.5); 400 or greater = 6.2 (SD = 1.4); <italic>p</italic> = 0.88).</p></sec></sec><sec sec-type="discussion" id="S17"><title>Discussion</title><p id="P31">In this study, the methodological quality of contemporary trials testing the efficacy and effectiveness of behavioural weight loss interventions among adults was examined, with an emphasis on key indicators of internal and external validity. A total of 90 articles using a wide range of intervention modalities, comparison conditions and conducted in a variety of settings were included. Overall, the studies were of moderate to high quality (median of six out of nine quality indicators), with considerable variation across indicators.</p><p id="P32">Internal validity indicators that were assessed included key features that have potential to bias the results of weight loss trials. Most studies used a protocol that required weight and height to be objectively measured, a positive finding in light of the recognized bias in the use of self-report or other methods in attaining these measures (<xref rid="R27" ref-type="bibr">27</xref>,<xref rid="R28" ref-type="bibr">28</xref>). Most studies additionally clearly stated that ITT analysis was completed, that is they were included in the analysis in the intervention group to which they were assigned regardless of their participation in or adherence to the intervention. Studies that did not use an ITT approach typically excluded participants who did not meet a specified intervention participation benchmark, which has potential to overestimate intervention effects.</p><p id="P33">While the majority of studies used a strategy to address loss to follow-up, more than one-fifth did not, instead using a complete case analysis approach. This can lead to reduced power and selection bias, particularly if loss to follow-up is differential across conditions or related to weight loss. Traditional clinical trial methodology additionally call for the most conservative approach to addressing loss to follow-up be used in order to decrease the probability of committing a type 1 or 2 error. In this review, we found that single imputation methods, such as last observation carried forward and baseline observation carried forward were most commonly used, which has been true in other areas. Such approaches, however, are typically not recommended as a primary approach for handling missing data because they can introduce bias (<xref rid="R25" ref-type="bibr">25</xref>). This may be particularly true with weight-related outcomes. On average, American adults are estimated to gain 1 to 2 pounds per year (<xref rid="R29" ref-type="bibr">29</xref>). Thus, baseline and last value carried forward approaches, the mostly commonly used in the studies included in this review and the easiest to implement statistically, may not be the most conservative approach. Another potential loss to follow up strategy is to impute an average weight gain to coincide with estimated population weight gain. However, only one study included in this review used this approach. More advanced statistical approaches such as multiple imputation, model-based approaches, while more computationally intensive, are increasingly considered a gold standard. In a meta analysis of pharmaceutical randomized controlled trials published between 2000 and 2006 that assessed weight loss or weight gain, Elobeid and colleagues compared various loss to follow-up analysis approaches on trial results. The authors concluded that multiple imputation and model based approaches are the most appropriate for weight loss trials (<xref rid="R30" ref-type="bibr">30</xref>). Less than 30% of studies included in this review used such a strategy.</p><p id="P34">There have been calls within the field of behavioural medicine and related disciplines to increase reporting of indicators that relate to project implementation and external validity in an effort to better promote the translation of research into practice and enhance the public health impact of research-developed interventions (<xref rid="R31" ref-type="bibr">31</xref>&#x02013;<xref rid="R33" ref-type="bibr">33</xref>). In this review, only 32.7% of studies reported on intervention fidelity, despite these calls to action and the availability of frameworks and standards for fidelity reporting (<xref rid="R34" ref-type="bibr">34</xref>&#x02013;<xref rid="R36" ref-type="bibr">36</xref>). Only 17.3% including objective reviews such as reviewing sessions in person or via audiotape for protocol adherence; other methods reported included describing training of study staff, use of standardized protocols, selecting intervention staff with specialized degrees and supervision of interventionists. It is possible that quality control and fidelity checks were in place, but not reported in many studies, particular given journal restrictions. Intervention adherence rates were reported by 76.7% of studies. We found that only 41.1% of studies samples from a known population, with the majority enrolling convenience samples. Of these, only one study made comparisons between the known population and the included sample. Thus, most studies were not designed to assess reach, and among those that were, only one attempted to describe the success in achieving a representative population. The results of these three indictors (fidelity, participation and reach/representativeness) are similar to those reported in a systematic review of childhood obesity prevention interventions published from 1980 to 2008 (<xref rid="R37" ref-type="bibr">37</xref>).</p><p id="P35">We found that methodological quality was not associated with whether or not a statistically significant weight loss result was reported, effect size, or sample size. This indicates that differences in methodological quality are not driving inconsistencies in translation of efficacy findings to effectiveness studies. Instead, such differences are likely a function of intervention design, intensity and/or delivery and the associated challenges of achieving behaviour change across diverse populations and settings. Mean weight loss in the studies included was 6.79 pounds (sd = 6.70) relative to 14.33 (sd = 10.36) in the Diabetes Prevention Program and 9.23 (sd = 11.24) in the Finnish Diabetes Prevention Program, both of which found significant associated reductions in diabetes risk of 58% (<xref rid="R38" ref-type="bibr">38</xref>). While weight losses in effectiveness studies were 47&#x02013;74% of those of these larger efficacy trials and the degree to which this magnitude of weight loss affects diabetes risk is unknown, it would seem premature to dismiss these interventions as ineffective. Two main criticisms have been launched against behavioural weight loss interventions. The first is that weight regain is extremely common and is an indicator of treatment failure (<xref rid="R13" ref-type="bibr">13</xref>,<xref rid="R14" ref-type="bibr">14</xref>). Although weight regain is common over the long-term, the Diabetes Prevention Program found that 10 years after the lifestyle intervention, in spite of average weight regain of 71%, reduction in incidence of diabetes was 34% relative to control (<xref rid="R39" ref-type="bibr">39</xref>). The Da Qing Diabetes Study found a reduction in diabetes incidence of 43% relative to control at 20 years although did not report weight regain rates (<xref rid="R40" ref-type="bibr">40</xref>). Weight loss may not be the best measure of impact of lifestyle interventions, although it is the most feasible measure of impact in shorter term effectiveness studies, which poses a challenge to this field. It should also be noted that termination of treatment for many conditions results in a return of symptoms. Lifestyle interventions may not be a permanent cure for obesity, but this is an extraordinarily high bar for any preventive intervention (<xref rid="R41" ref-type="bibr">41</xref>). Chronic care models are certainly needed. The second criticism of weight loss interventions is that they lack methodological rigour (<xref rid="R12" ref-type="bibr">12</xref>). Our findings show that recent trials are of moderate to high quality.</p><p id="P36">This study must be viewed in the context of its limitations. Only studies published in English were included. Publication bias is a well-known limiting factor in synthesizing research evidence that could impact the conclusions of this study (<xref rid="R42" ref-type="bibr">42</xref>). Studies with positive or significant results are more likely to be published in the scientific literature, and selected outcomes with null findings are more likely to be intentionally excluded from published reports. We additionally restricted articles to those with at least 100 people and six months or more follow-up time. The former was done to eliminate pilot type studies which typically are not designed with maximal methodological rigour and are not powered to find statistically significant results. The later restriction was improved because we were interested in studies designed to assess weight loss beyond the short-term impact of the intervention. We examined whether studies were statistically significant, rather than using an indicator of whether or not clinically meaningful weight loss was achieved. This was done because vast heterogeneity in how results were reported made it impossible to operationalize clinically meaningful results. Only a select number of quality indicators were included, indicators were limited to those deemed by the study team to be most likely to impact study bias. Lastly, we were only able to assess what was reported in the included studies. It is possible that a given study could have included/addressed a given quality indicator and the investigators chose not to report it for reasons such as journal word count restrictions, response to journal editors and/or reviewers and the investigators&#x02019; simply not perceiving it to be important.</p><p id="P37">In summary, this systematic review of behavioural trials testing weight loss interventions found this body of literature to be of high quality for specific quality indicators, including study design and measurement. However, specific areas in need of improvement were identified. This review identified a need for utilization of more methodologically rigorous statistical approaches to address loss to follow-up in such trials, more representative sampling approaches and better reporting of fidelity, in particular. These are potential areas of improvement for this field of research that are relevant to investigators, journal editors and reviewers.</p></sec><sec sec-type="supplementary-material" id="SM"><title>Supplementary Material</title><supplementary-material content-type="local-data" id="SD1"><label>Appendix</label><media xlink:href="NIHMS969921-supplement-Appendix.docx" orientation="portrait" xlink:type="simple" id="d36e467" position="anchor"/></supplementary-material></sec></body><back><fn-group><fn fn-type="COI-statement" id="FN2"><p id="P38"><bold>Conflict of interest statement</bold></p><p id="P39">The authors have no conflicts of interest to declare</p></fn><fn id="FN3"><p id="P40"><xref ref-type="supplementary-material" rid="SD1">Supporting information</xref></p><p id="P41"><xref ref-type="supplementary-material" rid="SD1">Additional supporting information</xref> may be found in the online version of this article at the publisher&#x02019;s web site.</p></fn></fn-group><ack id="S18"><p id="P42">Drs. Lemon and Pagoto receive funding from Grant # 5 U48 DP005031; Dr. Pagoto receives funding from Grant # K24 HL124366; Dr. Wang receives funding from Grant # K01 DK102447; and Christina Haughton is supported by Grant # T32 HL120823.</p></ack><ref-list><ref id="R1"><label>1</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ioannidis</surname><given-names>JP</given-names></name></person-group><article-title>Why most published research findings are false</article-title><source>PLoS Med</source><year>2005</year><volume>2</volume><fpage>e124</fpage><pub-id pub-id-type="pmid">16060722</pub-id></element-citation></ref><ref id="R2"><label>2</label><element-citation publication-type="journal"><collab>PSYCHOLOGY</collab><article-title>Estimating the reproducibility of psychological science</article-title><source>Science</source><year>2015</year><volume>349</volume><fpage>aac4716</fpage><pub-id pub-id-type="pmid">26315443</pub-id></element-citation></ref><ref id="R3"><label>3</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Knowler</surname><given-names>WC</given-names></name><name><surname>Barrett-Connor</surname><given-names>E</given-names></name><name><surname>Fowler</surname><given-names>SE</given-names></name><etal/></person-group><article-title>Reduction in the incidence of type 2 diabetes with lifestyle intervention or metformin</article-title><source>N Engl J Med</source><year>2002</year><volume>346</volume><fpage>393</fpage><lpage>403</lpage><pub-id pub-id-type="pmid">11832527</pub-id></element-citation></ref><ref id="R4"><label>4</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lindstrom</surname><given-names>J</given-names></name><name><surname>Louheranta</surname><given-names>A</given-names></name><name><surname>Mannelin</surname><given-names>M</given-names></name><etal/></person-group><article-title>The Finnish Diabetes Prevention Study (DPS): lifestyle intervention and 3-year results on diet and physical activity</article-title><source>Diabetes Care</source><year>2003</year><volume>26</volume><fpage>3230</fpage><lpage>3236</lpage><pub-id pub-id-type="pmid">14633807</pub-id></element-citation></ref><ref id="R5"><label>5</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname><given-names>XR</given-names></name><name><surname>Li</surname><given-names>GW</given-names></name><name><surname>Hu</surname><given-names>YH</given-names></name><etal/></person-group><article-title>Effects of diet and exercise in preventing NIDDM in people with impaired glucose tolerance. The Da Qing IGT and Diabetes Study</article-title><source>Diabetes Care</source><year>1997</year><volume>20</volume><fpage>537</fpage><lpage>544</lpage><pub-id pub-id-type="pmid">9096977</pub-id></element-citation></ref><ref id="R6"><label>6</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wing</surname><given-names>RR</given-names></name></person-group><article-title>Long-term effects of a lifestyle intervention on weight and cardiovascular risk factors in individuals with type 2 diabetes mellitus: four-year results of the Look AHEAD trial</article-title><source>Arch Intern Med</source><year>2010</year><volume>170</volume><fpage>1566</fpage><lpage>1575</lpage><pub-id pub-id-type="pmid">20876408</pub-id></element-citation></ref><ref id="R7"><label>7</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Franz</surname><given-names>MJ</given-names></name><name><surname>VanWormer</surname><given-names>JJ</given-names></name><name><surname>Crain</surname><given-names>AL</given-names></name><etal/></person-group><article-title>Weight-loss outcomes: a systematic review and meta-analysis of weight-loss clinical trials with a minimum 1-year follow-up</article-title><source>J Am Diet Assoc</source><year>2007</year><volume>107</volume><fpage>1755</fpage><lpage>1767</lpage><pub-id pub-id-type="pmid">17904936</pub-id></element-citation></ref><ref id="R8"><label>8</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Williams</surname><given-names>RL</given-names></name><name><surname>Wood</surname><given-names>LG</given-names></name><name><surname>Collins</surname><given-names>CE</given-names></name><name><surname>Callister</surname><given-names>R</given-names></name></person-group><article-title>Effectiveness of weight loss interventions&#x02014;is there a difference between men and women: a systematic review</article-title><source>Obes Rev</source><year>2015</year><volume>16</volume><fpage>171</fpage><lpage>186</lpage><pub-id pub-id-type="pmid">25494712</pub-id></element-citation></ref><ref id="R9"><label>9</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Franz</surname><given-names>MJ</given-names></name><name><surname>Boucher</surname><given-names>JL</given-names></name><name><surname>Rutten-Ramos</surname><given-names>S</given-names></name><name><surname>VanWormer</surname><given-names>JJ</given-names></name></person-group><article-title>Lifestyle weight-loss intervention outcomes in overweight and obese adults with type 2 diabetes: a systematic review and meta-analysis of randomized clinical trials</article-title><source>J Acad Nutr Diet</source><year>2015</year><volume>115</volume><fpage>1447</fpage><lpage>1463</lpage><pub-id pub-id-type="pmid">25935570</pub-id></element-citation></ref><ref id="R10"><label>10</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Raaijmakers</surname><given-names>LC</given-names></name><name><surname>Pouwels</surname><given-names>S</given-names></name><name><surname>Berghuis</surname><given-names>KA</given-names></name><name><surname>Nienhuijs</surname><given-names>SW</given-names></name></person-group><article-title>Technology-based interventions in the treatment of overweight and obesity: a systematic review</article-title><source>Appetite</source><year>2015</year><volume>95</volume><fpage>138</fpage><lpage>151</lpage><pub-id pub-id-type="pmid">26165415</pub-id></element-citation></ref><ref id="R11"><label>11</label><element-citation publication-type="web"><collab>American Academy of Family Physicians</collab><year>2015</year><source>Clinical Preventive Services Recommendation: Obesity</source><comment>[WWW document]. URL <ext-link ext-link-type="uri" xlink:href="http://aafp.org/patient-care/clinical-recommendations/all/obesity.html">http://aafp.org/patient-care/clinical-recommendations/all/obesity.html</ext-link></comment><date-in-citation>accessed October 27, 2015</date-in-citation></element-citation></ref><ref id="R12"><label>12</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Mann</surname><given-names>T</given-names></name><name><surname>Tomiyama</surname><given-names>AJ</given-names></name><name><surname>Westling</surname><given-names>E</given-names></name><name><surname>Lew</surname><given-names>AM</given-names></name><name><surname>Samuels</surname><given-names>B</given-names></name><name><surname>Chatman</surname><given-names>J</given-names></name></person-group><article-title>Medicare&#x02019;s search for effective obesity treatments: diets are not the answer</article-title><source>American Psychol</source><year>2007</year><volume>62</volume><fpage>220</fpage><lpage>233</lpage></element-citation></ref><ref id="R13"><label>13</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bacon</surname><given-names>L</given-names></name><name><surname>Aphramor</surname><given-names>L</given-names></name></person-group><article-title>Weight science: evaluating the evidence for a paradigm shift</article-title><source>Nutr J</source><year>2011</year><volume>10</volume><fpage>9</fpage><pub-id pub-id-type="pmid">21261939</pub-id></element-citation></ref><ref id="R14"><label>14</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Taubes</surname><given-names>G</given-names></name></person-group><article-title>The science of obesity: what do we really know about what makes us fat? An essay by Gary Taubes</article-title><source>BMJ</source><year>2013</year><volume>346</volume><fpage>f1050</fpage><pub-id pub-id-type="pmid">23596125</pub-id></element-citation></ref><ref id="R15"><label>15</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Aphramor</surname><given-names>L</given-names></name></person-group><article-title>Validity of claims made in weight management research: a narrative review of dietetic articles</article-title><source>Nutr J</source><year>2010</year><volume>9</volume><fpage>30</fpage><pub-id pub-id-type="pmid">20646282</pub-id></element-citation></ref><ref id="R16"><label>16</label><element-citation publication-type="web"><person-group person-group-type="author"><name><surname>Higgins</surname><given-names>JPT</given-names></name><name><surname>Green</surname><given-names>S</given-names></name></person-group><source>Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0. The Cochrane Collaboration</source><comment>(updated March 2011) [WWW document]. <ext-link ext-link-type="uri" xlink:href="http://www.cochrane.org/resources/handbook">www.cochrane.org/resources/handbook</ext-link></comment><date-in-citation>accessed October 5, 2015</date-in-citation></element-citation></ref><ref id="R17"><label>17</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Miller</surname><given-names>WR</given-names></name><name><surname>Wilbourne</surname><given-names>PL</given-names></name></person-group><article-title>Mesa Grande: a methodological analysis of clinical trials of treatments for alcohol use disorders</article-title><source>Addiction</source><year>2002</year><volume>97</volume><fpage>265</fpage><lpage>277</lpage><pub-id pub-id-type="pmid">11964100</pub-id></element-citation></ref><ref id="R18"><label>18</label><element-citation publication-type="web"><collab>Community Preventive Services Task Force</collab><source>Systematic review methods</source><comment>(last updated June 2014). [WWW document]. URL <ext-link ext-link-type="uri" xlink:href="http://www.thecommunityguide.org/about/methods.html">http://www.thecommunityguide.org/about/methods.html</ext-link></comment><date-in-citation>October 21, 2015</date-in-citation></element-citation></ref><ref id="R19"><label>19</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Moher</surname><given-names>D</given-names></name><name><surname>Liberati</surname><given-names>A</given-names></name><name><surname>Tetzlaff</surname><given-names>J</given-names></name><name><surname>Altman</surname><given-names>DG</given-names></name></person-group><article-title>Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement</article-title><source>Int J Surg</source><year>2010</year><volume>8</volume><fpage>336</fpage><lpage>341</lpage><pub-id pub-id-type="pmid">20171303</pub-id></element-citation></ref><ref id="R20"><label>20</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Abraha</surname><given-names>I</given-names></name><name><surname>Montedori</surname><given-names>A</given-names></name></person-group><article-title>Modified intention to treat reporting in randomised controlled trials: systematic review</article-title><source>BMJ</source><year>2010</year><volume>340</volume><fpage>c2697</fpage><pub-id pub-id-type="pmid">20547685</pub-id></element-citation></ref><ref id="R21"><label>21</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hollis</surname><given-names>S</given-names></name><name><surname>Campbell</surname><given-names>F</given-names></name></person-group><article-title>What is meant by intention to treat analysis? Survey of published randomised controlled trials</article-title><source>BMJ</source><year>1999</year><volume>319</volume><fpage>670</fpage><lpage>674</lpage><pub-id pub-id-type="pmid">10480822</pub-id></element-citation></ref><ref id="R22"><label>22</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Abraha</surname><given-names>I</given-names></name><name><surname>Cherubini</surname><given-names>A</given-names></name><name><surname>Cozzolino</surname><given-names>F</given-names></name><etal/></person-group><article-title>Deviation from intention to treat analysis in randomised trials and treatment effect estimates: meta-epidemiological study</article-title><source>BMJ</source><year>2015</year><volume>350</volume><fpage>h2445</fpage><pub-id pub-id-type="pmid">26016488</pub-id></element-citation></ref><ref id="R23"><label>23</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Alshurafa</surname><given-names>M</given-names></name><name><surname>Briel</surname><given-names>M</given-names></name><name><surname>Akl</surname><given-names>EA</given-names></name><etal/></person-group><article-title>Inconsistent definitions for intention-to-treat in relation to missing outcome data: systematic review of the methods literature</article-title><source>PLoS One</source><year>2012</year><volume>7</volume><fpage>e49163</fpage><pub-id pub-id-type="pmid">23166608</pub-id></element-citation></ref><ref id="R24"><label>24</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Miller</surname><given-names>WR</given-names></name><name><surname>Simpson</surname><given-names>TL</given-names></name></person-group><article-title>What works? A methodological analysis of the alcohol treatment outcome literature</article-title><person-group person-group-type="editor"><name><surname>Hester</surname><given-names>RK</given-names></name></person-group><source>Handbook of Alcoholism Treatment Approaches: Effective Alternatives</source><publisher-name>Allyn &#x00026; Bacon</publisher-name><publisher-loc>Needham Heights, MA</publisher-loc><year>1995</year><fpage>12</fpage><lpage>44</lpage></element-citation></ref><ref id="R25"><label>25</label><element-citation publication-type="book"><collab>National Research Council</collab><source>The Prevention and Treatment of Missing Data in Clinical Trials. Panel on Handling Missing Data in Clinical Trials. Committee on National Statistics. Division of Behavioral and Social Sciences and Education</source><publisher-name>The National Academic Press</publisher-name><publisher-loc>Washington, DC</publisher-loc><year>2010</year></element-citation></ref><ref id="R26"><label>26</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fewtrell</surname><given-names>MS</given-names></name><name><surname>Kennedy</surname><given-names>K</given-names></name><name><surname>Singhal</surname><given-names>A</given-names></name><etal/></person-group><article-title>How much loss to follow-up is acceptable in long-term randomised trials and prospective studies?</article-title><source>Arch Dis Child</source><year>2008</year><volume>93</volume><fpage>458</fpage><lpage>461</lpage><pub-id pub-id-type="pmid">18495909</pub-id></element-citation></ref><ref id="R27"><label>27</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Merrill</surname><given-names>RM</given-names></name><name><surname>Richardson</surname><given-names>JS</given-names></name></person-group><article-title>Validity of self-reported height, weight, and body mass index: findings from the National Health and Nutrition Examination Survey, 2001&#x02013;2006</article-title><source>Prev Chronic Dis</source><year>2009</year><volume>6</volume><fpage>A121</fpage><pub-id pub-id-type="pmid">19754997</pub-id></element-citation></ref><ref id="R28"><label>28</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Preston</surname><given-names>SH</given-names></name><name><surname>Fishman</surname><given-names>E</given-names></name><name><surname>Stokes</surname><given-names>A</given-names></name></person-group><article-title>Effects of categorization and self-report bias on estimates of the association between obesity and mortality</article-title><source>Ann Epidemiol</source><year>2015</year><volume>25</volume><fpage>907</fpage><lpage>911</lpage><fpage>e2</fpage><pub-id pub-id-type="pmid">26385831</pub-id></element-citation></ref><ref id="R29"><label>29</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hill</surname><given-names>JO</given-names></name><name><surname>Wyatt</surname><given-names>HR</given-names></name><name><surname>Reed</surname><given-names>GW</given-names></name><name><surname>Peters</surname><given-names>JC</given-names></name></person-group><article-title>Obesity and the environment: where do we go from here?</article-title><source>Science</source><year>2003</year><volume>299</volume><fpage>853</fpage><lpage>855</lpage><pub-id pub-id-type="pmid">12574618</pub-id></element-citation></ref><ref id="R30"><label>30</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Elobeid</surname><given-names>MA</given-names></name><name><surname>Padilla</surname><given-names>MA</given-names></name><name><surname>McVie</surname><given-names>T</given-names></name><etal/></person-group><article-title>Missing data in randomized clinical trials for weight loss: scope of the problem, state of the field, and performance of statistical methods</article-title><source>PLoS One</source><year>2009</year><volume>4</volume><fpage>e6624</fpage><pub-id pub-id-type="pmid">19675667</pub-id></element-citation></ref><ref id="R31"><label>31</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Glasgow</surname><given-names>RE</given-names></name><name><surname>Green</surname><given-names>LW</given-names></name><name><surname>Klesges</surname><given-names>LM</given-names></name><etal/></person-group><article-title>External validity: we need to do more</article-title><source>Ann Behav Med</source><year>2006</year><volume>31</volume><fpage>105</fpage><lpage>108</lpage><pub-id pub-id-type="pmid">16542124</pub-id></element-citation></ref><ref id="R32"><label>32</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Green</surname><given-names>LW</given-names></name><name><surname>Glasgow</surname><given-names>RE</given-names></name></person-group><article-title>Evaluating the relevance, generalization, and applicability of research: issues in external validation and translation methodology</article-title><source>Eval Health Prof</source><year>2006</year><volume>29</volume><fpage>126</fpage><lpage>153</lpage><pub-id pub-id-type="pmid">16510882</pub-id></element-citation></ref><ref id="R33"><label>33</label><element-citation publication-type="book"><person-group person-group-type="author"><name><surname>Allen</surname><given-names>JD</given-names></name><name><surname>Linnan</surname><given-names>LA</given-names></name><name><surname>Emmons</surname><given-names>KM</given-names></name></person-group><article-title>Fidelity and its relationship to implementation effectiveness, adaptation and dissemination</article-title><person-group person-group-type="editor"><name><surname>Brownson</surname><given-names>RC</given-names></name><name><surname>Coldtz</surname><given-names>GA</given-names></name></person-group><source>Dissemination and Implementation Research in Health: Translating Science to Practice</source><publisher-name>Oxford University Press</publisher-name><publisher-loc>New York, NY</publisher-loc><year>2012</year></element-citation></ref><ref id="R34"><label>34</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pinnock</surname><given-names>H</given-names></name><name><surname>Epiphaniou</surname><given-names>E</given-names></name><name><surname>Sheikh</surname><given-names>A</given-names></name><etal/></person-group><article-title>Developing standards for reporting implementation studies of complex interventions (StaRI): a systematic review and e-Delphi</article-title><source>Implement Sci</source><year>2015</year><volume>10</volume><fpage>42</fpage><pub-id pub-id-type="pmid">25888928</pub-id></element-citation></ref><ref id="R35"><label>35</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Glasgow</surname><given-names>RE</given-names></name><name><surname>Vogt</surname><given-names>TM</given-names></name><name><surname>Boles</surname><given-names>SM</given-names></name></person-group><article-title>Evaluating the public health impact of health promotion interventions: the RE-AIM framework</article-title><source>Am J Public Health</source><year>1999</year><volume>89</volume><fpage>1322</fpage><lpage>1327</lpage><pub-id pub-id-type="pmid">10474547</pub-id></element-citation></ref><ref id="R36"><label>36</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tabak</surname><given-names>RG</given-names></name><name><surname>Khoong</surname><given-names>EC</given-names></name><name><surname>Chambers</surname><given-names>DA</given-names></name><name><surname>Brownson</surname><given-names>RC</given-names></name></person-group><article-title>Bridging research and practice: models for dissemination and implementation research</article-title><source>Am J Prev Med</source><year>2012</year><volume>43</volume><fpage>337</fpage><lpage>350</lpage><pub-id pub-id-type="pmid">22898128</pub-id></element-citation></ref><ref id="R37"><label>37</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Klesges</surname><given-names>LM</given-names></name><name><surname>Williams</surname><given-names>NA</given-names></name><name><surname>Davis</surname><given-names>KS</given-names></name><name><surname>Buscemi</surname><given-names>J</given-names></name><name><surname>Kitzmann</surname><given-names>KM</given-names></name></person-group><article-title>External validity reporting in behavioral treatment of childhood obesity: a systematic review</article-title><source>Am J Prev Med</source><year>2012</year><volume>42</volume><fpage>185</fpage><lpage>192</lpage><pub-id pub-id-type="pmid">22261216</pub-id></element-citation></ref><ref id="R38"><label>38</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Lindstrom</surname><given-names>J</given-names></name><name><surname>Eriksson</surname><given-names>JG</given-names></name><name><surname>Valle</surname><given-names>TT</given-names></name><etal/></person-group><article-title>Prevention of diabetes mellitus in subjects with impaired glucose tolerance in the Finnish Diabetes Prevention Study: results from a randomized clinical trial</article-title><source>J Am Soc Nephrol</source><year>2003</year><volume>14</volume><fpage>S108</fpage><lpage>S113</lpage><pub-id pub-id-type="pmid">12819313</pub-id></element-citation></ref><ref id="R39"><label>39</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Knowler</surname><given-names>WC</given-names></name><name><surname>Fowler</surname><given-names>SE</given-names></name><name><surname>Hamman</surname><given-names>RF</given-names></name><etal/></person-group><article-title>10-year follow-up of diabetes incidence and weight loss in the Diabetes Prevention Program Outcomes Study</article-title><source>Lancet</source><year>2009</year><volume>374</volume><fpage>1677</fpage><lpage>1686</lpage><pub-id pub-id-type="pmid">19878986</pub-id></element-citation></ref><ref id="R40"><label>40</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Li</surname><given-names>G</given-names></name><name><surname>Zhang</surname><given-names>P</given-names></name><name><surname>Wang</surname><given-names>J</given-names></name><etal/></person-group><article-title>The long-term effect of lifestyle interventions to prevent diabetes in the China Da Qing Diabetes Prevention Study: a 20-year follow-up study</article-title><source>Lancet</source><year>2008</year><volume>371</volume><fpage>1783</fpage><lpage>1789</lpage><pub-id pub-id-type="pmid">18502303</pub-id></element-citation></ref><ref id="R41"><label>41</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Pagoto</surname><given-names>SL</given-names></name><name><surname>Appelhans</surname><given-names>BM</given-names></name></person-group><article-title>A call for an end to the diet debates</article-title><source>JAMA</source><year>2013</year><volume>310</volume><fpage>687</fpage><lpage>688</lpage><pub-id pub-id-type="pmid">23989081</pub-id></element-citation></ref><ref id="R42"><label>42</label><element-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dwan</surname><given-names>K</given-names></name><name><surname>Gamble</surname><given-names>C</given-names></name><name><surname>Williamson</surname><given-names>PR</given-names></name><name><surname>Kirkham</surname><given-names>JJ</given-names></name></person-group><article-title>Systematic review of the empirical evidence of study publication bias and outcome reporting bias&#x02014;an updated review</article-title><source>PLoS One</source><year>2013</year><volume>8</volume><fpage>e66844</fpage><pub-id pub-id-type="pmid">23861749</pub-id></element-citation></ref></ref-list></back><floats-group><fig id="F1" orientation="portrait" position="float"><label>Figure 1</label><caption><p id="P43">Diagram of study inclusion.</p></caption><graphic xlink:href="nihms969921f1"/></fig><table-wrap id="T1" position="float" orientation="portrait"><label>Table 1</label><caption><p id="P44">Description of intervention characteristics of included studies (<italic>n</italic> = 90)</p></caption><table frame="hsides" rules="groups"><thead><tr><th align="left" rowspan="1" colspan="1"/><th align="center" rowspan="1" colspan="1"><italic>N</italic> (%)</th></tr></thead><tbody><tr><td align="left" rowspan="1" colspan="1">Behavioural targets</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Diet only</td><td align="center" rowspan="1" colspan="1">6 (6.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Diet and physical activity</td><td align="center" rowspan="1" colspan="1">84 (93.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Delivery setting</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Health care</td><td align="center" rowspan="1" colspan="1">35 (38.9%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Academic/research lab</td><td align="center" rowspan="1" colspan="1">33 (36.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Community organization</td><td align="center" rowspan="1" colspan="1">8 (8.9%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Worksite</td><td align="center" rowspan="1" colspan="1">7 (7.8%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Commercial weight loss</td><td align="center" rowspan="1" colspan="1">3 (3.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Healthcare and community</td><td align="center" rowspan="1" colspan="1">3 (3.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Not described</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Intervention modality<xref ref-type="table-fn" rid="TFN1">*</xref></td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;In-person group</td><td align="center" rowspan="1" colspan="1">47 (30.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;In-person individual counselling</td><td align="center" rowspan="1" colspan="1">54 (34.6%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;In-person groups and individual counselling</td><td align="center" rowspan="1" colspan="1">25 (16.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Individually-targeted web or other technology-based individual counselling</td><td align="center" rowspan="1" colspan="1">21 (13.5%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Group-targeted web or other technology-based</td><td align="center" rowspan="1" colspan="1">4 (2.6%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Web or other technology-based groups combined with individual counselling</td><td align="center" rowspan="1" colspan="1">5 (3.2%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Maximum number of intervention sessions<xref ref-type="table-fn" rid="TFN1">*</xref></td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;1&#x02013;10</td><td align="center" rowspan="1" colspan="1">28 (18.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;11&#x02013;20</td><td align="center" rowspan="1" colspan="1">39 (25.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;21&#x02013;30</td><td align="center" rowspan="1" colspan="1">37 (23.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;31&#x02013;40</td><td align="center" rowspan="1" colspan="1">11 (7.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;41+</td><td align="center" rowspan="1" colspan="1">32 (20.5%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Unknown/web-based continuous access</td><td align="center" rowspan="1" colspan="1">9 (5.8%)</td></tr></tbody></table><table-wrap-foot><fn id="TFN1"><label>*</label><p id="P45">Total equals 156 because of studies testing more than one intervention arm.</p></fn></table-wrap-foot></table-wrap><table-wrap id="T2" position="float" orientation="portrait"><label>Table 2</label><caption><p id="P46">Description of methodological characteristics and results of included studies (<italic>n</italic> = 90)</p></caption><table frame="hsides" rules="groups"><thead><tr><th align="left" rowspan="1" colspan="1"/><th align="center" rowspan="1" colspan="1">N (%)</th></tr></thead><tbody><tr><td align="left" rowspan="1" colspan="1">Last follow-up period</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;6 months</td><td align="center" rowspan="1" colspan="1">19 (21.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;7&#x02013;12 months</td><td align="center" rowspan="1" colspan="1">36 (40.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;13&#x02013;24 months</td><td align="center" rowspan="1" colspan="1">31 (34.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;&#x0003e;24 months</td><td align="center" rowspan="1" colspan="1">4 (4.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Number of treatment arms</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;2</td><td align="center" rowspan="1" colspan="1">60 (66.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;3</td><td align="center" rowspan="1" colspan="1">20 (22.2%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;4+</td><td align="center" rowspan="1" colspan="1">10 (11.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Sample size</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;100&#x02013;199</td><td align="center" rowspan="1" colspan="1">28 (31.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;200&#x02013;399</td><td align="center" rowspan="1" colspan="1">40 (44.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;400 or greater</td><td align="center" rowspan="1" colspan="1">22 (24.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Comparison condition</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Nothing/usual care</td><td align="center" rowspan="1" colspan="1">20 (22.2%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Attention control</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Materials only</td><td align="center" rowspan="1" colspan="1">18 (20.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Wait list</td><td align="center" rowspan="1" colspan="1">6 (6.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Fitness club membership</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Other approach/comparative effectiveness/no control</td><td align="center" rowspan="1" colspan="1">27 (30.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Comparative effectiveness and other control group</td><td align="center" rowspan="1" colspan="1">17 (18.9%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Outcome measures<xref ref-type="table-fn" rid="TFN2">*</xref></td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Change in weight (lbs. or kg.) and/or BMI only</td><td align="center" rowspan="1" colspan="1">47 (52.2%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Change in weight and/or BMI and percent weight loss</td><td align="center" rowspan="1" colspan="1">10 (11.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Change in weight and/or BMI and proportion achieving a threshold</td><td align="center" rowspan="1" colspan="1">12 (13.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Change in weight and/or BMI, percent weight loss and proportion achieving a threshold</td><td align="center" rowspan="1" colspan="1">10 (11.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Percent weight loss only</td><td align="center" rowspan="1" colspan="1">8 (8.9%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Proportion achieving a threshold only</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Percent weight loss and proportion achieving a threshold</td><td align="center" rowspan="1" colspan="1">2 (2.2%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Intervention effect size<xref ref-type="table-fn" rid="TFN2">*</xref></td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;No effect (Cohen&#x02019;s D &#x0003c; .2)</td><td align="center" rowspan="1" colspan="1">63 (47.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Small effect (Cohen&#x02019;s D .2&#x02013;.49)</td><td align="center" rowspan="1" colspan="1">48 (36.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Moderate effect (Cohen&#x02019;s D .5&#x02013;.79)</td><td align="center" rowspan="1" colspan="1">17 (12.8%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Large effect (Cohen&#x02019;s D &#x02265; .8)</td><td align="center" rowspan="1" colspan="1">5 (3.8%)</td></tr></tbody></table><table-wrap-foot><fn id="TFN2"><label>*</label><p id="P47">Includes 133 interventions conditions from 76 studies with means and standard deviations for weight loss and/or BMI at baseline and final follow-up assessment.</p></fn></table-wrap-foot></table-wrap><table-wrap id="T3" position="float" orientation="portrait"><label>Table 3</label><caption><p id="P48">Description of methodological quality on included studies (<italic>n</italic> = 90)</p></caption><table frame="hsides" rules="groups"><thead><tr><th align="left" rowspan="1" colspan="1"/><th align="center" rowspan="1" colspan="1"><italic>N</italic> (%)</th></tr></thead><tbody><tr><td align="left" rowspan="1" colspan="1">Study design</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Randomized controlled trial</td><td align="center" rowspan="1" colspan="1">90 (100%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Objectively measured weight/height</td><td align="center" rowspan="1" colspan="1">8(96.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Power/sample size calculations presented</td><td align="center" rowspan="1" colspan="1">59 (65.6%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Intent-to-treat analysis</td><td align="center" rowspan="1" colspan="1">78 (86.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Loss to follow-up rate</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x0003c;25%</td><td align="center" rowspan="1" colspan="1">56 (62.2%)</td></tr><tr><td align="left" rowspan="1" colspan="1">25&#x02013;49%</td><td align="center" rowspan="1" colspan="1">31 (34.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x02265;50%</td><td align="center" rowspan="1" colspan="1">3 (3.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Missing data/loss to follow-up strategy</td><td align="center" rowspan="1" colspan="1"/></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;&#x0003c;5% missing data</td><td align="center" rowspan="1" colspan="1">3 (3.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Multiple imputation</td><td align="center" rowspan="1" colspan="1">15 (16.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Generalized estimating equations or other non-parametric modelling approaches</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Inverse probability weighting or maximum likelihood methods</td><td align="center" rowspan="1" colspan="1">0</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Maximum likelihood/parametric models</td><td align="center" rowspan="1" colspan="1">3 (3.3%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Imputed weight gain carry forward</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Baseline or last value carried forward only</td><td align="center" rowspan="1" colspan="1">18 (20.0%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Complete case analysis with sensitivity analysis-multiple Imputation</td><td align="center" rowspan="1" colspan="1">7 (7.8%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Complete case analysis with sensitivity analysis-last value or baseline carried forward</td><td align="center" rowspan="1" colspan="1">14 (15.6%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Complete case analysis only</td><td align="center" rowspan="1" colspan="1">22 (24.4%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Unknown/not described</td><td align="center" rowspan="1" colspan="1">6 (6.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Sampled from known population</td><td align="center" rowspan="1" colspan="1">37 (41.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">&#x000a0;&#x000a0;Comparisons to target population</td><td align="center" rowspan="1" colspan="1">1 (1.1%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Treatment receipt/adherence rate reported</td><td align="center" rowspan="1" colspan="1">69 (76.7%)</td></tr><tr><td align="left" rowspan="1" colspan="1">Intervention fidelity reported</td><td align="center" rowspan="1" colspan="1">31 (34.4%)</td></tr></tbody></table></table-wrap></floats-group></article>