The members of the Pneumonia Methods Working Group are listed in the Acknowledgments.
Methods for the identification and selection of patients (cases) with severe or very severe pneumonia and controls for the Pneumonia Etiology Research for Child Health (PERCH) project were needed. Issues considered include eligibility criteria and sampling strategies, whether to enroll hospital or community controls, whether to exclude controls with upper respiratory tract infection (URTI) or nonsevere pneumonia, and matching criteria, among others. PERCH ultimately decided to enroll community controls and an additional human immunodeficiency virus (HIV)–infected control group at high HIV-prevalence sites matched on age and enrollment date of cases; controls with symptoms of URTI or nonsevere pneumonia will not be excluded. Systematic sampling of cases (when necessary) and random sampling of controls will be implemented. For each issue, we present the options that were considered, the advantages and disadvantages of each, the rationale for the methods selected for PERCH, and remaining implications and limitations.
The Pneumonia Etiology Research for Child Health (PERCH) study aims to be the largest study of the etiology of severe pneumonia in children in >20 years [
An essential component of PERCH is the case and control selection process, which must be standardized across the study sites located in 7 countries: Dhaka and Matlab, Bangladesh; Basse, The Gambia; Kilifi, Kenya; Bamako, Mali; Soweto, South Africa; Nakhon Phanom and Sa Kaeo, Thailand; and Lusaka, Zambia. Epidemiologic challenges addressed in the design of PERCH included eligibility criteria for case and control enrollment, matching criteria, the sampling strategy for both cases and controls, where to identify controls, and whether to enroll controls with upper respiratory tract infection (URTI) or nonsevere pneumonia. The methods chosen needed to (1) ensure study objectives could be met, (2) ensure adequate sample size, (3) minimize selection bias, and (4) anticipate the need to control for confounding in analyses.
The PERCH Core Team presented options for each issue, usually identifying 1 preferred option, to the Pneumonia Methods Working Group (PMWG) [
This manuscript summarizes decisions made by the PMWG and PERCH Core Team about the principles of case and control selection. It describes the case and control identification and selection methods that were ultimately decided for PERCH, and the options considered along the decision pathway, the implications and tradeoffs for each, and rationale for the final decisions.
We will limit case enrollment to children hospitalized with World Health Organization–defined severe or very severe pneumonia [
An illustration of case and control sampling proportional to the case detection rates. A small time lag is anticipated for controls frequency-matched to the month of enrollment of cases, due to the expected interval from observed cases to the communication of this target number to the field workers who recruit controls.
Risk factors and circulating pathogens are expected to vary within the study populations. To avoid bias, cases and controls must be selected from the same reference population. We therefore need to define the reference population from which cases, detected at study hospitals, are drawn and select controls from the same geographic area. We considered several factors in defining the reference population. In settings with a single referral hospital, cases may originate from great distances because the hospital attracts severely ill children. Patients may come from varied settings, such as urban and rural areas. In large, densely populated cities with several hospitals and highly mobile populations, it may be difficult to define the areas from which cases come because hospital quality and cost may take precedence over distance in determining the choice of facility.
The catchment area is the study-defined geographic area where eligible study participants (both cases and controls; a subset of the reference population) must live. The catchment area will be defined using residence of cases obtained from hospital logs the previous year. Sites will define the catchment area based on where most cases came from the previous year to avoid having to enroll controls over a too expansive an area. This also has the benefit of not overrepresenting controls from very distant areas that rarely use the study hospital, and of not overrepresenting cases that are not representative of the study site population (ie, from great distances or only visiting the study area). However, excluding the farthest cases may risk excluding the most severe cases resulting from delay in presenting for medical care due to travelling farther distances when seeking hospital care.
Power calculations suggested that approximately 6300 cases (and 7000 controls) would be sufficient to evaluate all primary and secondary objectives of PERCH. However, projections from the 7 PERCH sites indicate that nearly twice as many eligible cases (approximately 12 500) might be expected to present at the enrollment hospitals over the course of 2 years. Therefore, several options were considered to limit the number of cases enrolled (see
Any form of sampling increases study complexity and potentially introduces bias, but some forms more so than others. We considered random, systematic (eg, enroll every other case or every other day), convenience (eg, enrollment only between 9
A benefit of sampling is that it affords an opportunity to control the enrollment ratio of severe to very severe pneumonia cases, which are expected to be fewer. Because PERCH’s focus is on identifying pathogens responsible for fatal pneumonia, we sought to increase enrollment of very severe cases. As a result, systematic sampling may be applied only to (or more frequently to) the severe cases; at many sites, all very severe cases will be invited to participate. This is a form of selection bias, albeit one we chose and can control for in the analysis. To optimize analyses stratified by severity, we will aim to balance the ratio. Some sites (Kenya and The Gambia) with projected excess cases will enroll all cases during the first year and sample only during the second year, if necessary. At least 1 site (Thailand) anticipates enrolling all eligible cases and will not apply any sampling criteria.
To minimize selection bias, a dedicated study team (rather than regular hospital staff) will manage the detection, selection, and enrollment of cases at most sites. Key data collected on all admissions, such as age, residence in catchment area, and admission diagnosis from existing hospital sources (eg, admission registers), will help assess the representativeness of the enrolled cases (
An illustration of the patient flowchart. Abbreviations: OPD, outpatient department; ER, emergency room; ARI, acute respiratory infection; PERCH, Pneumonia Etiology Research for Child Health Study.
The previously noted methodological decisions will result in many cases of severe pneumonia that will not be captured by PERCH. An inherent limitation of the focus on hospitalized cases is that we will miss cases who never come to hospital or who die at presentation before they can be enrolled. At the Kenya site, approximately two-thirds of children who die do so in the community [
We aim to enroll controls that are representative of the population from which cases are drawn. Limited funds restricted us to only 1 control group, either from the community or from the study hospital. These 2 alternative groups have advantages and disadvantages in enabling us to understand the roles of risk factors and detection biases (
Therefore, we decided that a community control group would best meet the analytic objectives of the study. They may be more representative of the catchment population than hospitalized children who might have underlying conditions (eg, malnutrition, human immunodeficiency virus [HIV] infection) that are not representative of the general population and that we would want to evaluate as risk factors for pneumonia. They would also not be skewed with respect to circulating pneumonia-causing pathogens, which hospital controls may have been.
Despite their advantages, using community controls does have some disadvantages. There are greater logistical challenges, such as identifying and locating eligible children who are scattered throughout a relatively large geographic area, selecting among them in an unbiased way, and the need for mobile, trained teams to collect their data. However, 3 sites already have extensive, successful experience enrolling controls from the community. In The Gambia, community awareness raised through radio programs helped successfully enroll controls in similar proportions to cases (S. Howie, written personal communication, 2009). In Dhaka, Bangladesh, community control enrollment was >95% in previous studies [
There may also be residual uncertainty about differences in healthcare-seeking behavior and other health-related behaviors between community controls and cases. Healthcare-seeking tendencies may confound both etiology and risk-factor analyses, because cases who seek care may differ from cases who do not seek care regarding characteristics associated with etiology. Such characteristics include vaccination against pneumonia-causing pathogens, residence (because circulating pathogens differ by community), socioeconomic status and ability to afford care, access to early effective treatment that will shift etiology to antibiotic-resistant organisms and viruses, nutrition, crowding, and HIV status, among other risk factors. We are collecting information on all these factors in cases and controls, to control for them in the analysis.
A greater challenge with community controls might be the collection of specimens, particularly blood samples. Collecting blood is often a sensitive issue, the more so among well children. Taking the time to inform the community and each control child’s parents and explaining carefully what we are doing and why is likely to overcome most of the resistance to blood sampling. As a potential benefit for participation, many sites will provide blood test results (HIV, thalassemia, sickle cell disease, or anemia) to the parents, and refer them for care if results are positive.
Community controls will be randomly selected from lists of previously enumerated children residing in Demographic Surveillance System areas (Kenya, The Gambia, Bangladesh), where birth registries exist (South Africa), or from lists of households in areas with existing registries of households (Thailand). In the remaining 2 sites, they will be selected using the Expanded Program on Immunization cluster-sampling method [
To minimize bias in the selection of controls, field workers will revisit selected households ≤3 times if eligible children are not at home on the first visit. When possible, to optimize recruitment of the random sample, field workers will visit the household in the early morning or evening hours or make appointments by partnering with village health volunteers. Widespread community dissemination of information about the study may also help reduce refusals.
Because in most sites the incidence of pneumonia is highly seasonal and the pathogens that cause pneumonia are also seasonal, we debated whether to match controls to the dates of enrollment of cases or to recruit controls at a constant rate throughout the year. The advantage of matching would be to increase the power of season-stratified analyses, whereas constant recruitment would assure adequate measurement of background prevalence of risk factors and etiology in all seasons. To achieve both benefits, we decided on a combination of the 2 approaches, resulting in slightly >1 control enrolled per case. PERCH sites will recruit a minimum of 25 controls per month, and in months with >25 cases enrolled, sites will enroll additional controls for a 1:1 ratio for that month (
We considered whether children with nonsevere pneumonia or URTI characterized by coryza, cough, sneezing, and sore throat should be excluded or enrolled and analyzed as separate control groups. Our understanding of the pathogenesis of pneumonia suggests that most pathogens that infect the lung begin by first infecting the upper respiratory tract and that many of these infections cause URTI symptoms. The concern, therefore, is that URTIs and nonsevere pneumonia could be the early stages of severe pneumonia. However, because the control group should represent the population from which the cases are drawn, because not all URTIs are on the causal pathway to pneumonia, and because some pathogens cause both URTIs and severe pneumonia, an unbiased control population should include children in any state of health, including those with URTI or nonsevere pneumonia, provided that they do not have case-defining severe or very severe pneumonia at the time of enrollment. This decision has 3 significant advantages: (1) It permits an unbiased estimate of the prevalence of exposure variables in the community; (2) it prevents a form of selection bias that could overestimate the role of viral pathogens as the cause of severe pneumonia (otherwise we would have to apply the same rule of excluding URTI patients to cases); (3) it broadens the scope for exploratory analyses regarding the spectrum of illness; and (4) it estimates the prevalence of URTIs and nonsevere pneumonia in the community.
For PERCH, we will exclude as controls children with case-defining severe or very severe pneumonia [
Because we expect the etiologic spectrum of pneumonia to differ markedly by HIV status [
We considered selecting HIV controls from admitted patients at the hospital (see
HIV-infected controls will be enrolled in a 1:1 ratio to HIV-infected cases, frequency-matched by age and month of enrollment; there will be no monthly minimum. Enrollment will be stratified on the duration of antiretroviral treatment (<3 months vs ≥3 months) to adjust for the level of immunosuppression that can influence the presence of pathogens in the naso-oropharynx. Eligibility criteria are the same as for community controls, plus the HIV-infected controls must be confirmed HIV infected and should not have been admitted to a hospital with an acute illness within the preceding 30 days. The latter criterion was added because acute illness can alter the CD4 count, which we will adjust for as an indicator of stage of HIV disease severity.
We decided that a case may be reenrolled as a case if the patient is admitted >30 days after the date of hospital discharge from the previous case episode. This is to ensure that the same pneumonia episode is not enrolled twice.
If a control develops severe or very severe pneumonia within 48 hours of their control enrollment date, this control enrollment will be excluded, but the child would be eligible to be a case. This is because any URTI that the child had at the time of control enrollment might have been the early stage of the pneumonia and is considered the same episode as case-defining pneumonia.
To be consistent in the treatment of cases and controls, a child who was previously a case may be enrolled as a control if the control enrollment date is >30 days after the date of hospital discharge from the previous case episode. There is no time period of exclusion between control enrollments, and a community control may be enrolled again as a control if reselected through the random selection process. However, HIV-infected controls may be selected only once because the smaller pool of eligible children is likely to result in many repeat invitations to participate in some age groups.
Despite their limitations, case-control studies of pneumonia etiology can provide valuable information on the likely causes of severe and fatal pneumonia in children. Although more expensive, complicated, and time-consuming than a study of hospitalized patients only, the case-control study provides valuable information that is often missing from more-limited studies. For example, a control group clarifies the usefulness and interpretation of some results from upper respiratory specimens, which are otherwise complicated and potentially misleading [
Pneumonia etiology information is most helpful when it is collected from representative cases and controls. However, hospital-based studies of cases and selection of representative controls from the population are complicated and potentially open to biases that can alter the conclusions. The PERCH project has gone to great efforts to attenuate these limitations. By employing sampling strategies to reduce bias among cases and controls, taking steps to minimize participant refusal, gathering data on potential confounders to use in the analysis, and enrolling controls along the continuum of respiratory illness, PERCH aims to address concerns about bias by minimizing it, assessing where it may occur, and mitigating it by collection of overlapping information and specimens. The description of these methods and their rationale in this manuscript is designed to help with the interpretation of PERCH results when they are available and with the design of other pneumonia etiology research studies.
We gratefully acknowledge the following individuals associated with the PERCH sites for their evaluation of the proposed options under local conditions: Dr Doli Goswami, MBBS, MPH, Dhaka, Bangladesh; Drs Mamadou Sylla and Boubou Tamboura, Bamako, Mali; Dr Sandra Panchalingam, University of Maryland, Baltimore; Dr Tussanee Amornintapichet, Sa Kaeo Crown Prince Hospital, Thailand; Dr Somchai Chuananont, Nakhon Phanom Provincial Hospital, Thailand; Dr Pasakorn Akarasewi, Bureau of Epidemiology, Ministry of Public Health (MoPH), Thailand; and Dr Julia Rhodes and Melissa Higdon, International Emerging Infections Program, Thailand MoPH–US CDC Collaboration.
Robert E. Black, Zulfiqar A. Bhutta, Harry Campbell, Thomas Cherian, Derrick W. Crook, Menno D. de Jong, Scott F. Dowell, Stephen M. Graham, Keith P. Klugman, Claudio F. Lanata, Shabir A. Madhi, Paul Martin, James P. Nataro, Franco M. Piazza, Shamim A. Qazi, Heather J. Zar, Site Investigators: Henry C. Baggett, W. Abdullah Brooks, James Chipeta, Bernard Ebruke, Hubert P. Endtz, Michelle Groome, Laura L. Hammitt, Stephen R. C. Howie, Karen Kotloff, Shabir A. Madhi, Susan A. Maloney, David Moore, Juliet Otieno, Phil Seidenberg, Samba O. Sow, Milagritos Tapia, Somsak Thamthitiwat, Donald M. Thea, and Khaleque Zaman.
This work was supported by grant 48968 from The Bill & Melinda Gates Foundation to the International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health. J. A. G. S. is supported by a clinical fellowship from The Wellcome Trust of Great Britain (081835).
This article was published as part of a supplement entitled “Pneumonia Etiology Research for Child Health,” sponsored by a grant from The Bill & Melinda Gates Foundation to the PERCH Project of Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland.
All authors: No reported conflicts.
All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.