Chronic fatigue syndrome (CFS) has no diagnostic clinical signs or diagnostic laboratory abnormalities and it is unclear if it represents a single illness. The CFS research case definition recommends stratifying subjects by co-morbid conditions, fatigue level and duration, or functional impairment. But to date, this analysis approach has not yielded any further insight into CFS pathogenesis. This study used the integration of peripheral blood gene expression results with epidemiologic and clinical data to determine whether CFS is a single or heterogeneous illness.
CFS subjects were grouped by several clinical and epidemiological variables thought to be important in defining the illness. Statistical tests and cluster analysis were used to distinguish CFS subjects and identify differentially expressed genes. These genes were identified only when CFS subjects were grouped according to illness onset and the majority of genes were involved in pathways of purine and pyrimidine metabolism, glycolysis, oxidative phosphorylation, and glucose metabolism.
These results provide a physiologic basis that suggests CFS is a heterogeneous illness. The differentially expressed genes imply fundamental metabolic perturbations that will be further investigated and illustrates the power of microarray technology for furthering our understanding CFS.
Chronic fatigue syndrome (CFS) is defined solely by self-reported symptoms and associated disability. There are no characteristic physical signs or diagnostic laboratory abnormalities. Diagnosis of CFS requires clinical evaluation to rule out other medical or psychiatric conditions that could cause or contribute to the patient's complaints [
CFS has been hypothesized to involve an abnormal response to infection, immunologic dysfunction, dysregulation of the hypothalamic-pituitary-adrenal axis, and dysautonomia, yet no biologic and physiologic perturbations have been reproducibly detected. This may reflect poor specificity of the case definition, patient selection bias, or other study design issues. Clearly, discovery of laboratory markers that improve the specificity of case ascertainment or differentiate groups within the CFS classification would increase the possibility of identifying pathogenic mechanisms.
The international CFS research guideline recommends that cases be stratified before analysis by several variables including co-morbid conditions, current level and total duration of fatigue, current level of functional impairment and type of fatigue onset [
Unique gene expression profiles have been found in cancer [
This study adhered to human experimentation guidelines of the U.S. Department of Health and Human Services. All participants were volunteers who gave informed consent. The Centres for Disease Control and Prevention Human Subjects Committee approved study protocols.
Forty-three CFS subjects were identified in a survey of the Wichita, Kansas's adult population [
Table
Clinical and epidemiological characteristics of 23 CFS women.
| Type of fatigue onset a | |
| Gradual | 12 |
| Sudden | 10 |
| Age, years b | |
| ≤50 | 12 |
| >50 | 11 |
| Duration of illness b | |
| ≤10 | 15 |
| >10 | 8 |
| No. of CFS symptoms b | |
| 4 | 8 |
| 5 | 7 |
| ≥6 | 8 |
| Body mass index c | |
| Normal | 6 |
| Overweight | 7 |
| Obese | 10 |
| Illness group d | |
| 1 | 4 |
| 2 | 19 |
a Onset type defined as sudden (self-reported as developing fatigue in = 1 week) or gradual (developing fatigue over a period = 1 month). One subject described onset as between 1 week and one month and was not classified for this stratification. b Further stratification and analysis using the Kruskal-Wallis nonparametric test did not show different results. c Analysis performed on BMI <25 (normal) compared with >30(obese). Subjects considered overweight were not included in this particular analysis. Further stratification and analysis using the Kruskal-Wallis test showed no significant differences between the groups (results not shown.). d Illness group defined by factor analysis of symptoms followed by cluster analysis [
During the clinical evaluation, a 10 ml blood sample was obtained and PBMC were isolated using LSM® Lymphocyte Separation Media (ICN Biomedicals, Costa Mesa, CA). Cells were washed, counted and stored for viability in liquid nitrogen as described [
Biotinylated cDNA synthesis from 1 μg of total RNA was performed as previously described [
The scanned TIFF images were processed using ArrayVision™ (Imaging Research Inc., Ontario, Canada). Features deemed unsuitable for accurate quantitation because of artefacts, poor morphology, or uneven hybridization were excluded from further analysis. A median background value was calculated around each feature and subtracted from the mean signal to give the net signal for the respective gene. Data was uploaded into the CDC MAdB web-based analysis package where background-adjusted intensity values were scaled and normalized to the 75th percentile. Values were log2 transformed and mean centered to fit the data to a Gaussian distribution.
We initially examined gene expression intensities for all 23 CFS subjects using the one-class analysis component of the Significance Analysis of Microarrays (SAM) program [
To identify distinct gene clusters we performed a two-way hierarchical cluster analysis as described by Eisen et al[
The standard statistical
Application of this method to the 23 CFS subjects identified no genes with expression variance statistically greater than the average that would provide evidence for heterogeneity of the CFS sample.
The 23 CFS subjects were grouped with respect to the variables listed in Table
Identification of differentially expressed genes in CFS subjects by clinical or epidemiologic variables.
| Clinical or epidemiologic variable | SAMa | Wilcoxonb | |
| Type of fatigue onset (gradual or sudden) | 117 | 199 | 159 |
| Illness group (1 or 2) | 0 | 15 | 1 |
| Age (≤50 or >50 years) | 4 | 29 | 22 |
| Body mass index (<25 or >30) | 0 | 4 | 5 |
| No. of symptoms (4 or ≥6) | 0 | 0 | 0 |
| Duration of illness (≤10 or >10) | 6 | 19 | 10 |
a Number of genes for which false discovery rate (FDR) = 5% b Number of genes for which comparison yielded p < 0.01 c The 7 subjects with 5 symptoms (Table
Figure
Figure
It is thought that CFS is a heterogeneous illness since a single cause of CFS has not been identified and it is thought that various kinds of physiologic stressors such as infection, trauma and toxins can trigger the development of CFS in susceptible individuals. A major difficulty in identifying etiologies for CFS is that the case definition requires a minimum duration of six months of illness. In most studies, subjects have been ill many years, making it difficult to detect initial disease triggers, as causal factors may be difficult to detect or are no longer present. In addition, in many diseases, factors associated with disability are distinct from causative factors. Biomarkers have the potential to give clues to disease etiology as well as mode of action.
In an attempt determine whether CFS was a single or heterogeneous illness, we used microarrays to profile the expression of 3,800 genes in 23 women with CFS. We analyzed the array data using three statistical tests: 1) a program specifically designed for the analysis of microarray data (SAM), 2) a parametric
Our findings of differentially expressed metabolic and RNA processing genes make both biologic and physiologic sense relative to CFS. We identified differences in purine and pyrimidine metabolism, glycolysis, oxidative phosphorylation, and glucose metabolism. Oxidative phosphorylation and the ATP generated by this process are the major source of energy for the normal function of most cells in the body. Metabolic changes are known to take place, and in some instances drive the pathophysiology of a number of chronic diseases. Subjects with sudden onset CFS often describe an infectious, viral-like illness as the initiating process. It is well-known that many RNA processing proteins are central to the effective action of the antiviral interferon [
The nature of the specimen determines the view of the disease reviewed by gene expression profiling. In CFS there are no anatomical lesions to sample. Peripheral blood is an accessible source of circulating cells that reflect systemic changes, so it is a good starting point to profile diseases that have no lesions, or lesions that are inaccessible. However, peripheral blood mononuclear cells are themselves very heterogeneous, including B and T lymphocytes, monocytes, and natural killer cells. Changes in gene expression could be due to changes in the cellular composition as well as to differences in cellular activities. However, several groups including our own, [
The study must be interpreted with caution, as the number of subjects is small and the gene profiled represent a fraction of those potentially of importance. However, these data do support the idea that CFS is a heterogeneous illness with a biochemical basis to explain the fatigue. Different gene expression profiles among those who describe a difference in illness onset imply distinct etiological or triggering events, and shows that these differences are maintained well into the disease process. The results in this study demonstrate the utility of gene expression profiling to characterize an illness at the biological and physiological level. This should advance the cause for defining CFS at a molecular resulting in diagnosis and possible identification of causative agents.
Although the full implication and biologic significance of the differentially expressed genes discussed above are not yet completely understood, the genes may serve as a platform to further explore relevant mechanisms of pathogenesis and improve the understanding of the molecular basis of CFS. It will be important to discover how these differential patterns relate to non-CFS subjects and to expand the number of genes examined. Our work shows that microarrays are an important tool in understanding the wide spectrum of genes likely involved in complex diseases such as CFS.
TW contributed to the design of the experimental approach, performed the experimental component and the analysis of the data and drafted the manuscript. RN gave statistical advice, did the non-parametric analysis and contributed to the manuscript preparation. ERU and SDV contributed to the conception and design of this study, assisted in the analysis and assisted in drafting the manuscript. All authors read and approved the final manuscript.
None declared.
The authors sincerely thank Dr. William C. Reeves for his guidance, expert advice and the many discussions had on interpreting the results in a public health context. His constructive input for drafting this manuscript and his skilled leadership are much appreciated.