Movatterモバイル変換


[0]ホーム

URL:


Skip to main content
                                  NCBI home page
Search in PMCSearch
As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsement of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more:PMC Disclaimer | PMC Copyright Notice
The Cochrane Database of Systematic Reviews logo

Exercise therapy for chronic fatigue syndrome

Lillebeth Larun1,,Kjetil G Brurberg1,Jan Odgaard‐Jensen2,Jonathan R Price3
1Norwegian Institute of Public Health, Division for Health Services, Postboks 4404 Nydalen, Oslo, Norway, N‐0403
2Medicinrådet, Dampfærgevej 27‐29, København Ø, Denmark, DK‐2100
3University of Oxford, Department of Psychiatry, The Warneford HospitalHeadington, Oxford, UK, OX3 7JX

Editorial Group: Cochrane Common Mental Disorders Group.

Corresponding author.

Collection date 2017 Apr.

Copyright © 2019 The Cochrane Collaboration. Published by John Wiley& Sons, Ltd.
PMCID: PMC6419524  PMID:28444695
This article has been updated. See "Exercise therapy for chronic fatigue syndrome" in volume 2019, CD003200.
This article is an update of "Exercise therapy for chronic fatigue syndrome" in volume 2016, CD003200.

Abstract

Background

Chronic fatigue syndrome (CFS) is characterised by persistent, medically unexplainedfatigue, as well as symptoms such as musculoskeletal pain, sleep disturbance, headachesand impaired concentration and short‐term memory. CFS presents as a common,debilitating and serious health problem. Treatment may include physical interventions,such as exercise therapy, which was last reviewed in 2004.

Objectives

The objective of this review was to determine the effects of exercise therapy (ET) forpatients with CFS as compared with any other intervention or control.

• Exercise therapy versus 'passive control' (e.g. treatment as usual,waiting‐list control, relaxation, flexibility).

• Exercise therapy versus other active treatment (e.g.cognitive‐behavioural therapy (CBT), cognitive treatment, supportive therapy,pacing, pharmacological therapy such as antidepressants).

• Exercise therapy in combination with other specified treatment strategies versusother specified treatment strategies (e.g. exercise combined with pharmacologicaltreatment vs pharmacological treatment alone).

Search methods

We searched The Cochrane Collaboration Depression, Anxiety and Neurosis Controlled TrialsRegister (CCDANCTR), the Cochrane Central Register of Controlled Trials (CENTRAL) andSPORTDiscus up to May 2014 using a comprehensive list of free‐text terms for CFSand exercise. We located unpublished or ongoing trials through the World HealthOrganization (WHO) International Clinical Trials Registry Platform (to May 2014). Wescreened reference lists of retrieved articles and contacted experts in the field foradditional studies

Selection criteria

Randomised controlled trials involving adults with a primary diagnosis of CFS who wereable to participate in exercise therapy. Studies had to compare exercise therapy withpassive control, psychological therapies, adaptive pacing therapy or pharmacologicaltherapy.

Data collection and analysis

Two review authors independently performed study selection, risk of bias assessments anddata extraction. We combined continuous measures of outcomes using mean differences (MDs)and standardised mean differences (SMDs). We combined serious adverse reactions anddrop‐outs using risk ratios (RRs). We calculated an overall effect size with 95%confidence intervals (CIs) for each outcome.

Main results

We have included eight randomised controlled studies and have reported data from 1518participants in this review. Three studies diagnosed individuals with CFS using the 1994criteria of the Centers for Disease Control and Prevention (CDC); five used the Oxfordcriteria. Exercise therapy lasted from 12 to 26 weeks. Seven studies used variations ofaerobic exercise therapy such as walking, swimming, cycling or dancing provided at mixedlevels in terms of intensity of the aerobic exercise from very low to quite rigorous,whilst one study used anaerobic exercise. Control groups consisted of passive control(eight studies; e.g. treatment as usual, relaxation, flexibility) or CBT (two studies),cognitive therapy (one study), supportive listening (one study), pacing (one study),pharmacological treatment (one study) and combination treatment (one study). Risk of biasvaried across studies, but within each study, little variation was found in the risk ofbias across our primary and secondary outcome measures.

Investigators compared exercise therapy with 'passive' control in eight trials,which enrolled 971 participants. Seven studies consistently showed a reduction in fatiguefollowing exercise therapy at end of treatment, even though the fatigue scales useddifferent scoring systems: an 11‐item scale with a scoring system of 0 to 11 points(MD ‐6.06, 95% CI ‐6.95 to ‐5.17; one study, 148 participants;low‐quality evidence); the same 11‐item scale with a scoring system of 0 to33 points (MD ‐2.82, 95% CI ‐4.07 to ‐1.57; three studies, 540participants; moderate‐quality evidence); and a 14‐item scale with a scoringsystem of 0 to 42 points (MD ‐6.80, 95% CI ‐10.31 to ‐3.28; threestudies, 152 participants; moderate‐quality evidence). Serious adverse reactionswere rare in both groups (RR 0.99, 95% CI 0.14 to 6.97; one study, 319 participants;moderate‐quality evidence), but sparse data made it impossible for review authorsto draw conclusions. Study authors reported a positive effect of exercise therapy at endof treatment with respect to sleep (MD ‐1.49, 95% CI ‐2.95 to ‐0.02;two studies, 323 participants), physical functioning (MD 13.10, 95% CI 1.98 to 24.22; fivestudies, 725 participants) and self‐perceived changes in overall health (RR 1.83,95% CI 1.39 to 2.40; four studies, 489 participants). It was not possible for reviewauthors to draw conclusions regarding the remaining outcomes.

Investigators compared exercise therapy with CBT in two trials (351 participants). Onetrial (298 participants) reported little or no difference in fatigue at end of treatmentbetween the two groups using an 11‐item scale with a scoring system of 0 to 33points (MD 0.20, 95% CI ‐1.49 to 1.89). Both studies measured differences infatigue at follow‐up, but neither found differences between the two groups using an11‐item fatigue scale with a scoring system of 0 to 33 points (MD 0.30, 95% CI‐1.45 to 2.05) and a nine‐item Fatigue Severity Scale with a scoring systemof 1 to 7 points (MD 0.40, 95% CI ‐0.34 to 1.14). Serious adverse reactions wererare in both groups (RR 0.67, 95% CI 0.11 to 3.96). We observed little or no difference inphysical functioning, depression, anxiety and sleep, and we were not able to draw anyconclusions with regard to pain, self‐perceived changes in overall health, use ofhealth service resources and drop‐out rate.

With regard to other comparisons, one study (320 participants) suggested a generalbenefit of exercise over adaptive pacing, and another study (183 participants) a benefitof exercise over supportive listening. The available evidence was too sparse to drawconclusions about the effect of pharmaceutical interventions.

Authors' conclusions

Patients with CFS may generally benefit and feel less fatigued following exercisetherapy, and no evidence suggests that exercise therapy may worsen outcomes. A positiveeffect with respect to sleep, physical function and self‐perceived general healthhas been observed, but no conclusions for the outcomes of pain, quality of life, anxiety,depression, drop‐out rate and health service resources were possible. Theeffectiveness of exercise therapy seems greater than that of pacing but similar to that ofCBT. Randomised trials with low risk of bias are needed to investigate the type, durationand intensity of the most beneficial exercise intervention.

Keywords: Adult; Humans; Cognitive Behavioral Therapy; Depression; Depression/therapy; Exercise; Exercise Therapy; Exercise Therapy/adverse effects; Exercise Therapy/methods; Fatigue Syndrome, Chronic; Fatigue Syndrome, Chronic/psychology; Fatigue Syndrome, Chronic/therapy; Health Status; Patient Dropouts; Patient Dropouts/statistics & numerical data; Quality of Life; Randomized Controlled Trials as Topic; Sleep Wake Disorders; Sleep Wake Disorders/therapy

Exercise as treatment for patients with chronic fatigue syndrome

Who may be interested in this review?

• People with chronic fatigue syndrome and their family and friends.

• Professionals working in specialist chronic fatigue services.

• Professionals working in therapeutic exercise.

• General practitioners.

Why is this review important?

Chronic fatigue syndrome (CFS) is sometimes called myalgic encephalomyelitis (ME).Research estimates that between 2 in 1000 and 2 in 100 adults in the USA are affected byCFS. People with CFS often have long‐lasting fatigue, joint pain, headaches, sleepproblems, and poor concentration and short‐term memory. These symptoms causesignificant disability and distress for people affected by CFS. There is no clear medicalcause for CFS, so people who are affected often deal with misunderstanding of theircondition from family, friends and healthcare professionals. National Institute for Healthand Care Excellence (NICE) guidelines recommend exercise therapy for individuals with CFS,and a previous review of the evidence suggested that exercise therapy was a promisingapproach to the treatment. It is thought that exercise therapy can help management of CFSsymptoms by helping people gradually reintroduce physical activity into their dailylives.

This review is an update of a previous Cochrane review from 2004, which showed thatexercise therapy was a promising treatment for adults with CFS. Since the review,additional studies investigating the effectiveness and safety of exercise therapy forpatients with CFS have been published.

What questions does this review aim to answer?

• Is exercise therapy more effective than ‘passive’treatments (e.g. waiting list, treatment as usual, relaxation, flexibility)?

• Is exercise therapy more effective than other‘active’ therapies (e.g. cognitive‐behaviouraltherapy (CBT), pacing, medication)?

• Is exercise therapy more effective when combined with another treatment thanwhen given alone?

• Is exercise therapy safer than other treatments?

Which studies were included in the review?

We searched databases to find all high‐quality studies of exercise therapy for CFSpublished up to May 2014. To be included in the review, studies had to be randomisedcontrolled trials and include adults over 18 years of age, more than 90% of whom had aclear diagnosis of CFS. We included eight studies with a total of 1518 participants in thereview. Seven studies used aerobic exercise therapy such as walking, swimming, cycling ordancing; the remaining study used non‐aerobic exercise. Most studies askedparticipants to exercise at home, between three and five times per week, with a targetduration of 5 to 15 minutes per session using different means of incrementation.

What does evidence from the review tell us?

Moderate‐quality evidence showed exercise therapy was more effective at reducingfatigue compared to ‘passive’ treatment or no treatment.Exercise therapy had a positive effect on people’s daily physicalfunctioning, sleep and self‐ratings of overall health.

One study suggests that exercise therapy was more effective than pacing strategies forreducing fatigue. However exercise therapy was no more effective than CBT.

Exercise therapy did not worsen symptoms for people with CFS. Serious side effects wererare in all groups, but limited information makes it difficult to draw firm conclusionsabout the safety of exercise therapy.

Evidence was not sufficient to show effects of exercise therapy on pain, use of otherhealthcare services, or to allow assessment of rates of drop‐out from exercisetherapy programmes.

What should happen next?

Researchers suggest that further studies should be carried out to discover what type ofexercise is most beneficial for people affected by CFS, which intensity is best, theoptimal length, as well as the most beneficial delivery method.

Summary of findings

Summary of findings for the main comparison.

Exercise therapy for chronic fatigue syndrome
Patient or population: males and females over 18 years of age withchronic fatigue syndrome
Intervention: exercise therapy
Comparison: standard care, waiting list or relaxation/flexibility
OutcomesIllustrative comparative risks* (95% CI)Relative effect (95% CI)Number of participants (studies)Quality of the evidence (GRADE)Comments
Assumed riskCorresponding risk
ControlExercise
Fatiguea: FS, Fatigue Scale (0 to 11 points)
(end of treatment)
Mean fatigue in the control groups was 10.4 pointsMean fatigue in the intervention groups was6.06 pointslower (6.95 to 5.17 lower)148 (1 study)⊕⊕⊝⊝Lowb,cLower score indicates less fatigue
Fatiguea: FS, Fatigue Scale (0 to 33 points)
(end of treatment)
Mean fatigue ranged across control groups from 15.3 to 26.3 pointsMean fatigue in the intervention groups was2.82 pointslower (4.07 to 1.57 lower)540 (3 studies)⊕⊕⊕⊝ModeratebLower score indicates less fatigue
Fatiguea: FS, Fatigue Scale (0 to 42 points)
(end of treatment)
Mean fatigue ranged across control groups from 24.4 to 31.6 pointsMean fatigue in the intervention groups was6.80 points lower (10.31 to3.28 lower)152 (3 studies)⊕⊕⊕⊝ModeratebLower score indicates less fatigue
Participants with serious adverse reactionsStudy populationRR 0.99 (0.14 to 6.97)319 (1 study)⊕⊕⊕⊝Moderated,e
13 per 100012 per 1000 (2 to 87)
Quality of Life (QOL) Scale (16 to 112 points)
(follow‐up)
Mean QOL score in the control group was 72 pointsMean QOL score in the intervention groups was9.00 points lower (19.00lower to 1.00 higher)44 (1 study)⊕⊝⊝⊝Verylowb,fHigher score indicates improved QOL
Physical functioning: SF‐36 subscale (0 to 100 points)
(end of treatment)
Mean physical functioning score ranged from 31.1 to 55.2 points across controlgroupsMean physical functioning score in the intervention groups was13.10 pointshigher (1.98 to 24.22 higher)725 (5 studies)⊕⊕⊝⊝
Lowb,g
Higher score indicates improved physical function
Depression: HADS depression score (0 to 21 points)
(end of treatment)
Mean depression score ranged across control groups from 5.2 to 11.2 pointsMean depression score in the intervention groups was1.63 points lower(3.50 lower to 0.23 higher)504 (5 studies)⊕⊝⊝⊝Verylowb,g,hLower score indicates fewer depressive symptoms
Sleep: Jenkins Sleep Scale (0 to 20 points)
(end of treatment)
Mean sleep score ranged across control groups from 11.7 to 12.2 pointsMean sleep score in the intervention groups was1.49 pointslower (2.95 to 0.02 lower)323 (2 studies)⊕⊕⊝⊝Lowb,hLower score indicates improved sleep quality
Self‐perceived changes in overall health
(end of treatment)
Study populationRR 1.83 (1.39 to 2.40)489 (4 studies)⊕⊕⊕⊝ModeratebRR higher than 1 means that more participants in exercise groups reportedimprovement
218 per 1000399 per 1000 (303 to 523)
Medium‐risk population
238 per 1000436 per 1000 (331 to 571)
Drop‐out
(end of treatment)
Study populationRR 1.63 (0.77 to 3.43)843
(6 studies)
⊕⊕⊝⊝Lowb,gRR higher than 1 means that more participants in exercise groups dropped outfrom treatment
70 per 1000114 per 1000
(54 to 241)
Medium‐risk population
89 per 1000145 per 1000
(69 to 305)
*The basis for theassumed risk (e.g. median control group riskacross studies) is provided in footnotes. Thecorresponding risk (and its95% confidence interval) is based on the assumed risk in the comparison groupand therelative effect of the intervention (and its 95%CI).CI: Confidence interval;RR: Risk ratio.
GRADE Working Group grades of evidence.High quality:Further research is very unlikely to change our confidence in the estimate ofeffect.Moderate quality: Further research is likely tohave an important impact on our confidence in the estimate of effect and maychange the estimate.Low quality: Further research is verylikely to have an important impact on our confidence in the estimate of effectand is likely to change the estimate.Very low quality: Weare very uncertain about the estimate.

aWe choose to present effect estimates as measured on the original scalesrather than to transform them to standardised units. As 3 different scoring systemsfor fatigue were used, the outcome is presented over 3 rows.

bRisk of bias (‐1): All studies were at risk of performance bias,as they were unblinded.cInconsistency (‐1): showsinconsistencies with other available trials when meta‐analysis based onstandardised mean differences is performed. Subgroup analyses could not explainvariation due to diagnostic criteria, treatment strategy or type ofcontrol.dRisk of bias (0): This outcome is unlikely to havebeen affected by detection or performance bias.eImprecision(‐1): low numbers of events and wide confidenceintervals.fImprecision (‐2): very low numbers ofparticipants and wide confidence intervals, which encompass benefit andharm.gInconsistency (‐1): variation in effect sizeand direction of effect across available studies.hImprecision(‐1): Confidence interval fails to exclude negligible differences in favour ofthe intervention.

Background

Description of the condition

Chronic fatigue syndrome (CFS) is an illness characterised by persistent, medicallyunexplained fatigue. Symptoms include severe, disabling fatigue, as well asmusculoskeletal pain, sleep disturbance, headaches, and impaired concentration andshort‐term memory (Prins 2006).Individuals experience significant disability and distress, which may be exacerbated bylack of understanding from others, including healthcare professionals. The term'myalgic encephalomyelitis (ME)' is often used, but 'CFS' is the termthat has been adopted and clearly defined for research purposes, and it will be used inthis review. The diagnosis can be made only after all alternative diagnoses have beenexcluded (Reeves 2003;Reeves 2007); several sets of criteria arecurrently used to diagnose CFS (Carruthers 2011;Fukuda 1994;NICE 2007;Reeves 2003;Sharpe 1991). The Centers for Disease Control andPrevention (CDC) 1994 diagnostic criteria for CFS (Fukuda 1994) are the most widely cited for research purposes (Fonhus 2011), resulting in prevalence of CFS ofbetween 0.24% (Reyes 2003) and 2.55% (Reeves 2007) among US adults. Practical applicationof diagnostic criteria may help to explain some of the observed variation in prevalenceestimates (Johnston 2013). In practice, mostpatients visit their local general practitioner (GP) for assessment. A minority ofpatients may be referred to specialist clinics (e.g. neurology, infectious diseases,psychiatry, endocrinology or general medicine) for exclusion of alternativeunderlying disorders.

Description of the intervention

Exercise therapy is often included as part of a treatment programme for individuals withCFS. 'Exercise' is defined as "planned structured and repetitive bodilymovement done to improve or maintain one or more components of physical fitness"(ACSM 2001); 'therapy' is defined as"treatment intended to relieve or heal a disorder" (Oxford English Dictionary). We define 'exercisetherapy' as a "regimen or plan of physical activity designed and prescribed[and] intended to relieve or heal a disorder," and 'therapeutic exercise'or 'exercise therapy' can be described as "planned exercise performed toattain a specific physical benefit, such as maintenance of the range of motion,strengthening of weakened muscles, increased joint flexibility, or improved cardiovascularand respiratory function" (Mosby 2009).Aerobic exercise such as walking, jogging, swimming or cycling is included, along withanaerobic exercise such as strength or stabilising exercises. Graded exercise therapy ischaracterised by establishment of a baseline of achievable exercise or physical activity,followed by a negotiated, incremental increase in the duration of time spent physicallyactive followed by an increase in intensity (White2011).

How the intervention might work

Physical activity can improve health and quality of life for patients with chronicdisease (Blair 2009). The causal pathway for CFSis unknown; however several hypotheses have been proposed as to why exercise therapy mightbe a viable treatment. The 'deconditioning model' assumes that the syndrome isperpetuated by reversible physiological changes of deconditioning and avoidance ofactivity; therefore exercise should improve deconditioning and thus the condition ofpatients with CFS (Clark 2005;White 2011). However, mediation studies suggest thatimproved conditioning is not associated with better outcomes (Fulcher 1997;Moss‐Morris 2005). Some graded exercisetherapy (GET) programmes are designed to gradually reintroduce the patient to the avoidedstimulus of physical activity or exercise, which may involve a conditioned responseleading to fatigue (Clark 2005;Fulcher 2000;White 2011). Mediation studies suggest thatreduced symptom focus may mediate outcomes with GET, consistent with this model (Clark 2005;Moss‐Morris 2005). Evidence has also been found for central sensitisationcontributing to hyperresponsiveness of the central nervous system to a variety of visceralinputs (Nijs 2011). The most replicated findingin patients with CFS is an increased sense of effort during exercise, which is consistentwith this model (Fulcher 2000;Paul 2001). Graded exercise therapy may reducethis extra sense of effort, perhaps by reducing central sensitisation (Fulcher 1997).

Further research is needed to verify these hypotheses, but effective treatments may bediscovered without knowledge of the effective pathway or underlying cause.

Why it is important to do this review

The previous Cochrane review (Edmonds 2004)suggested that exercise therapy was a promising treatment but that larger studies wereneeded to address the safety of this therapy (Edmonds2004). Such studies have been completed and their findings published, so that thepresent time is propitious for an updated review. Exercise therapy is often used astreatment for individuals with CFS and is recommended by treatment guidelines (NICE 2007). People with CFS should have theopportunity to make informed decisions about their care and treatment based on robustresearch evidence. This review will examine the effectiveness of exercise therapy,provided as a stand‐alone intervention or as part of a treatment plan. The CochraneCollaboration has reviewed multiple aspects of treatment for patients with CFS. A reviewon CBT was published in 2008 (Price 2008), andone on traditional Chinese herbal medicine in 2009 (Adams 2009); also, a protocol on pharmacological treatments was submitted (Hard 2009).

This review, which is an update of a Cochrane review first published in 2004, will updatethe evidence base that serves as a resource for informed decision making by healthcarepersonnel and patients. A protocol for an accompanying individual patient data review onchronic fatigue syndrome and exercise therapy has been published (Larun 2014).

Objectives

The objective of this review was to determine the effects of exercise therapy (ET) forpatients with chronic fatigue syndrome (CFS) as compared with any other intervention orcontrol.

  • Exercise therapy versus 'passive control' (e.g. treatment as usual,waiting‐list control, relaxation, flexibility).

  • Exercise therapy versus other active treatment (e.g. cognitive‐behaviouraltherapy (CBT), cognitive treatment, supportive therapy, pacing, pharmacologicaltherapy such as antidepressants).

  • Exercise therapy in combination with other specified treatment strategies versusother specified treatment strategies (e.g. exercise combined with pharmacologicaltreatment vs pharmacological treatment alone).

Methods

Criteria for considering studies for this review

Types of studies

We included randomised controlled trials, as well as cluster‐randomised trialsand cross‐over trials.

Types of participants

We included trials of male and female participants over the age of 18, irrespective ofcultures and settings. Investigators currrently have used several sets of criteria todiagnose CFS (Carruthers 2011;Fukuda 1994;NICE2007;Reeves 2003;Sharpe 1991); therefore we decided to includetrials in which participants fulfilled the following diagnostic criteria for CFS orME.

  • Fatigue, or a symptom synonymous with fatigue, was a prominent symptom.

  • Fatigue was medically unexplained (i.e. other diagnoses known to cause fatiguesuch as anorexia nervosa or sleep apnoea could be excluded).

  • Fatigue was sufficiently severe to significantly disable or distress theparticipant.

  • Fatigue persisted for at least six months.

We included trials that included participants with disorders other than CFS providedthat > 90% of participants had been given a primary diagnosis of CFS based on thecriteria discussed above. We included in the analysis of this review trials in whichless than 90% of participants had a primary diagnosis of CFS only if data on CFS werereported separately.

Co‐morbidity

Studies involving participants with co‐morbid physical or common mentaldisorders were eligible for inclusion only if the co‐morbidity did not providean alternative explanation for fatigue.

Types of interventions

Experimental intervention

Both aerobic and anaerobic interventions aimed at exercising big muscle groups, forexample, walking, swimming, jogging and strength or stabilising exercises, could beincluded. Both individual and group treatment modalities were eligible, butinterventions had to be clearly described and supported by appropriate references.

'Exercise therapy' is an umbrella term for the different types of exerciseprovided; it is based on the American College of Sports Medicine definition (ACSM 2001). We categorised exercise therapies inthis review in accordance with descriptions of the interventions provided byindividual studies. We prepared a table of Interventions with detailed information onexercise therapy reported by the included studies, as definitions vary across time andcontext. As a point of reference, we used the following empirical definitions, asderived from descriptions of the interventions.

  • Graded exercise therapy (GET): exercise in which the incremental increase inexercise was mutually set.

  • Exercise with pacing: exercise in which the incremental increase in exercisewas personally set.

  • Anaerobic exercise: exercise that requires a high level of exertion, in a briefspurt or short‐term in duration by the participant that can be graduallyincreased over time with practice

We did not impose restrictions with regard to the duration of each treatment session,the number of sessions or the time between sessions. Trials presenting data from oneof the following comparisons were eligible for inclusion.

Comparator interventions
  • ‘Passive control’: treatment asusual/waiting‐list control/relaxation/flexibility.

    • 'Treatment as usual' comprises medical assessments and advicegiven on a naturalistic basis. 'Relaxation' consists of techniquesthat aim to increase muscle relaxation (e.g. autogenic training, listeningto a relaxation tape). 'Flexibility' includes stretches performedaccording to selected exercises given.

  • Psychological therapies: cognitive‐behavioural therapy (CBT)/cognitivetreatment/supportive therapy/behavioural therapies/psychodynamic therapies.

  • Adaptive pacing therapy.

  • Pharmacological therapy (e.g. antidepressants).

Types of outcome measures

Primary outcomes

1. Fatigue: measured using any validated scale (e.g. Fatigue Scale (FS) (Chalder 1993), Fatigue Severity Scale (FSS)(Krupp 1989)).

2. Adverse outcomes: measured using any reporting system (e.g. serious adversereactions (SARs) (European Union Clinical TrialsDirective 2001)).

Secondary outcomes

3. Pain: measured using any validated scale (e.g. Brief Pain Inventory (Cleeland 1994)).

4. Physical functioning: measured using any validated scale (e.g. Short Form(SF)‐36, physical functioning subscale (Ware1992)).

5. Quality of life (QOL): measured using any validated scale (e.g. Quality of LifeScale (Burckhardt 2003)).

6. Mood disorders: measured using validated instruments (e.g. Hospital Anxiety andDepression Scale (Zigmond 1983)).

7. Sleep duration and quality: measured by self‐report on a validated scale,or objectively by polysomnography (e.g. Pittsburgh Sleep Quality Index (Buysse 1989)).

8. Self‐perceived changes in overall health: measured by self‐report ona validated scale (e.g. Global Impression Scale (Guy1976)).

9. Health service resource use (e.g. primary care consultation rate, secondary carereferral rate, use of alternative practitioners).

10. Drop‐outs (any reason).

Timing of outcome assessment

We extracted from all studies data on each outcome for end of treatment and end offollow‐up.

Search methods for identification of studies

Electronic searches

The Cochrane Collaboration's Depression, Anxiety and Neurosis (CCDAN) ReviewGroup's Trials Search Coordinator (TSC) searched their Group's SpecializedRegister (CCDANCTR‐Studies and CCDANCTR‐References) (all years to 9 May2014). This register is created from routine generic searches of MEDLINE (1950‐), EMBASE (1974‐ ) and PsycINFO (1967‐ ). Details of CCDAN's generic search strategies, used to inform he CCDANCTR can be foundon the Group‘s web site.

The CCDANCTR‐Studies Register was searched using the followingterms: Diagnosis = ("Chronic Fatigue Syndrome" or fatigue) andFree Text = (exercise or sport* or relaxation or "multi convergent" or"tai chi")

The CCDANCTR‐References Register was searched using a more sensitive list offree‐text search terms to identify additional untagged/uncoded references, e.g.fatigue*, myalgic encephalomyelitis*, exercise, physical active* and taiji.Full search strategy listed in Appendix 1.

A complementary search of the following bibliographic databases and international trialregisters were also conducted to 9 May 2014 (see Appendix 2):

  • SPORTSDiscus (1985 ‐ );

  • The Cochrane Central Register of Controlled Trials (CENTRAL, all years ‐);and

  • WHO International Clinical Trials Portal.

Searching other resources

We contacted the authors of included studies and screened reference lists to identifyadditional published or unpublished data. We conducted citation searches using theInstitute for Scientific Information (ISI) Science Citation Index on the Web ofScience.

Data collection and analysis

Selection of studies

Two of three review authors (LL, JO‐J, KGB) inspected identified studies, usingeligibility criteria to select relevant studies. In cases of disagreement, theyconsulted a third review author (JRP).

Data extraction and management

Melissa Edmonds and Jonatahan R Price independently extracted data from includedstudies for the 2004 version of this review, and LL and JO‐J did so for thisreview update, using a standardised extraction sheet. They extracted mean scores atendpoint, the standard deviation (SD) or standard error (SE) of these values and thenumber of participants included in these analyses. When only the SE was reported, reviewauthors converted it to the SD. For dichotomous outcomes, such as drop‐outs, weextracted the number of events. We sought clarification from trial authors whennecessary from investigators involved in the following trials:Fulcher 1997,Moss‐Morris 2005,Wallman2004,Wearden 2009,Wearden 2010 andWhite 2011. We resolved disagreement between review authors by discussion.

Main comparisons
  • Exercise therapy versus 'passive control'.

  • Exercise therapy versus psychological treatment.

  • Exercise therapy versus adaptive pacing therapy.

  • Exercise therapy versus pharmacological therapy (e.g. antidepressants).

  • Exercise therapy as an adjunct to other treatment versus other treatmentalone.

Assessment of risk of bias in included studies

Working independently, LL and JO‐J, KGB or Jane Dennis (JD) assessed risk ofbias using The Cochrane Collaboration risk of bias tool which was published in the mostrecent version of theCochrane Handbook for Systematic Reviews of Interventions(Higgins 2011). This tool encouragesconsideration of how the sequence was generated, how allocation was concealed, theintegrity of blinding at outcome, the completeness of outcome data, selective reportingand other potential sources of bias. We classified all items in the risk of biasassessment as low risk, high risk or unclear risk by the extent to which bias wasprevented.

Measures of treatment effect

Continuous data

For continuous outcomes, we calculated the mean difference (MD) when the same scalewas used in a similar manner across studies. When results for continuous outcomes werepresented using different scales or different versions of the same scale, we used thestandardised mean difference (SMD).

Dichotomous data

For dichotomous outcomes, we expressed effect size in terms of risk ratio (RR).

Unit of analysis issues

Studies with multiple treatment groups

We extracted data from relevant arms of the included studies, and we compared theexperimental condition (exercise therapy) versus each individual comparatorintervention: ‘Passive control’ (treatment asusual/waiting‐list control/relaxation/flexibility); 'Psychologicaltreatment' (cognitive‐behavioural therapy (CBT)/cognitivetreatment/supportive therapy/behavioural therapies/psychodynamic therapies);'Adaptive pacing therapy; and Pharmacological therapy (e.g. antidepressants).This meant that data from the exercise arm could be included in a separate univariateanalysis for more than one comparison. We described underDifferences between protocol and review plannedmethods that were found redundant, as we did not include studies requiring theiruse.

Dealing with missing data

When possible, we calculated missing standard deviations from reported standard errors,P values or confidence limits using the methods described in Chapter 7 (Sections 7.7.3.2and 7.7.3.3) of theCochrane Handbook for Systematic Reviews of Interventions(Higgins 2011). We approached trialinvestigators to obtain other types of missing data.

Assessment of heterogeneity

For this update, we assessed heterogeneity in keeping with the recommendations of theCochrane Handbook for Systematic Reviews of Interventions (I2 valuesof 0 to 40%: might not be important; 30% to 60%: may represent moderate heterogeneity;50% to 90%: may represent substantial heterogeneity; 75% to 100%: show considerableheterogeneity;Higgins 2011). In addition tothe I2 value (Higgins 2003),we present the P value of the Chi2 test, and we considered the direction andmagnitude of treatment effects when making judgements about statistical heterogeneity.We deemed that no analyses were inappropriate as a result of the presence of statisticalheterogeneity, as the measures and statistics used have low power and are unstable whenbased on few and small studies. A P value < 0.1 from the Chi2 test wasused as an indicator of statistically significant heterogeneity because of the low powerof provided measures.

Assessment of reporting biases

We planned at the protocol stage to construct funnel plots when sufficient numbers oftrials allowed a meaningful presentation, to establish whether other potential biasescould be present. Asymmetry of these plots may indicate publication bias, although italso may represent a true relationship between trial size and effect size. We identifiedan insufficient number of studies to use this approach in the present version of thereview (Egger 1997). We considered clinicaldiversity of the studies as a possible explanation for some of the heterogeneityapparent between studies.

Data synthesis

As the result of expected clinical heterogeneity (slightly different interventions,populations and comparators) among studies, we chose the random‐effects model asthe default method of analysis because the alternative fixed‐effect model assumesthat the true treatment effect in each trial is the same, and that observed differencesare due to chance.

We performed analyses using Review Manager 5.0.

Subgroup analysis and investigation of heterogeneity

We planned no subgroup analyses a priori. To explore possible differences betweenstudies that used different strategies (e.g. exercise therapy), control conditions anddiagnostic criteria, we performed post hoc subgroup analyses. We describe results ofthese subgroup analyses in the text of the review.

Sensitivity analysis

We planned no sensitivity analyses a priori. To explore the possible impact of ourpooling strategy (e.g. the impact of using SMD vs MD), we performed post hoc sensitivityanalyses. In addition, we performed sensitivity analyses when studies with outlyingresults where excluded. We describe results of these sensitivity analyses in the text ofthe review.

Results

Description of studies

Results of the search

Our searches identified 908 unique records. Of these, we retrieved 50 records and readthe full text. Along with the five included studies from the 2004 version of this review(Fulcher 1997;Moss‐Morris 2005;Powell 2001;Wallman 2004;Wearden 1998), we haveincluded three additional studies in this update (Jason 2007;Wearden 2010;White 2011; seeFigure 1).

Figure 1.

Figure 1

PRISMA flow diagram.

Included studies

A total of eight studies (Fulcher 1997;Jason 2007;Moss‐Morris 2005;Powell 2001;Wallman 2004;Wearden 1998;Wearden 2010;White 2011) met our inclusion criteria for thisreview (23 reports in all). All included studies were written in English and werepublished in peer‐reviewed journals.

Design

All included studies were described as randomised controlled trials.

Three studies included two arms (Fulcher1997;Moss‐Morris 2005;Wallman 2004) comparing exercise versusrelaxation/flexibility, waiting list or standard care, respectively.

Four studies had four arms. ForPowell 2001,we combined the three intervention arms and used these as comparators versus treatmentas usual. We considered two arms (exercise + drug placebo vs exercise placebo + drugplacebo) inWearden 1998 as relevant for thisreview. ForJason 2007 andWhite 2011, all four arms were used, as werethree arms inWearden 2010.

The eight studies randomly assigned a total of 1518 participants. Samples included inthis review ranged from 49 (Moss‐Morris2005) to 641 participants (White2011).

Setting

Two studies took place in primary care settings: one in the United Kingdom (Wearden 2010) and one in Australia (Wallman 2004). Two studies were performed insecondary care facilities: one in the United Kingdom (Fulcher 1997) and one in New Zealand (Moss‐Morris 2005). One study recruitedfrom a variety of sources but took place at a hospital in the USA (Jason 2007). Three studies were conducted atsecondary/tertiary care settings in the United Kingdom (Powell 2001;Wearden 1998;White 2011).

Participants

Three studies used the Centers for Disease Control and Prevention (CDC) 1994 criteria(Fukuda 1994) as inclusion criteria (Jason 2007;Moss‐Morris 2005;Wallman2004), and five (Fulcher 1997;Powell 2001;Wearden 1998;Wearden 2010;White 2011) used the Oxford criteria (Sharpe 1991).Wearden 2010 andWhite 2011 showedan overlap between Oxford criteria (Sharpe1991) and London ME criteria (The NationalTask Force on CFS) of 31% and 51%, respectively. More female than maleparticipants were included (range 71% to 84% when all arms were included), and meanages across studies were between 33 and 44.6 years (confirmation of age data wasrequested from a trial investigator in one case (Wallman 2009)). The studies reported median illness duration of between 2.3and 7 years. All but one study (Wallman 2004)reported depression, which ranged from 18% (Wearden2010) of those with a depression diagnosis to 39% among participants with acurrent Axis I disorder (Jason 2007). Threestudies did not report work and employment information (Wallman 2004;Wearden 2010;White 2011).Fulcher 1997 andJason 2007 reported that 39% and 46% ofparticipants were working or studying on at least a part‐time basis, 22% ofparticipants inMoss‐Morris 2005 wereunemployed and were unable to work because of disability and 42% of participants inPowell 2001 were receiving disabilitypensions (Table 7).

Table 1.

Study demographics

Study IDNGenderDuration of illnessDepression co‐morbidityUse of antidepressants (ADs)Work and employment status
Fulcher 19976649F/17M
65% female
2.7 years20 (30%) possible cases of depression (HADS)30 (45%) on full‐dose AD (n = 20) or low‐dose AD (n = 10)26 (39%) working or studying at least part time
Jason 200711495F/19M
83% female
> 5.0 years44 (39%) with a current Axis I disorder
(depression and anxiety most common)
Not stated52 (46%) working or studying at least part time, 24% unemployed, 6% retired,25% on disability
Moss‐Morris 20054934F/15M
69% female
3.1 years14 (29%) possible or probable cases of depression (HADS)Not stated11 (22%) were unemployed and were unable to work because of disability
Powell 2001148116F/32M
78% female
4.3 years58 (39%) possible or probable cases of depression (HADS)27 (18%) used AD50 (34%) were working, 64 (43%) were on disability
Wallman 20046147F/14M
77% female
Not statedNot stated16 (26%) used ADNot stated
Wearden 199813697F/39M
71% female
2.3 years46 (34%) with depressive disorder according to DSM‐III‐RcriteriaNot stated114 (84%) had recently changed occupation
Wearden 2010296230F/66M
78% female
7.0 years53 (18%) had a depression diagnosis160 (54%) were prescribed AD in the past 6 monthsNot stated
White 2011641495F/146M
77% female
2.7 years219 (34%) with any depressive disorder260 (41%) used ADNot stated
Intervention characteristics

The exercise therapy regimen lasted between 12 and 26 weeks. Seven studies usedvariations of aerobic exercise therapy such as walking, swimming, cycling or dancingat mixed levels in terms of intensity of the aerobic activity ranging from very low toquite rigorous; the remaining study used anaerobic exercise (Jason 2007). Scheduled therapist meetings couldbe conducted face‐to‐face or by telephone and varied from every secondweek to weekly; some sessions involved talking, and some exercise. Most of theincluded studies asked participants to exercise at home, most often between three andfive times per week, with a target duration of 5 to 15 minutes per session usingdifferent means of incrementation (Fulcher1997;Moss‐Morris 2005;Powell 2001;Wallman 2004;Wearden 1998;Wearden 2010;White 2011). Participants were asked to perform self‐monitoring byusing such tools as heart monitors, the Borg Scale or a diary including an exerciselog to measure adherence to treatment (Table 8). Controlinterventions included treatment as usual, relaxation plus flexibility and awaiting‐list control group.

Table 2.

Characteristics of exercise interventions

Study IDDeliverer of interventionExplanation and materialsType of exerciseSchedule therapistSchedule homeDuration of activityInitial exercise levelIncrement stepsParticipant self‐monitoringCriteria for (non)‐increment
Fulcher 1997Exercise physiologistVerbal explanation of deconditioning and reconditioningWalking (encouraged to take other modes such as cycling and swimming)Weekly
(1 hour), talking only
5 days/wk5 to 15 minutes increasing to 30 minutes/d5 to 15 minutes at 40% of peak O2 consumption
(target HR of resting + 50% of HRR)
Duration increased 1 to 2 minutes per week up to 30 minutes; then intensityincreasedAmbulatory heart rate monitorsIf increased fatigue, continue at the same level for an extra week
Wearden 1998Physiotherapist,
fitness focus
Minimal explanation; no written materialsPreferred activity
(walking/jogging, some did cycling, swimming)
At week 0, 1, 2, 4, 8, 12*, 20, 26*,
talking only
(*evaluation visits)
3 days/wk20 minutes75% of VO2max from bike testIntensity increasedBorg Exertion Scale chart, before and after HRIncrease if: 10 beats/min drop post exercise and 2‐pointdrop in Borg Scale score
Powell 2001Senior clinical therapistExplanations for GET, circadian dysrhythmia, deconditioning, sleep
"educational information pack"
Aerobic exercise;
own choice but mostly exercise bike
9 face‐to‐face
(1.5 hours each)
TailoredTailored to functional abilitiesTailored to functional abilities: “a level which you arecapable of doing on a BAD DAY”Varying daily increase (e.g. "5 second increase each day for the rest ofthe second week"
to 30 minutes twice/d
Duration of exerciseDiscouraged, but restart at lower level and rapidly reincrease
Wallman 2004Single physical therapistSmall laminated Borg Scale and heart rate monitorWalking/jogging, swimming or cyclingPhone contact every 2 weeksEvery second dayFrom 5 to 15 minutes, increasing to 30 minutesInitial exercise duration was between 5 and 15 minutes, and intensity was basedon the mean HR value achieved midpoint during submaximal exercisetests Duration increased by 2 to 5 minutes/2 wkHeart rate monitoring,
Borg Exertion Scale
Keep Borg within 11 to 14. Adjust every 2 weeks. Average peak HR whenexercising comfortably at a typical day represents patient’starget heart rate (± 3 bpm) for future sessions
Moss‐Morris 2005Health psychology MSc student, researcherFocused on the "downward spiral of activity reduction,deconditioning"Walking (but could also do other preferred exercise, e.g. jogging,swimming)Weekly for 12 weeks, talking only4 to 5 days/wkSet collaboratively approx 5 to 15 minutesHR at 40% of VO2maxDuration 3 to 5 minutes/wk
Intensity increased after 6 weeks 5 bpm/wk
Ambulatory heart rate monitorsIf increased fatigue, continue at the same level for an extra week
Jason 2007Registered nurses supervised by exercise physiologist"Behavioral goals explained, energy system education, redefiningexercise""individualized, constructive and pleasurable activities"Every 2 weeks
(45 minutes),
13 sessions
3 per weekTailoredFlexibility tests
Strength test (hand grip)
"Gradually increasing anaerobic activity levels"Self‐monitoring daily exercise diaryNew targets only after habituation, or if goals achieved for 2 weeks
Wearden 2010Nurses with 16 half‐days of training and supervisionExplanation of physiological symptoms and training in first sessionWide choice: walking, stairs, bicycle, dance, jog10 sessions over 18 weeksSeveral times per dayFirst 90 minutes, then alternating 60 and 30 minutesDetermined collaboratively with the participant"Increased very gradually," examples show 50% increase per dayDiary of progress on exercise programme, with note of daily activitiesOn "bad days," try to do same as day before
White 2011Exercise therapist/physiotherapist
(8 to 10 days training + ongoing supervision)
142‐page manual:
benefits of exercise
and "how to" of GET; some got pedometers
Wide choice: walking, cycling, swimming, Tai Chi.
Aim to build into daily activities
Weekly × 4, then
fortnightly;
total of 15 sessions
5 to 6 days/wkNegotiated, goal to get to 30 minutes per sessionTest of fitness (step test. and 6‐minute walking test),
perceived physical exertion, actigraphy data
"20% increases" per fortnight; increase duration to 30 minutes, thenincrease intensityExercise diary + Borg scale +
“Use non‐symptoms to monitor”and
heart rate monitor
(for intensity increases)
Do not increase if global increase in symptoms
© 9. March 2012, Paul Glasziou, Bond University, Australia
Outcomes

The main outcomes were symptom levels measured by rating scales at end of treatment(12 to 26 weeks) and at follow‐up (52 to 70 weeks). Fatigue was measured by theFatigue Scale (FS) (Chalder 1993) in sevenstudies (Fulcher 1997;Moss‐Morris 2005;Powell 2001;Wallman 2004;Wearden 1998;Wearden 2010;White 2011) and by the Fatigue Severity Scale (FSS) (Krupp 1989) in one study (Jason 2007). Another study (White 2011) reported adverse outcomes accordingto SAR categories (European Union Clinical TrialsDirective 2001).

TheJason 2007 study measured pain using theBrief Pain Inventory (Cleeland 1994).Physical functioning was measured by the SF‐36 physical functioning subscale(Ware 1992) in seven studies (Fulcher 1997;Jason 2007;Moss‐Morris2005;Powell 2001;Wearden 1998;Wearden 2010;White 2011). Qualityof life was measured by the Quality of Life Scale (QOLS) (Burckhardt 2003) in another study (Jason 2007).

Seven studies (Fulcher 1997;Jason 2007;Moss‐Morris 2005;Powell2001;Wallman 2004;Wearden 2010;White 2011) reported self‐perceived changes in overall health usingthe Global Impression Scale (Guy 1976).

Of the seven studies that reported mood disorder, six (Fulcher 1997;Powell 2001;Wallman 2004;Wearden 1998;Wearden 2010;White 2011) used theHospital Anxiety and Depression Scale (HADS) (Zigmond 1983), and one (Jason 2007)used the Beck Depression Inventory (BDI‐II) (Beck 1996) and the Beck Anxiety Inventory (BAI) (Hewitt 1993). Three studies (Powell 2001;Wearden 2010;White 2011) measuredsleep problems by using a questionnaire (Jenkins1988), two (Fulcher 1997;Powell 2001) by using the Pittburgh Sleep QualityIndex (PSQI) (Buysse 1989).

One study reported health service resource use (White 2011).

Drop‐out was calculated by the review authors.

Included studies reported several outcomes in addition to those reported in thisreview, such as work capacity by oxygen consumption (VO2), thesix‐minute walking test and illness beliefs. SeeCharacteristics of included studies for moredetailed information.

Ethics approval

Ethics approval was obtained for all listed studies and sponsoring or fundinglisted.

Excluded studies

Two studies were excluded in 2004, as the diagnoses used were Gulf War veterans'illness (Guarino 2001) and subclinical chronicfatigue (Ridsdale 2004). The study awaitingassessment from 2004 was also excluded (Stevens1999), as exercise therapy was a minor part of a combination treatment.

The current version excluded 14 studies (Evering2008;Gordon 2010;Guarino 2001;Nunez 2011;Ridsdale 2004;Ridsdale 2012;Russel 2001;Stevens 1999;Taylor 2004;Taylor 2006;Thomas 2008;Tummers 2012;Viner 2004;Wright 2005). In additionto the two studies excluded from the 2004 version because of the population included(Guarino 2001;Ridsdale 2004), another with the diagnosis ofchronic fatigue was excluded (Ridsdale 2012),as were two in which participants were younger than 18 years (Viner 2004;Wright 2005). Along with the one study excluded in 2004 (Stevens 1999), another five studies (Evering 2008;Nunez 2011;Russel 2001;Taylor 2004;Tummers 2012) were excluded in this review update because exercise therapy wasa minor part of the intervention. One study was excluded because investigators comparedtwo exercise interventions (Gordon 2010). Twostudies were excluded because they were not RCTs (Taylor 2006;Thomas 2008).

Ongoing studies

We identified five ongoing studies in trial registers (Broadbent 2012;Kos 2012;Marques 2012;Vos‐Vromans 2008;White 2012).

Studies awaiting classification

Studies identified from searches run to 9 May 2014 were assessed for eligibility andwere classified accordingly. Three studies identified in the search are waitingassessment for possible inclusion, as the available information is too sparse forconclusions about eligibility. One abstract seems to refer to an unpublished study(Hatcher 1998), but we have not been ableto contact the study authors for clarification. Additionally, two citations refer tostudies that are available only in Chinese (Liu2010;Zhuo 2007). Again, we have notbeen able to contact the study authors to clarify their relevance, and we have not hadthe resources to perform translation.

New studies found at this update

Three new studies have been added in this updated review (Jason 2007;Wearden 2010;White 2011).

Risk of bias in included studies

Summaries of the risk of bias assessments are presented inFigure 2 andFigure 3.

Figure 2.

Figure 2

Risk of bias summary: review authors' judgements about each risk of bias item foreach included study.

Figure 3.

Figure 3

Risk of bias graph: review authors' judgements about each risk of bias itempresented as percentages across all included studies.

Allocation

All but one of the studies had adequate sequence generation (Wallman 2004). We judged five reported methods ofallocation concealment as 'adequate' and found that methods described by theremaining three were unclear (Jason 2007;Powell 2001;Wallman 2004).

Blinding

As the intervention did not allow for blinding of participants or personnel deliveringthe exercise‐based interventions, and as all measures were performed byself‐report, blinding was impossible. This inevitably puts the review at somerisk of bias, and all of the included studies were rated as having high risk ofbias.

Incomplete outcome data

Risk of bias due to incomplete outcomes was low in five of the eight included studies,reflecting the fact that loss to follow‐up was low, and that participants whowere lost to follow‐up were evenly distributed between intervention and controlgroups (Fulcher 1997;Moss‐Morris 2005;Powell 2001;Wallman 2004;White 2011). One trialwas associated with unclear risk of attrition bias (Wearden 2010). The drop‐out rate in the intervention groups in thistrial was relatively high, but most of the participants who dropped out from treatmentwere still available for follow‐up assessments and were analysed within thegroups to which they were randomly assigned (Wearden2010). Two trials were associated with high risk of attrition bias (Jason 2007;Wearden 1998).Wearden 1998 reportedlarge drop‐out rates in all intervention groups as compared with control groups,and many participants were lost to follow‐up. InJason 2007, the conservatively defineddrop‐out rate (i.e. "attending four or fewer sessions or stopping therapyprior to satisfactory completion of therapy") on average was 25%. Study authorsused the best linear unbiased predictor to avoid taking missing data into account, butas loss to follow‐up for various intervention groups was not reported, weassessed the risk of attrition bias as high for this trial.

Selective reporting

Two studies (Wearden 2010;White 2011) referenced published protocols, andwhen we checked these against the published results, we found that reporting wasadequate. In one study (Wearden 1998), trialinvestigators reported numerical data for only one subscale (health perception) of theMedical Outcomes Survey (MOS) scale (Ware1992), for which data favour the intervention group; no numerical data were givenfor the five other subscales, nor for another scale (anxiety), as data were"similar in trial completers." It was not possible to check the other studiesfor selective reporting bias; therefore their risk of bias is considered unclear.

Other potential sources of bias

Seven of the eight studies seem to be free of other sources of bias, and one showed abaseline difference across groups for several variables (Jason 2007). These were not discussed when resultswere presented in the paper. In addition this study had 25 outcome measures; because ofthis large number, one significant measure would be expected to occur by chance (Jason 2007).Wallman 2004 showed differences between groups for anxiety and mental fatigueat baseline, and this might have influenced the results.

Effects of interventions

See:Table 1

Exercise therapy versus control

Comparison 1. Exercise therapy versus treatment as usual, relaxation orflexibility

All included studies (Fulcher 1997;Jason 2007;Moss‐Morris 2005;Powell2001;Wallman 2004;Wearden 1998;Wearden 2010;White 2011)contributed data for this comparison.

1.1 Fatigue

Powell 2001 (148 participants) assessedfatigue by dichotomised scoring of an 11‐item Fatigue Scale (FS, 0 to 11points) (Chalder 1993) and reported resultsclearly in favour of exercise therapy (mean difference (MD) ‐6.06, 95%confidence interval (CI) ‐6.95 to ‐5.17;Analysis 1.1). Three studies (Wallman 2004;Wearden 2010;White 2011)measured fatigue among a total of 540 participants using the same 11‐item FSwith a different scoring system (0 to 33 points) (Chalder 1993) (Analysis 1.1). Thepooled estimate suggests that exercise therapy was significantly more effective thantreatment as usual (MD ‐2.82, 95% CI ‐4.07 to ‐1.57) – aresult that was not associated with heterogeneity (I² = 0%, P value 0.54).Three studies (Fulcher 1997;Moss‐Morris 2005;Wearden 1998) with a total of 152 participantsmeasured fatigue using a 14‐item FS (0 to 42 points) (Chalder 1993). Pooling shows a significantdecrease in fatigue in the exercise group when compared with treatment as usual (MD‐6.80 points, 95% CI ‐10.31 to ‐3.28), and the analysis wasassociated with low heterogeneity (I² = 20%, P value 0.29).

Analysis 1.1.

Analysis 1.1

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 1 Fatigue (end of treatment).

At follow‐up, small strengthening of the effect was observed on the11‐point FS (Chalder 1993) asreported byPowell 2001 (MD ‐7.13,95% CI ‐7.97 to ‐6.29; 148 participants;Analysis 1.2). Pooling of the two studies(Wearden 2010;White 2011) that measured fatigue on the33‐point scale resulted in almost the same effect estimate atfollow‐up as at end of treatment (MD ‐2.87, 95% CI ‐4.18 to‐1.55; 472 participants;Analysis1.2). The latter analysis was not associated with any unexplainedheterogeneity (I² = 0%, P value 0.46).Jason2007 (50 participants) did not report results at end of treatment butshowed little or no difference in fatigue between anaerobic exercise and treatmentas usual at follow‐up, as measured on the Fatigue Severity Scale (FSS) (Krupp 1989) (MD 0.15, 95% CI ‐0.55 to0.85;Analysis 1.2).

Analysis 1.2.

Analysis 1.2

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 2 Fatigue (follow‐up).

Sensitivity analysis

Investigating heterogeneity

At end of treatment, fatigue was measured and reported on different scales, andwe performed a sensitivity analysis in which all available studies were pooledusing an SMD method. This strategy led to a pooled random‐effects estimateof ‐0.68 (95% CI ‐1.02 to ‐0.35), but the analysis sufferedfrom considerable heterogeneity (I² = 78%, P value < 0.0001;Analysis 1.19). The observed heterogeneitywas caused mainly by the deviating results presented inPowell 2001. Exclusion ofPowell 2001 gave rise to a pooled SMD of‐0.46 (95% CI ‐0.63 to ‐0.29) – an estimate that wasnot associated with heterogeneity (I² = 13%, P value 0.33).

Analysis 1.19.

Analysis 1.19

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 19 Subgroup analysis for fatigue.

At follow‐up, the four available studies (Jason 2007;Powell 2001;Wearden 2010;White 2011) measured and reportedfatigue on different scales, and we performed a sensitivity analysis in which allavailable studies were pooled using an SMD method. The pooled SMD estimate is‐0.63 (95% CI ‐1.32 to 0.06), but heterogeneity was extensive(I² = 93%, P value < 0.00001). Exclusion ofPowell 2001 gave rise to a new pooled SMD of‐0.29 (95% CI ‐0.55 to ‐0.03) and reduced heterogeneity(I² = 46%, P value 0.16).

Subgroup analysis

To explore the possible impact of our pooling strategy (e.g. the impact ofpooling studies adhering to different exercise strategies and control conditions),we performed post hoc subgroup analyses withinAnalysis 1.1 andAnalysis1.2.

Type of exercise

Post hoc subgroup analysis based on treatment strategy could not establishdifferences (I² = 0%, P value 0.60) between studies of graded exercisetherapy (Fulcher 1997;Moss‐Morris 2005;Powell 2001;Wearden 1998;Wearden 2010;White 2011) and studies testing exercise withself‐pacing (Wallman 2004) (SMD‐0.71, 95% CI ‐1.09 to ‐0.32; I² = 82% vs SMD‐0.54, 95% CI ‐1.05 to ‐0.02, respectively) (Analysis 1.19).

At follow‐up, post hoc subgroup analysis resulted in statisticallysignificant subgroup differences (I² = 73.7%, P value 0.05) between the threestudies (Powell 2001;Wearden 2010;White 2011) comparing graded exercise versustreatment as usual (SMD ‐0.86, 95% CI ‐1.67 to ‐0.05; I²= 95%) andJason 2007, in which anaerobicactivity was compared with relaxation (SMD 0.12, 95% CI ‐0.44 to 0.67).

Type of control

We cannot establish a subgroup difference (I² = 0%, P value 0.88) betweenthe five studies with treatment as usual as control (Moss‐Morris 2005;Powell 2001;Wearden 1998;Wearden 2010;White 2011) and the two studies prescribingrelaxation or flexibility to participants in the control arm (Fulcher 1997;Wallman 2004) (SMD ‐0.70, 95% CI‐1.14 to ‐0.25 vs SMD ‐0.65, 95% CI ‐1.02 to‐0.28).

Diagnostic criteria

As the use of various diagnostic criteria is often emphasised as particularlyimportant with regard to treatment response, we also performed subgroup analysesbased on diagnostic criteria. Comparison of the two studies using 1994 CDCcriteria (Moss‐Morris 2005;Wallman 2004) and the five studies using theOxford criteria (Fulcher 1997;Powell 2001;Wearden 1998;Wearden 2010;White 2011) revealed no differences betweensubgroups (I² = 0%, P value 0.84) (SMD ‐0.73, 95% CI ‐1.17 to‐0.28 vs SMD ‐0.66, 95% CI ‐1.09 to ‐0.24).

1.2 Adverse effects

White 2011 reported two serious adversereactions (SARs) (European Union Clinical TrialsDirective 2001) possibly related to treatment among the 160 participants(i.e. deterioration in mobility and self‐care and worse CFS symptoms andfunction) in the exercise group and two SARs among the 159 participants in thecontrol group (i.e. worse CFS symptoms and function and increased depression andincapacity) (odds ratio (OR) 0.99, 95% CI 0.14 to 7.1;Analysis 1.3). Participants in theWearden 2010 trial reported no SARs totherapy.

Analysis 1.3.

Analysis 1.3

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 3 Participants with serious adverse reactions.

1.3 Pain

Wearden 1998 reported that all treatedgroups scored similarly on the pain subscale of SF‐36 (Ware 1992), but measured values were notreported.

One trial,Jason 2007 (43 participants),assessed pain using the Brief Pain Inventory (Cleeland 1994) at follow‐up (Analysis 1.4) and observed an MD of ‐0.97 (95% CI ‐2.44 to0.50) on pain severity and ‐0.69 on the pain interference subscale (95% CI‐2.48 to 1.10). The wide confidence interval implies that the results wereinconclusive.

Analysis 1.4.

Analysis 1.4

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 4 Pain (follow‐up).

1.4 Physical functioning

Five trials (Fulcher 1997;Moss‐Morris 2005;Powell 2001;Wearden 2010;White 2011) with atotal of 725 participants assessed physical functioning according to the physicalfunctioning subscale of SF‐36 (Ware1992) at end of treatment. The pooled estimate for these studies (Analysis 1.5) suggests that mean improvementfor participants randomly assigned to exercise therapy was 13.10 points higher (95%CI 1.98 to 24.22) than for the treatment as usual group, but heterogeneity wasconsiderable (I² = 89%, P value < 0.00001).

Analysis 1.5.

Analysis 1.5

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 5 Physical functioning (end of treatment).

Four trials (669 participants) contributed data for evaluation of physicalfunctioning at follow‐up (Jason2007;Powell 2001;Wearden 2010;White 2011).Jason 2007 observedbetter results among participants in the relaxation group (MD 21.48, 95% CI 5.81 to37.15). However, results were distorted by large baseline differences in physicalfunctioning between the exercise and relaxation groups (39/100 vs 54/100); thereforewe decided not to include these results in the meta‐analysis. Pooling of thethree remaining trials (621 participants) showed a mean improvement on theSF‐36 physical functioning subscale that was 16.33 points higher for exercisethan for treatment as usual (95% CI ‐4.08 to 36.74;Analysis 1.6), but heterogeneity was excessive(I² = 96%, P value < 0.00001); therefore little or no difference cannot beruled out.

Analysis 1.6.

Analysis 1.6

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 6 Physical functioning (follow‐up).

Sensitivity analysis

Investigating heterogeneity

Extensive heterogeneity inAnalysis 1.5was largely driven by the remarkably positive effect of exercise therapy reportedbyPowell 2001. Heterogeneity (I²)dropped to 52% (P value 0.10) following exclusion ofPowell 2001, and the pooled mean differencestill showed better improvement for participants in the exercise group (MD 7.37,95% CI 1.23 to 13.51). The remaining heterogeneity may reflect the large variationin baseline physical functioning observed across studies, ranging from 29.8 (Wearden 2010) to 53.1 (Moss‐Morris 2005), but the number ofavailable studies was low; it is therefore difficult to explore this associationfurther.

Also at follow‐up, observed heterogeneity was driven by remarkablypositive results in favour of exercise as reported byPowell 2001. IfPowell 2001 was excluded, heterogeneitydropped to 0% (P value 0.50), and the two remaining trials (Wearden 2010;White 2011) reported a smaller butstatistically significant difference in favour of exercise therapy (MD‐5.79, 95% CI ‐10.53 to ‐1.06).

Subgroup analysis

To explore the possible impact of varying exercise strategies and controlconditions, we performed post hoc subgroup analyses withinAnalysis 1.5 andAnalysis 1.6.

Type of exercise

All studies included inAnalysis 1.5 andAnalysis 1.6 offered graded exercisetherapy.Jason 2007 observed betterresults among participants in the relaxation group than among those in theanaerobic exercise group (MD 21.48, 95% CI 5.81 to 37.15) at follow‐up. Asstated above, these results were distorted by large baseline differences inphysical functioning between exercise and relaxation groups (39 of 100 vs 54 of100) and were not included inAnalysis1.6.

Type of control

At end of treatment, post hoc subgroup analysis did not establish a subgroupdifference (I² = 0%, P value 0.92) between the four studies (Moss‐Morris 2005;Powell 2001;Wearden 2010;White 2011) using treatment as usual ascontrol (MD ‐12.96, 95% CI ‐26.63 to 0.72; I² = 92%) andFulcher 1997, in which relaxation orflexibility was used as a control (MD ‐13.87, 95% CI ‐24.31 to‐3.43). All studies available for analysis at follow‐up adhered tothe treatment as usual control condition, hence no sensitivity analyses wereperformed withinAnalysis 1.6.

Diagnostic criteria

We found no evidence of subgroup differences (I² = 0%, P value 0.91) betweenone study diagnosing participants according to the 1994 CDC criteria (MD‐14.05, 95% CI ‐27.48 to ‐0.62;Moss‐Morris 2005) and four studiesdiagnosing participants according to the Oxford criteria (MD ‐12.92, 95% CI‐25.99 to 0.14). All studies available for analysis at follow‐uprecruited participants in keeping with the Oxford criteria, thus no sensitivityanalyses were performed withinAnalysis1.6.

1.5 Quality of life

None of the included studies reported quality of life at end of treatment. Atfollow‐up, an estimate of effect suggested improvement towards better qualityof life (Burckhardt 2003) amongparticipants in the control group (MD 9.00, 95% CI ‐1.00 to 19.00; P value0.08) compared with those given exercise therapy (Jason 2007;Analysis 1.7; 44participants), but little or no effect cannot be ruled out. This estimate is biasedin favour of the control arm because of baseline differences between groups.

Analysis 1.7.

Analysis 1.7

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 7 Quality of life (follow‐up).

1.6.1 Depression

Five studies (Fulcher 1997;Powell 2001;Wallman 2004;Wearden 1998;Wearden 2010) with a total of 504 participantscontributed information on depression at end of treatment (12 to 26 weeks), allutilising the depression subscale of the Hospital Anxiety and Depression Scale(HADS) (Zigmond 1983). Pooling studyresults yielded an estimate of effect that suggested improvement in depressionscores among participants allocated to exercise therapy compared with controls (MD1.6 points, 95% CI ‐0.23 to 3.5;Analysis1.8), but the results were highly heterogeneous (I² = 84%, P value< 0.0001), and little or no difference cannot be ruled out.

Analysis 1.8.

Analysis 1.8

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 8 Depression (end of treatment).

At follow‐up (Analysis 1.9),Jason 2007 (45 participants) assesseddepression using the Beck Depression Inventory (BDI‐II) (Beck 1996) and observed no difference indepression scores (MD 3.44, 95% CI ‐3.00 to 9.88)—an estimate thatfavours controls because of baseline differences between groups. Three trialsreported HADS depression subscale values (Zigmond1983) at follow‐up (Powell2001;Wearden 2010;White 2011; 609 participants). The pooledestimate of effect suggests that exercise therapy improved depression more thantreatment as usual (MD ‐2.26, 95% CI ‐5.09 to 0.56), but heterogeneitywas considerable (I² = 92%, P value < 0.00001), and little or no differencecannot be ruled out.

Analysis 1.9.

Analysis 1.9

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 9 Depression (follow‐up).

Sensitivity analysis

Investigating heterogeneity

At end of treatment,Powell 2001 againreported very positive results and contributed greatly to the total heterogeneity.Exclusion ofPowell 2001 led to areduction in observed effect size (MD 0.80, 95% CI ‐0.21 to 1.82), butheterogeneity was also greatly reduced (I² = 36%, P value 0.20).

Also at follow‐up,Powell 2001reported a substantial benefit of exercise therapy compared with results describedby the other trials. Exclusion ofPowell2001 from the meta‐analysis was associated with a great reductionin heterogeneity, as I² dropped from 92% to 9% (P value 0.30). Exclusion ofPowell 2001 was also associated with achange in the observed effect estimate (MD ‐0.77, 95% CI ‐1.64 to0.09). Hence, we still see an effect estimate suggesting modest benefit associatedwith exercise therapy, but little or no difference cannot be ruled out.

Standardised mean difference (SMD)

At longer‐term follow‐up, depression was measured and reported ondifferent measurement scales; therefore we performed a sensitivity analysis inwhich all available studies were pooled using an SMD method. The four availablestudies (Jason 2007;Powell 2001;Wearden 2010;White 2011) yielded a pooled standardisedestimate of SMD ‐0.35 (95% CI ‐0.93 to 0.23) in an analysis that wasassociated with considerable heterogeneity (I² = 91%, P value <0.00001).

Subgroup analysis

To explore the possible impact of varying exercise strategies and controlconditions, we performed post hoc subgroup analyses withinAnalysis 1.8 andAnalysis 1.9.

Type of exercise

No statistical subgroup differences (I² = 0%, P value 0.75) were observedbetween the four studies offering graded exercise therapy (Fulcher 1997;Powell 2001;Wearden 1998;Wearden 2010) andWallman 2004, which offered exercise withpersonal pacing.

At longer‐term follow‐up, four available studies (Jason 2007;Powell 2001;Wearden 2010;White 2011) provided a pooledstandardised estimate of SMD ‐0.35 (95% CI ‐0.93 to 0.23) in ananalysis that was associated with considerable heterogeneity (I² = 91%, Pvalue < 0.00001). Post hoc subgroup analysis resulted in a statisticallysignificant subgroup difference (I² = 71.2%, P value 0.06) between the threestudies (Powell 2001;Wearden 2010;White 2011) comparing graded exercise therapyversus treatment as usual (SMD ‐0.53, 95% CI ‐1.20 to 0.13) andJason 2007, which compared anaerobicactivity versus relaxation (SMD 0.31, 95% CI ‐0.28 to 0.90).

Type of control

At end of treatment, the post hoc subgroup analysis did not establish a subgroupdifference (I² = 0%, P value 0.61) between the three studies (Powell 2001;Wearden 1998;Wearden 2010) using treatment as usual as thecontrol (MD ‐2.01, 95% CI ‐5.12 to 1.10; I² = 91%) and the twostudies (Fulcher 1997;Wallman 2004) using relaxation or flexibilityas the control (MD ‐1.05, 95% CI ‐2.95 to 0.84; I² = 59%).

1.6.2 Anxiety

Five trials (Fulcher 1997;Powell 2001;Wallman 2004;Wearden 1998;Wearden 2010) assessed anxiety at end oftreatment using the anxiety subscale of the HADS (Zigmond 1983). Three studies (387participants) reported data in a way that facilitated comparison in ameta‐analysis (Powell 2001;Wallman 2004;Wearden 2010), resulting in a pooled MD of‐1.48 points (95% CI ‐3.58 to 0.61;Analysis 1.10). The meta‐analysis wasassociated with heterogeneity (I² = 79%, P value 0.008), but some of thisheterogeneity can be explained by uncorrected baseline differences in HADS anxietyscore in included trials.Wearden 1998(68 participants) stated that no significant changes were observed on the HADSanxiety score at end of treatment.Fulcher1997 (58 participants) did not observe changes in median HADS anxietyscore in the exercise group, whereas an increase in median HADS anxiety score from4 to 7 was observed in the control group. However, the difference between exerciseand control groups did not reach statistical significance in non‐parametricstatistical analysis.

Analysis 1.10.

Analysis 1.10

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 10 Anxiety (end of treatment).

Four trials assessed anxiety at longer‐term follow‐up (52 to 70weeks;Analysis 1.11).Jason 2007 (45 participants) reported a meandifference on the Beck Anxiety Inventory (BAI) (Beck 1996) of 0.70 points (95% CI ‐4.52 to 5.92), and the wideconfidence interval implies inconclusive results. Three trials (607 participants)assessed follow‐up changes in anxiety using the HADS anxiety subscale(Powell 2001;Wearden 2010;White 2011). The pooled MD suggests greaterimprovement in HADS anxiety score in the exercise group compared with the groupgiven treatment as usual (MD 1.01, 95% CI ‐0.74 to 2.75), but heterogeneitywas considerable (I² = 78%, P value 0.01), and little or no difference cannotbe ruled out.

Analysis 1.11.

Analysis 1.11

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 11 Anxiety (follow‐up).

Sensitivity analysis

Investigating heterogeneity

At follow‐up,Powell 2001reported very positive results and contributed to increased heterogeneity.Exclusion ofPowell 2001 reducedheterogeneity to 63% (P value 0.10), and the pooled MD forWhite 2011 andWearden 2010 was reduced to 0.24 (95% CI‐1.27 to 1.74).

Standardised mean difference (SMD)

At longer‐term follow‐up, anxiety was measured and reported ondifferent measurement scales; therefore we performed a sensitivity analysis inwhich all available studies were pooled using an SMD method. Four availablestudies (Jason 2007;Powell 2001;Wearden 2010;White 2011) yielded a pooled standardisedestimate of SMD ‐0.17 (95% CI ‐0.50 to 0.15), but the analysis wasassociated with heterogeneity (I² = 71%, P value 0.02).

Subgroup analysis

To explore the possible impact of varying exercise strategies and controlconditions, we performed post hoc subgroup analyses withinAnalysis 1.10 andAnalysis 1.11.

Type of exercise and control

At end of treatment, post hoc subgroup analysis did not establish a subgroupdifference (I² = 0%, P value 0.64) between the two studies (Powell 2001;Wearden 2010) comparing graded exercisetherapy versus treatment as usual (MD ‐1.22, 95% CI 0.‐4.51 to 2.07;I² = 88%) andWallman 2004, whichcompared exercise with personal pacing versus flexibility and relaxation (MD‐2.10, 95% CI ‐3.86 to ‐0.34).

At follow‐up, four available studies (Jason 2007;Powell 2001;Wearden 2010;White 2011) yielded a pooled standardisedestimate of SMD ‐0.17 (95% CI ‐0.50 to 0.15), but the analysis wasassociated with heterogeneity (I² = 71%, P value 0.02). We could notestablish a statistically significant subgroup difference (I² = 0%, P value0.40) between the three studies (Powell2001;Wearden 2010;White 2011) comparing graded exercise therapyversus treatment as usual (SMD ‐0.23, 95% CI ‐0.61 to 0.16) andJason 2007, which compared anaerobicactivity versus relaxation (SMD 0.08, 95% CI ‐0.51 to 0.66).

1.7 Sleep

Two trials (Powell 2001;Wearden 2010), with a total of 323participants, suggested that sleep assessed by the Jenkins Sleep Scale (Jenkins 1988) had improved more amongparticipants in the exercise group at end of treatment (MD ‐1.49 points, 95%CI ‐2.95 to ‐0.02; P value 0.05;Analysis 1.12).Fulcher 1997,with 59 participants at end of treatment, observed a reduction in median sleepscore, as assessed by the Pittsburgh Sleep Quality Index, from 7 to 5 in theexercise group, whereas median sleep score remained 6 in the control group; thisgroup difference did not reach statistical significance in non‐parametricstatistical analysis.

Analysis 1.12.

Analysis 1.12

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 12 Sleep (end of treatment).

At follow‐up, three included trials (Powell 2001;Wearden 2010;White 2011) (610 participants) showed effectsin favour of exercise therapy when they were pooled (MD ‐2.04 points, 95% CI‐3.48 to ‐0.23; P value 0.03;Analysis 1.13), but the three studies showed heterogeneous results: alarge positive effect inPowell 2001 (MD‐4.05, 95% CI ‐6.08 to ‐2.02) and a moderate effect inWhite 2011 (MD ‐2.00, 95% CI‐3.84 to ‐0.23), withWearden2010 reporting no observed statistically significant differences betweenthe two groups (MD ‐0.31, 95% CI ‐1.97 to 1.35).

Analysis 1.13.

Analysis 1.13

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 13 Sleep (follow‐up).

Subgroup analysis

All available studies compared graded exercise therapy versus treatment as usual.All studies recruited participants according to the Oxford criteria, thus nosubgroup analyses were performed withinAnalysis1.12 andAnalysis 1.13.

1.8 Self‐perceived changes in overall health

Seven trials assessed changes in overall health at end of treatment or atfollow‐up by using a self‐rated Global Impression Change Scale withscores ranging from 1 (very much better) to 7 (very much worse). We performedanalysis of the numbers of participants reporting improvement. Four trials (523participants) reported changes in overall health after end of treatment (Fulcher 1997;Moss‐Morris 2005;Wallman2004;Wearden 2010) andconsistently showed a larger number of participants with some degree of improvementin the exercise group (RR 1.83, 95% CI 1.39 to 2.40;Analysis 1.14).

Analysis 1.14.

Analysis 1.14

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 14 Self‐perceived changes in overall health (end of treatment).

Three trials (518 participants) reporting self‐perceived changes in overallhealth at follow‐up were more inconsistent (Jason 2007;Powell 2001;White 2011). The point estimate for the riskratio favoured exercise therapy (RR 1.88, 95% CI 0.76 to 4.64;Analysis 1.15), but the confidence intervalimplies inconclusive results, and heterogeneity was substantial (I² = 85%).Jason 2007 showed no significantdifferences between exercise and relaxation (RR 0.83, 95% CI 0.44 to 1.56) andWhite 2011 suggested a positive effect ofexercise therapy compared with treatment as usual (RR 1.63, 95% CI 1.16 to 2.29),whereasPowell 2001 indicated a largepositive effect for exercise (RR 5.96, 95% CI 2.36 to 15.09).

Analysis 1.15.

Analysis 1.15

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 15 Self‐perceived changes in overall health (follow‐up).

Subgroup analysis

To explore the potential impact of varying exercise strategies and controlconditions, we performed a post hoc subgroup analysis withinAnalysis 1.14 andAnalysis 1.15.

Type of control

At end of treatment, the pooled RR for all available studies was 1.83 (95% CI1.39 to 2.40; I² = 0%) compared with 1.99 (95% CI 1.38 to 2.86; I² = 0%)in the treatment as usual subgroup (Moss‐Morris 2005;White2011) and 1.64 (95% CI 1.09 to 2.48; I² = 0%) in therelaxation/flexibility subgroup (Fulcher1997;Wallman 2004). Tests forsubgroup differences did not establish differences between the two groups (I²= 0%, P value 0.50).

Type of exercise

Three studies offering graded exercise therapy (Fulcher 1997;Moss‐Morris2005;White 2011) tended towardsa greater chance of improvement (RR 2.01, 95% CI 1.46 to 2.77) than the studyoffering exercise with personal pacing (RR 1.43, 95% CI 0.85 to 2.41;Wallman 2004), but statistical tests did notestablish a subgroup difference (I² = 13.6%, P value 0.28).

At follow‐up, the pooled RR for the three available studies was 1.88 (95%CI 0.76 to 4.64) in an analysis associated with extensive heterogeneity (I² =85%, P value 0.001). The post hoc subgroup analysis did not firmly establish asubgroup difference (I² = 63%, P value 0.10) between the two studies (Powell 2001;White 2011) comparing graded exercise therapyversus treatment as usual (RR 2.92, 95% CI 0.75 to 11.35; I² = 87%) andJason 2007, which compared anaerobic activityversus relaxation (RR 0.83, 95% CI 0.44 to 1.56).

1.9 Health service resources

Data on health service resources are available for one of the included studies witha total of 320 participants (White 2011).During the 12‐month post‐randomisation period, participants in thetreatment as usual group had a higher mean number of specialist medical carecontacts than those allocated to exercise therapy (MD ‐1.40, 95% CI‐1.87 to ‐0.93;Analysis1.16). Use of primary care resources (i.e. general practitioner or practicenurse), other doctor contacts (i.e. neurologist, psychiatrist or other specialists),accident and emergency contacts, medication (i.e. hypnotics, anxiolytics,antidepressants or analgesics), contacts with other healthcare professionals (i.e.dentist, optician, pharmacist, psychologist, physiotherapist, community mentalhealth nurse or occupational therapist), inpatient contacts and other contacts withhealthcare/social services (e.g. social worker, support worker, nutritionist,magnetic resonance imaging (MRI), computed tomography (CT), electroencephalography(EEG)) did not differ significantly between the two groups (Analysis 1.16;Analysis 1.17)

Analysis 1.16.

Analysis 1.16

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 16 Health resource use (follow‐up) [Mean no. of contacts].

Analysis 1.17.

Analysis 1.17

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 17 Health resource use (follow‐up) [No. of users].

1.10 Drop‐out

Six studies (Fulcher 1997;Moss‐Morris 2005;Powell 2001;Wearden 1998;Wearden 2010;White 2011), with a total of 843 participants,reported drop‐out rates (Analysis1.18). The pooled RR for drop‐out was 1.63 (95% CI 0.77 to 3.43).The confidence interval implies that these results were inconclusive, andheterogeneity was moderate (I² = 50%).

Analysis 1.18.

Analysis 1.18

Comparison 1 Exercise therapy versus treatment as usual, relaxation or flexibility,Outcome 18 Drop‐out.

Subgroup analysis

The main analysis pooled studies using treatment as usual (Moss‐Morris 2005;Powell 2001;Wearden 1998;Wearden 2010) and studies using flexibility(Fulcher 1997) into the samecomparison. The pooled RR for all available studies was 1.63 (95% CI 0.77 to 3.43;I² = 50%) compared with 1.77 (95% CI 0.71 to 4.38; I² = 61%) in thetreatment as usual subgroup and 1.33 (95% CI 0.32 to 5.50) in the flexibilitysubgroup (Fulcher 1997). Tests forsubgroup differences did not establish differences between the two groups (I²= 0%, P value 0.74).

Exercise therapy versus other treatments

Comparison 2. Exercise therapy versus psychological treatment

Three trials (Jason 2007;White 2011;Wearden 2010) contributed data to this comparison, which includedcognitive‐behavioural therapy (CBT) (Jason2007;White 2011), cognitive therapytreatment (COG) (Jason 2007) and supportivelistening (Wearden 2010). We decided not topool the results in meta‐analyses because of clinical and contextualheterogeneity.

2.1 Fatigue
End of treatment

White 2011 (298 participants) showedlittle or no difference in fatigue between exercise therapy and CBT (MD 0.20, 95%CI ‐1.49 to 1.89;Analysis2.1).

Analysis 2.1.

Analysis 2.1

Comparison 2 Exercise therapy versus psychological treatment, Outcome 1 Fatigue at endof treatment (FS; 11 items/0 to 33 points).

Compared with 97 participants randomly assigned to supportive listening (Wearden 2010), 85 participants in the gradedexercise therapy group experienced greater improvement in fatigue (MD‐4.03, 95% CI ‐6.24 to ‐1.82; P value < 0.001;Analysis 2.1).

Follow‐up

Jason 2007 assessed fatigue using a7‐point Fatigue Severity Scale (Krupp1989) and showed an MD of ‐0.10 (95% CI ‐0.79 to 0.59) foranaerobic exercise versus COG (49 participants;Analysis 2.2). The wide confidence interval implies imprecise andinconclusive results.

Analysis 2.2.

Analysis 2.2

Comparison 2 Exercise therapy versus psychological treatment, Outcome 2 Fatigue atfollow‐up (FSS; 1 to 7 points).

Wide confidence intervals and imprecise results also apply to the comparison ofanaerobic exercise versus CBT as reported byJason 2007 (49 participants) with an MD of 0.40 (95% CI ‐0.34 to1.14;Analysis 2.2).White 2011 compared graded exercise therapyversus CBT (302 participants) by assessing fatigue on a 33‐point FatigueScale (Chalder 1993) and observed littleor no difference between the two groups (MD 0.30, 95% CI ‐1.45 to 2.05;Analysis 2.3).

Analysis 2.3.

Analysis 2.3

Comparison 2 Exercise therapy versus psychological treatment, Outcome 3 Fatigue atfollow‐up (FS; 11 items/0 to 33 points).

Wearden 2010 (182 participants) assessedfatigue on a 33‐point Fatigue Scale (Chalder 1993) and reported differences between rehabilitation andsupportive listening that favoured graded exercise therapy (MD ‐2.72, 95%CI ‐5.14 to ‐0.30; P value 0.03;Analysis 2.3).

Sensitivity analysis

At follow‐up, the available studies (Jason 2007;White 2011)measured and reported fatigue on different scales, and we performed a sensitivityanalysis in which the two studies were pooled using an SMD method. The resultingpooled SMD estimate is 0.07 (95% CI ‐0.13 to 0.28) with no unexplainedheterogeneity (I² = 0%, P value 0.40).

Subgroup analysis

Post hoc subgroup analysis did not establish a subgroup difference (I² = 0%,P value 0.40) betweenWhite 2011, whichcompared graded exercise therapy versus CBT (SMD 0.04, 95% CI ‐0.19 to0.26), andJason 2007, which comparedanaerobic activity versus CBT (SMD 0.30, 95% CI ‐0.26 to 0.86).

2.2 Adverse effects

White 2011 reported the number of seriousadverse reactions (SARs) (European Union ClinicalTrials Directive 2001) observed in each treatment group (Analysis 2.4). Two adverse reactions possiblyrelated to treatment were observed among the 160 participants in the exercise group(one participant with deterioration in mobility and self‐care, and one withworse CFS symptoms and function), and three participants reporting a total of fourSARs were described among 161 participants in the CBT group (one incident ofself‐harm, one incident of low mood with an episode of self‐harm, oneepisode of worsened mood and CFS symptoms and one incident of threatenedself‐harm). Thus, the observed RR was 0.67 (95% CI 0.11 to 3.96), implyingthat these results were inconclusive.

Analysis 2.4.

Analysis 2.4

Comparison 2 Exercise therapy versus psychological treatment, Outcome 4 Participantswith serious adverse reactions.

Wearden 2010 stated that no participantsin the rehabilitation or supportive listening group demonstrated SARs with aprobable relation to therapy (Analysis2.4).

2.3 Pain

Jason 2007 (43 participants) reporteddifferences in pain at follow‐up (52 weeks), as assessed by the Brief PainInventory (Cleeland 1994). When anaerobicexercise was compared with CBT, results were imprecise for pain severity (MD 0.07,95% CI ‐1.52 to 1.66;Analysis 2.5)and for pain interference (MD ‐0.35, 95% CI ‐2.29 to 1.59;Analysis 2.6). As the result of baselinedifferences between groups, these estimates, to some extent, are biased in favour ofexercise.

Analysis 2.5.

Analysis 2.5

Comparison 2 Exercise therapy versus psychological treatment, Outcome 5 Pain atfollow‐up (BPI, pain severity subscale; 0 to 10 points).

Analysis 2.6.

Analysis 2.6

Comparison 2 Exercise therapy versus psychological treatment, Outcome 6 Pain atfollow‐up (BPI, pain interference subscale; 0 to 10 points).

Jason 2007 also compared anaerobicexercise versus COG (44 participants). Here, inconclusive results were observed inpain severity (MD 0.51, 95% CI ‐0.92 to 1.94;Analysis 2.5) and pain interference (MD 0.39,95% CI ‐1.37 to 2.15;Analysis2.6).

2.4 Physical functioning
End of treatment

White 2011 (298 participants) reportedchanges in physical functioning between participants randomly assigned to exerciseand CBT at end of treatment by using the SF‐36 physical functioningsubscale (Ware 1992). Scores on thisscale range from 0 to 100, and study authors observed little or no difference inphysical function between the two groups (MD ‐1.20, 95% CI ‐6.30 to3.90;Analysis 2.7).

Analysis 2.7.

Analysis 2.7

Comparison 2 Exercise therapy versus psychological treatment, Outcome 7 Physicalfunctioning at end of treatment (SF‐36, physical functioning subscale; 0 to 100points).

Wearden 2010 (181 participants)suggested greater improvement in physical function among participants in thegraded exercise therapy group than in the supportive listening group (MD‐6.66 point, 95% CI ‐13.7 to 0.40; P value 0.06;Analysis 2.7), but little or no differencecannot be ruled out.

Follow‐up

BothJason 2007 andWhite 2011 reported physical function at52‐week follow‐up. WhereasWhite2011 (302 participants) observed little or no difference between gradedexercise therapy and CBT (MD 0.50, 95% CI ‐4.89 to 5.89;Analysis 2.8),Jason 2007 (46 participants) reported asignificant difference favouring CBT (MD 18.92, 95% CI 2.12 to 35.72;Analysis 2.8) when compared with anaerobicexercise. However, results of the latter study are skewed because of uncorrectedbaseline differences in physical function between the two groups (39 vs 46points), and this explains some of the observed heterogeneity.

Analysis 2.8.

Analysis 2.8

Comparison 2 Exercise therapy versus psychological treatment, Outcome 8 Physicalfunctioning at follow‐up (SF‐36, physical functioning subscale; 0 to 100points).

Jason 2007 (47 participants) alsocompared anaerobic exercise versus COG, suggesting a large difference in favour ofCOG (MD 21.37, 95% CI 6.61 to 36.13;Analysis2.8). It should be noted, however, that the latter estimate is probablybiased in favour of COG because of uncorrected baseline differences in physicalfunction between the two groups (39 vs 46 points).

Wearden 2010 (171 participants)suggested greater improvement in physical function among participants in thegraded exercise therapy than in the supportive listening group (MD ‐7.55point, 95% CI ‐15.57 to 0.47;Analysis2.8), but little or no difference cannot be ruled out.

2.5 Quality of life

Study authors provided no data.

2.6.1 Depression
End of treatment

InWearden 2010 (182 participants),graded exercise therapy was associated with greater improvement on the HADSdepression subscale (Zigmond 1983) thanwas seen with supportive listening (MD ‐1.57, 95% CI ‐2.74 to‐0.40; P value 0.008;Analysis2.9). We did not identify trials reporting depression for exercise versusCBT or for exercise versus COG at end of treatment.

Analysis 2.9.

Analysis 2.9

Comparison 2 Exercise therapy versus psychological treatment, Outcome 9 Depression atend of treatment (HADS depression score; 7 items/21 points).

Follow‐up

Jason 2007 assessed depression using theBeck Depression Inventory (BDI‐II) (Beck1996). When comparing anaerobic exercise versus COG (45 participants),study authors saw a trend towards greater improvement among participants in theCOG group (MD 5.08, 95% CI ‐0.77 to 10.93;Analysis 2.10), but little or no differencecannot be ruled out.

Analysis 2.10.

Analysis 2.10

Comparison 2 Exercise therapy versus psychological treatment, Outcome 10 Depression atfollow‐up (BDI; 0 to 63 points).

Two trials compared exercise therapy versus CBT (Jason 2007;White 2011), with neither showing statistically significant differencesbetween the two groups.Jason 2007 (44participants) assessed depression using the BDI‐II (Beck 1996) and reported imprecise results (MD2.99, 95% CI ‐4.37 to 10.35;Analysis2.10); interpretation of these results is further complicated by baselinedifferences between groups. On the other hand,White 2011 (287 participants) assessed depression using the HADSdepression subscale (Zigmond 1983) andfound little or no difference between graded exercise therapy and CBT (MD‐0.10, 95% CI ‐1.00 to 0.80;Analysis 2.11).

Analysis 2.11.

Analysis 2.11

Comparison 2 Exercise therapy versus psychological treatment, Outcome 11 Depression atfollow‐up (HADS depression score; 7 items/21 points).

Wearden 2010 compared graded exercisetherapy and supportive listening. At end of treatment, results favoured exercise,but this effect was not sustained at 70 weeks' follow‐up (171participants; MD ‐0.79, 95%CI ‐2.31 to 0.55;Analysis 2.11).

Sensitivity analysis

As depression was measured and reported on two different scales inJason 2007 andWhite 2011, we performed a sensitivityanalysis in which the two studies were pooled using an SMD method. The resultingpooled SMD estimate is 0.01 (95% CI ‐0.21 to 0.22) with no unexplainedheterogeneity (I² = 0%, P value 0.42).

Subgroup analysis

Post hoc subgroup analysis did not establish a subgroup difference (I² = 0%,P value 0.42) betweenWhite 2011, whichcompared graded exercise therapy versus CBT (SMD ‐0.03, 95% CI ‐0.26to 0.21) andJason 2007, which comparedanaerobic exercise versus CBT (SMD 0.23, 95% CI ‐0.36 to 0.83).

2.6.2 Anxiety
End of treatment

Wearden 2010 (182 participants) foundlittle or no difference on the HADS anxiety subscale (Zigmond 1983) between graded exercise therapyand supportive listening (MD ‐0.48, 95% CI ‐1.85 to 0.89;Analysis 2.12). We did not identify trialsreporting anxiety for exercise therapy versus CBT or for exercise therapy versusCOG at end of treatment.

Analysis 2.12.

Analysis 2.12

Comparison 2 Exercise therapy versus psychological treatment, Outcome 12 Anxiety at endof treatment (HADS anxiety; 7 items/21 points).

Follow‐up

Jason 2007 (45 participants) assessedanxiety using the Beck Anxiety Inventory (BAI) (Beck 1996). When comparing anaerobic exercise versus COG, study authorsdid not observe statistically significant differences between groups, but resultswere imprecise (MD 3.15, 95% CI ‐1.17 to 7.47;Analysis 2.13).

Analysis 2.13.

Analysis 2.13

Comparison 2 Exercise therapy versus psychological treatment, Outcome 13 Anxiety atfollow‐up (BAI; 0 to 63 points).

Two trials compared exercise therapy versus CBT (Jason 2007;White 2011), with neither showing statistically significant differencesbetween the two groups.Jason 2007 (44participants) assessed anxiety using the BAI (Beck 1996), with imprecise and statistically insignificant results (MD0.66, 95% CI ‐4.68 to 6.00;Analysis2.13).White 2011 (287participants) found little or no difference between graded exercise therapy andCBT using the HADS anxiety subscale (MD 0.30, 95% CI ‐0.71 to 1.31;Analysis 2.14).

Analysis 2.14.

Analysis 2.14

Comparison 2 Exercise therapy versus psychological treatment, Outcome 14 Anxiety atfollow‐up (HADS anxiety; 7 items/21 points).

Wearden 2010 (171 participants) did notobserve statistically significant differences on the HADS anxiety subscale betweengraded exercise therapy and supportive listening at 70 weeks (MD ‐0.08,95%CI ‐1.52 to 1.36;Analysis2.14).

Sensitivity analysis

As depression was measured and reported on two different scales inJason 2007 andWhite 2011, we performed a sensitivityanalysis in which the two studies were pooled using an SMD method. The resultingpooled SMD estimate is 0.07 (95% CI ‐0.15 to 0.28) with no unexplainedheterogeneity (I² = 0%, P value 0.99).

Subgroup analysis

Post hoc subgroup analysis did not establish a subgroup difference (I² = 0%,P value 0.99) betweenWhite 2011, whichcompared graded exercise therapy versus CBT (SMD 0.07, 95% CI ‐0.16 to0.30) andJason 2007, which comparedanaerobic activity versus CBT (SMD 0.07, 95% CI ‐0.52 to 0.66).

2.7 Sleep
End of treatment

Wearden 2010 observed that the 83participants in the graded exercise therapy group experienced greater improvementon the 20‐point Jenkins Sleep Scale (Jenkins 1988) as compared with the 97 participants in the supportivelistening group (MD ‐2.46 points, 95% CI ‐4.01 to ‐0.91; Pvalue 0.002;Analysis 2.15). We did notidentify trials reporting sleep for exercise therapy versus CBT or for exercisetherapy versus COG at end of treatment.

Analysis 2.15.

Analysis 2.15

Comparison 2 Exercise therapy versus psychological treatment, Outcome 15 Sleep at endof treatment (Jenkins Sleep Scale; 0 to 20 points).

Follow‐up

White 2011 (287 participants) assessedsleep using the Jenkins Sleep Scale (Jenkins1988) and found little or no difference between graded exercise therapyand CBT (MD ‐0.90, 95%CI ‐2.07 to 0.27;Analysis 2.16).Wearden 2010 (171 participant) also used theJenkins Sleep Scale and found little or no difference between graded exercisetherapy and supportive listening (MD ‐0.86, 95% CI ‐2.56 to 0.84;Analysis 2.16).

Analysis 2.16.

Analysis 2.16

Comparison 2 Exercise therapy versus psychological treatment, Outcome 16 Sleep atfollow‐up (Jenkins Sleep Scale; 0 to 20 points).

2.8 Self‐perceived changes in overall health

Two trials (Jason 2007;White 2011) assessed changes in overall healthby using a self‐rated Global Impression Change Scale with scores ranging from1 (very much better) to 7 (very much worse) (Guy1976). We performed analysis of the numbers of participants reportingimprovement.

End of treatment

White 2011 (320 participants) reportedchanges in overall health following graded exercise therapy versus CBT, butresults were inconclusive (RR 0.96, 95% CI 0.71 to 1.31;Analysis 2.17).

Analysis 2.17.

Analysis 2.17

Comparison 2 Exercise therapy versus psychological treatment, Outcome 17Self‐perceived changes in overall health at end of treatment.

Follow‐up

At follow‐up, self‐perceived changes in overall health werereported byJason 2007 andWhite 2011.

For the comparison of COG versus anaerobic exercise,Jason 2007 (50 participants) showed that moreparticipants in the CBT group than in the exercise group tended to reportimprovement, but little or no difference between CBT and exercise therapy cannotbe ruled out (RR 0.63, 95% CI 0.36 to 1.10;Analysis 2.18).

Analysis 2.18.

Analysis 2.18

Comparison 2 Exercise therapy versus psychological treatment, Outcome 18Self‐perceived changes in overall health at follow‐up.

BothJason 2007 (47 participants) andWhite 2011 (321 participants) comparedexercise therapy versus CBT. Pooling resulted in an RR of 0.71 (95% CI 0.33 to1.54;Analysis 2.18), implying impreciseand inconclusive results. The meta‐analysis was associated withconsiderable heterogeneity (I2 = 86%) as the result of inconsistencybetween effect estimates reported byJason2007, which compared anaerobic exercise versus CBT (RR 0.46, 95% CI 0.28to 0.77), andWhite 2011, which comparedgraded exercise therapy versus CBT (RR 1.02, 95% CI 0.77 to 1.35).

2.9 Health service resources

Data on health service resources were provided by one of the included studies witha total of 321 participants (White 2011).During the 12‐month post‐randomisation period, participants in the CBTgroup showed lower mean numbers of contacts with neurologist, psychiatrist or otherspecialists (MD 0.60, 95% CI 0.05 to 1.15;Analysis 2.19) and lower mean numbers of inpatient days (MD 0.80, 95% CI0.41 to 1.19;Analysis 2.19) when comparedwith participants in the exercise group. However, these group differences were notseen when data were analysed at a dichotomous level (Analysis 2.20).

Analysis 2.19.

Analysis 2.19

Comparison 2 Exercise therapy versus psychological treatment, Outcome 19 Healthresource use (follow‐up) [Mean no. of contacts].

Analysis 2.20.

Analysis 2.20

Comparison 2 Exercise therapy versus psychological treatment, Outcome 20 Healthresource use (follow‐up) [No. of users].

2.10 Drop‐out

White 2011 (321 participant) reporteddrop‐out from treatment. Drop‐out rates were not significantlydifferent between graded exercise therapy and CBT (RR 0.59, 95% CI 0.28 to 1.25;Analysis 2.21), but these results wereimprecise and inconclusive because few events were reported.

Analysis 2.21.

Analysis 2.21

Comparison 2 Exercise therapy versus psychological treatment, Outcome 21Drop‐out.

Wearden 2010 reported that moreparticipants discontinued graded exercise therapy (12 of 92 participants) thansupportive listening (7 of 91 participants) (RR 1.70, 95% CI 0.70 to 4.11;Analysis 2.21), but the confidence intervalimplies that these results were imprecise and inconclusive.

Comparison 3. Exercise therapy versus adaptive pacing therapy

One trial contributed data on 319 participants for this comparison (White 2011).

3.1 Fatigue

Fatigue assessed by a 33‐point Fatigue Scale (Chalder 1993) improved more among participantsallocated to graded exercise therapy than adaptive pacing (MD ‐2.00, 95% CI‐3.57 to ‐0.43; P value 0.01) when measured at end of treatment (24weeks; 305 participants). This positive effect was sustained at 52 weeks'follow‐up (307 participants; MD ‐2.50, 95% CI ‐4.16 to‐0.84; P value 0.003;Analysis3.1).

Analysis 3.1.

Analysis 3.1

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 1 Fatigue.

3.2 Adverse effects

White 2011 reported the number of SARs(European Union Clinical Trials Directive2001) observed in each treatment group (Analysis 3.2). Two SARs possibly related to treatment were observed amongthe 160 participants in the graded exercise therapy group (one incident ofdeterioration in mobility and self‐care, and one episode of worse CFSsymptoms and function) compared with two in the adaptive pacing group (159participants) (one incident of suicidal thoughts, and one episode of worseneddepression). Thus, results were inconclusive, with an RR of 0.99 (95% CI 0.14 to6.97).

Analysis 3.2.

Analysis 3.2

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 2 Participants withserious adverse reactions.

3.3 Pain

No data were provided.

3.4 Physical functioning

The graded exercise therapy group (150 participants) experienced significantimprovement in physical functioning compared with the adaptive pacing group (155participants) (Analysis 3.3). At end oftreatment, participants in the graded exercise therapy group scored a mean of 12.2points better (95% CI ‐17.23 to ‐7.17) on the SF‐36 physicalfunctioning subscale (Ware 1992) than thosein the adaptive pacing group—a difference that was sustained at 52 weeks'follow‐up (307 participants; MD ‐11.8, 95% CI ‐17.5 to‐6.05).

Analysis 3.3.

Analysis 3.3

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 3 Physicalfunctioning.

3.5 Quality of life

No data were provided.

3.6.1 Depression

The change on the HADS depression subscale (Zigmond 1983) at end of treatment was not reported (White 2011). At follow‐up, participantsin the graded exercise therapy group (144 participants) had improved by a mean of1.10 points (95% CI ‐2.09 to ‐0.11) on the HADS depression subscalewhen compared with the 149 participants in the pacing group (Analysis 3.4).

Analysis 3.4.

Analysis 3.4

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 4 Depression.

3.6.2 Anxiety

White 2011 did not report the change onthe HADS anxiety subscale (Zigmond 1983) atend of treatment, and they observed little or no difference between the two groups(293 participants) at 52 weeks (MD ‐0.40, 95% CI ‐1.40 to 0.60;Analysis 3.5).

Analysis 3.5.

Analysis 3.5

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 5 Anxiety.

3.7 Sleep

White 2011 did not report change in sleepat end of treatment as assessed by the 20‐point Jenkins Sleep Scale (Jenkins 1988). At follow‐up,participants in the graded exercise therapy group (144 participants) had improved bya mean of 1.60 points (95% CI ‐2.70 to ‐0.50) when compared with the150 participants in the adaptive pacing group (Analysis 3.6).

Analysis 3.6.

Analysis 3.6

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 6 Sleep.

3.8 Self‐perceived changes in overall health

White 2011 assessed changes in overallhealth by using a self‐rated Global Impression Change Scale with scoresranging from 1 (very much better) to 7 (very much worse) (Guy 1976). Comparisons of the numbers ofparticipants reporting improvement showed that a larger fraction of participants inthe graded exercise therapy group experienced improvement at end of treatment (319participants; RR 1.45, 95% CI 1.02 to 2.07;Analysis 3.7). At follow‐up, an estimate of effect that suggestedimprovement favouring graded exercise therapy was still observed, but little or noeffect cannot be ruled out (319 participants; RR 1.31, 95% CI 0.96 to 1.79).

Analysis 3.7.

Analysis 3.7

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 7 Self‐perceivedchanges in overall health.

3.9 Health service resources

One of the included studies with a total of 319 participants provided data onhealth service resources (White 2011).During the 12‐month post‐randomisation period, participants in thepacing group showed lower mean numbers of contacts with complementary healthcareresources (MD 3.80, 95% CI 1.42 to 6.18;Analysis3.8), lower mean numbers of contacts with other doctors (neurologist,psychiatrist and other specialists) (MD 0.70, 95% CI 0.14 to 1.26;Analysis 3.8), lower mean numbers of accidentsand emergencies (MD 0.50, 95% CI 0.31 to 0.69;Analysis 3.8) and higher mean numbers of inpatient days (MD 1.00, 95% CI0.46 to 1.54;Analysis 3.8) than were seenamong participants in the exercise group. However, these group differences were notseen when data were analysed at a dichotomous level (Analysis 3.9).

Analysis 3.8.

Analysis 3.8

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 8 Health resource use(follow‐up) [Mean no. of contacts].

Analysis 3.9.

Analysis 3.9

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 9 Health resource use(follow‐up) [No. of users].

3.10 Drop‐out

In the PACE trial (White 2011), 10 of the160 participants in the graded exercise therapy group and 11 of the 160 participantsin the adaptive pacing group withdrew, thus the results were inconclusive (RR 0.91,95% CI 0.40 to 2.08;Analysis 3.10).

Analysis 3.10.

Analysis 3.10

Comparison 3 Exercise therapy versus adaptive pacing, Outcome 10 Drop‐out.

Comparison 4. Exercise therapy versus antidepressants

One trial contributed data on a total of 69 participants to this comparison (Wearden 1998). In this trial, investigatorscombined graded exercise therapy with antidepressant placebo, and the antidepressantused was fluoxetine.

4.1 Fatigue

Investigators assessed fatigue on a 42‐point Fatigue Scale (Chalder 1993; 48 participants) at end oftreatment, but the results were inconclusive (MD ‐1.99, 95% CI ‐8.28to 4.30;Analysis 4.1).

Analysis 4.1.

Analysis 4.1

Comparison 4 Exercise therapy + antidepressant placebo versus antidepressant + exerciseplacebo, Outcome 1 Fatigue.

4.2 Adverse effects

Study authors provided no data.

4.3 Pain

Study authors provided no data.

4.4 Physical functioning

Study authors provided no data.

4.5 Quality of life

Study authors provided no data.

4.6.1 Depression

Researchers assessed depression among 48 participants at end of treatment using theHADS depression subscale (Zigmond 1983),but they found little or no difference between the exercise and fluoxetine groups(MD 0.15, 95% CI ‐2.11 to 2.41;Analysis4.2).

Analysis 4.2.

Analysis 4.2

Comparison 4 Exercise therapy + antidepressant placebo versus antidepressant + exerciseplacebo, Outcome 2 Depression.

4.6.2 Anxiety

Study authors provided no data.

4.7 Sleep

Study authors provided no data.

4.8 Self‐perceived changes in overall health

Study authors provided no data.

4.9 Health service resources

Study authors provided no data.

4.10 Drop‐out

Wearden 1998 observed similardrop‐out rates in both groups, with 11 drop‐outs reported among the 34participants in the exercise group and 10 drop‐outs among the 35 participantsin the antidepressant group (RR 1.13, 95% CI 0.55 to 2.31;Analysis 4.3), implying that the results wereinconclusive.

Analysis 4.3.

Analysis 4.3

Comparison 4 Exercise therapy + antidepressant placebo versus antidepressant + exerciseplacebo, Outcome 3 Drop‐out.

Exercise therapy adjunctive to other treatment versus the other treatmentalone

Comparison 5. Exercise therapy versus antidepressants plus exercisetherapy

One trial contributed data for a total of 68 participants to this comparison (Wearden 1998). In this trial, investigatorscombined graded exercise therapy with use of the antidepressant fluoxetine.

5.1 Fatigue

Researchers assessed fatigue on a 42‐point Fatigue Scale (Chalder 1993; 43 participants) at end oftreatment, but the results were inconclusive (MD ‐3.66, 95% CI ‐10.41to 3.09;Analysis 5.1).

Analysis 5.1.

Analysis 5.1

Comparison 5 Exercise therapy + antidepressant versus antidepressant + exerciseplacebo, Outcome 1 Fatigue.

5.2 Adverse effects

Study authors provided no data.

5.3 Pain

Study authors provided no data.

5.4 Physical functioning

Study authors provided no data.

5.5 Quality of life

Study authors provided no data.

5.6.1 Depression

Researchers assessed depression at end of treatment among 43 participants using theHADS depression subscale (Zigmond 1983),but the results were inconclusive (MD ‐0.52, 95% CI ‐2.68 to 2.14;Analysis 5.2).

Analysis 5.2.

Analysis 5.2

Comparison 5 Exercise therapy + antidepressant versus antidepressant + exerciseplacebo, Outcome 2 Depression.

5.6.2 Anxiety

Study authors provided no data.

5.7 Sleep

Study authors provided no data.

5.8 Self‐perceived changes in overall health

Study authors provided no data.

5.9 Health service resources

Study authors provided no data.

5.10 Drop‐out

Wearden 1998 observed similardrop‐out rates in both groups, with 14 drop‐outs reported among the 33participants in the exercise plus antidepressant group, and 10 drop‐outsamong the 35 participants in the antidepressant group (RR 1.48, 95% CI 0.77 to 2.87;Analysis 5.3). The confidence intervalimplies that the results were inconclusive.

Analysis 5.3.

Analysis 5.3

Comparison 5 Exercise therapy + antidepressant versus antidepressant + exerciseplacebo, Outcome 3 Drop‐out.

Discussion

Summary of main results

We have included eight studies including 1518 participants in this review.

When exercise therapy was compared with 'passive control,' fatigue wassignificantly reduced at end of treatment (Analysis1.1). Data on serious adverse reactions (SARs) were available from only onetrial, and SARs were rare, but too few events were reported to allow any conclusions to bedrawn (Analysis 1.3). A positive effect ofexercise therapy was observed both at end of treatment and at follow‐up withrespect to sleep (Analysis 1.12;Analysis 1.13), physical functioning (Analysis 1.5;Analysis 1.6) and self‐perceived changes in overall health (Analysis 1.14;Analysis 1.15). For the remaining outcomes, wewere not able to draw any conclusions.

When exercise therapy was compared with cognitive‐behavioural therapy (CBT),little or no difference in fatigue was noted between the two groups (Analysis 2.1;Analysis 2.2). Serious adverse reactions were rare and were reported at similarrates in the two groups. Events were few; therefore results were too imprecise to allowany conclusions to be drawn (Analysis 2.4).Little or no difference was observed between exercise therapy and CBT for physicalfunctioning (Analysis 2.7;Analysis 2.8), depression (Analysis 2.10;Analysis 2.11), anxiety (Analysis 2.13;Analysis 2.14) and sleep (Analysis 2.16). It was not possible to draw anyconclusions regarding pain (Analysis 2.5;Analysis 2.6), self‐perceived changes inoverall health (Analysis 2.17;Analysis 2.18) or drop‐out (Analysis 2.21).

When exercise therapy was compared with pacing, fatigue (Analysis 3.1), physical functioning (Analysis 3.3), depression (Analysis 3.4), sleep (Analysis 3.6) and self‐perceived changes inoverall health at end of treatment (Analysis 3.7)were significantly better. Data on SARs were available from only one trial, and SARs wererare, but events were too few to allow any conclusions to be drawn (Analysis 3.2). For anxiety, little or no differencebetween groups was reported (Analysis 3.5).

Overall completeness and applicability of evidence

The evidence base was limited to patients able to participate in exercise therapy, andall studies were conducted in developed countries (Australia, New Zealand, North Americaand the United Kingdom). Settings varied from primary to tertiary care, which suggestseasy generalisation. Most of the outcomes investigated were reported in the includedstudies, apart from health service resources. Most studies used aerobic exercise, but itwould be preferable if we had found studies that used different types of exercise therapy,as this would reflect clinical practice.

Quality of the evidence

Risk of bias across studies was relatively low. We were able to identifypre‐published protocols for only two studies (Wearden 2010;White 2011) and haveidentified a risk of unpublished outcomes.

One limitation is that formal blinding of participants and clinicians to treatment arm isnot inherently possible in trials of exercise therapy. This increases risk of bias, asinstructors' and participants' knowledge of group assignation might haveinfluenced the true effect. In addition, outcomes were measured subjectively (e.g.questionnaires, visual analogue scales), leading to risk that this might increase theoutcome estimate. Against this, many patient charities are opposed to exercise therapy forchronic fatigue syndrome (CFS), and this may in contrast reduce the effect. Six of theseven studies reported that investigators used intention‐to‐treat analysis,but this was done in different ways, which might have influenced the effect estimate. Onestudy (Jason 2007) reported baseline differences,used a best linear unbiased predictor to avoid taking missing data into account anddescribed 25 outcomes, with none stated as primary.

Several methodological challenges have become evident during the review process. Anobvious topic of discussion is the between‐study variation observed with regard totype of exercise, intensity of exercise and incremental procedures used (Table 8). We acknowledge that an effect of exercise therapy islikely to depend on how training is conducted, thus inclusion of trials using differentexercise regimens is likely to introduce some heterogeneity into the analysis. Possiblyequally important, the treatment provided to participants in the control group was notuniform across included trials. Whereas the difference between waiting list, relaxationand treatment as usual is rather obvious, it is important to recognise that the actualingredients of ‘treatment as usual’ differ widely among theincluded trials, and this may contribute to variation in observed effect estimates. Withregard to participants and their health status, it is important to realise thatsubstantial differences in baseline illness severity were noted, as illustrated by thewide range in baseline physical functioning, depression co‐morbidity and illnessduration shown inTable 7. Some trials applied narrow selectioncriteria, whereas others seem to have included more heterogeneous sample populations;these differences might cause variation in the observed effect estimate. Our finding ofsimilar outcomes with different definitions of CFS mitigates this risk.

All potential sources of heterogeneity mentioned above could have contributed tovariation in results derived from the aggregate analysis presented in the present reviewand might have reduced our ability to draw firm conclusions. It is easy to imagine apotential correlation between observed treatment effect and factors such as exercisecharacteristics, control conditions, participant recruitment strategies, participantcharacteristics and baseline differences. We aimed to explore these associations insubgroup analyses. However, the number of potential heterogeneity factors is high and thenumber of available trials is low; therefore we were limited in our ability to exploreheterogeneity in a sensible way at the aggregate level.

Potential biases in the review process

The strength of this review lies in its rigorous methods, which include thoroughsearching for evidence, systematic appraisal of study quality and systematic andwell‐defined data synthesis. Even though we tried to search as extensively aspossible, we may have missed out on eligible trials, such as trials reported only indissertations or in non‐indexed journals.

The table of interventions (Table 8) includes published andunpublished information regarding types of interventions, but not effect estimates. Forthis updated review, we have not collected unpublished data for our outcomes but have useddata from the 2004 review (Edmonds 2004) and frompublished versions of included articles.

The authors of this review had to make a cutoff regarding what kind of exercise should beincluded. We decided to exclude traditional Chinese exercise such as Tai Chi and Qigong,but to include pragmatic rehabilitation for which the type of exercise is described aswalking, walking stairs, bicycling, dancing or jogging. The cutoff might be contentious,and discussion regarding what type of exercise should be included should be ongoing.

One of the included studies (Powell 2001) is anoutlier, reporting very positive results in favour of exercise therapy; we decided posthoc to perform a sensitivity analysis from whichPowell2001 was excluded to learn what the results would be if this study was notincluded.

Review authors noted potential bias regarding how the comparators in this review werecategorised and pooled. We decided to report diverse comparators such ascognitive‐behavioural therapy (CBT), cognitive therapy treatment (COG) andsupportive therapy together as a single comparator called 'psychologicaltreatments' (however, because of clinical and contextual heterogeneity, we decidednot to pool the results in meta‐analyses). These different psychological treatmentsdo have similar elements, for example, both CBT and COG use cognitive approaches and goalsetting; however they differ in certain respects (e.g. CBT tries to change unhelpfulthoughts, while COG aims to accept them (Jason2007)). Our approach of combining these comparators might be consideredcontentious, and discussion about what should be lumped together and what should be splitinto different comparators should be ongoing.

Meta‐analysis of individual patient data (IPD) constitutes an alternative approachto meta‐analysis of aggregate data. Analysis based on individual patient data ingeneral will enable us to use a wider range of statistical and analytical approaches(Higgins 2011). In particular, by utilisingIPD, it is possible to explore the relative importance of the various heterogeneityfactors mentioned above more thoroughly, and to ensure that missing data and baselinedifferences are dealt with in standardised ways. With access to IPD, it is also possibleto perform subgroup analyses that have not been previously reported. A project aimed atundertaking IPD analyses of the trials included in the present review has been initiated,and when the IPD analyses are presented, they are likely to shed some new light on theaggregate level analyses presented in the current systematic review.

Agreements and disagreements with other studies or reviews

This review is an updated version of a review that was originally published in 2004(Edmonds 2004); the revised version offersmajor additions and changes. According to recent updates provided in theCochraneHandbook for Systematic Reviews of Interventions, we have implemented severalmethodological improvements, including a thorough risk of bias assessment for all includedstudies (Higgins 2011). Also, the updated searchfor literature led to the inclusion of three new trials with a total of 1051 participants(Jason 2007;Wearden 2010;White 2011), thus thenumber of included participants has more than tripled since the 2004 version. Theinclusion of new trials has important implications. First, statistical power has beenincreased by the addition of new data. Second, the most recent trials offered longerfollow‐up times; therefore we can provide more clear conclusions aboutfollow‐up treatment effects in this update than were provided in the originalreview. Third, the most recent trials involve comparisons beyond exercise therapy versustreatment as usual, for example, comparisons of exercise therapy versus other activetreatment strategies such as CBT and adaptive pacing therapy.

This update provides valuable additional information when compared with the originalreview, and results reported in the original review are largely confirmed in this update.Moreover, the results reported here correspond well with those of other systematic reviews(Bagnall 2002;Larun 2011;Prins2006) and with existing guidelines (NICE2007). One meta‐analysis of CBT and GET suggests that the two treatmentsare equally efficacious, especially for patients with co‐morbid anxiety ordepressive symptoms (Castell 2011).

A recent randomised trial comparing quality of life among participants randomly assignedto group CBT plus graded exercise therapy plus conventional pharmacological treatment orexercise counselling plus conventional pharmacological treatment found no differencesbetween the two groups at 12 months' follow‐up (Nunez 2011). This trial did not meet our a prioriinclusion criteria and was excluded from our review. As the comparison used inNunez 2011 differs from the comparisons reported inour review, it is difficult to compare the results directly; this comparison wascomplicated further by the fact thatNunez 2011did not measure outcomes viewed as primary outcomes in our review. Consequently, our viewis that the conclusions presented in our review correspond well with those of otherrelevant studies and reviews, but further research is needed to explore the considerableheterogeneity observed across available trials.

Authors' conclusions

Encouraging evidence suggests that exercise therapy can contribute to alleviation ofsome symptoms of CFS, especially fatigue. Exercise therapy seems to perform betterthan no intervention or pacing and seems to lead to results similar to those seen withcognitive behavioural therapy. Reported results were obtained from patients who wereable to participate (not from those too disabled to attend clinics); these resultswere inconclusive as to type of exercise therapy and showed heterogeneity. Few seriousadverse reactions were reported. We think the evidence suggests that exercise therapymight be an effective and safe intervention for patients able to attend clinics asoutpatients.

Further randomised controlled studies are needed to clarify the most effective type,intensity and duration of exercise therapy. These studies should report contextualcharacteristics of the exercise therapy provided, such as deliverer of theintervention, schedule, explanation and materials, supervision and monitoring. It isimportant that these trials measure health service use alongside the primary outcomesof fatigue and adverse effects, as well as alongside relevant secondary outcomes.Researchers should take care to describe which set of diagnostic criteria they haveused and how they operationalised the diagnostic process.

Feedback

Feedback

Summary

The two reviews about chronic fatigue syndrome (CFS) (on exercise and CBT) areimportant documents in a controversial field. However, they seem to be listed on thewebsite as mental health topics, alongside depression, etc. CFS is not a form of mentalillness, although of course individual cases may have a psychological component that canbe addressed during treatment. May I suggest that you place them elsewhere, as it ismisleading and confusing to include them under the mental health umbrella?

Reply

Many thanks for your comment on the two Cochrane CFS reviews. Apologies for the delayin responding, I have been on annual leave. We appreciate your observations about theplacement of these reviews inThe Cochrane Library. Feedback on reviews isnormally dealt with by the relevant review author, but in this case I am responding, asyour query relates more to an organisational issue. These reviews are listed as topicsunder a mental health heading because, as a result of the psychological component towhich you refer, both reviews are supported by a mental health Cochrane group. Similararrangements are in place for reviews of treatments for other disorders involving avariety of component problems and that as a result do not easily fit within the scope ofone Cochrane group. These reviews however can be accessed in a number of different ways,for example, by searching for the specific topic (CFS and associated terminology,exercise and associated terminology, CBT and associated terminology); by searching forthe study authors; by looking under subject headings, etc. The subject headings are notreally intended as a comment on/guide to the aetiology of an illness, but they sometimesreflect the services involved in management of the condition. I have copied thisresponse to the review authors in case they wish to comment further. Many thanks foryour feedback.

Contributors

Cathy Stillman‐Lowe (occupation freelance editor/sciencewriter) cathy.stillman‐lowe@care4free.net Submitteragrees with default conflict of interest statement: I certify that I have noaffiliations with or involvement in any organisation or entity with a financial interestin the subject matter of my feedback.

Types of evidence included, 3 June 2013

Summary

Unfortunately, this review ignores the large body of patient testimony suggesting thatmany persons with severe myalgic encephalomyelitis have been harmed by graded exercisetherapy.

Since it was prepared, the International Consensus Primer and Guidelines for MedicalPractitioners have been published.

Current thinking is to stay within your energy envelope. People with ME tend tooverdo not underdo what they are capable of....

Care must be taken to NOT encourage them to do too much.

Further many definitions are used for CFS, and this muddies the waters.

I agree with the conflict of interest statement below:

I certify that I have no affiliations with or involvement in any organisation or entitywith a financial interest in the subject matter of my feedback.

Reply

Thank you for your comments on this Cochrane Review.

In conducting this review, our aim was to gather and synthesise a specific type ofevidence—that reported by randomised controlled trials. We fully accept thatpatient testimony, particularly that gathered and synthesised by high‐qualityqualitative research, is invaluable in any clinical area, particularly in an area aschallenging for patients and healthcare professionals as CFS‐ME. However, thisproject was not designed to incorporate such evidence.

We do consider the possibility of harm arising from graded exercise therapy byconsidering reported adverse events. Clearly this is an important issue to considerwith any therapeutic intervention. Moreover, in the usual course of any illness,the condition of some patients improves (with or without treatment) and the condition ofothers worsens (with or without treatment). It is only through the use of randomisedcontrolled trials that the effects (whether beneficial or adverse) of putativetreatments can be disentangled reliably from the natural history of illness.

You raise the important point that (some) 'people with ME tend to overdo notunderdo what they are capable of.' The critical point is the extent to whichpatients should be 'encouraged to do more' and the way in which they should beencouraged to do so. These are important research questions. As you know, new randomisedevidence is available from the PACE trial, published in 2011 inLancet. Whilstthis is a controversial trial, it is an important randomised comparison of gradedexercise therapy and 'adaptive pacing.' We look forward to further randomisedevidence in due course.

We also look forward to continuing to work in this clinical area, in the hope that wecan advance our understanding of the impact of this treatment approach.

Contributors

Submitter: Adrienne.

Response prepared by Jonathan Price.

Comment 1 of 2, 9 September 2015

Summary

I would first like to thank those involved for their work in preparing this document.Even for those of us who have read the individual Chronic Fatigue Syndrome (CFS) papersit is useful to have the results collated, as well as details regarding theinterventions. Also it is interesting to see the results of sensitivity analyses,subgroup analyses, standardised mean differences, etc.

I would like to make a few comments. I’m splitting them into twosubmissions as the piece had become very long. I’ve added some looseheadings to hopefully make it more readable.

Objective measures

The review assessed the studies as having a high risk of bias regarding blinding, sinceneither participants nor assessors were blinded. Evidence suggests that subjectiveoutcomes are more prone to bias than objective outcomes when there is no blinding (1).It is thus unfortunate that the review concentrated almost exclusively on subjectivemeasures, failing to include results from nearly all the objective outcome measures thathave been published with trials. (The exception was health resource use for which youpresented follow‐up data from one trial).

I hope objective outcome data can be included in a future revision or edition of thisreview.

Examples of objective outcomes include: exercise testing (work capacity by oxygenconsumption); fitness test/step test; the six minute walking test; employment status;and disability payments.

Adding in these results would allow a more rigorous assessment of the effectiveness andrelevance of the therapies, their causal mechanisms, therapeutic compliance, andsafety.

On exercise testing, for example, in the PACE Trial (the largest trial in the review)there was no improvement in fitness levels as measured by a step test (2). The fitnessdata contrasts sharply with the many positive results from subjective self‐reportmeasures in the trial, so one is left wondering how much the subjective measures reflectreality.

On another exercise test used in the PACE Trial, the 6 minute walk test, there was asmall (mean) increase from 312 metres at baseline to 379 metres at 12 months: this was35.3 metres more than the "passive" control group when adjustments were made.However, the final result of 379 metres remains very poor compared to the more than 600metres one would expect from healthy people of a similar age and gender make‐up(3,4). By comparison, a group with Class III heart failure walked an average of 402metres (5). A score of less than 400 metres has been suggested as the level at whichsomebody should be put on a lung transplant list (6). Such information from objectivemeasures helps to add important context to the subjective measures and restraint to theconclusions that can be drawn from them.

Objective data is also needed to check compliance with a therapy. If patientsdiligently exercised for 12 months one would expect much better results on fitness andexercise testing than the aforementioned results in the PACE Trial. This is importantwhen considering adverse events and safety: such trials may not give us good informationon the safety of complying with such interventions if patients haven't actuallycomplied.

Employment and receipt of disability payments are practical objective measures ofgeneral functional capacity so data on them would help establish whether patients canactually do more overall or whether they may just be doing, for example, a little moreexercise but have substituted that for doing less in other areas (7,8). Also, CFSpatients are sometimes pressured by insurance companies into doing graded exercisetherapy (GET) programs so it would be useful to have data collated on employmentoutcomes to see whether pressure can in any way be justified (9,10). In the PACE Trial,there was no significant improvement in employment measures and receipt of disabilitypayments in the GET group (11). Outside the realm of clinical trials, the quantitativeand qualitative data in a major (UK) ME Association survey also found that GETdidn't lead to higher levels of employment and lower levels of receipt ofdisability payments on average (9). Also, extensive external audits were performed ofBelgian CFS rehabilitation clinics that treated using cognitive behavioural therapy(CBT) and GET. The main reports are in French and Dutch (12,13), with an English summaryavailable (14) that says, "Employment status decreased at the end of the therapy,from an average of 18.3% of a 38h working week, to 14.9% [...] The percentage ofpatients living from a sickness allowance increased slightly from 54 to 57%." Thiscontrasts with the average improvements reported in the audit for some symptoms likefatigue.

While data on (self‐reported) symptoms like fatigue (one of your two primaryoutcomes) is interesting, arguably more important to patients is improving their overalllevel of functioning (and again, objective measures are needed here). Being able towork, for example, despite experiencing a certain level of fatigue would likely be moreimportant for many than being unable to work but having slightly lower levels offatigue.

An example of how reductions in the reported levels of fatigue may not lead toimprovements in functioning can be seen in an analysis of three gradedactivity‐oriented CBT therapy interventions for CFS (15). The analysis showed,compared to controls, there were no improvements in overall activity levels as measuredby actometers despite improvements in self‐reported fatigue (15). Activity inthese trials was assessed using actometers. Another study that exemplifies the problemof focusing too much on fatigue scores after behavioural interventions is a study of CBTin multiple sclerosis (MS) patients with “MSfatigue”(16). The study found that following the intervention,patients with MS reported significantly lower (i.e. better) scores on the ChalderFatigue Scale (0‐33 scoring) than those in a healthy, nonfatigued comparisongroup! This significant difference was maintained at 3 and 6 months’follow‐up. It is difficult to believe that patients with MS fatigue (at baseline)truly subsequently had less fatigue than healthy nonfatigued controls: a much morelikely scenario is that undertaking the intervention had led to response biases.

You mention that "many patient charities are opposed to exercise therapy forchronic fatigue syndrome (CFS)". One reason for concern about the way in whichexercise programmes are promoted to patients is that they are often based upon modelswhich assume that there is no abnormal physiological response to exercise in thecondition, and make unsupported claims to patients. For example, in the FINE trial(Wearden et al., 2010) patient booklet (17), it is boldly asserted that: "Activityor exercise cannot harm you" (p. 49). However, a large number of studies have foundabnormal responses to exercise, and the possibility of harm being done simply cannot beexcluded on the basis of current evidence (discussed in 4, 18‐20)."

Compliance

The review doesn't include any information on compliance. I'm not sure thatthere is much published information on this but I know there was a measure based onattendance at therapy sessions (which could be conducted over the phone) given for thePACE Trial (3). Ideally, it would be interesting if you could obtain some unpublisheddata from activity logs, records from heart‐rate monitors, and other records tohelp build up a picture of what exercise was actually performed and the level ofcompliance. Information on adherence and what exercise was actually done is important interms of helping clinicians, and indeed patients, to interpret and use the data. Imention patients because patients' own decisions about their behaviour is likely tobe affected by the medical information available to them, both within and outside of asupervised programme of graded exercise; unlike with an intervention like a drug,patients can undertake exercise without professional supervision.

"Selective reporting (outcome bias)" and White et al. (2011)

I don't believe that White et al. (2011) (the PACE Trial) (3) should be classed ashaving a low risk of bias under "Selective reporting (outcome bias)" (Figure2, page 15). According to the Cochrane Collaboration's tool for assessing risk ofbias (21), the category of low risk of bias is for: "The study protocol isavailable and all of the study’s pre‐specified (primary andsecondary) outcomes that are of interest in the review have been reported in thepre‐specified way". This is not the case in the PACE Trial. The threeprimary efficacy outcomes can be seen in the published protocol (22). None have beenreported in the pre‐specified way. The Cochrane Collaboration's tool forassessing risk of bias states that a “highrisk” of bias applies if any one of several criteria are met,including that “not all of the study’spre‐specified primary outcomes have been reported” or“one or more primary outcomes is reported using measurements,analysis methods or subsets of the data (e.g. subscales) that were notpre‐specified”. In the PACE Trial, the third primaryoutcome measure (the number of "overall improvers") was never published. Also,the other two primary outcome measures were reported using analysis methods that werenot pre‐specified (including switching from the bimodal to the Likert scoringmethod for The Chalder Fatigue Scale, one of the primary outcomes in your review). Thesefacts mean that the “high risk of bias”category should apply.

Thank you for taking the time to read my comments.

Tom Kindlon

Conflict of Interest statement: I am a committee member of the Irish ME/CFS Associationand do a variety of unpaid work for the Association.

References:

1. Turner L, Boutron I, Hróbjartsson A, Altman DG, Moher D: The evolution ofassessing bias in Cochrane systematic reviews of interventions: celebratingmethodological contributions of the Cochrane Collaboration. Syst Rev 2013, 2:79.

2. Chalder T, Goldsmith KA, White PD, Sharpe M, Pickles AR. Rehabilitative therapiesfor chronic fatigue syndrome: a secondary mediation analysis of the PACE trial. LancetPsychiatry. 2015;2:141‐152.

3. White PD, Goldsmith KA, Johnson AL, Potts L, Walwyn R, DeCesare JC, et al.Comparison of adaptive pacing therapy, cognitive behaviour therapy, graded exercisetherapy, and specialist medical care for chronic fatigue syndrome (PACE): a randomisedtrial. The Lancet 2011;377:823‐36.

4. Kindlon T. Reporting of Harms Associated with Graded Exercise Therapy and CognitiveBehavioural Therapy in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome. Bull IACFSME. 2011;19:59‐111.http://iacfsme.org/ME‐CFS‐Primer‐Education/Bulletins/BulletinRelatedPages5/Reporting‐of‐Harms‐Associated‐with‐Graded‐Exercise.aspx

5. Lipkin DP, Scriven AJ, Crake T, Poole‐Wilson PA (1986). Six minute walkingtest for assessing exercise capacity in chronic heart failure. British Medical Journal292, 653‐5.

6. Kadikar A, Maurer J, Kesten S. The six‐minute walk test: a guide toassessment for lung transplantation. J Heart Lung Transplant. 1997Mar;16(3):313‐9.

7. Friedberg F, Sohl S. Cognitive‐behavior therapy in chronic fatigue syndrome:is improvement related to increased physical activity? J Clin Psychol. 2009 Feb 11.

8. Friedberg F. Does graded activity increase activity? A case study of chronic fatiguesyndrome. Journal of Behavior Therapy and Experimental Psychiatry, 2002, 33, 3‐4,203‐21

9. Results and In‐depth Analysis of the 2012 ME Association Patient SurveyExamining the Acceptability, Efficacy and Safety of Cognitive Behavioural Therapy,Graded Exercise Therapy and Pacing, as Interventions used as Management Strategies forME/CFS. Gawcott, England.http://www.meassociation.org.uk/2015/05/23959/ Accessed: September 3, 2015

10. Critical Illness ‐ A Dreadful Experience with Scottish Provident.http://forums.moneysavingexpert.com/showthread.php?t=2356683 Accessed:September 4, 2015

11. McCrone P, Sharpe M, Chalder T, Knapp M, Johnson AL, Goldsmith KA, White PD.Adaptive pacing, cognitive behaviour therapy, graded exercise, and specialist medicalcare for chronic fatigue syndrome: a cost‐effectiveness analysis. PLoS One.2012;7(8):e40808.

12. Rapport d’évaluation (2002‐2004) portant surl’exécution des conventions de rééducation entre leComité de l’assurance soins de santé (INAMI) et lesCentres de référence pour le Syndrome de fatigue chronique (SFC). 2006.http://health.belgium.be/internet2Prd/groups/public/@public/@shc/documents/ie2divers/14926531_fr.pdf(Starts on page 223.) Accessed September 4, 2015 (French language edition)

13. Evaluatierapport (2002‐2004) met betrekking tot de uitvoering van derevalidatieovereenkomsten tussen het Comité van de verzekering voor geneeskundigeverzorging (ingesteld bij het Rijksinstituut voor Ziekte‐ eninvaliditeitsverzekering) en de Referentiecentra voor het Chronischvermoeidheidssyndroom (CVS). 2006.http://health.belgium.be/internet2Prd/groups/public/@public/@shc/documents/ie2divers/14926531.pdf(Starts on page 227.) Accessed September 4, 2015 (Dutch language version)

14. Stordeur S, Thiry N, Eyssen M. Chronisch Vermoeidheidssyndroom: diagnose,behandeling en zorgorganisatie. Health Services Research (HSR). Brussel: FederaalKenniscentrum voor de Gezondheidszorg (KCE); 2008. KCE reports 88A (D/2008/10.273/58)https://kce.fgov.be/sites/default/files/page_documents/d20081027358.pdfAccessed September 4, 2015

15. Wiborg JF, Knoop H, Stulemeijer M, Prins JB, Bleijenberg G. How does cognitivebehaviour therapy reduce fatigue in patients with chronic fatigue syndrome? The role ofphysical activity. Psychol Med. 2010; 40:1281‐1287.

16. Van Kessel K, Moss‐Morris R, Willoughby, Chalder T, Johnson MH, Robinson E,A randomized controlled trial of cognitive behavior therapy for multiple sclerosisfatigue, Psychosom. Med. 2008; 70:205–213.

17. Powell P. FINE Trial Patient Booklethttp://www.fine‐trial.net/downloads/Patient%20PR%20Manual%20ver9%20Apr05.pdfAccessed September 7, 2015

18. Twisk FNM, Maes M. A review on Cognitive Behavorial Therapy (CBT) and GradedExercise Therapy (GET) in Myalgic Encephalomyelitis (ME)/Chronic Fatigue Syndrome (CFS):CBT/GET is not only ineffective and not evidence‐based, but also potentiallyharmful for many patients with ME/CFS. Neuro Endocrinol Lett.2009;30:284‐299.

19. Carruthers BM et al. Myalgic Encephalomyelitis – Adult & Paediatric:International Consensus Primer for Medical Practitioners. ISBN978‐0‐9739335‐3‐6http://www.investinme.org/Documents/Guidelines/Myalgic%20Encephalomyelitis%20International%20Consensus%20Primer%20‐2012‐11‐26.pdfAccessed September 5, 2015

20. Twisk FN. Objective Evidence of Post‐exertional“Malaise” in Myalgic Encephalomyelitis andChronic Fatigue Syndrome. J Sports Med Doping Stud 2015. 5:159. doi:10.4172/2161‐0673.100015

21. Higgins JPT, Green S: Cochrane Handbook for Systematic Reviews of InterventionsVersion 5.1.0 [updated March 2011]. Table 8.5.d. The Cochrane Collaboration; 2011.http://handbook.cochrane.org/chapte...a_for_judging_risk_of_bias_in_the_risk_of.htmAccessed: September 5, 2015

22. White PD, Sharpe MC, Chalder T, DeCesare JC, Walwyn R; on behalf of the PACE trialgroup. Protocol for the PACE trial: A randomised controlled trial of adaptive pacing,cognitive behaviour therapy, and graded exercise as supplements to standardisedspecialist medical care versus standardised specialist medical care alone for patientswith the chronic fatigue syndrome/myalgic encephalomyelitis or encephalopathy. BMCNeurology 2007, 7:6http://www.biomedcentral.com/1471‐2377/7/6 Accessed: September 5,2015

Reply

Thank you for reading the review so carefully and for your comments. I have split theanswers according to the headings you have used.

Objective measures and compliance

The protocol for this review did not include objective measurements or compliance asoutcomes, hence are not included. You make a strong case and including objectivemeasures and compliance should be carefully considered in an update.

Selective reporting (outcome bias)

The Cochrane Risk of Bias tool enables the review authors to be transparent about theirjudgments, but due to the subjective nature of the process it does not guarantee anindisputable consensus. You particularly mention the risk of bias in the PACE trialregarding not providing pre‐specified outcomes however the trial didpre‐specify the analysis of outcomes. The primary outcomes were the same as inthe original protocol, although the scoring method of one was changed and the analysisof assessing efficacy also changed from the original protocol. These changes were madeas part of the detailed statistical analysis plan (itself published in full), which hadbeen promised in the original protocol. These changes were drawn up before the analysiscommenced and before examining any outcome data. In other words they werepre‐specified, so it is hard to understand how the changes contributed to anypotential bias. The relevant paper also alerted readers to all these changes and gavethe reasons for them. Overall, we don’t think that the issues you raisewith regard to the risk of selective outcome bias are such as to suspect high risk ofbias, but recognize that you may reach different conclusions than us.

Kind Regards,

Lillebeth Larun

Contributors

Feedback submitted by: Tom Kindlon

Response submitted by: Lillebeth Larun

Comment 2 of 2, 9 September 2015

Summary

Variation in interventions

It would have been useful to have some more information on the“exercise with pacing” intervention tested inthe Wallman et al. (2004) trial and how it was distinct from some other exerciseinterventions tested. The authors say (1): “On days when symptomsare worse, patients should either shorten the session to a time they consider manageableor, if feeling particularly unwell, abandon the sessionaltogether” (p. 143). I don't believe the description givenin the review conveys this. In the review, this approach is described as "Exercisewith pacing: exercise in which the incremental increase in exercise was personallyset." But Wallman et al.’s approach allows patients to decrease aswell as increase how much exercise they do on the day. This approach also contrasts withhow White (an investigator in two of the trials) has described graded exercise therapy:"if [after increasing the intensity or duration of exercise] there has been anincrease in symptoms, or any other adverse effects, they should stay at their currentlevel of exercise for a further week or two, until the symptoms are back to theirprevious levels" (2). In the PACE Trial manual White co‐wrote (3), the GETintervention was guided by the principle that “planned physicalactivity and not symptoms are used to determine what the participantdoes” (p. 21); similarly, “it is theirplanned physical activity, and not their symptoms, that determine what they are asked todo” (p. 20). Compliance data would help us examine which approachpatients are actually using: I suspect many patients are in fact doing exercise withpacing even in trials such as the PACE Trial (i.e. when they have increased symptoms,often reducing levels of exercise and sometimes doing no exercise activities at all onthat day).

Bimodal versus Likert scoring in Wearden et al. (2010)

I find it odd that the fatigue scores for the Wearden et al. (2010) trial (4) are givenin the 0‐33 format rather than the 0‐11 scoring method. The 0‐11scoring system is what is mentioned as a primary outcome measure in the protocol and iswhat is reported in the main paper reporting the results (4, 5). It is even what yourown report says on p. 44 is the scoring method (“Fatigue Scale, FS;11 items; each item was scored dichotomously on a 4‐point scale [0, 0, 1 or1]”). This is important because using the scoring method for whichyou don't report data (0‐11), there is no statistically significantdifference at the primary outcome point of 70 weeks (5).

Diagnostic criteria

One problem with using these trials as an evidence base, which I don't believe wasmentioned, is that all the trials used the Oxford and Fukuda diagnostic criteria (6, 7).Neither of these criteria require patients to have post‐exertional malaise (orsomething similar). Many consider this to be a core symptom of ME/CFS and it ismandatory in most of the other major criteria (8‐11). [Aside: The London criteriawere assessed in the PACE Trial (12) but they seem to have been operationalised in anunusual way. Ninety seven per cent of the participants who satisfied the (broad) Oxfordcriteria who didn't have a psychiatric disorder satisfied the definition of M.E.used (13). Ellen Goudsmit, one of the authors of the London criteria, has rejected theway they were used in the PACE Trial (14)]. So this lack of requirement for patients tohave post‐exertional malaise (or a similar description) means we cannot be surethat the evidence can be generalised to such patients. An independent NationalInstitutes of Health committee this year concluded "continuing to use the Oxforddefinition may impair progress and cause harm. Therefore, for progress to occur, werecommend that this definition be retired" (15). An Agency for Healthcare Researchand Quality review of diagnostic methods this year reached a similar conclusion:"Consensus groups and researchers should consider retiring the Oxford casedefinition because it differs from the other case definitions and is the leastrestrictive, probably including individuals with other overlappingconditions” (16). An Agency for Healthcare Research and Qualityreview of ME/CFS treatments said: "The Oxford CFS case definition is the leastrestrictive, and its use as entry criteria could have resulted in selection ofparticipants with other fatiguing illnesses or illnesses that resolve spontaneously withtime" (17).

Exclusion of some data from analyses due to baseline differences

It seems unfortunate that some data cannot be used due to baseline differences e.g."Four trials (669 participants) contributed data for evaluation of physicalfunctioning at follow‐up (Jason 2007; Powell 2001; Wearden 2010; White 2011).Jason 2007 observed better results among participants in the relaxation group (MD 21.48,95% CI 5.81 to 37.15). However, results were distorted by large baseline differences inphysical functioning between the exercise and relaxation groups (39/100 vs 54/100);therefore we decided not to include these results in the meta‐analysis". Itwould be good if other methods could be investigated (e.g. using baseline levels ascovariates) to analyse such data.

Thank you for taking the time to read my comments.

Tom Kindlon

I am a committee member of the Irish ME/CFS Association and do a variety of unpaid workfor the Association.

1. Wallman KE, Morton AR, Goodman C, Grove R. Exercise prescription for individualswith chronic fatigue syndrome. Med J Aust. 2005;183:142‐3.

2. White P. How exercise can help chronic fatigue syndrome. Pulse: 1998. June20:86‐87.

3. Bavinton J, Darbishire L, White PD ‐on behalf of the PACE trial managementgroup. Graded Exercise Therapy for CFS/ME (Therapist Manual)http://www.pacetrial.org/docs/get‐therapist‐manual.pdf

4. Wearden AJ, Riste L, Dowrick C, Chew‐Graham C, Bentall RP, Morriss RK, PetersS, Dunn G, Richardson G, Lovell K, Powell P. Fatigue Intervention by NursesEvaluation‐‐the FINE Trial. A randomised controlled trial of nurse ledself‐help treatment for patients in primary care with chronic fatigue syndrome:study protocol. [ISRCTN74156610]. BMC Med. 2006 Apr 7;4:9.

5. Wearden AJ, Dowrick C, Chew‐Graham C, Bentall RP, Morriss RK, Peters S, RisteL, Richardson G, Lovell K, Dunn G; Fatigue Intervention by Nurses Evaluation (FINE)trial writing group and the FINE trial group. Nurse led, home based self help treatmentfor patients in primary care with chronic fatigue syndrome: randomised controlled trial.BMJ. 2010 Apr 23;340:c1777. doi: 10.1136/bmj.c1777.

6. Sharpe M, Archard L, Banatvala J, Borysiewicz LK, Clare AW, David A, et al. Chronicfatigue syndrome: guidelines for research. Journal of the Royal Society of Medicine1991;84 (2):118–21.

7. Fukuda K, Straus SE, Hickie I, et al. The chronic fatigue syndrome: A comprehensiveapproach to its definition and study. Ann Intern Med. 1994; 121: 953‑959.

8. Carruthers BM, Jain AK, De Meirleir KL, et al. Myalgic Encephalomyelitis/chronicfatigue syndrome: Clinical working case definition, diagnostic and treatments protocols.Journal of Chronic Fatigue Syndrome. 2003; 11: 7‐115.

9. Carruthers BM, van de Sande MI, De Meirleir KL, et al. Myalgic encephalomyelitis:International Consensus Criteria. J Intern Med. 2011; 270: 327‐338.

10. IOM (Institute of Medicine). Beyond myalgic encephalomyelitis/chronic fatiguesyndrome: Redefining an illness. Washington, DC: The National Academies; 2015.

11. National Institute for Health and Clinical Excellence. Chronic fatiguesyndrome/myalgic encephalomyelitis (or encephalopathy): diagnosis and management ofCFS/ME in adults and children, 2007.http://www.nice.org.uk/guidance/CG53 Accessed September 6, 2015. London:National Institute for Health and Clinical Excellence.

12. White PD, Goldsmith KA, Johnson AL, Potts L, Walwyn R, DeCesare JC, et al.Comparison of adaptive pacing therapy, cognitive behaviour therapy, graded exercisetherapy, and specialist medical care for chronic fatigue syndrome (PACE): a randomisedtrial. The Lancet 2011;377:823‐36.

13. Kindlon T. PACE Trial ‐ 97% of the participants who didn't have apsychiatric disorder satisfied the definition of M.E. used.https://listserv.nodak.edu/cgi‐bin/wa.exe?A2=ind1106A&L=CO‐CURE&P=R2764Accessed: September 6, 2015

14. Ellen Goudsmit on PubMed Commons:http://www.ncbi.nlm.nih.gov/myncbi/ellen m.goudsmit.1/comments/

15. Green CR, Cowan P, Elk R, O'Neil KM, Rasmussen AL. National Institutes ofHealth Pathways to Prevention Workshop: advancing the research on myalgicencephalomyelitis/chronic fatigue syndrome. Ann Intern Med. 2015; 162:860‐5.

16. Haney E, Smith MEB, McDonagh M, Pappas M, Daeges M, Wasson N, et al. Diagnosticmethods for myalgic encephalomyelitis/chronic fatigue syndrome: a systematic review fora National Institutes of Health Pathways to Prevention Workshop. Ann Intern Med. 2015;162:834‐40.

17. Smith MEB, Haney E, McDonagh M, Pappas M, Daeges M, Wasson N, et al. Treatment ofmyalgic encephalomyelitis/chronic fatigue syndrome: a systematic review for a NationalInstitutes of Health Pathways to Prevention Workshop. Ann Intern Med. 2015;162:841‐50.

Reply

Variation in interventions

There is ongoing work to improve descriptions of interventions both in primary studiesand in systematic reviews (Scroter 2012, Glasziou 2008). We tried to describe theexercise programs, and differences between them in great detail. We did this both in thetables of study characteristics, and in the Characteristics of exercise interventiontable (table 2). We also contacted trial authors to check that the information wascorrect. We recognize the need for more research to explore which parts of an exercisetreatment program that are most essential or most closely correlated to an successfuloutcome, i.e. the active ingredient.

Bimodal versus Likert scoring in Wearden et al. 2010

To enable pooling of as many studies as possible in a mean differencemeta‐analyses, we used the 33‐scale results reported by Wearden. Yousuggest that the decision to use the 33‐point fatigue scores in our analysis maybias the results because there is no statistically significant difference at the11‐point data at 70 weeks. This statement suggests that there is a statisticallysignificant difference when using the 33‐point data, but if you look intoanalysis 1.2 that is not the case. At 70 week we report MD ‐2.12 (95% CI‐4.49 to 0.25) for the FINE trial, i.e. not statistically significant.

Review authors response: Diagnostic criteria As the use of variousdiagnostic criteria is often emphasised as particularly important with regard totreatment response, we performed subgroup analyses based on diagnostic criteria. Theavailability of relevant trials limits which subgroup analyses are possible to carry outin a systematic review, and hence, we were only able to contrast CDC versus Oxfordcriteria and found no evidence for a difference. We realize that the role of diagnosticcriteria as a possible moderator for the efficacy of exercise receives a lot ofattention, and would welcome trials to investigate these matters more thoroughly.

Exclusion of some data from analyses due to baseline differences

In meta‐analysis based on aggregated data the authors have to act based on theinformation that is available from original publications or additional informationobtained from the original investigators. As you state, these restrictions may besuboptimal. It is possible to adjust for baseline differences in meta‐regressiontype analyses, but this requires adjustment for dependency between the intervention andcontrol group results from the same trial. As a consequence, three variables(intervention vs control, baseline level, and trial) would have to be accounted for inthe analyses. This implies that at least 30 data points will be needed to gain somewhatstable and trustworthy estimates adjusted for baseline levels. Systematic reviews basedon individual patient data (IPD) allows for more appropriate processing andstandardization of data. We are happy to inform you that we have now received individualpatient data from most of the studies included in this review, and that the preparationof an IPD review is in progress.

Scroter S, Glasziou P, Heneghan C. Quality of descriptions of treatments: a review ofpublished randomized trials. BMJ Open 2012:2e001978doi:10.1136/bmjopen‐2012‐001978

Glasziou P, Meats M, Heneghan C, Sheppers S. What is missing in descriptions oftreatment in trials and reviews? BMJ 2008;336:1472 doi: /10.1136/bmj.39590.732037.47

Contributors

Feedback submitted by: Tom Kindlon

Response submitted by: Lillebeth Larun

Feedback submitted, 16 April 2016

Summary

Query regarding use of post‐hoc unpublished outcome data: Scoring system forthe Chalder fatigue scale, Wearden, 2010

I would like to highlight what appears to be a discrepancy within the Cochrane review[1] with respect to the analysis of data from Wearden 2010 [2,3]. Throughoutthe Cochrane review (please see details below), the impression is given that onlyprotocol defined and published data or outcomes were used for the Cochrane analysis ofthe Wearden 2010 study.

However, this does not appear to be the case and, to the best of my knowledge, insteadof using protocol defined or published data, the Cochrane analyses of fatigue for theWearden 2010 study, appears to have used an alternative unpublished set of data.

The relevant analyses of fatigue in the Cochrane review are: Analyses: 1.1, 1.2, 2.1and 2.3. Each of these analyses states that the“0,1,2,3” scoring system was used for theChalder fatigue questionnaire. This scoring system is known as the Likert scoring systemand uses a fatigue scale of 033 points.

However, to the best of my knowledge, data or analyses using this scoring system werenot proposed in the Wearden 2010 trial protocol [3], and were not included in Wearden2010 [2], and have not previously been formally (i.e. via peer review) published byWearden et al. A posthoc informal analysis using this data has been informally releasedby Wearden et al. as a BMJ Rapid Response comment [4].

In the Cochrane review, the analyses using the 0, 1, 2, 3 scoring system contradicttext within the section “Characteristics OfStudies”, in relation to Wearden 2010: Under“Outcomes”, it is stated that Chalder fatiguewas measured using the 0,0,1,1 scoring system using a scale from 011 points:“Fatigue (Fatigue Scale, FS; 11 items; each item wasscored dichotomously on a 4 point scale (0, 0, 1 or 1)”.

Wearden 2010 prespecified Chalder fatigue questionnaire scores as a primary outcome at70 weeks, and as a secondary outcome immediately after treatment at 20 weeks. Thescoring, in both cases, used the 0,0,1,1 system, with a scale of 011. This scoringsystem was described both in the trial protocol [3] and the main results paper publishedin 2010 [2].

The Likert (0, 1, 2, 3) scoring system was neither proposed in the trial protocol, norformally published, and so the Likert scores should be considered posthoc. Even if it isargued that the Chalder fatigue questionnaire (irrespective of the scoring system) waspredefined as a primary outcome measure, data using the Likert scoring system wasneither proposed nor published and so the data itself must surely be considered to beposthoc. The outcome analyses using the Likert data must be considered posthoc.

Simply changing a scoring system may, at first glance, appear not to be a significantor major adjustment, however, we do not know what difference it made because asensitivity analysis has not been published.

I cannot find any explanation within the Cochrane review that explains why the Cochranereview has replaced predefined published data with an unpublished and posthoc set ofdata.

Is it normal practice for a Cochrane metaanalysis to selectively ignore the predefinedprimary outcome data for a trial, and to selectively include and analyse posthoc data? Iwonder if some clarity could be shed on this situation?

I suggest that the posthoc data are replaced with the original published data.Otherwise, the posthoc data should be clearly labelled as such and the risk of biasanalysis amended accordingly; and an explanation should be included in the reviewexplaining why an apparently adequate predefined set of data has been replaced with anapparent novel set of posthoc data.

Also, I suggest that any discrepancies that I will outline below, should be correctedwhere necessary; Either the analyses (1.1, 1.2, 2.1 and 2.3) should be amended orthe description of the data should be amended so it is not incorrectly labelled asprotocol defined and published data with a “lowrisk” of bias.

Discrepancies within the text of the Cochrane Analysis

Please note that all page numbers used below are pertinent to the current version(version 4) of the Cochrane review in PDF format.

1. On page 28 of the Cochrane review [1], in section “Potentialbiases in the review process”, under the heading“Potential bias in the review process”, inrelation to the review in general, it is stated that: "For this updated review, wehave not collected unpublished data for our outcomes ..." However, as explainedabove, this is not the case for the Wearden 2010 fatigue data for which unpublished datahas been used in the Cochrane analysis.

2. On page 45 of the review, in section “Characteristics OfStudies”, specifically in relation to Wearden 2010 [2,3], it isstated that only protocol defined outcomes were used: "all relevantoutcomes are reported in accordance with the protocol". "Selective reporting(reporting bias)" is rated as "low risk". However, as explained above,this is not the case, because the Wearden 2010 fatigue data (used in the Cochraneanalysis) was not proposed in the protocol. If the data is posthoc, then the“low risk” category will need to berevised.

3. On page 44 of the review, in section “Characteristics OfStudies”, in relation to Wearden 2010 [2,3], under“Outcomes”, it is stated that Chalder fatiguewas measured using the 0,0,1,1 scoring system using a scale from 011 points:“Fatigue (Fatigue Scale, FS; 11 items; each item wasscored dichotomously on a 4 point scale (0, 0, 1 or 1)”. Wearden2010 did indeed use the 0,0,1,1 scoring system for the Chalder fatigue scale: Thisscoring system was proposed in the trial protocol and published with the main outcomedata in Wearden 2010. However, as explained above, this scoring system has not been usedin the Cochrane analysis.

4. If figures 2 and 3 also contain discrepancies, after any amendments to the review,then they should be amended accordingly.

There may be other related discrepancies and inaccuracies in the text that Ihaven’t noticed. I thank the Cochrane team in advance for giving thissubmission careful consideration, and for making amendments to the analysis, andproviding explanations, where appropriate. I hope you will agree that clarity,transparency and accuracy in relation to the analysis is paramount.

References: 1. Larun L, Brurberg KG, Odgaard Jensen J, Price JR.Exercise therapy for chronic fatigue syndrome. Cochrane Database Syst Rev. 2016;CD003200.

2. Wearden AJ, Dowrick C, ChewGraham C, et al. Nurse led, home based self helptreatment for patients in primary care with chronic fatigue syndrome: randomisedcontrolled trial. BMJ. 2010; 340:c1777.

3. Wearden AJ, Riste L, Dowrick C, et al. Fatigue Intervention by Nurses Evaluation– The FINE Trial. A randomised controlled trial of nurse ledselfhelp treatment for patients in primary care with chronic fatiguesyndrome: study protocol. BMC Med. 2006; 4:9.

4. Wearden AJ, Dowrick C, ChewGraham C, et al. Fatigue scale. BMJ Rapid Response. 2010.http://www.bmj.com/rapidresponse/2011/11/02/fatiguescale0 (accessed April16, 2016).

Reply

Dear Robert Courtney

Thank you for your detailed comments on the Cochrane review 'Exercise Therapy forChronic Fatigue Syndrome'. We have the greatest respect for your right to commenton and disagree with our work. We take our work as researchers extremely seriously andpublish reports that have been subject to rigorous internal and external peer review. Inthe spirit of openness, transparency and mutual respect we must politely agree todisagree.

The Chalder Fatigue Scale was used to measure fatigue. The results from the Wearden2010 trial show a statistically significant difference in favour of pragmaticrehabilitation at 20 weeks, regardless whether the results were scored bi‐modallyor on a scale from 0‐3. The effect estimate for the 70 week comparison with thescale scored bi‐modally was ‐1.00 (CI‐2.10 to +0.11; p =.076) and‐2.55 (‐4.99 to ‐0.11; p=.040) for 0123 scoring. The FINE datameasured on the 33‐point scale was published in an online rapid response after areader requested it. We therefore knew that the data existed, and requested clarifyingdetails from the authors to be able to use the estimates in our meta‐analysis. Inour unadjusted analysis the results were similar for the scale scored bi‐modallyand the scale scored from 0 to 3, i.e. a statistically significant difference in favourof rehabilitation at 20 weeks and a trend that does not reach statistical significancein favour of pragmatic rehabilitation at 70 weeks. The decision to use the 0123 scoringdid does not affect the conclusion of the review.

Regards,

Lillebeth Larun

Contributors

Feedback submitted by: Robert Courtney

Response submitted by: Lillebeth Larun

Feedback submitted, 1 May 2016

Summary

Comment: Assessment of Selective Reporting Bias in White 2011

With reference to the current Cochrane review of exercise therapy for chronic fatiguesyndrome [1], I would like to follow‐up the discussion between Tom Kindlon andLillebeth Larun that has been published in the latest version of the full reviewpublished in 2016. Kindlon submitted two comments, dated 9 September 2016, and Larunissued a response to each.

Kindlon raised the issue of the study referred to as "White 2011" in theCochrane review, commonly known as the PACE trial [2]; specifically whether or not therisk of bias for selective reporting of outcomes for the trial has been assessed andcategorised appropriately, in terms of Cochrane's guidelines and policies.

In this submission I will make reference to the current "Cochrane Handbook forSystematic Reviews of Interventions" [3], including Table 8.5.d ("Criteria forjudging risk of bias in the ‘Risk of bias’ assessmenttool"), which I will refer to as the "Cochrane guidelines".

In his submission, Kindlon said: "I don't believe that White et al. (2011)(the PACE Trial) [...] should be classed as having a low risk of bias under"Selective reporting (outcome bias)"."

In a considered response, Larun concluded: "Overall, we don’tthink that the issues you raise with regard to the risk of selective outcome bias aresuch as to suspect high risk of bias, but recognize that you may reach differentconclusions than us."

Larun's response to the concerns raised by Kindlon has left me unsure aboutwhether Cochrane's guidelines have been applied appropriately in this case, so Iwould like to discuss some of the finer details.

Pre‐Planned Analysis

I note that the PACE trial's protocol was submitted for publication in 2006 andpublished in 2007 [4], which was after the trial had commenced in 2005 [2]. This raisesthe question of whether the protocol itself can be defined as a pre‐trial report.Cochrane's glossary of terms defines a "pre‐specified" analysis asa "Statistical analyses specified in the trial protocol; that is, planned inadvance of data collection." [5] So the Cochrane glossary states that apre‐specified analysis plan, or protocol, must be completed before datacollection has commenced.

Other sources, such as the Wiley Encyclopedia of Clinical Trials, also define apre‐planned analysis as that which has been defined before data collection hascommenced: "A primary efficacy endpoint needs to be specified before the start ofthe clinical trial." [6]

To be certain that the PACE trial's analyses were defined before data collectionhad commenced then we would need an earlier publication such as a trial register [7] ortrial identifier, both of which were created and which included definitions of primaryendpoints which were different to the trial protocol (see appendix, below, for detaileddescriptions). To my knowledge, the Cochrane review, does not discuss these issues.

Nevertheless, the Cochrane guidelines (section 8.14.2) advise using a trial protocol asa guide to determine which trial outcomes were pre‐determined: "If theprotocol is available, then outcomes in the protocol and published report can becompared".

As the protocol was published after the trial had commenced, it seems certain that anysubsequent (i.e. after the protocol had been published) changes to methodology were madeafter data collection had commenced and were therefore not pre‐specified.

Statistical Analysis Plan

Larun states that various changes from the protocol were "made as part of thedetailed statistical analysis plan (itself published in full), which had been promisedin the original protocol." The protocol did indeed refer to a statistical analysisplan, but the protocol wording suggests to me that no changes from the protocol wereplanned, but that the statistical analysis plan would simply flesh out the protocol:"A full Analysis Strategy will be developed, independently of looking at the trialdatabase, and before undertaking any analysis. This paper [i.e. the protocol] summarisesthe analysis plan." There was no suggestion that there would be wholesale changesto primary, secondary or recovery outcomes. But, in any case, even if theinvestigators' initial intentions had been to make changes after data collectionhad started, the result would still not be a pre‐specified analysis according tothe Cochrane glossary of terms [5].

The statistical analysis plan was submitted for publication in 2012 and published in2013 [8] after the main trial results had been published in 2011 [2], and long after thetrial had commenced in 2005, so the statistical analysis plan cannot reasonably beconsidered to be a priori. Indeed the statistical analysis report itself confirms thatthe analysis was finalised or approved towards the end of data collection in 2010:"These planned analyses were written with a view to publication and are reproducedalmost as they were approved by the Trial Steering Committee (Version 1.2 dated 2 May2010) prior to database lock."

Larun states that the "changes [to the trial] were drawn up before the analysiscommenced and before examining any outcome data. In other words they werepre‐specified [...]" However, the latter assertion is not consistent withCochrane's glossary, which states that pre‐specified changes are thosedefined before data collection has commenced [5].

Investigators of an open‐label trial can potentially gain insights into a trialbefore formal analysis has been carried out. If changes to a planned methodology aremade after a trial has started and/or after data collection has commenced (whether ornot the data has been formally analysed) then it is generally accepted that this failsthe definition of a "pre‐specified" study, which is confirmed by theCochrane glossary and other sources. Otherwise trial registries and protocols could bedrawn up after all data had been collected but before the formal analysis has commenced,and still be described as pre‐planned. This would be particularly problematic inopen‐label trials such as the PACE trial.

Pre‐Planned and Unplanned Primary Endpoints

The PACE trial's protocol had proposed three primary efficacy analyses which allhad binary outcomes (i.e. a positive or negative outcome for each patient), whereas thefinal primary analyses were entirely different; they were continuous measures focused onthe differences in mean improvements between intervention groups at 52 weeks. So thechanges to the protocol were substantial. (See appendix, below, for detaileddescriptions.) The PACE trial's three a priori primary efficacy analyses were notincluded in the final published results, and have never been published and, to myknowledge, no sensitivity analysis has been published for the final published primaryanalyses.

The PACE trial's published results paper [2] confirmed the unplanned outcomeswitching, as follows: "We used continuous scores for primary outcomes to allow amore straightforward interpretation of the individual outcomes, instead of theoriginally planned composite measures (50% change or meeting a thresholdscore)."

This entirely contradicts Larun's claims that: "the trial didpre‐specify the analysis of outcomes" and: "The primary outcomes werethe same as in the original protocol".

The Cochrane guidelines give guidance that is specific to the issue of changing apre‐planned analysis for the same set of data and they describe such an action as"selective reporting of analyses using the same data". The guidelinescouldn't be more specific that changing a method for analysing the same set of datashould be considered selective reporting.

The Cochrane guidelines (8.14.1) state: "Selective reporting of analyses using thesame data: There are often several different ways in which an outcome can be analysed.For example, continuous outcomes such as blood pressure reduction might be analysed as acontinuous or dichotomous variable, with the further possibility of selecting frommultiple cut‐points."

Scoring System for Chalder Fatigue

A change from the protocol, that will have had a direct impact on the Cochraneanalysis, was the scoring system used for the Chalder fatigue scale. The PACE trialprotocol proposed two self‐report questionnaires as tools to use for the primaryendpoint analyses: one was the Chalder fatigue questionnaire, and the other was theShort Form 36 (SF‐36) physical function subscale. The scoring system for theChalder fatigue questionnaire was pre‐defined as a bimodal scoring system (i.e. ascore of 0 or 1 for each response to the 11 questions, giving a fatigue scale of0‐11). However, after data collection had commenced, the decision was to made tochange to a continuous scoring system, known as the Likert system (i.e. a score of 0 ,1,2, or 3 for each response to the 11 questions, giving a fatigue scale of 0‐33).This change was made after the PACE trial's nominal 'sister trial', knownas the FINE trial, had completed its analysis of very similar type of data using boththe bimodal and Likert scoring systems. The FINE trial investigators had found nosignificant effect for their primary endpoint when using the bimodal scoring system forChalder fatigue [9] but determined a significant effect using the Likert system in aninformal post‐hoc analysis [10]. The FINE trial has published its raw data, aspart of the PLoS One data sharing commitment, and an informal analysis has shown thatthere may potentially be other significant differences in some outcomes, when changingfrom bimodal to Likert scoring [11].

With regards to risk of selective reporting bias specifically in relation to the PACEtrial, the Cochrane review states: “Our primary interest is theprimary outcome reported in accordance with the protocol, so we do not believe thatselective reporting is a problem."

However, the Likert scoring system for the Chalder fatigue questionnaire is clearlylabelled as a secondary outcome in the PACE trial protocol, and not a primary outcome.The protocol specifically lists "Chalder Fatigue Questionnaire Likert scoring(0,1,2,3)" as a "secondary outcome" only. This contradicts the abovestatement in the review (i.e. "our primary interest is the primary outcome reportedin accordance with the protocol"), and it contradicts Larun's impliedassertion that the Likert scoring system was "pre‐specified" for use asa primary outcome measure.

So, to reiterate, the Likert scoring system, that the Cochrane review has described asa primary outcome measure, is specifically described as only a secondary measure in thePACE trial's protocol. It could not be more specific.

The change in questionnaire scoring methods is more than just a technicality, and mayhave made a significant difference to the trial's outcomes [11]. The rationale forthe change (i.e. "to more sensitively test our hypotheses of effectiveness")may or may not be justified, and the change may or may not be beneficial in terms ofbetter understanding treatment effects, but the fact remains that it was not part of apre‐specified trial plan.

Considering the issues discussed above, the analysis of the Chalder fatigue scores inthe Cochrane review should undoubtedly, in my opinion, be considered an unplannedanalysis and labelled as such.

Sensitivity Analysis

The Cochrane review focused on the mean differences between intervention groups, andwhether there was a statistically significant effect, which is the same analysis as thePACE trial's final published outcomes.

Analyses of the PACE trial data using the pre‐planned methods have not beenpublished and, to my knowledge, a sensitivity analysis for the (unplanned) finaloutcomes has neither been published in the PACE trial literature nor the Cochranereview, so it is impossible for the reader to have insight into the impact of thechanges.

In terms of what should be done when only post‐hoc data is available theCochrane guidelines (section 8.14.2) advise that a sensitivity analysis should bepublished: "It is not generally recommended to try to ‘adjustfor’ reporting bias in the main meta‐analysis. Sensitivityanalysis is a better approach to investigate the possible impact of selective outcomereporting (Hutton 2000, Williamson 2005a)."

It would be helpful if this guideline was adhered to.

Assessment of Risk of Reporting Bias

As well as those outlined above, various other important outcomes in the trial werechanged dramatically, such as the recovery analysis, which was reported in a separatepublication [12]. Also, the pre‐defined 'clinically importantdifference' was dropped, and was replaced with a 'clinically usefuldifference' which had an entirely different definition. There were too manydeviations from the protocol in the final analyses to list them all in detail here.

The Cochrane guidelines (8.14.2) state: "The assessment of risk of bias due toselective reporting of outcomes should be made for the study as a whole, rather than foreach outcome. Although it may be clear for a particular study that some specificoutcomes are subject to selective reporting while others are not, we recommend thestudy‐level approach because it is not practical to list all fully reportedoutcomes in the ‘Risk of bias’ table."

The Cochrane review currently designates the risk of reporting bias for the PACE trialas "low risk": Under the subheading "Characteristics of includedstudies" and under: "Selective reporting (reporting bias)", White 2011 isdesignated as "Low risk". This designation is repeated elsewhere in thereview, such as the "Risk of bias summary" in Figure 2.

Kindlon pointed out that the Cochrane guidelines (Table 8.5.d) set out the criteria forthe judgement of high risk of reporting bias as follows:

"Any one of the following:

  • Not all of the study’s pre‐specified primary outcomes havebeen reported;

  • One or more primary outcomes is reported using measurements, analysis methods orsubsets of the data (e.g. subscales) that were not pre‐specified;

  • One or more reported primary outcomes were not pre‐specified (unless clearjustification for their reporting is provided, such as an unexpected adverseeffect);

  • One or more outcomes of interest in the review are reported incompletely so thatthey cannot be entered in a meta‐analysis;

  • The study report fails to include results for a key outcome that would beexpected to have been reported for such a study."

I consider the trial to meet at least the first three of these requirements for a highrisk of bias. However, in the response to Kindlon, Larun says: "Overall, wedon’t think that the issues you raise with regard to the risk ofselective outcome bias are such as to suspect high risk of bias, but recognize that youmay reach different conclusions than us."

I find this claim impossible to square with the Cochrane risk of bias tool, for which,in my opinion, the PACE trial unambiguously meets at least three high risk criteria whenonly one is required to label the study as high risk.

The Cochrane guidelines advise that the bias risk of a study should be assessed bytaking into account the study as a whole; and as all of the PACE trial's mainpublished analyses, including the primary analyses and the recovery analysis, were notpre‐planned, this suggests that the Cochrane report should have labelled thetrial as having a high risk of reporting bias, according to my interpretation of theCochrane guidelines.

I request that a revaluation is carried out, with reference to the Cochraneguidelines.

Definition of a Primary Endpoint

In his submission to Cochrane, Kindlon explained that the three pre‐plannedprimary endpoint analyses were abandoned in favour of novel analyses in the final trialanalysis: "The three primary efficacy outcomes can be seen in the publishedprotocol" and "None have been reported in the pre‐specified way.

I find Larun's response to Kindlon to be confusing and unsatisfactory. As far asmy understanding goes, the response does not seem to take account of the Cochraneguidelines. Larun acknowledges that the scoring system for one of the primary outcomeassessment tools, was changed: "the scoring method of one was changed", andshe acknowledges that the trial's primary endpoint analyses were changed: "theanalysis of assessing efficacy also changed from the original protocol."

However, Larun seems to contradict this by saying: "The [final] primary outcomeswere the same as in the original protocol [...]"

Larun's latter statement contradicts the published results which state that the"originally planned" "primary outcomes" were switched: "We usedcontinuous scores for primary outcomes [...] instead of the originally planned compositemeasures (50% change or meeting a threshold score)." [2]

The primary endpoints (i.e. criteria to judge a successful outcome) were defined inprecise detail rather than simply being described as 'fatigue' and'physical function'. Instead, a specific primary endpoint efficacy analysiswas defined which included a required threshold for a positive outcome in fatigue andfunction at 52 weeks. Also, the questionnaire and scoring method was defined for eachprimary endpoint.

As the primary endpoint analyses were changed then I would argue that the primaryoutcomes were substantially changed.

A "primary efficacy endpoint" has been described as "a clinical orlaboratory outcome measured in an individual after randomization that allows one to testthe primary hypothesis and provides the means of assessing whether a therapy iseffective compared with its control." [6]

An example of "Completely defined pre‐specified primary and secondaryoutcome measures, including how and when they were assessed" is given in theConsort guidelines: "Example—“The primary endpoint withrespect to efficacy in psoriasis was the proportion of patients achieving a 75%improvement in psoriasis activity from baseline to 12 weeks as measured by the PASI[psoriasis area and severity index] Additional analyses were done on the percentagechange in PASI scores and improvement in target psoriasis lesions" [13]

In the PACE trial the primary objectives were to compare CBT and GET against SMC. Toeffectively achieve this comparison, a specific primary analysis was provided, as theprimary endpoint, to determine a successful outcome. The results for thepre‐planned primary endpoints have not been released.

Conclusion

Larun says: "The Cochrane Risk of Bias tool enables the review authors to betransparent about their judgments, but due to the subjective nature of the process itdoes not guarantee an indisputable consensus."

I accept that assessment of bias can be subjective but, as I have outlined above, theissues relating to the PACE trial seem clear‐cut, according to the Cochraneguidelines, which give very specific advice in relation to the type of the changes thatwe see here. I do not accept Larun's suggestion that this is a nuanced orsubjective evaluation. PACE seems to fail at least the first three criteria in the'high risk' category of the Cochrane risk tool for reporting bias.

The changes to the PACE trial's primary outcomes had the effect of lowering thethreshold for a positive outcome and therefore portraying the interventions in a morepositive light. A major purpose of a trial protocol is to avoid bias that canpotentially arise through selective reporting. Avoidable bias does a disservice for themedical and patient communities and I would expect Cochrane to be rigorous in pointingout potential bias, and discussing the implications of bias, labelling bias correctlyand including unbiased data where possible or including a sensitively analysis wherepossible. Indeed, this is what the Cochrane guidelines advise, and it is what the publicexpect of Cochrane. I feel that these issues have been neglected in this specificinstance, and a reader of the Cochrane review in isolation would be unaware of any ofthe issues discussed above, relating to the PACE trial.

I ask for a reassessment and revaluation of this review in relation to the PACE trialand risk of bias.

Many thanks, in advance, for your careful consideration of these issues.

‐‐‐

Appendix

PACE trial: protocol‐defined primary endpoints ‐ trial protocol [4].

Three Primary Endpoints.

"Primary outcome measures – Primary efficacy measures"

1. “The 11 item Chalder Fatigue Questionnaire measures theseverity of symptomatic fatigue, [27] and has been the most frequently used measure offatigue in most previous trials of these interventions. We will use the 0,0,1,1 itemscores to allow a possible score of between 0 and 11. A positive outcome will be a 50%reduction in fatigue score, or a score of 3 or less, this threshold having beenpreviously shown to indicate normal fatigue. [27]"

2. “The SF‐36 physical function sub‐scale [29]measures physical function, and has often been used as a primary outcome measure intrials of CBT and GET. We will count a score of 75 (out of a maximum of 100) or more, ora 50 % increase from baseline in SF‐36 sub‐scale score as a positiveoutcome. A score of 70 is about one standard deviation below the mean score (about 85,depending on the study) for the UK adult population. [51, 52]”

3. "Those participants who improve in both primary outcome measures will beregarded as overall improvers."

PACE trial: post‐hoc primary endpoints ‐ main results paper [2].

The difference between mean changes in fatigue and physical function acrossintervention groups at 52 weeks, using an effect size to assess the efficacy ofinterventions.

PACE trial: pre‐specified primary endpoints ‐Trial Registry [7]

"Endpoints/primary outcome(s)

1. The 11 item Chalder fatigue questionnaire, using categorical item scores to allow acategorical threshold measure of “abnormal”fatigue with a score of 4 having been previously shown to indicate abnormal fatigue.

2. The SF‐36 physical function sub‐scale, counting a score of 75 (out ofa maximum of 100) or more as indicating normal function."

‐‐‐

References:

1. Larun L, Brurberg KG, Odgaard‐Jensen J, Price JR. Exercise therapy forchronic fatigue syndrome. Cochrane Database Syst Rev. 2016; CD003200.

2. White PD, Goldsmith KA, Johnson AL, et al. Comparison of adaptive pacing therapy,cognitive behaviour therapy, graded exercise therapy, and specialist medical care forchronic fatigue syndrome (PACE): a randomised trial. Lancet 2011; 377:823‐36.

3. Higgins JPT, Green S (editors). Cochrane Handbook for Systematic Reviews ofInterventions. Version 5.1.0 [updated March 2011]. The Cochrane Collaboration, 2011.http://handbook.cochrane.org/front_page.htm (accessed 19 April 2016).

4. White PD, Sharpe MC, Chalder T, et al. Protocol for the PACE trial: a randomisedcontrolled trial of adaptive pacing, cognitive behaviour therapy, and graded exercise,as supplements to standardised specialist medical care versus standardised specialistmedical care alone for patients with the chronic fatigue syndrome/myalgicencephalomyelitis or encephalopathy. BMC Neurol. 2007; 7:6.

5. Glossary. Cochrane Community Archive.https://community‐archive.cochrane.org/glossary/5#term82 (accessed 20April 2016)

6. Follmann DA. Primary Efficacy Endpoint. Wiley Encyclopedia of Clinical Trials.2007.

7. Trial Registry. BioMed Central. Internet Archive.http://web.archive.org/web/20050524130106/http://www.controlled‐trials.com/mrct/trial/CHRONICFATIGUE SYNDROME/1042/40645.html (accessed 29 April 2016)

8. Walwyn R, Potts L, McCrone P, et al. A randomised trial of adaptive pacing therapy,cognitive behaviour therapy, graded exercise, and specialist medical care for chronicfatigue syndrome (PACE): statistical analysis plan. Trials 2013; 14:386.

9. Wearden AJ, Dowrick C, Chew‐Graham C, et al. Nurse led, home based self helptreatment for patients in primary care with chronic fatigue syndrome: randomisedcontrolled trial. BMJ. 2010; 340:c1777.

10. Wearden AJ, Dowrick C, Chew‐Graham C, et al. Fatigue scale. BMJ RapidResponse. 2010.http://www.bmj.com/rapid‐response/2011/11/02/fatigue‐scale‐0(accessed 21 Feb 2016).

11. Carter S. Exploring changes to PACE trial outcome measures using anonymised datafrom the FINE trial. PubMed Commons 2016.http://www.ncbi.nlm.nih.gov/pubmed/23363640#cm23363640_14248 (accessed 20 Feb2016).

12. White PD, Goldsmith K, Johnson AL, Chalder T, Sharpe M. Recovery from chronicfatigue syndrome after treatments given in the PACE trial. Psychol Med. 2013;43:2227‐35.

13. Moher D, Hopewell S, Schulz KF, et al. CONSORT 2010 explanation and elaboration:updated guidelines for reporting parallel group randomised trials. BMJ. 2010;340:c869.

‐‐‐‐‐ ‐‐‐‐‐‐‐‐‐‐

I do not have any affiliation with or involvement in any organisation with a financialinterest in the subject matter of my comment

Reply

Dear Robert Courtney

Thank you for your detailed comments on the Cochrane review 'Exercise Therapy forChronic Fatigue Syndrome'. We have the greatest respect for your right to commenton and disagree with our work. We take our work as researchers extremely seriously andpublish reports that have been subject to rigorous internal and external peer review. Inthe spirit of openness, transparency and mutual respect we must politely agree todisagree.

Cochrane reviews aim to report the review process in a transparent way, for example,are reasons for the risk of bias stated. We do not agree that Risk of Bias for the Pacetrial (White 2011) should be changed, but have presented it in a way so it is possibleto see our reasoning. We find that we have been quite careful in stating the effectestimates and the certainty of the documentation. We note that you read thisdifferently.

Regards,

Lillebeth

Contributors

Feedback submitted by: Robert Courtney

Response submitted by: Lillebeth Larun

Feedback submitted, 3 June 2016

Summary

Comment: concerns regarding the use of unplanned primary outcomes in the Cochranereview

Summary

In this submission, I will discuss the details and implications of unplanned revisionsto the Cochrane review's protocol, specifically changes to the primary outcomes. Iwill raise concerns about the clarity with which the changes to the protocol have beenexplained in the review and I will question the justification given for switching theprimary outcomes. I will compare the details of the pre‐specified primaryoutcomes with the unplanned (revised) primary outcomes. I will explore how the protocolrevisions have impacted the overall conclusions of the review, and how some reviewoutcomes have been misrepresented in the main discussions. I will also briefly discusspotential biases involved in reviewing open‐label studies that useself‐report outcomes, and how such biases may potentially have affected thereview's outcomes. Finally, I will discuss what I believe is: a lack of clarity inhow the review has discussed and portrayed outcomes, and; a lack of depth in howpotential biases have been considered and explored.

I will conclude by asking the reviewers to reassess the review, including the decisionto switch the primary outcomes, with a view to improving clarity, rigour and accuracy. Ispecifically ask the reviewers to:

1. Amend the review as per the Cochrane guidelines (i.e. "every effort should bemade to adhere to a predetermined protocol"), and revert to the pre‐plannedprimary analyses; and

2. Clearly and unambiguously explain that all but one health indicator (i.e. fatigue,physical function, overall health, pain, quality of life, depression, and anxiety, butnot sleep) demonstrated a non‐significant outcome for pooled treatment effects atfollow‐up for exercise therapy versus passive control; and

3. Include a rigorous assessment of how the potential for bias may have affectedoutcomes.

Introduction

After detailed scrutiny of the current version of the Cochrane review of exercisetherapy for chronic fatigue syndrome (version 4, dated 7 February 2016) [1], I havenoticed that the primary outcomes of the review have not been reported as per thepre‐specified review plan, but that unplanned (revised) primary analyses havebeen published in the place of the pre‐specified analyses. (By'unplanned', I refer to revisions to the methodology that were notpre‐specified in the review's protocol.) The switching of primary outcomes(from pre‐specified to unplanned analyses) is not mentioned in the maindiscussions, conclusion, or abstract, and is not explicitly explained anywhere in thereview. I had to carry out a detailed inspection of the review to understand exactlywhat had been changed.

At the very end of the full version of the review, a section titled "[d]ifferencesbetween protocol and review" explains the deviations from the protocol:

"[...] in the protocol it is stated, "where results for continuous outcomeswere presented using different scales or different versions of the same scale, we usedstandardised mean differences (SMDs)." We realise that the standardised meandifference (SMD) is much more difficult to conceptualise and interpret than the normalmean difference (MD); therefore we decided to report both MDs and SMDs in the Resultssection. In general, MDs are reported in the main Results section, whereas SMDs aresupplied under the "Sensitivity and subgroup analysis" subheading."

Although the above quote isn't explicit in referring to the primary outcomes, itexplains the nature of, and rationale for, the unplanned changes to the review'sprimary outcomes. The only reason given for changing the pre‐specified outcomeswas that "the standardised mean difference (SMD) is much more difficult toconceptualise and interpret than the normal mean difference". No evidence isprovided to support this assertion, and it appears to be an assumption about thereaders' ability to interpret outcomes.

The outcomes of the review's pre‐specified primary analyses are outlined inthe analysis section, but are only mentioned briefly (i.e. only one or two sentences areused to explain each outcome), and the pre‐specified outcomes are not discussedin the review's main discussions, abstract or conclusions. The pre‐specifiedanalyses have been relegated to the status of "sensitivity analyses", and itis not explicitly explained that these sensitivity analyses are the pre‐specifiedprimary analyses. It is easy for a reader to overlook these important outcomes and tomisunderstand their significance. I am concerned that most readers will be unaware ofthese changes to the primary outcomes and of the significance of the changes to theprotocol.

I consider the changes to have significantly altered the fundamental design, the mainoutcomes, and overall interpretation of the review.

Primary Outcomes

I would like to take this opportunity to explain the details of the changes to theprimary outcomes to the reader, to the best of my understanding. The review comparesexercise therapy with a passive control (e.g. treatment as usual), which is the focus ofthis submission. Outcomes for exercise therapy compared with other interventions (e.g.cognitive‐behavioural therapy, supportive therapy, and pacing) are also includedin the review, but are not central to the concerns of this submission and will not bediscussed further. The review uses two primary outcome measures (fatigue and adverseoutcomes) but adverse outcomes are not relevant to this submission. A primary analysisat both end of treatment (12 to 26 weeks) and at follow‐up (52 to 70 weeks) iscarried out. This submission focuses on primary analyses in relation to fatigue only,for exercise therapy versus passive control only.

The protocol defined two pre‐specified primary analyses (one at end of treatmentand one at follow‐up) that were to determine the pooled treatment effects of alleligible studies on fatigue. The two analyses were to determine a standardised meandifference (SMD) for the pooled studies.

An unplanned decision was later made to relegate these pre‐specified primaryanalyses to the status of sensitivity analyses and to replace them with two unplannedanalyses which assessed the same studies but by a different statistical method. Theunplanned analyses (1.1 and 1.2) do not provide an overall (pooled) treatment effect butprovide mean differences in a number of sub‐analyses of studies grouped togetherbased on the specific tool or scoring method used to measure fatigue.

The two pre‐specified primary analyses are published as sensitivity analysis1.19 (fatigue at end of treatment) and another analysis (fatigue at follow‐up)which has not been designated a numerical identifier. To reiterate; these two analysesprovide the pooled standardised mean difference for fatigue for all eligible studies.Analysis 1.19 was included within the comprehensive set of tables published in thereview, however, the follow‐up analysis (which demonstrated anon‐significant outcome) was (uniquely for primary outcomes) omitted from the setof tables (i.e. it was not published as a table) but was only briefly outlined under thesubheadings: "Sensitivity analysis" > "Investigatingheterogeneity" (see appendix, below, for quote). As this analysis is not mentionedelsewhere in the review, and is only mentioned in one sentence, it is easy to miss.

To clarify; the unplanned analyses assess the same studies as the pre‐specifiedanalyses, but only the pre‐specified analyses indicate the overall treatmenteffect for all eligible studies pooled together.

The outcomes of the two pre‐specified analyses, using a pooled standardised meandifference (SMD) for all eligible studies, were that exercise therapy (versus passivecontrol) at end of treatment (i.e. analysis 1.19) had a significant positive treatmenteffect (SMD: ‐0.68; 95% CI ‐1.02 to ‐0.35), whereas atfollow‐up the treatment effect was not significant (SMD: ‐0.63; 95% CI‐1.32 to 0.06).

The unplanned primary analysis 1.1 (fatigue at end of treatment) includes threeseparate sub‐analyses which all demonstrate a positive treatment effect, whereasunplanned analysis 1.2 (fatigue at follow‐up) had mixed outcomes with two out ofthree sub‐analyses demonstrating a significant treatment effect.

So, to reiterate, the pre‐specified primary analyses demonstrate that exercisetherapy (versus passive control) had a significant pooled treatment effect on fatigue atend of treatment, but no significant effect at follow‐up. Whereas the unplanned(revised) analyses demonstrate significant treatment effects at end of treatment butmixed outcomes at follow‐up.

The fact that unplanned analysis 1.2 (fatigue at follow‐up) did not consistentlydemonstrate significant treatment effects is not explained with clarity in the maindiscussions of the review. For example, the outcomes are described as follows:"Moderate‐quality evidence showed exercise therapy was more effective atreducing fatigue compared to ‘passive’ treatment or notreatment." (See the appendix, below, for more quoted examples.)

The main discussions in the review also fail to inform the reader that the pooledtreatment effect on fatigue (compared to a passive control), for all eligible studies atfollow‐up, demonstrated a lack of significance, as per the pre‐specifiedprimary analysis.

All Outcomes at Follow‐Up

Despite the limitations associated with self‐report measures [2], physicalfunction (a secondary outcome in the review) is widely considered a useful measure fordemonstrating severity of illness and functional changes in outcomes for chronic fatiguesyndrome [3,4,5]. It may be an especially helpful measure when assessing exercisetherapy because exercise therapy is designed specifically to address physical functionor tolerance to exercise, or both [6]. It seems reasonable to expect physical functionto improve after a course of exercise therapy in chronic fatigue syndrome patients, ifthe therapy is clinically beneficial. The review reports that exercise therapy (whencompared to passive control) has a positive effect on self‐report physicalfunction at end of treatment (analysis 1.5), but this effect is not sustained and therewas no significant treatment effect at follow‐up (see analysis 1.6).

There was also no significant effect on self‐perceived overall health atfollow‐up (see analysis 1.15). Indeed, if we consider all of thehealth‐related pre‐specified (primary and secondary) outcomes for thereview, for exercise therapy versus passive control, then with the exception only ofsleep, all the indicators of health (i.e. fatigue, physical function, overall health,pain, quality of life, depression, and anxiety), showed no significant treatment effectsat follow‐up. (The remaining measures were: serious adverse reactions totreatment; drop‐outs; and 'health resource use' for which a pooledeffect size was not provided but which demonstrated non‐significant differencesbetween intervention arms in all but one of the sub‐analyses.) This means thatonly sleep had a significant positive treatment outcome, at follow‐up, as per thepre‐specified health indicators, for exercise therapy versus passive control.

Put simply, apart from sleep, all the pooled analyses demonstrate that there were nosignificant health benefits from exercise therapy at follow‐up.

These outcomes present a significantly different picture to the impression given by thereview authors in their main discussions, abstract, conclusions and summaries wherein,for example, outcomes in general, including fatigue, physical function and overallhealth, are described as being broadly positive (e.g. it is stated that "patientswith CFS may generally benefit and feel less fatigued following exercise therapy"and: "Exercise therapy had a positive effect on people’s dailyphysical functioning, sleep and self‐ratings of overall health.")Furthermore, some specific erroneous information has been included in the main text tosupport the review authors' interpretation; i.e. the main discussion erroneouslydescribes both physical function and self‐rated overall health as indicating apositive treatment effect at follow‐up, when in fact the outcomes (i.e. analyses1.6 and 1.15) were not significant. The reviewers erroneously assert that: "Apositive effect of exercise therapy was observed both at end of treatment and atfollow‐up with respect to [...] physical functioning (Analysis 1.5; Analysis 1.6)and self‐perceived changes in overall health (Analysis 1.14; Analysis1.15)." (See appendix, below, for full quote.)

The non‐significant outcomes seen in all but one of the pre‐specifiedhealth indicators at follow‐up (exercise vs passive control) were not discussedor explored in the discussions of the Cochrane review. I find this omissiondisappointing because the information would help to inform patients and clinicians ofthe ongoing treatment effects that they might realistically expect from behaviouraltherapies such as exercise therapy. I believe that the review would be more robust andhelpful if it accurately highlighted and adequately explored these issues in the maindiscussions.

The health outcomes at follow‐up would currently be completely lost on a readerwho did not scrutinise the individual analyses of the review but relied upon theabstract or main discussions.

Bias Inherent in Open‐Label Studies

Another issue that I believe is not explored with careful consideration is the possibleimplications relating to a review of purely open‐label studies; i.e. thepossibility that any initial positive treatment effects broadly seen in this review atend of treatment, may entirely, or to some degree, reflect biases inherent in trialmethodologies that are unable to blind patients, therapists or trial investigators tothe treatment arm. The review itself explains that formal blinding "is notinherently possible in trials of exercise therapy" and that this "increasesrisk of bias, as instructors' and participants' knowledge of group assignationmight have influenced the true effect." The trend in this review towardsnon‐significant effects, after treatment has ended, may lend strength to aconcern that the initial self‐report treatment effects are transient and may bethe result of various inherent methodological biases in open‐label trials thatuse self‐report outcome measures [2,7,8]. Potential methodological biases inopen‐label trials using self‐report outcomes may be, for example:inadequate control conditions; self‐reporting bias; therapist allegiance; and/orunplanned changes to trial methodology [7,9].

Readers might be interested to note that, for White 2011, which was the largest trialincluded in the review, the follow‐up data used in the review was at 52 weeks[10] but further follow‐up data has also been published, at a median of 2.5 yearsafter randomisation, which demonstrated no significant differences between interventionarms for the primary outcomes [11].

Summary of Outcomes

In summary, the pre‐specified primary analyses for fatigue were to assess thepooled standardised mean differences. However, the reviewers then made a post‐hocdecision to replace these analyses for which the only rationale provided was anassumption that a standardised mean difference is supposedly "more difficult toconceptualise and interpret". When all of the eligible studies are pooled, as perthe pre‐specified plan, the pooled treatment effect at follow‐up is notsignificant. However, the promotion of the unplanned analyses has allowed the lack of asignificant pooled treatment effect at follow‐up to be overlooked and dismissedin the main analyses and discussions, to the point where the main discussions could beinterpreted to indicate that the treatment effects for fatigue were entirely positive(see appendix, below, for quotes).

I question whether this is an appropriate level of clarity compared to what is expectedfrom a Cochrane review. Cochrane reviews have a reputation of providing transparent,uncomplicated, straightforward and reliable explanations of complex and rigorousanalyses, whereas this review has: used unplanned primary outcomes without a robust orevidence‐based reason for switching outcomes; provided just one sentence toexplain the changes to the pre‐specified primary outcomes; omitted a crucialsensitivity analysis from the tables section; has not reflected the entire range ofoutcomes in the abstract, conclusions or main discussions; and has inaccuratelydescribed outcomes at follow‐up for physical function and overall health.

Justification for Switching Primary Outcomes

The reason given for switching the primary outcomes in the review is: "We realisethat the standardised mean difference (SMD) is much more difficult to conceptualise andinterpret than the normal mean difference (MD) [...]".

However, it is questionable whether the reason given for switching the primary outcomesjustifies such an unplanned fundamental change in the methodology of the review; nojustification is given as to why the reviewers believe that readers would find it easierto interpret the mean scores of a range of disparate fatigue questionnaires, in a seriesof sub‐analyses, rather than a single standardised mean difference for a pooledanalysis of eligible studies. It is not clear to me why it is assumed that a variety ofseparate fatigue scales should be easier to understand and interpret than a singlestandardised mean difference. As the changes to the protocol have had the effect ofchanging the primary outcomes at follow‐up, this means it would be desirable toprovide a well‐reasoned case to deviate from the protocol and switch the primaryoutcomes.

The claim with regards to interpretability raises the question of why standardised meandifferences are adequate for other Cochrane studies, but not this particular study.Cochrane has not adopted a policy of avoiding using standardised mean differences;instead the Cochrane guidelines (section 12.6) encourage their use [12]. So thisdecision appears to be a novel post‐hoc decision specific for this study.

The Cochrane guidelines (section 12.6.1) actually suggest that ordinary meandifferences can be difficult to interpret: "The units of such outcomes [i.e. meandifferences] may be difficult to interpret, particularly when they relate to ratingscales." [12] The guidelines (section 12.6.1) acknowledge that there may bedifficulties in interpreting standardised mean differences: "Without guidance,clinicians and patients may have little idea how to interpret results presented asSMDs." The guidelines do not favour one method over another in general, butdescribe how each may be used for specific purposes; if one wishes to provide an overalltreatment effect for studies that use different measures to measure the same construct,then the standardised mean difference is a standard tool which is used widely inCochrane reviews and other research. The guidelines suggest that "[t]here areseveral possibilities for re‐expressing [standardised means differences] in morehelpful ways".

Implications Related to Changing Trial Protocols and Outcome Switching

The unplanned changes to the review make it vulnerable to potential bias or accusationsof bias; conscious or unconscious personal or professional preferences have thepotential to affect post‐hoc decisions with respect to methodology. Even ifinvestigators are scrupulous in the rigour of their decision making, unexpected biaseshave the potential to creep into unplanned decisions, which is an issue that factorsinto the reasons why pre‐trial plans (e.g. trial registers and protocols) havebecome widespread [7], and are used for Cochrane reviews [12].

The Cochrane Guidebook for reviewers (Section: 2.1; "Rationale forprotocols") [12] explains: "Post hoc decisions made when the impact on theresults of the research is known, such as excluding selected studies from a systematicreview, are highly susceptible to bias and should be avoided."

In the same paragraph, the guidelines also state: "While every effort should bemade to adhere to a predetermined protocol, this is not always possible orappropriate."

However, it seems that, in this case, every effort was not made to adhere to theprotocol because the unplanned changes seem to be based on preference rather thennecessity, and the pre‐planned analyses have not been shown to be inferior,inadequate or inappropriate.

Conclusion

I find the changes to the protocol to be of particular concern for the followingreasons:

1. The final primary analyses are unplanned and have replaced adequate, and arguablymore helpful, pre‐specified analyses;

2. The rationale provided for the changes was neither robust nor evidence‐basedbut was based upon an assumption;

3. The changes have significantly altered the main outcomes and affected theinterpretation of the review (i.e. changed one of the two main outcomes from aninsignificant treatment effect to an inconsistent but broadly positive effect); and

4. The pre‐planned analysis for fatigue at follow‐up, has been omittedfrom the tables section of the review which, as far as I understand, is a uniqueomission for the primary outcomes.

For the sake of simplicity, rigour, and transparency, I ask the review team to reassessthe review, including the decision to switch the primary outcomes, and to:

1. Amend the review as per the guidelines in the Cochrane Guidebook quoted above (i.e."every effort should be made to adhere to a predetermined protocol"), and torevert to the pre‐planned primary analyses; and

2. Clearly and unambiguously explain that all but one health indicator (i.e. fatigue,physical function, overall health, pain, quality of life, depression, and anxiety, butnot sleep) demonstrated a non‐significant outcome for pooled treatment effects atfollow‐up for exercise therapy versus passive control; and

3. Include a rigorous assessment of how the potential for bias may have affectedoutcomes of the open‐label studies in this review, with consideration of the useof self‐report measures in open‐label studies.

‐‐‐ ‐‐‐

Appendix

Relevant Quotes from the Review

Differences between protocol and review

"[...] in the protocol it is stated, "where results for continuous outcomeswere presented using different scales or different versions of the same scale, we usedstandardised mean differences (SMDs)." We realise that the standardised meandifference (SMD) is much more difficult to conceptualise and interpret than the normalmean difference (MD); therefore we decided to report both MDs and SMDs in the Resultssection. In general, MDs are reported in the main Results section, whereas SMDs aresupplied under the "Sensitivity and subgroup analysis" subheading."

Quotes detailing the outcomes of the pre‐specified analyses:

Effects of interventions > Exercise therapy versus treatment as usual, relaxation orflexibility > Sensitivity analysis

Fatigue, End of Treatment

"At end of treatment, fatigue was measured and reported on different scales, andwe performed a sensitivity analysis in which all available studies were pooled using anSMD method. This strategy led to a pooled random‐effects estimate of ‐0.68(95% CI ‐1.02 to ‐0.35), but the analysis suffered from considerableheterogeneity (I² = 78%, P value < 0.0001; Analysis 1.19). The observedheterogeneity was caused mainly by the deviating results presented in Powell 2001.Exclusion of Powell 2001 gave rise to a pooled SMD of ‐0.46 (95% CI ‐0.63to ‐0.29) – an estimate that was not associated with heterogeneity (I²= 13%, P value 0.33)."

Fatigue, Follow‐up

"At follow‐up, the four available studies (Jason 2007; Powell 2001; Wearden2010; White 2011) measured and reported fatigue on different scales, and we performed asensitivity analysis in which all available studies were pooled using an SMD method. Thepooled SMD estimate is ‐0.63 (95% CI ‐1.32 to 0.06), but heterogeneity wasextensive (I² = 93%, P value < 0.00001)."

Quotes from main discussion sections of review re effectiveness of graded exercise(compared with passive control) on fatigue

Abstract > Authors' conclusions

"Patients with CFS may generally benefit and feel less fatigued following exercisetherapy, and no evidence suggests that exercise therapy may worsen outcomes."

Plain language summary > What does evidence from the review tell us?

"Moderate‐quality evidence showed exercise therapy was more effective atreducing fatigue compared to ‘passive’ treatment or notreatment."

 Discussion > Summary of main results

"When exercise therapy was compared with 'passive control,' fatigue wassignificantly reduced at end of treatment (Analysis 1.1)."

Quotes selectively reporting secondary outcomes

 Abstract > Authors' conclusions

"A positive effect with respect to sleep, physical function andself‐perceived general health has been observed, but no conclusions for theoutcomes of pain, quality of life, anxiety, depression, drop‐out rate and healthservice resources were possible."

Plain language summary > What does evidence from the review tell us?

"Exercise therapy had a positive effect on people’s daily physicalfunctioning, sleep and self‐ratings of overall health."

Erroneous reporting of outcomes for physical function and overall health atfollow‐up

Discussion > Summary of main results

"A positive effect of exercise therapy was observed both at end of treatment andat follow‐up with respect to sleep (Analysis 1.12; Analysis 1.13), physicalfunctioning (Analysis 1.5; Analysis 1.6) and self‐perceived changes in overallhealth (Analysis 1.14; Analysis 1.15)."

‐‐‐ ‐‐‐

References

1. Larun L, Brurberg KG, Odgaard‐Jensen J, Price JR. Exercise therapy forchronic fatigue syndrome. Cochrane Database Syst Rev. 2016; CD003200.

2. Kindlon TP. Objective measures found a lack of improvement for CBT & GET in thePACE Trial: subjective improvements may simply represent response biases or placeboeffects in this non‐blinded trial. BMJ Rapid Response 2015.http://www.bmj.com/content/350/bmj.h227/rr‐10 (accessed May 18,2016).

3. Buchwald D, Pearlman T, Umali J, Schmaling K, Katon W. Functional status in patientswith chronic fatigue syndrome, other fatiguing illnesses, and healthy individuals. Am JMed. 1996;101:364‐70.

4. Cook DB, Lange G, DeLuca J, Natelson BH. Relationship of brain MRI abnormalities andphysical functional status in chronic fatigue syndrome. Int J Neurosci.2001;107:1‐6.

5. Crawley E, Sterne JA. Association between school absence and physical function inpaediatric chronic fatigue syndrome/myalgic encephalopathy. Arch Dis Child.2009;94:752‐6.

6. Bavinton J, Darbishire L, White PD. PACE manual for therapists; graded exercisetherapy for CFS/ME. 2004. Internet.http://www.wolfson.qmul.ac.uk/images/pdfs/5.get‐therapist‐manual.pdf(accessed May 18, 2016).

7. Moher D, Hopewell S, Schulz KF, et al. CONSORT 2010 explanation and elaboration:updated guidelines for reporting parallel group randomised trials. BMJ. 2010;340:c869.

8. Wilshire CE. Re: Tackling fears about exercise is important for ME treatment,analysis indicates. BMJ Rapid Response 2015.http://www.bmj.com/content/350/bmj.h227/rr‐7 (accessed May 18,2016).

9. Van de Mortel TF. Faking it: social desirability response bias in self‐reportresearch. Australian Journal of Advanced Nursing, The. 2008;25:40.

10. White PD, Goldsmith KA, Johnson AL, et al. Comparison of adaptive pacing therapy,cognitive behaviour therapy, graded exercise therapy, and specialist medical care forchronic fatigue syndrome (PACE): a randomised trial. Lancet 2011; 377:823‐36.

11. Sharpe M, Goldsmith KA, Johnson AL, et al. Rehabilitative treatments for chronicfatigue syndrome: long‐term follow‐up from the PACE trial. LancetPsychiatry 2015; 2:1067–74.

12. Higgins JPT, Green S (editors). Cochrane Handbook for Systematic Reviews ofInterventions. Version 5.1.0 [updated March 2011]. The Cochrane Collaboration, 2011.Internet.http://handbook.cochrane.org (accessed May 12, 2016).

‐‐‐ ‐‐‐

I do not have any affiliation with or involvement in any organisation with a financialinterest in the subject matter of my comment.

Reply

Dear Robert Courtney

Thank you for your ongoing and detailed scrutiny of our review. We have the greatestrespect for your right to comment on and disagree with our work, but in the spirit ofopenness, transparency and mutual respect we must politely agree to disagree.

Presenting health statistics in a way that makes sense to the reader is a challenge.Statistical illiteracy is – according to Girgerenzer and co‐workers –common in patients, journalists, and physicians (1). With this in mind we have presentedthe results as mean difference (MD) related to the relevant measurement scales, forexample Chalder Fatigue Scale, as well as standardised mean difference (SMD). The use ofMD enables the reader to transfer the results to the relevant measurement scale directlyand judge the effect in relation to the scale. We disagree that presenting MD and SMDrather than SMD and MD is an important change, and we disagree with the claim that theanalysis based on MD and SMD are inconsistent. This has been discussed as part of thepeer‐review process. Confidence intervals are probably a better way to interpretdata that P values when borderline results are found (2). Interpreting the confidenceintervals, we find it likely that exercise with its SMD on ‐0.63 (95% CI‐1.32 to 0.06) is associated with a positive effect. Moreover, one should alsokeep in mind that the confidence interval of the SMD analysis are inflated by theinclusion of two studies that we recognize as outliers throughout our review. Absence ofstatistical significance does not directly imply that no difference exists.

All the included studies reported results after the intervention period and this is themain results. The results at different follow‐up times are presented in the text,but we have only included data available at the last search date, 9 may 2014. When thereview is updated, a new search will be conducted to find new, relevant follow up dataand new studies. As a general comment, it is often challenging to analysefollow‐up data gathered after the formal end of a trial period. There is always achance that participants may receive other treatments following the end of the trialperiod, a behaviour that will lead to contamination of the original treatment arms andchallenge the analysis.

Cochrane reviews aim to report the review process in a transparent way, which enablesthe reader to agree or disagree with the choices made. We do not agree that thepresentation of the results should be changed. We note that you read thisdifferently.

Regards,

Lillebeth Larun

1. Girgerenzer G, Gaissmaier W, Kurtz‐Milcke E, Schwartz LM, Woloshin S. HelpingDoctors and Patients Make Sense of Health Statistics. Pyschological Science in thePublic Interest, 2008;8:(2):53‐96.http://www.psychologicalscience.org/journals/pspi/pspi_8_2_article.pdf.

2. Hackshaw A and Kirkwood A. Interpreting and reporting clinical trials with resultsof borderline significance.BMJ 2011;343:d3340 doi: 10.1136/bmj.d3340

Contributors

Feedback submitted by: Robert Courtney

Response submitted by: Lillebeth Larun

Feedback submitted, 12 May 2016

Summary

Comment: A query regarding the way outcomes for physical function and overall healthhave been described in the abstract, conclusion and discussions of the review.

I would like to query the way that the outcomes for both physical function and overallhealth have been reported in the abstract, conclusion and in the main discussion sectionof the current version (version 4) of the Cochrane review by Larun et al., dated 7February 2016 [1].

The abstract, conclusion and main discussion section unambiguously indicate that therewas a positive treatment effect on both physical function and overall health, inrelation to exercise therapy compared to a passive control.

For example, with respect to exercise therapy versus passive control, the"authors' conclusions" in the abstract state without qualification that:"A positive effect with respect to sleep, physical function andself‐perceived general health has been observed[...]". Another section ofthe review ("What does evidence from the review tell us?") asserts that:"Exercise therapy had a positive effect on people’s daily physicalfunctioning, sleep and self‐ratings of overall health." The "summary ofmain results" unequivocally states that: "A positive effect of exercisetherapy was observed both at end of treatment and at follow‐up with respect tosleep (Analysis 1.12; Analysis 1.13), physical functioning (Analysis 1.5; Analysis 1.6)and self‐perceived changes in overall health (Analysis 1.14; Analysis1.15)." (Please see the appendix, below, to read these quotes in full.)

However, upon careful consideration of the relevant analyses, it seems that there werenot consistent positive treatment effects for either physical function or overall healthin relation to exercise therapy versus passive control. Instead, for both of thesevariables, there was a significant treatment effect only at end of treatment, but not atfollow‐up.

The relevant analyses are 1.5 (end of treatment) and 1.6 (follow‐up) forself‐report physical function, and 1.14 (end of treatment) and 1.15(follow‐up) for self‐report overall health.

Analysis 1.5 assessed the pooled treatment effect on physical function at end oftreatment for all eligible studies, and demonstrates a significant effect. Analysis 1.6used the same criteria but at follow‐up, and demonstrates that there was not asignificant effect for physical function at follow‐up.

Analysis 1.14 assessed the pooled treatment effect on overall health at end oftreatment for all eligible studies, and demonstrates a significant effect. Analysis 1.15used the same criteria but at follow‐up, and demonstrates that there was not asignificant effect for overall health at follow‐up.

The lack of a significant treatment effect at follow‐up is clearly illustratedby analyses 1.6 and 1.15.

These outcomes are also confirmed in the analysis section of the review where, inrelation to the difference between exercise therapy versus passive control, for physicalfunction at follow‐up, it is confirmed that: "[...] little or no differencecannot be ruled out." And for overall health at follow‐up, it is confirmedthat "the confidence interval implies inconclusive results".

I believe that these outcome are not reflected accurately in the abstract, the maindiscussions or the conclusions of the review; specifically the extracts that are quotedabove and in the appendix below. For example, the "summary of main results"specifically claims that positive treatment effects are demonstrated by analyses 1.6 and1.15, but these analyses actually demonstrate an absence of significant treatmenteffects. The discussion claims: "A positive effect of exercise therapy was observed[...] at follow‐up with respect to [...] physical functioning ([...]Analysis 1.6)and self‐perceived changes in overall health ([...]Analysis 1.15)."

It is generally understood that a "positive" treatment effect equates to asignificant effect, and I believe that the Cochrane text should reflect this, or atleast clarify that the term "positive effect" is being used to indicate a lackof significance.

It is likely that many readers will not read the full report or scrutinise eachindividual analysis but will read only the abstract, main discussions or conclusions, soI believe it is important for the discussions to carefully and accurately reflect theoutcomes of the analyses.

Cochrane has a reputation for upholding the highest standards including with respect toexplaining outcomes in accurate and straightforward language. With this in mind, Irequest that the Cochrane review team kindly review the apparent disparities describedabove and amend the text of the discussions and conclusions where appropriate, in orderto reflect the lack of a significant treatment effect for physical function and overallhealth at follow‐up with respect to exercise therapy versus passive control.

‐‐‐

Appendix

Quotes from the review:

Abstract > Authors' conclusions

"Patients with CFS may generally benefit and feel less fatigued following exercisetherapy, and no evidence suggests that exercise therapy may worsen outcomes. A positiveeffect with respect to sleep, physical function and self‐perceived general healthhas been observed, but no conclusions for the outcomes of pain, quality of life,anxiety, depression, drop‐out rate and health service resources werepossible."

What does evidence from the review tell us?

"Moderate‐quality evidence showed exercise therapy was more effective atreducing fatigue compared to ‘passive’ treatment or notreatment. Exercise therapy had a positive effect on people’s dailyphysical functioning, sleep and self‐ratings of overall health."

Summary of main results

"[...] A positive effect of exercise therapy was observed both at end of treatmentand at follow‐up with respect to sleep (Analysis 1.12; Analysis 1.13), physicalfunctioning (Analysis 1.5; Analysis 1.6) and self‐perceived changes in overallhealth (Analysis 1.14; Analysis 1.15)."

‐‐‐

Reference

1. Larun L, Brurberg KG, Odgaard‐Jensen J, Price JR. Exercise therapy forchronic fatigue syndrome. Cochrane Database Syst Rev. 2016; CD003200.

‐‐‐

I do not have any affiliation with or involvement in any organisation with a financialinterest in the subject matter of my comment.

Reply

Thank you for your ongoing and detailed scrutiny of our review. We have the greatestrespect for your right to comment on and disagree with our work, but in the spirit ofopenness, transparency and mutual respect we must (again) politely agree todisagree.

All the included studies reported results after the intervention period and this is themain result. The results at different follow‐up times are presented in the text.It can be noted that the quality of the evidence is higher for theend‐of‐treatment time point because more trials are included, and hence,we do not agree that it is wrong to give higher weight to these results in the abstract.Additionally, it is often challenging to analyse follow‐up data gathered afterthe formal end of a trial period. There is always a chance that participants may receiveother treatments following the end of the trial period, a behaviour that will lead tocontamination of the original treatment arms and challenge the analysis.

Cochrane reviews aim to report the review process in a transparent way, which enablesthe reader to agree or disagree with the choices made. We do not agree that thepresentation of the results should be changed. We note that you read thisdifferently.

Contributors

Feedback submitted by: Robert Courtney

Response submitted by: Lillebeth Larun

Feedback submitted, 16 June 2017

Summary

Comment: I'm concerned regarding your conclusion that no evidence suggests thatexercise therapy may worsen outcome, as you have stated that no conclusions werepossible for the drop‐out rate.

Whilst I appreciate that you are unable to draw conclusions about drop‐out ratesdue to insufficient data, is it perhaps potentially misleading or ambiguous to summarisethat in general patients may benefit from GET with there being no evidence for symptomsworsening, when there are a researchers that support the claim that CBT/GET isdetrimental to the long term prognosis of patients with ME/CFS. Without assessment ofdata concerning those whom have dropped out (those most likely to experience worseningsymptoms) the conclusions you have stated could prove harmful if taken as encouragementfor GPs to place their patients on GET regimes.

I do not question your analysis of the data, but rather I am concerned with the way inwhich you have expressed your findings.

Reply

Thank you for your interest in the review and your comment.

In our systematic review, we aim to summarise the effect estimates associated with theuse of exercise therapy for patients diagnosed with chronic fatigue syndrome CFS/ME. Wedecided to rely on data from randomised controlled trials (RCT), as RCTs provide muchmore robust data than for example anecdotal evidence. We held serious adverse reactions(SAR) and serious adverse events (SAE) as our primary outcome, whereas thedrop‐out rate was added as a secondary outcome.

Systematic reviews based on aggregated data dependent on the data reported in theincluded trials. One trial reported that SARs and SAEs were rare in both groups,suggesting that the difference between the groups is small when measured in absoluteterms. Analysis of drop‐out rates did not reveal statistical differences betweenthe groups, and we cannot conclude that exercise is associated with higherdrop‐out rates. Even if we had seen differences between the groups, however,drop‐out rates must be interpreted with caution. It is important to be aware thatdrop‐out is not a direct measure of harm. There might be several reasons patientsdrop out, and some of these reasons are not expected to distribute equally between thegroups. Harm is one possible reason for drop‐out, but patients may also withdrawbecause they are unhappy with the randomisation (preconceptions), because they feelbetter or because they don’t experience the expected level ofimprovement etc.

Systematic reviews aim to bring the best evidence to the clinical encounter, but shareddecision making includes patient preferences and clinical expertise when a treatmentplan is decided upon.

Contributors

Feedback submitted by: Richard Gardner

Response submitted by: Lillebeth Larun

Feedback submitted, 18 October 2018

Summary

It has been raised by others that the Cochrane review erroneously places ME/CFS in itsmental health category. The response provided by Cochrane when this issue was raiseddoes not inspire confidence in their knowledge/advice on the disease. ME/CFS was thesubject of a comprehensive literature review carried out by the USA National Academy,published in 2015 this report categorically determined that the disease ME/CFS is not amental health disorder. The large volume of biomedical research findings of a wide rangeof organic abnormalities is also at odds with a mental health disorder. Further more theWorld Health Organisation has categorised ME/CFS as a neurological disease. The NationalCentre for Neuroimmune and Emerging Diseases has patented a blood test for the diseaseand is in the early stages of validating it. It would be much appreciated if Cochranewould categorise ME/CFS in the appropriate group i.e. along with other neurologicaldiseases such as Parkinson's, Huntington's, multiple sclerosis etc. Delightedto see that the latest review has been suspended and look forward to its replacementwith a review that bases its findings on OBJECTIVE outcome data.

Reply

Many thanks for your comment and for noting recent categorisations of Chronic fatiguesyndrome (CFS)/myalgic encephalomyelitis (or encephalopathy) (ME). Feedback on reviewsis normally dealt with by the relevant review author, but in this case as your queryrelates more to an organisational management issue, we are responding on behalf of theCochrane Common Mental Disorders (CMD) Review Group and the Cochrane Editor inChief.

We value your observations about the placement of CFS/ME reviews in The CochraneLibrary. We want our evidence to properly support those with lived experience of CFS/MEand to ensure that the CFS/ME community have confidence in our portfolio of reviews. Weare also aware that the hosting of this topic by the Cochrane CMD Review Group has beenantagonistic to some in the CFS/ME community.

Cochrane has recently created eight new Networks of Cochrane Review Groups (CRGs). Theformation of these networks provides a timely opportunity to review the scope of allCRGs and to consider changes where appropriate. In response to concerns raised bymembers of the CFS/ME community, Cochrane has been considering repositioning theeditorial oversight of CFS/ME reviews. The Cochrane CMD Review Group currently sitswithin the Brain, Nerves and Mind (BNM) Network. In the future, reviews on this topicmight sit with another Cochrane Review Group within the BNM Network, or they mighttransfer to another Network altogether, such as the Long Term Conditions and Ageing 2Network. Please be reassured that this is currently under consideration and a decisionis anticipated before the end of 2018.

We would also like to refer you to the recent published note for the latest informationabout the status of this particular review ‘This review is subject to anongoing process of review and revision following the submission of a formal complaint tothe Editor in Chief. Cochrane considers all feedback and complaints carefully, andrevises or updates reviews when it is appropriate. The review author team have advisedus that a resubmission of this review is imminent. A decision on the status of thisreview will be made once this resubmission has been through editorial process, which weanticipate will be towards the end of November 2018’.

Contributors

Feedback submitted by: Adrienne Wooding

Response: Peter Coventry and Jessica Hendon

Peter Coventry is the Feedback Editor of the CMD Review Group and Jessica Hendon is theManaging Editor of the CMD Review Group. No other conflicts of interest declared.

Feedback submitted, 5 November 2018

Summary

A few questions about where the disease ME/CFS will be placed by Cochrane in thefuture. If Cochrane moves myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS)into the Long term conditions and Aging Network, will it be moved into the Metabolic andEndocrine Disorders review group within this network? I'm making this assumption,based on the metabolic abnormalities, found in people with ME/CFS, when objectivemetabolic exercise tests, are carried out, as per "Cardiopulmonary Exercise TestMethodology for Assessing Exertion Intolerance in Myalgic Encephalomyelitis/ChronicFatigue Syndrome" ‐https://www.ncbi.nlm.nih.gov/pubmed/30234078? If, in the alternative, Cochranedecides that ME/CFS is to remain in the Brain, Nerves and Mind (BNM) Network, will it bemoved into a separate group of its own? As whilst the disease fits within the BNMNetwork, it doesn't fit into any of the listed Cochrane Review Groups. The closestfit is probably the Multiple Sclerosis and rare diseases of the CNS? However, given thatME/CFS is thought to be more prevalent that multiple sclerosis and is not rare, itdoesn't really fit into this group. Will a new Cochrane Review Group, be made forME/CFS, that is in line with the published biomedical and physiological findings?

Reply

Many thanks for your follow‐on comments related to Cochrane’sdecision to consider repositioning its chronic fatigue syndrome (CFS)/myalgicencephalomyelitis (or encephalopathy) (ME) reviews. The repositioning of the editorialoversight of CFS/ME reviews is ongoing. Your feedback has been forwarded to the CochraneEditor in Chief so that it can be considered as part of this process.

Contributors

Feedback submitted by: Adrienne Wooding

Response: Jessica Hendon (Managing Editor of the Cochrane Common Mental DisordersReview Group)

Feedback submitted, 2 December 2018

Summary

Recently I have published a reanalysis of this Cochrane review. Unfortunately there aremany problems with the review and the trials in it. For example, P‐Hacking,extensive endpoint changes, overlap in entry/recovery criteria, selecting patients whodon't have the disease, ignoring null effects, relying on subjective outcomes inunblinded trials and ignoring the absence of objective improvement. The reanalyses whichlooked at the objective outcomes showed that graded exercise therapy is not an effectivetreatment for ME/CFS. The studies in the review do not provide any evidence that gradedexercise therapy is safe, on the other hand, patient evidence and the literature showthat it is not safe.

The open access reanalysis can be read here:https://journals.sagepub.com/doi/full/10.1177/2055102918805187

Reply

Many thanks for your feedback on this review. Cochrane recognises the importance of thereview and is committed to providing a high quality review that reflects the bestcurrent evidence to inform decisions. The Editor in Chief is currently holdingdiscussions with colleagues and the author team to determine a series of steps that willlead to a full update of this review. Your feedback will be considered as part of thisprocess so that it can inform future versions of the review. These discussions will beconcluded as soon as possible.

Contributors

Feedback submitted by: Mark Vink

Response: Jessica Hendon (Managing Editor of the Cochrane Common Mental DisordersReview Group)

Acknowledgements

We would like to thank Peter White and Paul Glasziou for advice and additional informationprovided. We would also like to thank Kathy Fulcher, Richard Bentall, Alison Wearden, KarenWallman and Rona Moss‐Morris for providing additional information from trials inwhich they were involved, as well as the CCDAN editorial base for providing support andadvice and Sarah Dawson for conducting the searches. In addition, we would like to thankJane Dennis, Ingvild Kirkehei, Hugh McGuire and Melissa Edmonds for their valuablecontributions, and Elisabet Hafstad for assistance with the search.

Appendices

Appendix 1. Search strategy—CCDANCTR‐References

CCDANCTR‐References Register

(fatigue* or asthenia or “musculardisorder*” or neurasthenia* or“infectious mononucleos*” or“myalgic encephalomyelit*” or“royal free disease*” or lassitude or“muscular weakness*” or“akureyri disease” or“atypical poliomyelitis” or CFIDS or CFS or(chronic and mononucleos*) or “epidemicneuromyasthenia” or “icelanddisease” or “post infectiousencephalomyelitis” or PVFS or tiredness or adynamia or legastheniaor (perspective and asthenia) or neurataxia or (“musclestrength” and loss) or “muscle*weak*” or “weak*muscle*” or (muscular and insufficiency) or (neuromuscular andfatigue))

and

exercise or “physical fitness” or"physical education” or “physicalcondition*” or “physicaltrain*” or “physicalmobility” or “physicalactiv*” or “physicalexertion” or “physicaleffort*” or (breathing and (therap* or exercise*)) or(respiration and therap*) or “gi gong” orgigong or *kung or tai or thai or taiji or taijiquan or taichi or walking or yoga orrelaxation* or gymnastics or calisthenics or aerobic or danc* or jumping orhopping or running or jogging or ambulat* or “musclestrengthening” or (muscular and (strength or resistance)) or((weight or weights) and lifting) or weightlifting or “powerlifting” or “weighttrain*” or pilates or stretching or plyometric* or“cardiopulmonary conditioning” or“motion therap*” or“neuromuscular facilitation*” or“movement therap*” or ((recreation oractivity) and therap*) or “isometrictraining” or climbing or cycling or bicycle* or“lifting effort*” or swim* or (trainingand (technical or course or program*)) or writing or kinesi* or gardening ormulticonvergent)

Appendix 2. Other search strategies

SPORTDiscus (EBSCOHost)

1. exp Exercise/

2. exp Exercise Therapy/

3. exp Exercise Movement Techniques/

4. Physical Fitness/

5. exp "Physical Education and Training"/

6. (exercise$ or exercising).tw.

7. ((breathing or respiration) adj (therap$ or exercise$)).tw.

8. (gi gong or gigong).tw.

9. relaxation$.tw.

10. ((tai adj ji) or ((tai or thai) adj chi) or taiji or taijiquan or taichi).tw.

11. walking.tw.

12. yoga.tw.

13. (physical adj (fitness or condition$ or education or training or mobility oractivit$ or exertion or effort)).tw.

14. gymnastics.tw.

15. calisthenics.tw.

16. aerobic danc$.tw.

17. danc$.tw.

18. (jumping or hopping).tw.

19. (running or jogging).tw.

20. ambulat$.tw.

21. muscle strengthening.tw.

22. (muscular adj (strength or resistance) adj training).tw.

23. ((weight$1 adj2 lifting) or weightlifting or power lifting or weighttraining).tw.

24. pilates.tw.

25. stretching.tw.

26. plyometric$.tw.

27. cardiopulmonary conditioning.tw.

28. motion therap$.tw.

29. neuromuscular facilitation$.tw.

30. movement therap$.tw.

31. ((recreation or activity) adj therap$).tw.

32. gymnastic therap$.tw.

33. isometric training.tw.

34. climbing.tw.

35. cycling.tw.

36. lifting effort$.tw.

37. swimming.tw.

38. writing.tw.

39. technical training.tw.

40. (training adj (course$ or program$)).tw.

41. (training adj (course$ or program$)).tw.

42. kinesi?therap$.tw.

43. gardening.tw.

44. multiconvergent.tw.

45. exp Sports/

46. or/1‐45

47. Fatigue Syndrome, Chronic/

48. exp Fatigue/

49. Asthenia/

50. Neurasthenia/

51. chronic fatigue$.tw.

52. fatigue syndrom$.tw.

53. infectious mononucleos$.tw.

54. postviral fatigue syndrome$.tw.

55. chronic fatigue‐fibromyalgia syndrome$.tw.

56. myalgic encephalomyelit$.tw.

57. royal free disease$.tw.

58. neurasthenic neuroses.tw.

59. akureyri disease.tw.

60. atypical poliomyelitis.tw.

61. benign myalgic encephalomyelitis.tw.

62. (CFIDS or CFS).tw.

63. (chronic adj4 mononucleos$).tw.

64. epidemic neuromyasthenia.tw.

65. iceland disease.tw.

66. post infectious encephalomyelitis.tw.

67. PVFS.tw.

68. (perspective adj4 asthenia).tw.

69. neurasthenic syndrome$.tw.

70. neurataxia.tw.

71. neuroasthenia.tw.

72. (neuromuscular adj6 fatigue).tw.

73. or/47‐72

74. randomized controlled trial.pt.

75. controlled clinical trial.pt.

76. randomi#ed.ab.

77. placebo$.ab.

78. randomly.ab.

79. trial.ab.

80. (clinic$ adj3 (trial$ or study or studies$)).ti,ab.

81. (control$ or prospectiv$ or volunteer$).ti,ab.

82. ((singl$ or doubl$ or tripl$) adj (blind$ or mask$or dummy)).ti,ab.

83. or/74‐82

84. (animals not (humans and animals)).sh.

85. 83 not 84

95. 46 and 73 and 85

Cochrane Central Register of Controlled Trials (CENTRAL)

#1 MeSH descriptor Exercise

#2 MeSH descriptor Exercise Therapy

#3 MeSH descriptor Exercise Movement Techniques

#4 MeSH descriptor Physical Fitness

#5 MeSH descriptor Physical Education and Training

#6 exercis*

#7 breathing NEAR/2 (therap* or exercis*)

#8 respiration NEAR/2 (therap* or exercis*)

#9 (gi gong or gigong)

#10 relaxation*

#11 tai or thai or taiji or taijiquan or taichi

#12 walking

#13 yoga

#14 (physical NEAR/2 (fitness or condition* or education or training or mobility oractivit* or exertion or effort))

#15 gymnastics

#16 calisthenics

#17 aerobic*

#18 danc*

#19 jumping or hopping

#20 ambulat*

#21 muscle strengthening

#22 (muscular NEAR/2 (strength or resistance))

#23 (weight or weights) NEAR/2 lift*

#24 weightlifting or power lifting or weight training

#25 (Pilates or stretching or plyometric* or cardiopulmonary conditioning or motiontherap* or neuromuscular facilitation* or movement therap* or gymnastictherap* or isometric training or climbing or cycling or lifting effort* orswimming or writing) #26 ((recreation or activity) NEAR/2 therap*)

#27 technical training

#28 (training NEAR/2 (course* or program*))

#29 (training adj (course* or program*))

#30 kinesi*

#31 gardening

#32 multiconvergent

#33 MeSH descriptor Sports explode all trees

#34 (#1 OR #2 OR #3 OR #4 OR #5 OR #6 OR #7 OR #8 OR #9 OR #10 OR #11 OR #12 OR #13 OR#14 OR #15 OR #16 OR #17 OR #18 OR #19 OR #20 OR #21 OR #22 OR #23 OR #24 OR #25 OR #26 OR#27 OR #28 OR #29 OR #30 OR #31 OR #32 OR #33)

#35 MeSH descriptor Fatigue Syndrome, Chronic

#36 MeSH descriptor Fatigue

#37 MeSH descriptor Asthenia

#38 MeSH descriptor Neurasthenia

#39 chronic fatigue*

#40 fatigue syndrom*

#41 infectious mononucleos*

#42 postviral fatigue syndrome*

#43 chronic fatigue‐fibromyalgia syndrome*

#44 myalgic encephalomyelit*

#45 royal free disease*

#46 neurasthenic neuroses

#47 akureyri disease

#48 atypical poliomyelitis

#49 benign myalgic encephalomyelitis

#50 CFIDS or CFS

#51 chronic NEAR/5 mononucleos*

#52 epidemic neuromyasthenia

#53 iceland disease

#54 post infectious encephalomyelitis

#55 PVFS

#56 perspective NEAR/5 asthenia

#57 neurasthenic syndrome*

#58 neurataxia

#59 neuroasthenia

#60 neuromuscular NEAR/6 fatigue

#61 (#35 OR #36 OR #37 OR #38 OR #39 OR #40 OR #41 OR #42 OR #43 OR #44 OR #45 OR #46 OR#47 OR #48 OR #49 OR #50 OR #51 OR #52 OR #53 OR #54 OR #55 OR #56 OR #57 OR #58 OR #59 OR#60)

#62 (#34 AND #61)

International Trial Registers

World Health Organization International Clinical Trials Portal available athttp://apps.who.int/trialsearch/,incorporating the following International trials registers/registries.

  • Australian New Zealand Clinical Trials Registry

  • ClinicalTrials.gov

  • EU Clinical Trials Register (EU‐CTR)

  • International Standard Randomised Controlled Trial Number (ISRCTN)

  • Brazilian Clinical Trials Registry (ReBec)

  • Chinese Clinical Trial Registry

  • Clinical Trials Registry—India

  • Clinical Research Information Service—Republic of Korea

  • Cuban Public Registry of Clinical Trials

  • German Clinical Trials Register

  • Iranian Registry of Clinical Trials

  • Japan Primary Registries Network

  • Pan African Clinical Trial Registry

  • Sri Lanka Clinical Trials Registry

  • The Netherlands National Trial Register

  • Thai Clinical Trials Register (TCTR)

Data and analyses

Comparison 1.

Exercise therapy versus treatment as usual, relaxation or flexibility

Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Fatigue (end of treatment)7Mean Difference (IV, Random, 95% CI)Subtotals only
1.1 Fatigue Scale, FS (11 items/0 to 11 points)1148Mean Difference (IV, Random, 95% CI)‐6.06 [‐6.95, ‐5.17]
1.2 Fatigue Scale, FS (11 items/0 to 33 points)3540Mean Difference (IV, Random, 95% CI)‐2.82 [‐4.07, ‐1.57]
1.3 Fatigue Scale, FS (14 items/0 to 42 points)3152Mean Difference (IV, Random, 95% CI)‐6.80 [‐10.31, ‐3.28]
2 Fatigue (follow‐up)4Mean Difference (IV, Random, 95% CI)Subtotals only
2.1 Fatigue Scale, FS (11 items/0 to 11 points)1148Mean Difference (IV, Random, 95% CI)‐7.13 [‐7.97, ‐6.29]
2.2 Fatigue Scale, FS (11 items/0 to 33 points)2472Mean Difference (IV, Random, 95% CI)‐2.87 [‐4.18, ‐1.55]
2.3 Fatigue Severity Scale, FSS (9 items/1 to 7 points)150Mean Difference (IV, Random, 95% CI)0.15 [‐0.55, 0.85]
3 Participants with serious adversereactions1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
4 Pain (follow‐up)1Mean Difference (IV, Random, 95% CI)Totals not selected
4.1 Brief Pain Inventory, pain severity subscale (0 to 10 points)1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
4.2 Brief Pain Inventory, pain interference subscale (0 to 10 points)1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
5 Physical functioning (end oftreatment)5Mean Difference (IV, Random, 95% CI)Subtotals only
5.1 SF‐36, physical functioning subscale (0 to 100 points)5725Mean Difference (IV, Random, 95% CI)‐13.10 [‐24.22, ‐1.98]
6 Physical functioning(follow‐up)3Mean Difference (IV, Random, 95% CI)Subtotals only
6.1 SF‐36, physical functioning subscale (0 to 100 points)3621Mean Difference (IV, Random, 95% CI)‐16.33 [‐36.74, 4.08]
7 Quality of life (follow‐up)1Mean Difference (IV, Random, 95% CI)Totals not selected
7.1 Quality of Life Scale (16 to 112 points)1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8 Depression (end of treatment)5Mean Difference (IV, Random, 95% CI)Subtotals only
8.1 HADS, depression score (7 items/21 points)5504Mean Difference (IV, Random, 95% CI)‐1.63 [‐3.50, 0.23]
9 Depression (follow‐up)4Mean Difference (IV, Random, 95% CI)Subtotals only
9.1 Beck Depression Inventory (0 to 63 points)145Mean Difference (IV, Random, 95% CI)3.44 [‐1.00, 9.88]
9.2 HADS, depression subscale (0 to 21 points)3609Mean Difference (IV, Random, 95% CI)‐2.26 [‐5.09, 0.56]
10 Anxiety (end of treatment)3Mean Difference (IV, Random, 95% CI)Subtotals only
10.1 HADS, anxiety score (0 to 21 points)3387Mean Difference (IV, Random, 95% CI)‐1.48 [‐3.58, 0.61]
11 Anxiety (follow‐up)4Mean Difference (IV, Random, 95% CI)Subtotals only
11.1 Beck Anxiety Inventory (0 to 63 points)145Mean Difference (IV, Random, 95% CI)0.70 [‐4.52, 5.92]
11.2 HADS, anxiety score (0 to 21 points)3607Mean Difference (IV, Random, 95% CI)‐1.01 [‐2.75, 0.74]
12 Sleep (end of treatment)2Mean Difference (IV, Random, 95% CI)Subtotals only
12.1 Jenkins Sleep Scale (0 to 20 points)2323Mean Difference (IV, Random, 95% CI)‐1.49 [‐2.95, ‐0.02]
13 Sleep (follow‐up)3Mean Difference (IV, Random, 95% CI)Subtotals only
13.1 Jenkins Sleep Scale (0 to 20 points)3610Mean Difference (IV, Random, 95% CI)‐2.04 [‐3.84, ‐0.23]
14 Self‐perceived changes in overallhealth (end of treatment)4489Risk Ratio (M‐H, Random, 95% CI)1.83 [1.39, 2.40]
15 Self‐perceived changes in overallhealth (follow‐up)3518Risk Ratio (M‐H, Random, 95% CI)1.88 [0.76, 4.64]
16 Health resource use (follow‐up)[Mean no. of contacts]1Mean Difference (IV, Random, 95% CI)Totals not selected
16.1 Primary care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.2 Other doctor1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.3 Healthcare professional1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.4 Inpatient1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.5 Accident and emergency1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.6 Other health/social services1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.7 Complementary health care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.8 Standardised medical care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
17 Health resource use (follow‐up) [No.of users]1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
17.1 Primary care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.2 Other doctor1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.3 Healthcare professional1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.4 Inpatient1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.5 Accident and emergency1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.6 Medication1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.7 Complementary health care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.8 Other health/social services1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
17.9 Standardised medical care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
18 Drop‐out6843Risk Ratio (M‐H, Random, 95% CI)1.63 [0.77, 3.43]
19 Subgroup analysis for fatigue7840Std. Mean Difference (IV, Random, 95% CI)‐0.68 [‐1.02, ‐0.35]
19.1 Graded exercise therapy6779Std. Mean Difference (IV, Random, 95% CI)‐0.71 [‐1.09, ‐0.32]
19.2 Exercise with self‐pacing161Std. Mean Difference (IV, Random, 95% CI)‐0.54 [‐1.05, ‐0.02]

Comparison 2.

Exercise therapy versus psychological treatment

Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Fatigue at end of treatment (FS; 11 items/0to 33 points)2Mean Difference (IV, Random, 95% CI)Totals not selected
1.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
1.2 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
2 Fatigue at follow‐up (FSS; 1 to 7points)1Mean Difference (IV, Random, 95% CI)Totals not selected
2.1 CT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
2.2 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
3 Fatigue at follow‐up (FS; 11 items/0to 33 points)2Mean Difference (IV, Random, 95% CI)Totals not selected
3.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
3.2 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
4 Participants with serious adversereactions2Risk Ratio (M‐H, Random, 95% CI)Totals not selected
4.1 CBT1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
4.2 Suportive listening1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
5 Pain at follow‐up (BPI, pain severitysubscale; 0 to 10 points)1Mean Difference (IV, Random, 95% CI)Totals not selected
5.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
5.2 CT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
6 Pain at follow‐up (BPI, paininterference subscale; 0 to 10 points)1Mean Difference (IV, Random, 95% CI)Totals not selected
6.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
6.2 CT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
7 Physical functioning at end of treatment(SF‐36, physical functioning subscale; 0 to 100 points)2Mean Difference (IV, Random, 95% CI)Totals not selected
7.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
7.2 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8 Physical functioning at follow‐up(SF‐36, physical functioning subscale; 0 to 100 points)3Mean Difference (IV, Random, 95% CI)Subtotals only
8.1 CBT2348Mean Difference (IV, Random, 95% CI)7.92 [‐9.79, 25.63]
8.2 CT147Mean Difference (IV, Random, 95% CI)21.37 [6.61, 36.13]
8.3 Supportive listening1171Mean Difference (IV, Random, 95% CI)‐7.55 [‐15.57, 0.47]
9 Depression at end of treatment (HADSdepression score; 7 items/21 points)1Mean Difference (IV, Random, 95% CI)Totals not selected
9.1 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
10 Depression at follow‐up (BDI; 0 to63 points)1Mean Difference (IV, Random, 95% CI)Totals not selected
10.1 CT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
10.2 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
11 Depression at follow‐up (HADSdepression score; 7 items/21 points)2Mean Difference (IV, Random, 95% CI)Totals not selected
11.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
11.2 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
12 Anxiety at end of treatment (HADS anxiety;7 items/21 points)1Mean Difference (IV, Random, 95% CI)Totals not selected
12.1 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
13 Anxiety at follow‐up (BAI; 0 to 63points)1Mean Difference (IV, Random, 95% CI)Totals not selected
13.1 CT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
13.2 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
14 Anxiety at follow‐up (HADS anxiety;7 items/21 points)2Mean Difference (IV, Random, 95% CI)Totals not selected
14.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
14.2 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
15 Sleep at end of treatment (Jenkins SleepScale; 0 to 20 points)1Mean Difference (IV, Random, 95% CI)Totals not selected
15.1 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16 Sleep at follow‐up (Jenkins SleepScale; 0 to 20 points)2Mean Difference (IV, Random, 95% CI)Totals not selected
16.1 CBT1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
16.2 Supportive listening1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
17 Self‐perceived changes in overallhealth at end of treatment1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
17.1 CBT1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
18 Self‐perceived changes in overallhealth at follow‐up2Risk Ratio (M‐H, Random, 95% CI)Subtotals only
18.1 CT150Risk Ratio (M‐H, Random, 95% CI)0.63 [0.36, 1.10]
18.2 CBT2368Risk Ratio (M‐H, Random, 95% CI)0.71 [0.33, 1.54]
19 Health resource use (follow‐up)[Mean no. of contacts]1Mean Difference (IV, Random, 95% CI)Totals not selected
19.1 Primary care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.2 Other doctor1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.3 Healthcare professional1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.4 Inpatient1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.5 Accident and emergency1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.6 Other health/social services1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.7 Complementary health care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
19.8 Standardised medical care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
20 Health resource use (follow‐up) [No.of users]1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
20.1 Primary care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.2 Other doctor1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.3 Healthcare professional1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.4 Inpatient1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.5 Accident and emergency1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.6 Medication1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.7 Complementary health care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.8 Other health/social services1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
20.9 Standardised medical care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
21 Drop‐out2Risk Ratio (M‐H, Random, 95% CI)Totals not selected
21.1 CBT1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
21.2 Supportive listening1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]

Comparison 3.

Exercise therapy versus adaptive pacing

Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Fatigue1Mean Difference (IV, Random, 95% CI)Totals not selected
1.1 Fatigue Scale, FS (11 items/33 points)—end of treatment1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
1.2 Fatigue Scale, FS (11 items/33 points)—follow‐up1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
2 Participants with serious adversereactions1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
3 Physical functioning1Mean Difference (IV, Random, 95% CI)Totals not selected
3.1 SF‐36, physical functioning subscale (0 to 100)—end oftreatment1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
3.2 SF‐36, physical functioning subscale (0 to100)—follow‐up1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
4 Depression1Mean Difference (IV, Random, 95% CI)Totals not selected
4.1 HADS, depression score (7 items/21 points)—follow‐up1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
5 Anxiety1Mean Difference (IV, Random, 95% CI)Totals not selected
5.1 HADS, anxiety score (0 to 21 points)—follow‐up1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
6 Sleep1Mean Difference (IV, Random, 95% CI)Totals not selected
6.1 Jenkins Sleep Scale (0 to 20 points)—follow‐up1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
7 Self‐perceived changes in overallhealth1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
7.1 End of treatment1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
7.2 Follow‐up1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
8 Health resource use (follow‐up) [Meanno. of contacts]1Mean Difference (IV, Random, 95% CI)Totals not selected
8.1 Primary care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.2 Other doctor1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.3 Healthcare professional1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.4 Inpatient1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.5 Accident and emergency1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.6 Other health/social services1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.7 Complementary health care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
8.8 Standardised medical care1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
9 Health resource use (follow‐up) [No.of users]1Risk Ratio (M‐H, Random, 95% CI)Totals not selected
9.1 Primary care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.2 Other doctor1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.3 Healthcare professional1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.4 Inpatient1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.5 Accident and emergency1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.6 Medication1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.7 Complementary health care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.8 Other health/social services1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
9.9 Standardised medical care1Risk Ratio (M‐H, Random, 95% CI)0.0 [0.0, 0.0]
10 Drop‐out1Risk Ratio (M‐H, Random, 95% CI)Totals not selected

Comparison 4.

Exercise therapy + antidepressant placebo versus antidepressant + exerciseplacebo

Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Fatigue1Mean Difference (IV, Random, 95% CI)Totals not selected
1.1 Fatigue Scale, FS (14 items/0 to 42 points)—end of treatment1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
2 Depression1Mean Difference (IV, Random, 95% CI)Totals not selected
2.1 HADS, depression score (7 items/21 points)—end of treatment1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
3 Drop‐out1Risk Ratio (M‐H, Random, 95% CI)Totals not selected

Comparison 5.

Exercise therapy + antidepressant versus antidepressant + exercise placebo

Outcome or subgroup titleNo. of studiesNo. of participantsStatistical methodEffect size
1 Fatigue1Mean Difference (IV, Random, 95% CI)Totals not selected
1.1 Fatigue Scale, FS (14 items/0 to 42 points)—end of treatment1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
2 Depression1Mean Difference (IV, Random, 95% CI)Totals not selected
2.1 HADS, depression score (7 items/21 points)—end of treatment1Mean Difference (IV, Random, 95% CI)0.0 [0.0, 0.0]
3 Drop‐out1Risk Ratio (M‐H, Random, 95% CI)Totals not selected

What's new

DateEventDescription
8 March 2019AmendedAddition of new published note 'Cochrane’s editors andthe review author team have jointly agreed that there will be a further periodup to the end of May 2019, in which time the author team will amend the reviewto address changes aimed at improving the quality of reporting of the reviewand ensuring that the conclusions are fully defensible and valid to informhealth care decision making. The changes will also address concerns raised infeedback since the Robert Courtney complaint. The amendment will not include afull update, but a decision about this will made subsequently.'

History

Protocol first published: Issue 3, 2001 Review first published: Issue 3,2004

DateEventDescription
5 December 2018Feedback has been incorporatedFeedback has been added, along with a response from the Cochrane CommonMental Disorders (CMD) Review Group
30 November 2018AmendedAddition of new published note 'The author team has re‐submitteda revised version of this review following the complaint by Robert Courtney.The Editor in Chief and colleagues recognise that the author team has soughtto address the criticisms made by Mr Courtney but judge that further work isneeded to ensure that the review meets the quality standards required, and asa result have not approved publication of the re‐submission. The reviewis also substantially out of date and in need of updating.
Cochrane recognises the importance of this review and is committed toproviding a high quality review that reflects the best current evidence toinform decisions.
The Editor in Chief is currently holding discussions with colleagues and theauthor team to determine a series of steps that will lead to a full update ofthis review. These discussions will be concluded as soon aspossible'.
9 November 2018Feedback has been incorporatedFeedback has been added, along with a response from the Cochrane CommonMental Disorders (CMD) Review Group
2 November 2018Feedback has been incorporatedFeedback has been added, along with a response from the Cochrane CommonMental Disorders (CMD) Review Group
25 October 2018AmendedAddition of new published note 'This review is subject to an ongoingprocess of review and revision following the submission of a formal complaintto the Editor in Chief. Cochrane considers all feedback and complaintscarefully, and revises or updates reviews when it is appropriate. The reviewauthor team have advised us that a resubmission of this review is imminent. Adecision on the status of this review will be made once this resubmission hasbeen through editorial process, which we anticipate will be towards the end ofNovember 2018'.
5 October 2017Feedback has been incorporatedFeedback has been added, along with the authors' response
5 May 2017Feedback has been incorporatedFeedback has been added, along with the authors' response.
21 June 2016Feedback has been incorporatedFeedback has been added, along with the authors' response.
1 February 2016Feedback has been incorporatedFeedback has been added along with the authors' response.
20 November 2014New citation required but conclusions have not changedFour new studies have been added in this update, and the conclusionstrengthens results reported in the 2004 version of the review.
2 October 2014New search has been performedThis review has been updated with newer methodology, and new studies havebeen incorporated.
1 November 2008AmendedThis review has been converted to the new review format
25 May 2004New search has been performedThe protocol for this review has undergone post hoc alteration based onfeedback from referees. The following sections have been altered: Types ofinterventions; Search strategy; Methods of the review
8 May 2004New citation required and conclusions have changedSubstantive amendments have been made

Differences between protocol and review

Changes made to the original review are stated below.

Objectives have been changed from '(1) To systematically review all randomisedcontrolled trials of exercise therapy for adults with CFS, and (2) To investigate therelative effectiveness of exercise therapy alone or as part of a treatment plan' in the2004 version to 'The objective of this review was to determine the effects of exercisetherapy (ET) for patients with chronic fatigue syndrome (CFS) as compared with any otherintervention or control' in this update.

Comparisons have been changed from: '(1) Exercise therapy versus treatment as usual orrelaxation plus flexibility, (2) Exercise therapy versus pharmacotherapy (fluoxetine), (3)Exercise therapy alone versus exercise therapy plus pharmacotherapy (fluoxetine) and (4)Exercise therapy alone versus exercise therapy plus patient education' in the 2004version to the following in this update.

  • '"Passive control": treatment as usual/waiting‐listcontrol/relaxation/flexibility.

    • "Treatment as usual" comprises medical assessments and advice given ona naturalistic basis. "Relaxation" consists of techniques that aim toincrease muscle relaxation (e.g. autogenic training, listening to a relaxationtape). "Flexibility" includes stretches performed according to selectedexercises given.

  • Psychological therapies: cognitive‐behavioural therapy (CBT)/cognitivetreatment/supportive therapy/behavioural therapies/psychodynamic therapies.

  • Adaptive pacing therapy.

  • Pharmacological therapy (e.g. antidepressants).'

We have revised and reordered the list of secondary outcomes for clarity and have addedself‐reported changes in overall health as a new outcome, while moving adverseeffects from a secondary outcome to a primary outcome.

We have updated the methods according to recommendations provided in the 2011 version oftheCochrane Handbook for Systematic Reviews of Interventions. For the first versionof this review (2004), assessment of methodological quality was conducted according tocontemporary criteria of the handbook of The Cochrane Collaboration (Alderson 2004).The adequacy of allocation concealmentwas rated as adequate (A), unclear (B) or inadequate (C) or as not used (D), and the CCDANQuality Rating System (Moncrieff 2001) was applied.For this update, we reextracted data on risk of bias to comply with current recommendations,and we used concealment of allocation as the main quality criterion for includedstudies.

To explore possible differences between studies using different treatment strategies,control conditions and diagnostic criteria, we decided to perform post hoc subgroup analyseswhen applicable. We also performed post hoc subgroup analyses excludingPowell 2001, as the results reported in this trial seemto have introduced considerable heterogeneity into the analysis. Moreover, in the protocolit is stated, "where results for continuous outcomes were presented using differentscales or different versions of the same scale, we used standardised mean differences(SMDs)." We realise that the standardised mean difference (SMD) is much more difficultto conceptualise and interpret than the normal mean difference (MD); therefore we decided toreport both MDs and SMDs in the Results section. In general, MDs are reported in the mainResults section, whereas SMDs are supplied under the "Sensitivity and subgroupanalysis" subheading.

Planned methods not used in this review

Cluster trials

Studies often employ 'cluster randomisation' (such as randomisation byclinician or practice), but analysis and pooling of clustered data pose problems. First,study authors often fail to account for intraclass correlation in clustered studies,leading to a 'unit of analysis' error (Bland1997) whereby P values are spuriously low, confidence intervals unduly narrowand statistical significance overestimated. This causes type I errors (Bland 1997;Gulliford 1999).

No cluster RCTs were identified in this version of the review. Should such studies beidentified in future updates, we will use the following methodological approach. Whenclustering has not been accounted for in primary studies, we will present data in atable, with a (*) symbol to indicate the presence of a probable unit of analysiserror. We will seek to contact first authors of studies to obtain intraclass correlationco‐efficients for their clustered data and to adjust for this by using acceptedmethods (Gulliford 1999). When clustering isincorporated, we will present the data as if from a parallel‐group randomisedstudy, but adjusted for the clustering effect. We will additionally exclude such studiesin a sensitivity analysis.

If cluster studies are appropriately analysed by taking into account intraclasscorrelation co‐efficients and relevant data documented in the report, synthesiswith other studies will be possible using the generic inverse variance technique.

Cross‐over trials

A major concern of cross‐over trials is the potential for carry‐overeffect. This occurs when an effect (e.g. pharmacological, physiological, psychological)of treatment in the first phase is carried over to the second phase. As a consequence ofentry to the second phase, participants can differ systematically from their initialstate despite a wash‐out phase. For the same reason, cross‐over trials arenot appropriate when the condition of interest is unstable (Elbourne 2002). As both effects are very likely inCFS/ME, randomised cross‐over studies were eligible but only when data up to thepoint of first cross‐over were used. Data from the subsequent (second) period ofthe cross‐over trial were not considered for analysis.

Studies with multiple treatment groups

Multiple dose groups

Some studies may address the effects of different levels of supervision andfollow‐up with regards to the exercise intervention and the comparator (e.g.sessions for designing exercise therapy, sessions for designing exercise therapy andplanned telephone contacts, sessions for designing exercise therapy and sevenface‐to‐face treatment sessions, usual care). Should we identify trialsthat take this approach in future updates, we will adopt the following approach. Fordichotomous outcomes, we will sum up the sample sizes and the numbers of people withevents across all intervention groups. For continuous outcomes, means and standarddeviations will be combined using the methods described in Chapter 7 (Section 7.7.3.8)of theCochrane Handbook for Systematic Reviews of Interventions (Higgins 2011).

Multiple medications

Some studies may combine several interventions with one comparison group. Should weidentify trials of this nature in future updates, we will analyse the effects of eachintervention group versus placebo separately, but we will divide up the total numberof participants in the placebo group. In the case of continuous outcomes, the totalnumber of participants in the placebo group again will be divided up, but means andstandard deviations will be left unchanged (see Chapter 16, Section 16.5.4, inHiggins 2011).

Methods intended for future reviews

If future updates identify a number of studies that enable reporting at different timepoints, this should be done for example at end of treatment, at short‐termfollow‐up (zero to six months), at medium‐term follow‐up (seven to 12months) and at long‐term follow‐up (over 12 months).

Characteristics of studies

Characteristics of included studies [ordered by studyID]

MethodsRCT, 2 parallel arms
ParticipantsDiagnostic criteria: Oxford
Number of participants: N = 66
Gender: 49 (65%) female Age, mean (SD): 37.2 (10.7) years
Earlier treatment: NS
Co‐morbidity: 20 (30%) possible cases of depression (HADS): 30 (45%)on full‐dose antidepressant (n = 20) or low‐dose tricyclicantidepressants as hypnotics (n = 10)
Average illness duration: 2.7 (0.6 to 19) years
Work and employment status: 26 (395) working or studying at least parttime
Setting: secondary care (chronic fatigue clinic in a general hospital ofpsychiatry)
Country: UK
InterventionsGroup 1: exercise therapy (12 sessions) with 1 weekly supervised sessionand 5 home sessions a week, initially lasting between 5 and 15 minutes (n =33) Group 2: flexibility and relaxation (12 sessions) with 5home sessions prescribed per week (n = 33)
Outcomes
  • Changes in overall health (Global Impression Scale, score between 1and 7, where 1 = very much better, 4 = no change)

  • Anxiety and depression (Hospital Anxiety and Depression Scale,HADS)

  • Fatigue (Fatigue Scale, FS; 14‐item questionnaire)

  • Sleep (Pittsburgh Sleep Quality Index, PSQI)

  • Physical functioning (Short Form (SF)‐36)

  • Physiological assessments (maximal voluntary contraction ofquadriceps, peak oxygen consumption, lactate, heart rate)

  • Perceived exertion (Borg Scale)


Outcomes were assessed at end of treatment (12 weeks)
NotesNo long‐term follow‐up, as participants who completed theflexibility programme were invited to cross over to the exercise programmeafterwards
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "determined by random number tables"
Allocation concealment (selection bias)Low riskQuote: "Randomisation was achieved blindly to the psychiatrist andindependently of the exercise physiologist by placing the letter E or F in66 separate blank envelopes. These were then arranged in random orderdetermined by random number tables and opened by an independentadministrator after baseline tests as each new patient entered thestudy"
Blinding (performance bias and detection bias) of participantsand personnel?High riskNot possible to blind participants or personnel (supervisors) to treatmentallocation
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FS,SF‐36)
Incomplete outcome data (attrition bias) All outcomesLow riskQuote: "We completed follow up assessments on four of the sevenpatients who dropped out of treatment and included these data in theintention to treat analysis. Patients with missing data were counted asnon­improvers"
Selective reporting (reporting bias)Unclear riskAll primary outcomes stated under Methods were reported; however, as thetrial protocol is not available, we cannot categorically state that thereview is free of selective outcome reporting
Other biasLow riskWe do not suspect other bias
MethodsRCT, 4 parallel arms
ParticipantsDiagnostic criteria: CDC 1994
Number of participants: N = 114
Gender: 95 (83.3%) female
Age: 43.8 years
Earlier treatment: NS
Co‐morbidity: 44 (39%) with a current Axis I disorder (depressionand anxiety most common). Use of antidepressant not stated
Illness duration: > 5 years
Work and employment status: 52 (46%) working or studying at least parttime, 24% unemployed, 6% retired, 25% on disability
Setting: secondary care, but recruitment from different sources
Country: USA
Interventions13 sessions every 2 weeks lasting 45 minutes
Group 1: cognitive‐behavioural therapy (CBT) aimed at showingparticipants that activity could be done without exacerbating symptoms (n =29)
Group 2: anaerobic activity therapy (ACT) focused on developingindividualised and pleasurable activities accompanied by reinforcement ofprogress (n = 29)
Group 3: cognitive therapy treatment(COG) focused on developing strategiesto better tolerance, reduce stress and symptoms and lessenself‐criticism (n = 28)
Group 4: relaxation treatment (RELAX) introducing several types ofrelaxation techniques along with expectations of skill practice (n = 28)
OutcomesSeveral outcomes are reported (˜25), among others.
  • Physical functioning (SF‐36)

  • Fatigue (Fatigue Severity Scale, FSS)

  • Depression (Back Depression Inventory, BDI‐II)

  • Anxiety (Beck Anxiety Inventory, BAI)

  • Self‐efficacy (self‐efficacy questionnaire)

  • Stress (Perceived Stress Scale, PSS)

  • Pain (Brief Pain Inventory)

  • Quality of life (Quality of Life Scale)

  • 6‐Minute walking test


Outcomes assessed at 12 months' follow‐up
NotesFidelity ratings and drop‐out reported across study arms
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "Random assignment was done using a random number generator instatistical software (SPSS version 12)"
Allocation concealment (selection bias)Unclear riskNot stated
Blinding (performance bias and detection bias) of participantsand personnel?High riskNot possible to blind participants or personnel (supervisors) to treatmentallocation
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FSS,BPI)
Incomplete outcome data (attrition bias) All outcomesHigh riskQuote: "The average dropout rate was 25%, but it was not significantlydifferent per condition." The statistical analysis used, the bestlinear unbiased predictor, is a way to avoid taking missing data intoaccount
Selective reporting (reporting bias)Unclear riskAll primary outcomes stated under Methods were reported; however, as thetrial protocol is not available, we cannot categorically state that thereview is free of selective outcome reporting
Other biasHigh riskBaseline data differences across groups for several important parameters(e.g. physical functioning: ACT group 39.17 (15.65) and RELAX group 53.77(26.66))
MethodsRCT, 2 parallel arms
ParticipantsDiagnostic criteria: CDC 1994
Number of participants: N = 49
Gender: 34 (69%) female Age, mean (SD): 40.9 years: 36.7 (11.8)in treatment group and 45.5 (10.5) in control group
Earlier treatment: NS
Co‐morbidity, mean (SD): 14 (29%) possible or probable cases ofdepression (HADS). HADSAnxiety 6.72(3.44) in treatment group and7.17 (3.43) in control group. HADSDepression 5.70 (2.69) intreatment group and 6.70 (0.67) in control group. Use of antidepressant notstated
Illness duration, median (range): 3.1 years, 2.67 (0.6 to 20) in treatmentgroup and 5 (0.5 to 45) in control group
Work and employment status: 11 (22%) unemployed and unable to work becauseof disability Setting: specialist CFS general practice
Country: New Zealand
InterventionsGroup 1: graded exercise therapy (12 weeks), met weekly, final goal 30minutes for 5 days a week, 70% of VO2max (n =25) Group 2: standard medical care provided by a CFS specialistphysician (n = 24)
Outcomes
  • Changes in overall health (Global Impression Scale, score between 1and 7, where 1 = very much better, 4 = no change)

  • Physical function (SF‐36 physical function subscale score)

  • Fatigue (Fatigue Scale, FS)

  • Activity levels

  • Cognitive function

  • Physiological assessments (e.g. maximum aerobic capacity, HR)

  • Acceptability


Outcomes assessed at end of treatment (12 weeks). A self‐reportquestionnaire was distributed at 6 months' follow‐up and wasreturned by 16 exercise participants and 17 control participants
NotesThe exact components involved in 'treatment as usual' are notexplained
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "...randomised into either treatment or control conditions bymeans of a sequence of computer generated numbers placed in sealed opaqueenvelopes by an independent administrator"
Allocation concealment (selection bias)Low riskQuote: "placed in sealed opaque envelopes by an independentadministrator"
Blinding (performance bias and detection bias) of participantsand personnel?High riskNot possible to blind participants or personnel (supervisors) to treatmentallocation
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FS,SF‐36)
Incomplete outcome data (attrition bias) All outcomesLow risk3 of 25 participants (12%) dropped out from exercise treatment. Reasons fordrop‐out: 1 had to return to the USA, 1 had an injured calf and 1 wasnot reached at follow‐up. 3 of 24 patients (12.5%) in control groupdid not return follow‐up questionnaire at 12 weeks. To determinewhether drop‐out affected the calculated treatment effect, studyauthors completed intention‐to‐treat analysis
Selective reporting (reporting bias)Unclear riskAll primary outcomes stated under Methods were reported; however, as thetrial protocol is not available, we cannot categorically state that thereview is free of selective outcome reporting
Other biasLow riskWe do not suspect other bias
MethodsRCT, 4 parallel arms
ParticipantsDiagnostic criteria: Oxford Number of participants: N = 148
Gender: 116 (78%) female
Age, mean: 33 years
Earlier treatment: NS
Co‐morbidity: 58 (39%) possible cases of depression (HADS), 27 (18%)used antidepressants
Illness duration: 4.3 years
Work and employment status: 50 (34%) working, 64 (43%) ondisability Setting: secondary/tertiary care
Country: UK
InterventionsGroup 1: treatment as usual (n = 34)
Group 2: exercise therapy + 2 sessions (total 3 hours, n =37) Group 3: exercise therapy + 7 telephone sessions (total 3.5hours, n = 39) Group 4: exercise therapy + 7 sessions (total 7hours, n = 38)
Sessions, whether telephone or face‐to‐face, were used toreiterate the treatment rationale and to discuss problems associated withgraded exercise
Outcomes
  • Physical functioning (SF‐36, subscale physical functioning).Clinical improvement at 1 year predetermined as a score ≥25 or an increase from baseline of ≥ 10 on the physicalfunctioning scale (score range, 10 to 30)

  • Fatigue (Fatigue Scale, FS; 11 items; scores > 3 indicateexcessive fatigue)

  • Anxiety and depression (Hospital Anxiety and Depression Scale, HADS;score range from 0 to 21 worst)

  • Sleep (Jenkins Sleep Scale, 4 items; lower scores indicate betteroutcomes; score range 0 to 20 worst)

  • Changes in overall health (Global Impression Scale; score between 1and 7, where 1 = very much better, 4 = no change)

  • Illness beliefs and experience of treatment (simplequestionnaire)


Outcomes assessed at 3 (end treatment), 6 and 12 months
NotesTreatment as usual comprised a medical assessment, advice and aninformation booklet that encouraged graded activity and positive thinkingbut gave no explanations for symptoms.
SF‐36 physical functioning subscale is reported on a 10 to 30 scale.We transformed scores from the 10 to 30 scale to the more common 0 to 100scale by using the following formula: meannew =(meanold ‐ 10) * 5 and SDnew = 5 *SDold
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "Randomised into four groups by means of a sequence of computergenerated random numbers...simple randomisation with stratification forscores on the hospital anxiety and depression scale, 15, using a cut off of11 to indicate clinical depression"
Allocation concealment (selection bias)Unclear riskQuote: "...in sealed numbered envelopes"
Blinding (performance bias and detection bias) of participantsand personnel?High riskNot possible for this intervention
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FS,SF‐36)
Incomplete outcome data (attrition bias) All outcomesLow riskQuote: "We used an intention to treat analysis. For patients whodropped out of treatment, the last values obtained were carried forward.Complete data were obtained for all patients who completed treatment exceptfor three: two did not complete the questionnaire at three months and onedid not complete the questionnaire at one year"
Selective reporting (reporting bias)Unclear riskAll primary outcomes stated under Methods were reported; however, as thetrial protocol is not available, we cannot categorically state that thereview is free of selective outcome reporting
Other biasLow riskWe do not suspect other bias
MethodsRCT, 2 parallel arms
ParticipantsDiagnostic criteria: CDC 1994
Number of participants: N = 68
Gender: 47 (77%) female Age: 16 to 74 years (average 43.3(12.7) in the exercise group and 45.7 (12.5) in the control group)
Earlier treatment: NS
Co‐morbidity: possible depression not stated, 16 (26%) usedantidepressants
Illness duration: no initial difference between groups
Work and employment status: not stated Setting: primarycare
Country: Western Australia
InterventionsGroup 1: prescribed exercise therapy, 12 weeks (n = 32) Group2: flexibility and relaxation, 12 weeks (n = 29)
Outcomes
  • Physiological assessments (heart rate, blood pressure at rest andduring exercise, lactate and oxygen consumption)

  • Perceived exertion (Borg Scale, rating of perceived exertion(RPE))

  • Energy expenditure (Older Adult Exercise Status Inventory)

  • Fatigue (Fatigue Scale, FS; 11 items)

  • Anxiety and depression (Hospital Anxiety and Depression Scale,HADS)

  • Cognitive function (computerised version of the modified Stroop ColorWord Test)

  • Changes in overall health (Global Impression Scale, score between 1and 7, where 1 = very much better, 4 = no change)


Outcomes assessed at 12 weeks (end of treatment)
NotesSupplementary HADS data obtained from study authors for first version ofthis review
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Unclear riskQuote: "...patients were randomised (by an independentinvestigator)"
Allocation concealment (selection bias)Unclear riskNot adequately described
Blinding (performance bias and detection bias) of participantsand personnel?High riskNot possible to blind participants or personnel (supervisors) to treatmentallocation
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FS,SF‐36)
Incomplete outcome data (attrition bias) All outcomesLow risk2 of 34 (6%) participants in the ET group withdrew: "...for reasonsnot associated with the study"
5 of 34 (15%) participants in control group withdrew: "for reasons notassociated with the study, and a further subject was excluded because herbody mass index (44 kg/m2) prevented her form participating inthe exercise test"
Selective reporting (reporting bias)Unclear riskAll primary outcomes stated under Methods were reported; however, as thetrial protocol is not available, we cannot categorically state that thereview is free of selective outcome reporting
Other biasUnclear riskBaseline data differences between groups for anxiety (7.3 in exercise groupvs 8.7 in control group) and mental fatigue (6.3 vs 5.6)
MethodsRCT, 4 parallel arms
ParticipantsDiagnostic criteria: Oxford
Number of participants: N = 136
Gender: 97 (71%) female Age, mean (SD): 38.7 (10.8) years
Earlier treatment: NS
Co‐morbidity: 46 (34%) with depressive disorder according toDSM‐III‐R criteria, use of antidepressant not stated
Illness duration: duration of fatigue, median (IQR) 28.0 (39.5) months
Work and employment status: 114 (84%) had recently changed occupation
Setting: secondary/tertiary care
Country: UK
InterventionsGroup 1: graded exercise + fluoxetine (n = 33) Group 2: gradedexercise + drug placebo, 26 weeks, preferred aerobic exercise 20 minutes atleast 3 times per week, up to 75% of participants' functional maximum(n = 34) Group 3: exercise placebo + fluoxetine (n =35) Group 4: exercise placebo + drug placebo, 26 weeks, offeredno specific advice but participants told to do what they felt capable of andto rest when the felt they needed to (n = 34)
Outcomes
  • Fatigue (Fatigue Scale, FS; 14 items; 4 or more were used as cutoffto designate caseness)

  • General health status (Medical Outcome Survey Short‐FormScales, MOS SF‐36); measure of general health status on thefollowing 6 scales (cutoff score for poor function in parentheses):physical function (< 83.3), role or occupational function (≤50), social function (≤ 40), pain (≤ 50), health perception(≤ 70) and mental health (≤ 67)

  • Anxiety or depression (Hospital Anxiety and Depression Scale, HADS;cutoff of 11 or more designated cases)

  • Psychiatric diagnoses (Clinical Interview Schedule + supplementaryquestions by psychologist)

  • Physiological assessments (grip strength and functional workcapacity)


Outcomes assessed at weeks 12 and 26 (end of treatment)
NotesGroup 4 was used as treatment as usual, as participants were given nospecific advice on exercise but were advised to exercise when they feltcapable. Supplementary HADS data were obtained from study authors for thefirst version of this review
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "...randomised into a treatment group by computer generatednumbers, with groups of 10 to obtain roughly equal numbers"
Allocation concealment (selection bias)Low riskQuote: "A list of subject numbers marked with the exercise group foreach number was held by the physiotherapist. Pharmacy staff dispensedmedication in accordance with the subject number assigned to eachsubject." The initial assessment was done independently: "Allpatients were medically assessed by a doctor...under the supervision of aconsultant physician"
Blinding (performance bias and detection bias) of participantsand personnel?High riskQuote: "The drug treatment was double blind. The placebo to fluoxetinewas a capsule of similar taste and appearance. The placebo to the exerciseprogramme was a review of activity diaries by the physiotherapists"
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FS,SF‐36)
Incomplete outcome data (attrition bias) All outcomesHigh riskQuote: "Analysis was carried out on an intention to treat basis. Whenthere were missing data at 12 and 26 weeks, scores on the previousassessment were substituted. No data were available on 17 patients for theweek 12 assessment, functional work capacity assessments at week 0, seven atweek 12 and seven at week 26"
Large drop‐out rates in all intervention groups
Selective reporting (reporting bias)High riskIt is clear (p 488) that investigators collected data for all six subscalesof the MOS that they used (as well as measures for fatigue, depression andanxiety). Data from fatigue and depression (primary outcomes) are reportednumerically. Data from the anxiety scale are said to show 'nosignificant changes' and are not reported numerically. This is also thecase for 5 of the 6 subscales of the MOS, with the exception of healthperceptions, which is significant and favours the intervention group.
NB: Data for forced work capacity (fwc) were collected by investigators butare not reported in this review
Other biasLow riskWe do not suspect other bias
MethodsRCT, 3 parallel arms
ParticipantsDiagnostic criteria: Oxford (31% fulfilled London ME criteria)
Number of participants: N = 296
Gender: 230 (78%) female Age, mean (SD): 44.6 (11.4) years
Earlier treatment: 264 (89%) reported medication during the past 6 monthswith antidepressant (n = 160) or analgesic (n = 79)
Co‐morbidity, N (%): 53 (18) had a depression diagnosis, 160 (54)were prescribed antidepressants the last 6 months
Illness duration (M): 7 (range from 0.5 to 51.7) years
Work and employment status: not stated
Setting: primary care
Country: UK
InterventionsGroup 1: pragmatic rehabilitation, 10 sessions over an 18‐weekperiod; graded return to activity designed collaboratively by theparticipant and the therapist, also focusing on sleep patterns andrelaxation exercises to address somatic symptoms of anxiety (n = 95)
Group 2: supportive listening, 10 sessions over an 18‐week period;listening therapy in which the therapist aims to provide an empathic andvalidating environment in which patients can freely discuss theirprioritised concerns (n = 101)
Group 3: general practitioner treatment as usual; GPs were asked to managetheir cases as they saw fit, but to not refer participants for systematicpsychological therapies for CFS/ME during the 18‐week treatmentperiod (n = 100)
Outcomes
  • Physical functioning (SF‐36 physical functioning subscale,percentage score in which higher scores indicate better outcomes)

  • Fatigue (Fatigue Scale, FS; 11 items; each item was scoreddichotomously on a 4‐point scale (0, 0, 1 or 1); total scoresof 4 or more designated significant levels of fatigue. Lower scoresindicated better outcomes)

  • Anxiety and depression (Hospital Anxiety and Depression Scale (HADS),depression and anxiety scale; lower scores indicate betteroutcomes)

  • Sleep (Jenkins Sleep Scale; 4 items; lower scores indicate betteroutcomes)


Outcomes assessed at 20 weeks (end of treatment) and at 70 weeks(follow‐up)
NotesEconomic evaluation of the relative cost‐effectiveness of pragmaticrehabilitation and supportive listening when compared with treatment asusual, results of which will be reported separately
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "Individual patients were randomly allocated to one of thethree treatment arms using computer generated randomised permuted blocks(with randomly varying block sizes of 9, 12, 15, and 18), afterstratification on the basis of whether the patient was non‐ambulatory(used a mobility aid on most days) and whether the patient fulfilled LondonME criteria"
Allocation concealment (selection bias)Low riskQuote: "The random allocation was emailed to the trial manager, whoassigned each patient a unique study number and notified the designatednurse therapist if the patient had been allocated to a therapy arm"
Blinding (performance bias and detection bias) of participantsand personnel?High riskNot possible to blind participants or personnel (supervisors) to treatmentallocation
Blinding (performance bias and detection bias) of outcomeassessors?High riskBlinding not possible for self‐reported measurements (e.g. FS,SF‐36)
Incomplete outcome data (attrition bias) All outcomesUnclear riskNumber of drop‐outs (did not complete treatment): 18/95 (group 1),17/101 (group 2). Reasons for drop‐out: unhappy with randomisation (n= 8), lost contact (n = 8), too busy (n = 7), not benefiting or feelingworse (n = 5), nurse therapist safety concern (n = 2), misdiagnosis (n = 1),received different treatment (n = 1)
Loss to follow‐up at 20 weeks: 10/95 (group 1), 4/101 (group 2),8/100 (group 3)
Loss to follow‐up at 70 weeks: 14/95 (group 1), 11/101 (group 2),14/100 (group 3)
Selective reporting (reporting bias)Low riskAll relevant outcomes are reported in accordance with the protocol
Other biasLow riskWe do not suspect other types of bias
MethodsRCT, multi‐centre, 4 parallel arms
ParticipantsDiagnostic criteria: Oxford (56% satisfied London ME criteria)
Number of participants: N = 641
Gender: 495 (77%) female Age, mean (SD): 38 (12) years
Earlier treatment: NS
Co‐morbidity: 219 (34%) with any depressive disorder, 260 (41%) usedantidepressants
Illness duration: median 32 (IQR 16 to 68) months (GET 35 (18 to 67) andSMC 25 (15 to 57) months)
Work and employment status: mean baseline score at the work and socialadjustment scale, 27.4
Setting: secondary/tertiary care
Country: UK
InterventionsGroup 1, specialist medical care (SMC): provided by doctors with specialistexperience in CFS. All participants were given a leaflet explaining theillness and the nature of this treatment. Treatment consisted of anexplanation of chronic fatigue syndrome, generic advice such as to avoidextremes of activity and rest, specific advice on self‐help accordingto the particular approach chosen by the participant (if receiving SMCalone) and symptomatic pharmacotherapy (especially for insomnia, pain andmood, n = 160)
Group 2, adaptive pacing therapy (APT): based on the envelope theory aimedat optimum adaptation to the illness by helping the participant to plan andpace activity to reduce or avoid fatigue, achieve prioritised activities andprovide the best conditions for natural recovery. Therapeutic strategiesconsisted of identifying links between activity and fatigue by using a dailydiary, with corresponding encouragement to plan activity to avoidexacerbations, developing awareness of early warnings of exacerbation,limiting demands and stress, regularly planning rest and relaxation andalternating different types of activities, with advice not to undertakeactivities that demanded more than 70% of participants’perceived energy envelopes. Increased activities were encouraged ifparticipants felt able, and as long as they did not exacerbate symptoms (n =160)
Group 3, cognitive‐behavioural therapy (CBT): done on the basis ofthe fear avoidance theory of CFS. The aim of treatment was to change thebehavioural and cognitive factors assumed to be responsible for perpetuationof participants’ symptoms and disability. Therapeuticstrategies guided participants to address unhelpful cognitions, includingfears about symptoms or activities, by testing them through behaviouralexperiments. These experiments consisted of establishing a baseline ofactivity and rest and a regular sleep pattern, then making collaborativelyplanned gradual increases in both physical and mental activity. Participantswere helped to address social and emotional obstacles to improvement throughproblem solving (n = 161)
Group 4, graded exercise therapy (GET): done on the basis of deconditioningand exercise intolerance theories of chronic fatigue syndrome. The aim oftreatment was to help participants gradually return to appropriate physicalactivities and reverse deconditioning, thereby reducing fatigue anddisability. Therapeutic strategies consisted of establishment of a baselineof achievable exercise or physical activity, followed by a negotiated,incremental increase in the duration of time spent being physically active.Target heart rate ranges were set when necessary to avoid overexertion,which eventually aimed at 30 minutes of light exercise 5 times a week. Whenthis rate was achieved, the intensity and aerobic nature of the exercise(usually walking) were gradually increased in response to participantfeedback and with mutual planning (n = 160)
OutcomesPrimary outcomes
  • Fatigue (Fatigue Scale, FS; Likert scoring 0, 1, 2, 3; range 0 to 33;lowest score is least fatigue)

  • Physical function (Short Form‐36 (SF‐36) physicalfunction subscale version 2; range 0 to 100; highest score is bestfunction)

  • Safety outcomes (non‐serious adverse events, serious adverseevents, serious adverse reactions to trial treatments, seriousdeterioration and active withdrawals from treatment)

  • Adverse events (i.e. any clinical change, disease or disorderreported, whether or not related to treatment)


Secondary outcomes
  • Changes in overall health (Global Impression Scale, score between 1and 7, where 1 = very much better, 4 = no change)

  • Overall disability: work and social adjustment scale

  • 6‐Minute walking test (distance in meters walked)

  • Sleep (Jenkins Sleep Scale score for disturbed sleep)

  • Anxiety and depression (Hospital Anxiety and Depression Scale,HADS)

  • Number of chronic fatigue syndrome symptoms (individual symptoms ofpostexertional malaise and poor concentration or memory)

  • Use of health service resources


Outcomes assessed at 12 weeks, 24 weeks (end of treatment) and 52 weeks(follow‐up)
Notes
Risk of bias
BiasAuthors' judgementSupport for judgement
Random sequence generation (selection bias)Low riskQuote: "Participants were allocated to treatment groups through theMental Health and Neuroscience Clinical Trials Unit (London, UK) afterbaseline assessment and obtainment of consent. A database programmerundertook treatment allocation, independently of the trial team. The firstthree participants at each of the six clinics were allocated withstraightforward randomisation. Thereafter allocation was stratified bycentre, alternative criteria for chronic fatigue syndrome and myalgicencephalomyelitis and depressive disorder (major or minor depressive episodeor dysthymia), with computer‐generated probabilisticminimisation"
Allocation concealment (selection bias)Low riskQuote: "Once notified of treatment allocation by the Clinical TrialsUnit, the research assessor informed the participant andclinicians"
Blinding (performance bias and detection bias) of participantsand personnel?High riskQuote: "As with any therapy trial, participants, therapists, anddoctors could not be masked to treatment allocation and it was alsoimpractical to mask research assessors. The primary outcomes were rated byparticipants themselves"
Blinding (performance bias and detection bias) of outcomeassessors?High riskQuote: "The statistician undertaking the analysis of primary outcomeswas masked to treatment allocation"
Incomplete outcome data (attrition bias) All outcomesLow riskNone found
Selective reporting (reporting bias)Low riskQuote: "These secondary outcomes were a subset of those specified inthe protocol, selected in the statistical analysis plan as most relevant tothis report." Our primary interest is the primary outcome reported inaccordance with the protocol, so we do not believe that selective reportingis a problem
Other biasLow riskWe do not suspect other types of bias

ACT, anaerobic activity therapy.

APT, adaptive pacing therapy.

BAI, Beck Anxiety Inventory.

BDI‐II, Beck Depression Inventory.

BPI, Brief Pain Inventory.

CBT, cognitive‐behavioural therapy.

CDC, Centers for Disease Control and Prevention.

CFS, chronic fatigue syndrome.

COG, cognitive therapy.

ET, exercise therapy.

FS, Fatigue Scale.

FSS, Fatigue Severity Scale.

GET, graded exercise therapy.

HADS, Hospital Anxiety and Depression Scale.

HR, heart rate.

IQR, interquartile range.

ME, myalgic encephalitis.

MOS, Medical Outcome Survey.

NS, Not stated.

PSQI, Pittsburgh Sleep Quality Index.

PSS, Perceived Stress Scale.

RCT, randomised controlled trial.

RELAX, relaxation treatment.

RPE, rating of perceived exertion.

SD, standard deviation.

SF‐36, Short Form 36.

SMC, specialist medical care.

VO2, oxygen consumption.

Characteristics of excluded studies [ordered by studyID]

StudyReason for exclusion
Evering 2008RCT
The trial was excluded, as the intervention was feedback on physicalactivity
Gordon 2010RCT
Compares the relative effectiveness of 2 different types of exercisetherapy. Even though this is an interesting question, it was beyond thescope of this version of the review
Guarino 2001The trial was excluded, as the population was "Gulf Warveterans"
Nunez 2011RCT
Combination treatment of which exercise therapy is a minor part
Ridsdale 2004RCT
No clinical diagnosis of chronic fatigue syndrome. Our inclusion criteriastate that the duration of fatigue needs to > 6 months, whereas inclusioncriteria in Risdale 2004 is > 3 months
The trial was excluded, as the intervention did not include exercise:"cognitive behaviour therapy (CBT) with counselling"; thepopulation was "patients with chronic fatigue"
Ridsdale 2012RCT
The trial was excluded, as the population was "people presenting withchronic fatigue in primary care"
Russel 2001RCT
The trial was excluded, as exercise was not the main part of theintervention: "Group rehabilitation (psycho‐education, gradedexercise, goal setting and pacing, breathing control and challengingunhelpful thoughts)"
Stevens 1999RCT
The PhD was excluded, as exercise was a minor component of theintervention: "conducted to implement the use of sleep hygieneeducation, biofeedback assisted relaxation and breathing retraining, gradedaerobic exercise, and cognitive therapy...."
Taylor 2004RCT
The trial was excluded, as exercise was not the main component of theintervention: "In our program, group topics included activity pacingusing the Envelope Theory (Jason et al., 1999), cognitive coping skillstraining, relaxation and meditation training, employment issues and economicself‐sufficiency, personal relationships, traditional andcomplementary medical approaches, and nutritional approaches"
Taylor 2006The trial was excluded, as the study used a "cross‐sectionaldesign"
Thomas 2008The trial was excluded, as "between‐group comparisons wereused." This was a controlled trial, but participants were not randomlyassigned
Tummers 2012RCT
The trial was excluded, as interventions included variations of CBT:"additional CBT (stepped care) or regular CBT (care as usual)"
Viner 2004The trial was excluded, as the population consisted of "young people(aged 9–17 years) with CFS/ME"
Wright 2005The trial was excluded, as the population included young people 0 to 19years of age

Characteristics of studies awaiting assessment [ordered by studyID]

MethodsRCT, 2 arms
ParticipantsPatients with chronic fatigue syndrome
InterventionsDothiepin and graded activity
OutcomesNot found
NotesNot able to identify published paper nor study author
MethodsRCT, 3 arms, N = 90
ParticipantsPatients with chronic fatigue syndrome
InterventionsTuina group
Taijiquan (take exercise) group
Fluoxetine group
OutcomesTherapeutic effects and changes in malondialdehyde (MDA) content and inactivity of serum superoxide dismutases (SOD) and serum glutathioneperoxidase (GSH‐Px) were observed
NotesPublished paper does not report outcomes that are relevant for thisreview
Study authors were contacted to clarify whether relevant outcomes weremeasured, but we are still awaiting response
MethodsRCT, 2 parallel arms
ParticipantsPatients with chronic fatigue syndrome, N = 70
InterventionsSports group received gradual exercise
Comparison group rested
OutcomesFatigue symptoms of chronic fatigue syndrome (CFS), sleeping time; symptomsfor ears and eyes, muscle and bone system, nervous system and quality oflife
NotesInformation from English abstract. Waiting for translation

Characteristics of ongoing studies [ordered by studyID]

Trial name or titlePilot study on the effects of intermittent and graded exercise comparedwith no exercise for optimising health and reducing symptoms in chronicfatigue syndrome (CFS) patients
MethodsRandomised controlled trial, parallel
ParticipantsInclusion criteria: medical diagnosis of chronic fatigue syndrome:persistent and disabling, and/or recurring, fatigue lasting longer than 6months, which does not result from physical exertion and is not alleviatedby rest. Other symptoms include muscle weakness and pain, ongoing medicalsymptoms such as swollen lymph nodes and fever, poor sleep, poorconcentration and reduced quality of life
Exclusion criteria: diagnosed cardiac and/or respiratory disease; joint ormuscle condition/disease other than CFS that is contraindicated forexercise; any mental health condition that may affect exercise participationor safety of participants and researchers
Age minimum: 18 years
Age maximum: 60 years
Gender: both male and female
InterventionsRandomised controlled trial of intermittent exercise training compared withgraded exercise and standard care. Graded exercise is the currentrecommended exercise approach to CFS; it consists of self‐paced (e.g.low‐intensity) steady state exercise at a constant workload for ashort time; as the patient's fitness gradually improves, the length oftime and eventually the intensity are increased in a gradual graded manner,provided no adverse symptoms occur. Intermittent or interval exerciseconsists of short blocks of exercise at low to moderate intensity with arest interval in between bouts of exercise (e.g. 1 minute oflow‐intensity cycling, followed by 1 minute of rest, followed by 1minute of cycling); total time spent exercising can be gradually increasedwhilst rest or unloaded exercise intervals are maintained. Participants willbe randomly allocated to 1 of 3 groups. Each group will consist of 20participants to provide a power of 80% for the study (based on data fromGordon 2010), with an a priortest used to compute required sample size, given alpha (P value 0.05), powerand effect size for an F test, and looking at ANOVA fixed effects, maineffects and interactions (GPower). Volunteers will participate in 3 aerobicexercise sessions (cycling on a cycle ergometer) per week, consisting of thefollowing.
  • Warm‐up of 5 minutes of unloaded cycling for both ITE and GEgroups

  • Either a steady state (constant effort) low‐ tomoderate‐intensity cycling period (50% VO2peak, RPE3 Modified Borg Scale) initially for 10 minutes (GE group) OR anintermittent exercise block of 1 minute of moderate‐intensitycycling (60% VO2peak, RPE 4 to 5) alternated with 1 minuteof unloaded or very low‐intensity/unloaded cycling (20% to 30%VO2peak, RPE 1 to 2), totaling 20 minutes

  • Cool‐down of 5 minutes unloaded cycling plus stretching ofmain muscle groups for both groups


Over the 12 weeks of the project, we aim to progress the duration of SSexercise towards 20 minutes, as tolerated by the participant, and toprogress ITE participants towards intervals of 2 to 3 minutes ofmoderate‐intensity cycling, alternated with 1‐minute intervalsof low‐intensity cycling, totaling 25 to 30 minutes in duration. Allgroup sessions will be supervised by a member of the research team(consisting of accredited exercise physiologists) with assistance frompostgraduate Masters of Clinical Exercise Physiology students, who arestudying to become accredited exercise physiologists
Total intervention duration will be 12 weeks for graded, intermittent andcontrol groups
OutcomesImproved physiological adaptations to exercise (reduced RPE, heart rate andblood pressure). Rate of perceived exertion (RPE) is assessed using astandard 10‐point Borg Scale on which participants are asked how hardthey feel they are exercising; heart rate will be measured using a12‐lead ECG during prestudy and poststudy exercise tests, and duringexercise sessions, by using a Polar heart rate monitor; blood pressure willbe monitored constantly during prestudy and poststudy exercise testing, andduring exercise sessions, using a standard sphygmomanometer and anadult‐sized cuff and stethoscope
Increased lymphocyte function and reduced inflammatory cytokines measuredprestudy and poststudy by comparison of immune cell counts, lymphocyte (CD4,CD8, CD19, NK) function and inflammatory cytokines (IFN‐λ,IL‐1) in both exercise groups and control groups. Cell counts will bemeasured by full blood count (standard pathology); lymphocyte subsets willbe measured by cell count using a FACSCanto flow cytometer (BectonDickinson); lymphocyte function will be analysed using proliferative assayswith flow cytometric fluorescent analysis; and inflammatory cytokines willbe assessed using standard ELISA assays
Increased VO2peak, as measured prestudy and poststudy by opencircuit spirometry (Sensormedics) metabolic cart and bybreath‐by‐breath analysis. The test protocol is a cycle teststarting with a 3‐minute warm‐up of unloaded cycling, followedby 1‐minute increments of 10 watts (W) until a VO2 plateauis achieved (i.e. VO2 does not increase, although workloadcontinues to increase and/or RER > 1.15 and/or peak heart rate within 10beats per minute of age‐predicted maximum and/or volitionalexhaustion). The test may also be stopped at the request of participants ifthey feel too fatigued. If a submaximal value is achieved at this stage, apeak VO2 value can be extrapolated by using a linearregression
Reduced fatigue and symptoms (Cummins Fatigue Scale)
Starting date10/02/2013
Contact informationsuzanne.broadbent@scu.edu.au
Noteshttp://apps.who.int/trialsearch/Trial.aspx?TrialID=ACTRN12612001241820http://www.anzctr.org.au/ACTRN12612001241820.aspx
Trial name or titlePacing activity self‐management for patients with chronic fatiguesyndrome: randomized controlled clinical trial
MethodsRCT
ParticipantsInclusion criteria
  • Adults between 18 and 65 years of age

  • Female gender

  • Willing to sign informed consent form

  • Fulfilling 1994 Centers for Disease Control and Prevention criteriafor the diagnosis of chronic fatigue syndrome


Exclusion criteria
  • Not fulfilling each of the inclusion criteria listed above

InterventionsBehavioural: pacing
Behavioural: relaxation therapy
OutcomesChange in score on the Canadian Occupational Performance Measure (COPM)
Change in autonomic activity at rest and following 3 activities of dailyliving
Change in CFS Symptom List
Change in Checklist of Individual Strength (CIS)
Change in subscale scores on the Medical Outcomes Short Form‐36Health Status Survey (SF‐36)
Starting dateAugust 2011
Contact informationJo.Nijs@vub.ac.be
Noteshttp://clinicaltrials.gov/show/NCT01512342
Trial name or titleProtocol for the "four steps to control your fatigue(4‐STEPS)" randomised controlled trial: a self‐regulationbased physical activity intervention for patients with unexplained chronicfatigue
MethodsMulti‐centre, randomised controlled trial (RCT)
ParticipantsFulfilling operationalised criteria for idiopathic chronic fatigue (ICF)and for chronic fatigue syndrome (CFS)
Patients visiting their physician with a main complaint of unexplainedfatigue of at least 6 months' duration are recruited for the study
Inclusion criteria: meeting the operationalised criteria for ICF or CFS(CDC criteria); between 18 and 65 years of age; fluent in spoken Portuguese;capacity to provide informed consent Exclusion criteria: presence of aconcurrent somatic condition that can explain the fatigue symptoms; severepsychiatric disorders
InterventionsStandard care (SC) or standard care plus a self‐regulation basedphysical activity programme (4‐STEPS)
In addition to standard care, participants in the intervention groupreceived the 4‐STEPS programme consisting of the following.
  • 2 face‐to‐face individual motivational interviewing(MI) sessions aimed at exploring important health and life goals,increasing participants' motivation and confidence to bephysically active and setting a specific personal physical activitygoal. The first MI session takes place 1 week after the baselineassessment, and the second MI session takes place 2 weeks after thefirst. The MI session is delivered by a psychologist with MI training(member of the research team). The duration of the sessions isapproximately 1 hour. Details on topics addressed during the MIsessions are presented in Table 1

  • 2 brief telephone counselling sessions: Sessions take about 20minutes and are provided 2 weeks and 6 weeks after the last MIsession. Details on topics addressed during the telephone sessions arepresented in Table 1

  • Self‐regulation (SR) booklets: 2 booklets were designed tohelp patients change their level of physical activity (informationalbooklet and workbook). The informational booklet was provided at theend of the baseline assessment; the "Step 1" part of theworkbook is provided at the first MI session, and parts "Step2," "Step 3" and "Step 4" are given duringthe second MI session. Details on topics addressed in the SR bookletsare presented in Table 2

  • A pedometer to register physical activity on a daily basis (stepstaken) during the 3‐month intervention period. Instructions onhow to use the pedometer are given during the baseline assessmentsession (Table 2)

  • Daily activities record (Table 2): Participants received severaldaily activity records (physical activities, mental activities andrest). The first daily activity record was given to the participant atthe end of the first MI session; participants were asked to fill outthe activity record during the time between the first and second MIsessions. This homework assignment aimed to evaluateparticipants' daily activities management while possiblyrecognising an erratic pattern of rest and activity (boom and bustcycle). At the end of the second MI session, participants receiveddaily activities records that could be used to monitor changes indaily activity patterns during the subsequent 9 weeks

  • Leaflet for family: At the end of the first MI session, participantsreceived a leaflet for their partner or significant other to increasesocial support

OutcomesThe primary outcome was the reduction in perceived fatigue severity, whichwas assessed by using the Checklist of Individual Strength (CIS‐20R).A difference of 7 points between intervention and control groups for themain dimension (the subjective feeling of fatigue subscale) of theCIS‐20R was considered to be clinically significant
Starting dateThe 4‐STEPS RCT started in January 2011
Contact informationMarta Marques: mmarques@ispa.pt
NotesISRCTN: ISRCTN70763996
Copied from the published protocol:http://www.biomedcentral.com/1471‐2458/12/202
Trial name or titleIs a multi‐disciplinary rehabilitation treatment more effective thanmono‐disciplinary cognitive behavioural therapy for patients withchronic fatigue syndrome? A multi‐centre randomised controlledtrial
MethodsRCT
ParticipantsPatients were included if they fulfilled the CDC‐94 criteria for CFSand had a score ≥ 40 on the Checklist of Individual Strength(CIS)‐fatigue questionnaire. CDC‐94 criteria for CFS are asfollows.
  • At least 6 months of persistent or recurring fatigue for which nophysical explanation was found and that

    • was of new onset, that is to say, it had not been lifelong

    • was not the result of ongoing exertion

    • was not substantially alleviated by rest and

    • severely limited functioning


In combination with 4 or more of the following symptoms, persistent orregularly recurring over a period of 6 months and that must not havepredated the fatigue.
  • Self‐reported impairment in memory or concentration

  • Sore throat

  • Tender cervical lymph nodes

  • Muscle pain

  • Multi‐joint pain

  • Headache

  • Unrefreshing sleep

  • Postexertional malaise lasting 24 hours or longer


Additional inclusion criteria for this study follow here
  • Participants are willing to participate in a treatment that is set upto change behaviour

  • Participants are between 18 and 60 years of age, of either sex

  • Participants can speak, understand and write the Dutch language

InterventionsAfter intake, participants will be randomly divided into 2 groups:cognitive‐behavioural therapy (CBT) and multi‐disciplinaryrehabilitation therapy (MRT)
  • Cognitive‐behavioural therapy (CBT)


CBT is based on process variables of a CFS model. This model shows thathigh physical attributions will decrease physical activity and increasefatigue and functional impairment. A low level of sense of control oversymptoms and focusing on physical sensations have a direct causal effect onfatigue. In CFS precipitating and perpetuating factors are important. Theperpetuating factors become the focus of the intervention in CBT. Animportant subject in the therapy is the balance between activity and restand the patients' responsibility to see to it. Negative beliefsregarding the symptoms of fatigue, self‐expectations orself‐esteem are identified and patients are encouraged to challengethem the conventional way. Specific lifestyle changes are encouraged ifdeemed appropriate. At the end of the therapy relapse prevention isaddressed. Patients who are assigned to this group will attend 16 individualtherapy sessions of one hour duration, spread out over 6 months with apsychologist or behavioural therapist.
  • Multi‐disciplinary rehabilitation therapy (MRT): MRT includesCBT, GET, pacing and body awareness therapy (investigationaltreatment)

    • CBT: as above

    • Graded exercise therapy (GET): structured and supervised activitymanagement that aims at a gradual but progressive increase inaerobic activities. It is completed by graded activity and gradedexercise in which a gradual and progressive increase in physicaland mental activities is trained. Activities include activities ofdaily living and occupational and social or leisure activities

    • Pacing: helps the patient divide energy over the day/week.Eventually patients are encouraged to carry out a gradual increasein physical and mental activity

    • Body awareness therapy: teaches the patient to be aware ofhealthy physical sensations and to link them in the mind (bodymentalisation). Patients are taught to react adequately todisturbances in the balance between daily workload and thecapacity to deal with it. The balance between activity and rest islinked to the patient's inner control and to healthy physicalsensations


MRT includes the following
  • 2 weeks: observation (2 sessions of 1 hour with psychology, 2sessions of 1 hour with a social worker, 2 sessions of 1/2 hour withoccupational therapy, 2 sessions of 1/2 hour with physiotherapy)

  • 2 weeks: no therapy

  • 10 weeks therapy (5 sessions of 1 hour with psychology, 4 sessions of1 hour with a social worker, 26 sessions of 1/2 hour withphysiotherapy and 20 sessions of 1/2 hour with occupationaltherapy)

  • 6 weeks: no therapy

  • 1 session of 1 hour with a social worker (after 6 weeks of notherapy)

  • 2 sessions of both 1/2 and 1 hour of therapy with the therapistchosen by participants


During MRT, a participant sees the physician during rehabilitation 3 times(20 minutes per visit) Total duration of both treatments is 6months. Duration of follow‐up for both treatments is also 6months
OutcomesPrimary outcomes
  • Fatigue severity as measured using the Checklist of IndividualStrength at baseline, 6 months and 12 months after start oftherapy


Secondary outcomes
  • Quality of life as measured using the 36‐itemShort‐Form Health Survey (SF‐36)

  • Psychological well‐being as measured using Symptom CheckList‐90

  • Sense of control in relation to CFS complaints as measured using aself‐efficacy scale

  • Somatic attributions as measured using the Causal AttributionList

  • Mindfulness as measured using the Mindfulness Attention AwarenessScale

  • Functional activities (the most important) that a patient wants toimprove during treatment as measured using the Patient‐SpecificComplaints and Goals Questionnaire

  • Impact of disease on both physical and emotional functioning asmeasured using the Sickness Impact Profile

  • Physical activity as measured using the Body Media Sensewear ActivityMonitor

  • Self‐rated improvement as measured using 5 questions on the5‐ and 10‐point Likert scale

  • Life satisfaction as measured using the Life SatisfactionQuestionnaire

  • Utility as measured using EuroQol 6‐D

  • Treatment expectancy and credibility as measured using the Devillyand Borkovec Questionnaire


All outcomes are measured at baseline and at 6 and 12 months after start oftherapy. Treatment costs and additional expenses (work‐related costs,healthcare and non‐healthcare costs) are measured using theTrimbos/iMTA Questionnaire for Costs Associated With Psychiatric Illness;will be measured every month (from baseline until 12 months after start oftherapy)
Starting date27/11/2008
Recruitment status: completed
Contact informationd.vos‐vromans@rcbreda.nl
Noteshttp://isrctn.org/ISRCTN77567702
Trial name or titleGraded Exercise Therapy guided SElf‐help Treatment (GETSET) forpatients with chronic fatigue syndrome/myalgic encephalomyelitis: arandomised controlled trial in secondary care (GETSET)
MethodsRandomised interventional trial
ParticipantsInclusion
  • Patients attending 2 CFS/ME specialist clinics in London

  • Patients receiving a diagnosis of CFS/ME from a specialist doctor andgoing onto a waiting list for clinic treatment

  • Patients 18 years of age or older

  • Speak and read English adequately to provide informed consent andread the guided support booklet

  • Target gender: male and female

  • Lower age limit: 18 years


Exclusion
  • Not receiving a diagnosis of CFS/ME

  • Co‐morbid condition that requires that exercise be performedonly in the presence of a doctor

  • Younger than age 18

  • Active suicidal thoughts

InterventionsGuided support, a copy of the GETSET booklet, a 30‐minuteconsultation face‐to‐face by Skype or by telephone, 3 furtherSkype telephone contacts
Intervention over 9 weeks: follow‐up length: 3 month(s); studyentry: single randomisation only
OutcomesPrimary: SF‐36 physical function subscale (SF‐36PF) measured12 weeks from randomisation
Secondary: Clinical Global Impression Change Scale (CGI) score measured 12weeks from baseline
Starting date16/05/2012
Contact informationProf PD White; p.d.white@qmul.ac.uk
Noteshttp://www.controlled‐trials.com/ISRCTN22975026/GETSET

ANOVA, analysis of variance.

CFS, chronic fatigue syndrome.

CGI, Clinical Global Impression scale.

CIS, Checklist of Individual Strength.

COPM, Canadian Occupational Performance Measure.

ELISA, enzyme‐linked immunosorbent assay.

EuroQol 6‐D: Short Form 6‐D of the standard measure of healthoutcomes of the EuroQol Group.

GE, Graded exercise.

ICF, idiopathic chronic fatigue.

IFN, interferon.

IL, interleukin.

ITE, intermittent exercise training.

MI, motivational interviewing.

MRT, multi‐disciplinary rehabilitation therapy.

NK, natural killer cell.

RER, respiratory exchange ratio.

RPE, rating of perceived exertion.

SC, standard care.

SS, steady state.

VO2, oxygen consumption

Contributions of authors

LL, KGB, JO‐J: checked trials for inclusion. LL, KGB, JO‐J:extracted data for the update. LL, JO‐J, KGB: analysed data for theupdate. LL, JO‐J, JRP, KGB: wrote the update.

Sources of support

Internal sources

  • University of Oxford Department of Psychiatry, UK.

  • Norwegian Knowledge Centre for Health Services, Norway.

External sources

  • No sources of support supplied

Declarations of interest

LL: nothing to declare. KGB: nothing to declare. JO‐J:nothing to declare. JRP: nothing to declare.

Notes

Cochrane’s editors and the review author team have jointly agreed thatthere will be a further period up to the end of May 2019, in which time the author team willamend the review to address changes aimed at improving the quality of reporting of thereview and ensuring that the conclusions are fully defensible and valid to inform healthcare decision making. The changes will also address concerns raised in feedback since theRobert Courtney complaint. The amendment will not include a full update, but a decisionabout this will made subsequently.

Previously published notes

November 2018

The author team has re‐submitted a revised version of this review following thecomplaint by Robert Courtney. The Editor in Chief and colleagues recognise that the authorteam has sought to address the criticisms made by Mr Courtney but judge that further workis needed to ensure that the review meets the quality standards required, and as a resulthave not approved publication of the re‐submission. The review is alsosubstantially out of date and in need of updating.

Cochrane recognises the importance of this review and is committed to providing a highquality review that reflects the best current evidence to inform decisions.

The Editor in Chief is currently holding discussions with colleagues and the author teamto determine a series of steps that will lead to a full update of this review. Thesediscussions will be concluded as soon as possible.

October 2018

This review is subject to an ongoing process of review and revision following thesubmission of a formal complaint to the Editor in Chief. Cochrane considers all feedbackand complaints carefully, and revises or updates reviews when it is appropriate. Thereview author team have advised us that a resubmission of this review is imminent. Adecision on the status of this review will be made once this resubmission has been througheditorial process, which we anticipate will be towards the end of November 2018.

February 2015

A protocol for an accompanying individual patient data review on chronic fatigue syndromeand exercise therapy has been published (Larun2014).

Edited (no change to conclusions)

References

References to studies included in this review

  1. Fulcher KY, White PD. Chronic fatigue syndrome: adescription of graded exercise treatment.Physiotherapy 1998;84(9):223‐6. [Google Scholar];Fulcher KY, White PD. Randomised controlled trial ofgraded exercise in patients with chronic fatigue syndrome.BMJ 1997;314(7095):1647‐52. [DOI] [PMC free article] [PubMed] [Google Scholar];White PD, Fulcher KY. A randomised controlled trial ofgraded exercise in patients with a chronic fatigue.Royal College of Psychiatrists Winter Meeting, Cardiff.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Hlavaty LE, Brown MM, Jason LA. The effect of homework complianceon treatment outcomes for participants with myalgic encephalomyelitis/chronicfatigue syndrome. RehabilitationPsychology 2011;56(3):212‐8. [DOI] [PMC free article] [PubMed] [Google Scholar];Jason L, Torres‐Harding S, Friedberg F, Corradi K, Njoku M Donalek J, et al.Non‐pharmacologic interventions for CFS: a randomizedtrial. Journal of Clinical Psychology in MedicalSettings 2007;172:485‐90. [Google Scholar]
  3. Moss‐Morriss R, Sharon C, Tobin R, Baldi JC. A randomized controlled gradedexercise trial for chronic fatigue syndrome: outcomes and mechanisms ofchange. Journal of Health Psychology 2005;10(2):245‐59. [DOI] [PubMed] [Google Scholar]
  4. Powell P, Bentall ROP, Nye FJ, Edwards RHT. Patient education to encouragegraded exercise in chronic fatigue syndrome: 2‐year follow‐up ofrandomised controlled trial. The British Journal ofPsychiatry 2004;184:142‐6. [DOI] [PubMed] [Google Scholar];Powell P, Bentall RP, Nye FJ, Edwards RH. Randomised controlled trial ofpatient education to encourage graded exercise in chronic fatiguesyndrome. BMJ 2001;322(7283):387‐90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Wallman KE, Morton AR, Goodman C, Grove R. Exercise prescription forindividuals with chronic fatigue syndrome. MedicalJournal of Australia 2005;183(3):142‐3. [DOI] [PubMed] [Google Scholar];Wallman KE, Morton AR, Goodman C, Grove R, Guilfoyle AM. Randomised controlled trial ofgraded exercise in chronic fatigue syndrome. MedicalJournal of Australia 2004;180(9):444‐8. [DOI] [PubMed] [Google Scholar]
  6. Appleby L. Aerobic exercise and fluoxetine inthe treatment of chronic fatigue syndrome. NationalResearch Register1995.;Morriss R, Wearden A, Mullis R, Strickland P, Appleby L, Campbell I, et al. A double‐blindplacebo‐controlled treatment trial of fluoxetine and graded exercise forchronic fatigue syndrome (CFS). 8th Congress of theAssociation of European Psychiatrists, London. 1996. [DOI] [PubMed] [Google Scholar];Wearden AJ, Morriss RK, Mullis R, Strickland PL, Pearson DJ, Appleby L, et al. Randomised,double‐blind, placebo‐controlled treatment trial of fluoxetine andgraded exercise for chronic fatigue syndrome. BritishJournal of Psychiatry 1998;178:485‐92. [DOI] [PubMed] [Google Scholar]
  7. Wearden AJ. Randomised controlled trial ofnurse‐led self‐help treatment for patients in primary care withchronic fatigue syndrome. The FINE trial (Fatigue Intervention by Nurses Evaluation)ISRCTN74156610, 2001.http://www.controlled‐trials.com/ISRCTN74156610/ISRCTN74156610(accessed 2 September 2014). [DOI] [PMC free article] [PubMed];Wearden AJ, Dowrick C, Chew‐Graham C, Bentall RP, Morriss RK, Peters S, et al. Nurse led, home based selfhelp treatment for patients in primary care with chronic fatigue syndrome:randomised controlled trial. BMJ 2010;340(1777):1‐12.[DOI: 10.1136/bmj.c1777] [DOI] [PMC free article] [PubMed] [Google Scholar];Wearden AJ, Dowrick C, Chew‐Graham C, Bentall RP, Morriss RK, Peters S, et al. Nurse led, home based selfhelp treatment for patients in primary care with chronic fatigue syndrome:randomised controlled trial. BMJ, rapidresponse 27 May 2010.;Wearden AJ, Riste L, Dowrick C, Chew‐Graham C, Bentall RP, Morriss RK, et al. Fatigue interventions bynurses evaluation—The FINE Trial. A randomised controlled trial of nurse ledself‐help treatment for patients in primary care with chronic fatiguesyndrome: study protocol (ISRCTN74156610). BMCMedicine 2006;4(9):1‐12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. McCrone P, Sharpe M, Chalder T, Knapp M, Johnson AL, Goldsmith KA, et al. Adaptive pacing, cognitivebehaviour therapy, graded exercise, and specialist medical care for chronic fatiguesyndrome: a cost‐effectiveness analysis. PLoSONE 2012;7(7):e40808. [DOI:10.1371/journal.pone.0040808] [DOI] [PMC free article] [PubMed] [Google Scholar];Sharpe MD, Goldsmith KA, Johnson AL, Chalder T, Walker J, White PD. Rehabilitative treatments forchronic fatigue syndrome: long‐term follow‐up from the PACEtrial. Lancet Psychiatry 2015:ePub ahead of print. [DOI:10.1016/S2215-0366(15)00317-X] [DOI] [PubMed] [Google Scholar];White P, Chalder T, McCrone P, Sharpe M. Non‐pharmacologicalmanagement of chronic fatigue syndrome: efficacy, cost effectiveness and economicoutcomes in the PACE trial [conference abstract].Journal of Psychosomatic Research. Proceedings of the 15th AnnualMeeting of the European Association for Consultation‐Liaison Psychiatry andPsychosomatics, EACLPP and 29th European Conference on Psychosomatic Research, ECPR;2012 Jun 27‐30; Aarhus Denmark. 2012; Vol. 72, issue 6:509. [Google Scholar];White PD. A randomised controlled trial ofadaptive pacing, cognitive behaviour therapy, and graded exercise, as supplements tostandardised specialist medical care versus standardised specialist medical carealone for patients with the chronic fatigue syndrome/myalgic encephalomyelitis orencephalopathy [PACE], 2014.http://www.controlled‐trials.com/ISRCTN54285094(accessed 1 September 2014). [DOI] [PMC free article] [PubMed];White PD, Goldsmith KA, Johnson AL, Potts L, Walwyn R, DeCesare JC, et al. Comparison of adaptivepacing therapy, cognitive behaviour therapy, graded exercise therapy, and specialistmedical care for chronic fatigue syndrome (PACE): a randomised trial.The Lancet 2011;377:611‐90. [DOI] [PMC free article] [PubMed] [Google Scholar];White PD, Goldsmith KA, Johnson AL, et al. on behalf of the PACE TrialManagement Group. Supplementary web appendix. Comparisonof adaptive pacing therapy, cognitive behaviour therapy, graded exercise therapy,and specialist medical care for chronic fatigue syndrome (PACE): a randomisedtrial. The Lancet 2011;377:832‐6.[DOI: 10.1016/S0140-6736(11)60096-2] [DOI] [PMC free article] [PubMed] [Google Scholar];White PD, Sharpe MC, Chalder T, DeCesare JC, Walwyn R, the PACE Trial Group.Protocol for the PACE trial. A randomised controlled trial of adaptivepacing, cognitive behaviour therapy, and graded exercise as supplements tostandardised specialist medical care versus standardised specialist medical carealone for patients with the chronic fatigue syndrome/myalgic encephalomyelitis orencephalopathy. BMC Neurology 2007;7(6):1‐20.[DOI: 10.1186/1471-2377-7-6] [DOI] [PMC free article] [PubMed] [Google Scholar]

References to studies excluded from this review

  1. Evering RMH. Ambulatory feedback at dailyphysical activity patterns. A treatment for the chronic fatigue syndrome in the homeenvironment?. Universitet Twente,Netherlands2013:1‐223.;Evering RMH. Optimalization of cognitivebehavioral therapy (CBT) for CFS patients in rehabilitation by means of ambulatoryactivity‐based feedback (ABF).trialregister.nl/trialreg/admin/rctview.asp?TC=1513(accessed 7 May 2013).
  2. Gordon BA, Knapman LM, Lubitz L. Graduated exercise training andprogressive resistance training in adolescents with chronic fatigue syndrome: arandomized controlled pilot study. ClinicalRehabilitation 2010;24:1072‐9.[DOI: 10.1177/0269215510371429] [DOI] [PubMed] [Google Scholar]
  3. Guarino P, Peduzzi P, Donta ST, Engel CC Jr, Clauw DJ, Williams DA, et al. A multicenter two by twofactorial trial of cognitive behavioral therapy and aerobic exercise for gulf warveterans' illnesses: design of a Veterans Affairs cooperative study (CSP#470). Controlled Clinical Trials 2001;22:31032. [DOI] [PubMed] [Google Scholar]
  4. Nunez M, Fernandez Soles J, Nunez E, Fernandez Huerta JM, Godas Sieso T, Gomez Gil E. Health‐related quality oflife in patients with chronic fatigue syndrome: group cognitive behavioural therapyand graded exercise versus usual treatment. A randomised controlled trial with 1year of follow‐up. ClinicalRheumatology 2011;30(3):381‐9. [DOI] [PubMed] [Google Scholar]
  5. Risdale L, Darbishire L, Seed T. Is graded exercise better thancognitive behaviour therapy for fatigue? A UK randomized trial in primarycare. Psychological Medicine 2003;34:37‐49. [DOI] [PubMed] [Google Scholar]
  6. Ridsdale L, Hurley M, King M, McCrone P, Dobalson N. The effect of counselling, gradedexercise and usual care for people with chronic fatigue in primary care: arandomized trial. Psychological Medicine 2012;42:2217‐24.[DOI: 10.1017/S0033291712000256] [DOI] [PMC free article] [PubMed] [Google Scholar];Sabes‐Figuera R, McCrone P, Hurley M, King M, Donaldson AN, Risdale L. Cost‐effectiveness ofcounselling, graded‐exercise and usual care for chronic fatigue: evidencefrom a randomised trial in primary care. BMC HealthServices Reserach 2012;12:264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Russel V, Gaston AM, Lewin RJP. Atkinson CM, Champion PD. Group rehabilitation for adultchronic fatigue syndrome. Unpublishedarticle2001.
  8. Stevens MW. Chronic Fatigue Syndrome: AChronobiologically Oriented Controlled Treatment Outcome Study.San Diego: California School ofProfessional Psychology, 1999. [UMI9928180] [Google Scholar]
  9. Taylor RR. Quality of life and symptomseverity for individuals with chronic fatigue syndrome: findings from a randomizedclinical trial. American Journal of OccupationalTherapy 2004;58:35‐43. [DOI] [PubMed] [Google Scholar]
  10. Taylor RR, Jason LA, Shiraishi Y, Schoeny ME, Keller J. Conservation of resources theory,perceived stress, and chronic fatigue syndrome: outcomes of a consumer‐drivenrehabilitation program. RehabilitationPsychology 2006;51:157‐65. [Google Scholar];Taylor RR, Thanawala SG, Shiraishi Y, Schoeny ME. Long‐term outcomes of anintegrative rehabilitation program on quality of life: a follow‐upstudy. Journal of Psychosomatic Research 2006;61:835‐9. [DOI] [PubMed] [Google Scholar]
  11. Thomas M, Sadlier M, Smith A. A multiconvergent approach to therehabilitation of patients with chronic fatigue syndrome: a comparativestudy. Physiotherapy 2008;94(1):35‐42. [Google Scholar];Thomas MA, Sadlier MJ, Smith AP. The effect of multi convergenttherapy on the psychopathology, mood and performance of chronic fatigue syndromepatients: a preliminary study. Counselling andPsychotherapy Research 2006;6:91‐9. [Google Scholar]
  12. Tummers M, Knoop H, Dam A, Bleijenberg G. Implementing a minimal interventionfor chronic fatigue syndrome in a mental health centre: a randomized controlledtrial. Psychological Medicine 2012;42:2205‐15.[DOI: 10.1017/S0033291712000232] [DOI] [PubMed] [Google Scholar]
  13. Viner R, Gregorowski A, Wine C, Bladen M, Fisher D, Miller M, et al. Outpatient rehabilitativetreatment of chronic fatigue syndrome (CFS/ME).Archives of Disease in Childhood 2004;89(7):615‐9.[DOI: 10.1136/adc.2003.035154] [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Wright B, Ashby B, Beverley D, Calvert E, Jordan J, Miles J, et al. A feasibility studycomparing two treatment approaches for chronic fatigue syndrome inadolescents. Archives of Disease inChildhood 2005;90(4):369‐72.[DOI: 10.1136/adc.2003.046649] [DOI] [PMC free article] [PubMed] [Google Scholar]

References to studies awaiting assessment

  1. Hatcher S. A randomised double‐blindplacebo controlled trial of dothiepin and graded activity in the treatment ofchronic fatigue syndrome. Personalcommunication, 1998. [Google Scholar]
  2. Liu CZ, Lei B. Effect of Tuina on oxygen freeradicals metabolism in patients with chronic fatigue syndrome[Chinese]. Zhongguo Zhenjiu 2010;11:946‐8. [PubMed] [Google Scholar]
  3. Zhuo J‐X, Gu L‐Y. Relative research ontreating chronic fatigue syndrome with gradual exercise.Journal of Beijing Sport University 2007;30(6):801‐3. [Google Scholar]

References to ongoing studies

  1. Broadbent S, Coutts R. The protocol for a randomisedcontrolled trial comparing intermittent and graded exercise to usual care forchronic fatigue syndrome patients. BMC Sports Science,Medicine & Rehabilitation 2013;5(1):1‐6. [DOI] [PMC free article] [PubMed] [Google Scholar];Broadbent S.A pilot study on the effects of intermittent and graded exercisecompared to no exercise for optimising health and reducing symptoms in chronicfatigue syndrome (CFS) patients.anzctr.org.au/Trial/Registration/TrialReview.aspx?ACTRN=12612001241820(accessed 7 May 2013).
  2. Kos D, Nijs J. Pacing activityself‐management for patients with chronic fatigue syndrome: randomizedcontrolled clinical trial, 2012.clinicaltrials.gov/show/NCT01512342 (accessed 7 May 2013).
  3. Marques M, Gucht V, Maes S, Leal I. Protocol for the "four stepsto control your fatigue (4‐STEPS)" randomised controlled trial: aself‐regulation based physical activity intervention for patients withunexplained chronic fatigue. BMC PublicHealth 2012;12:202. [DOI:10.1186/1471-2458-12-202] [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Vos‐Vromans D. Is a multidisciplinaryrehabilitation treatment more effective than mono disciplinary cognitive behaviouraltherapy for patients with chronic fatigue syndrome? A multi centre randomisedcontrolled trial [FatiGo, ISRCTN77567702].http://www.controlled‐trials.com/isrctn/pf/77567702(accessed 7 May 2013). [ISRCTN77567702 ];Vos‐Vromans DCWM, Smeets RJEM, Rijnders LJM, Gorrissen RRM, Pont M, Köke AJA, et al. Cognitive behaviouraltherapy versus multidisciplinary rehabilitation treatment for patients with chronicfatigue syndrome: study protocol for a randomized controlled trial(FatiGo). Trials [electronic resource] 2012;13:71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. White PD. Therapy guided self‐helptreatment (GETSET) for patients with chronic fatigue syndrome/myalgicencephalomyelitis: a randomised controlled trial in secondary care. ISRCTN22975026,2012.http://www.controlled‐trials.com/ISRCTN22975026/GETSET(accessed 30 Octrober 2014).

Additional references

  1. American College of Sports Medicine. ACSM`sResource Manual for Guidelines for Exercise Testing and Prescription.4th Edition. Baltimore, MD:Lippincott Williams & Wilkins, 2001. [Google Scholar]
  2. Adams D, Wu T, Yang X, Tai S, Vohra S. Traditional Chinese medicinal herbsfor the treatment of idiopathic chronic fatigue and chronic fatiguesyndrome. Cochrane Database of SystematicReviews 2009;4:1‐16.[DOI: 10.1002/14651858.CD006348.pub2] [DOI] [PubMed] [Google Scholar]
  3. Alderson P, Green S, Higgins JP, editors. CochraneReviewers’ Handbook 4.2.2 [updated December 2003].The Cochrane Library, Issue 1. Chichester,UK: John Wiley & Sons Ltd,2004. [Google Scholar]
  4. Bagnall AM, Whiting P, Richardson R, Sowden AJ. Interventions for the treatmentand management of chronic fatigue syndrome/myalgic encephalomyelitis.Quality & Safety in Health Care 2001;11(3):284‐8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Beck AT, Steer RA, Brown GK. Manual for the Beck DepressionInventory‐II. Manual for the Beck DepressionInventory‐II. San Antonio:Psychological Cooperation, 1996. [Google Scholar]
  6. Blair SN, Morris JN. Healthy hearts—and theuniversal benefits of being physically active: physical activity andhealth. Annals of Epidemiology 2009;19(4):253‐6. [DOI] [PubMed] [Google Scholar]
  7. Bland JM, Kerry SM. Statistics notes. Trialsrandomised in clusters. BMJ 1997;315:600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Burckhardt CS, Anderson KL. The Quality of Life Scale (QOLS):reliability, validity and utilization. Health andQuality of Life Outcomes 2003;1:60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Buysse DJ, Reynolds CF, Monk TH, Berman SR, Kupfer DJ. The Pittsburgh Sleep QualityIndex: a new instrument for psychiatric practice and research.Psyciatric Research 1989;28:193‐213. [DOI] [PubMed] [Google Scholar]
  10. Carruthers BM, an de Sande MI, Meirleir KL, Klimas NG, Broderick G, Mitchell T, et al. Myalgic encephalomyelitis:international consensus criteria. Journal of InternalMedicine 2011;270(4):327‐38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Castell BD, Kazantzis N, Moss‐Morris RE. Cognitive behavioral therapy andgraded exercise for chronic fatigue syndrome: a meta‐analysis.Clinical Psychology: Science and Practice 2011;18:311‐24. [Google Scholar]
  12. Chalder T, Berelowitz G, Pawlikowska T, Watts L, Wessely S, Wright D, et al. Development of a fatiguescale. Journal of Psychosomatic Research 1993;37(6):147‐53. [DOI] [PubMed] [Google Scholar]
  13. Clark LV, White PD. The role of deconditioning andtherapeutic exercise in chronic fatigue syndrome (CFS).Journal of Mental Health 2005;14(3):237‐52. [Google Scholar]
  14. Cleeland CS, Ryan KM. Pain assessment: the global use ofthe Brief Pain Inventory. Annals Academy of MedicineSingapore 1994;23:123‐38. [PubMed] [Google Scholar]
  15. Edmonds M, McGuire H, Price J. Exercise therapy for chronicfatigue syndrome. Cochrane Database of SystematicReviews 2004;3(3):1‐28.[DOI: 10.1002/14651858.CD003200.pub2] [DOI] [PubMed] [Google Scholar]
  16. Egger M, Davey‐Smith G, Schneider M, Minder C. Bias in meta‐analysisdetected by a simple, graphical test. British MedicalJournal 1997;315:629‐34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Elbourne DR, Altman DG, Higgins JP, Curtin F, Worthington HV, Vail A. Meta‐analyses involvingcross‐over trials: methodological issues.International Journal of Epidemiology 2002;31:140‐9. [DOI] [PubMed] [Google Scholar]
  18. The Europien Parliament and the Councel of the EuropeanUnion. DIRECTIVE 2001/20/EC Europien Parliament and theCouncel of the European Union of 4 April 2001. OfficialJournal of the European Communities2001; Vol. L 121/34.[http://www.eortc.be/services/doc/clinical‐eu‐directive‐04‐april‐01.pdf]
  19. Fønhus MS, Larun L, Brurberg KG. Diagnostic criteria for chronicfatigue syndrome [Diagnosekriterier for kronisk utmattelsessyndrom.Notat fra Kunnskapssenteret 2011]. Norwegian Knowledge Centre for theHealth Services2011.
  20. Fukuda K, Straus SE, Hickie I, Sharpe MC, Dobbins JG, Komaroff A. The chronic fatigue syndrome: acomprehensive approach to its definition and study.Annals of Internal Medicine 1994;121(12):953‐9. [DOI] [PubMed] [Google Scholar]
  21. Fulcher KY, White PD. Strength and physiologicalresponse to exercise in patients with chronic fatigue syndrome.Journal of Neurology Neurosurgery & Psychiatry 2000;69:302‐7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gulliford MC, Ukoumunne OC, Chinn S. Components of variance andintraclass correlations for the design of community‐based surveys andintervention studies: data from the Health Survey for England 1994.American Journal of Epidemiology 1999;149:924‐6. [DOI] [PubMed] [Google Scholar]
  23. Guy W. ECDEU assessment manual forpsychopharmacology. ECDEU Assessment Manual forPsychopharmacology. Rockville, MD:National Institute of Mental Health, 1976:218‐222. [Google Scholar]
  24. Hard K, Rickards HE, Haque MS, Ward C. Pharmacological treatments forchronic fatigue syndrome in adults. Cochrane Databaseof Systematic Reviews 2007, Issue 4. [DOI: 10.1002/14651858.CD006788.pub2] [DOI] [Google Scholar]
  25. Hewitt PL, Norton GR. The Beck Anxiety Inventory: apsychometric analysis. PsychologicalAssessment 1993;5:408‐12. [Google Scholar]
  26. Higgins JPT, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency inmeta‐analyses. BMJ 2003;327(7414):557‐60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Higgins JPT, Altman DG, Sterne JAC (editors). Cochrane Handbook forSystematic Reviews of Interventions Version 5.1.0 [updated February 2011]. TheCochrane Collaboration, 2011. Available fromwww.cochrane‐handbook.org. The Cochrane Collaboration.
  28. Jenkins D, Stanton B, Niemcryk S, Rose R. A scale for the estimation of sleepproblems in clinical research. Journal of ClinicalEpidemiology 1988;41:313‐21. [DOI] [PubMed] [Google Scholar]
  29. Johnston S, Brenu EW, Staines D,Marshall‐Gradisnik S. The prevalence of chronic fatiguesyndrome/ myalgic encephalomyelitis: a meta‐analysis.Clinical Epidemiology 2013;5:105‐10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Krupp LB, LaRocca NG, Muir‐Nash J, Steinberg AD. The fatigue severity scale:application to patients with multiple sclerosis and systemic lupuserythematosus. Archives of Neurology 1989;46:1121‐3. [DOI] [PubMed] [Google Scholar]
  31. Larun L, Malterud K. Exercise therapy for patients withchronic fatigue syndrome [Treningsbehandling ved kroniskutmattelsessyndom]. Tidsskr Nor Laegeforen 2011;138(8):231‐6. [DOI] [PubMed] [Google Scholar]
  32. Larun L, Odgaard‐Jensen J, Brurberg KG, Chalder T, Dybwad M, Moss‐Morris RE, et al. Exercise therapy forchronic fatigue syndrome (individual patient data).Cochrane Database of Systematic Reviews 2014, Issue 4. [DOI: 10.1002/14651858.CD011040] [DOI] [Google Scholar]
  33. Moncrieff J, Churchill R, Drummond C, McGuire H. Development of a quality assessmentinstrument for trials of treatments for depression and neurosis.International Journal of Methods in PsychiatricResearch 2001;10(3):126‐33. [Google Scholar]
  34. Mosby. Mosby's MedicalDictionary. 8th Edition.Philadelphia: Elsevier,2009. [Google Scholar]
  35. National Institute for Health and Clinical Excellence.Chronic fatigue syndrome/myalgic encephalomyelitis (orencephalopathy): diagnosis and management of CFS/ME in adults and children,2007.http://guidance.nice.org.uk/CG53/guidance/pdf/English (last accessedNovember 2009). London: National Institute for Health and ClinicalExcellence.
  36. Nijs J, Meeus M, Oosterwijck J, Ickmans K, Moorkens G, Hans G, et al. In the mind or the brain?Scientific evidence for central sensitisation in chronic fatiguesyndrome. European Journal of ClinicalInvestigation 2011;42:203‐11.[DOI: 10.1111/j.1365-2362.2011.02575.x] [DOI] [PubMed] [Google Scholar]
  37. OED Online. December 2014. Oxford University Press."therapy, n.".http://www.oed.com/view/Entry/200468?redirectedFrom=therapy(accessed January 21, 2015).
  38. Paul LM, Wood L, Maclaren W. The effect of exercise on gait andbalance in patients with chronic fatigue syndrome. Gaitand Posture 2001;14:19‐27. [DOI] [PubMed] [Google Scholar]
  39. Price JR, Mitchell E, Tidy E, Hunot V. Cognitive behaviour therapy forchronic fatigue syndrome in adults. Cochrane Databaseof Systematic Reviews 2008, Issue 3. [DOI: 10.1002/14651858.CD001027.pub2] [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Prins JB, Meer JW, Bleijenberg G. Chronic fatiguesyndrome. Lancet 2006;367:346‐55. [DOI] [PubMed] [Google Scholar]
  41. Reeves WC, Lloyd A, Vernon SD, Klimas N, Jason LA, Bleijenberg G, and the International Chronic FatigueSyndrome Study Group. Identification of ambiguities inthe 1994 chronic fatigue syndrome research case definition and recommendations forresolution. BMC Health Services Research 2003;3(25):1‐9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Reeves WC, Jones JF, Heim C, Hoaglin DC, Boneva RS, Mirrissey M, et al. Prevalence of chronicfatigue syndrome in metropolitan, urban, and rural Georgia.Population Health Metrics 2007;5:1‐10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Reyes M, Nisenbaum R, Hoaglin DC, Unger ER, Emmons C, Randall B, et al. Prevalence and incidence ofchronic fatigue syndrome in Wichita, Kansas. Archivesof Internal Medicine 2003;163(13):1530‐6. [DOI] [PubMed] [Google Scholar]
  44. Sharpe M, Archard L, Banatvala J, Borysiewicz LK, Clare AW, David A, et al. Chronic fatigue syndrome:guidelines for research. Journal of the Royal Societyof Medicine 1991;84(2):118‐21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. The National Task Force on Chronic Fatigue Syndrome.Report from the National Task Force on Chronic Fatigue Syndrome (CFS),Post Viral Fatigue Syndrome (PVFS), Myalgic Encephalomyelitis (ME). AppendixB. Bristol: Westcare,1994.
  46. Wallman K. Confirmation of ages (means andSDs) of groups in trial. Personal correspondence (emailto Lillebeth Larun), 2009 2 November.
  47. Ware JE, Sherbourne CD. The MOS 36‐item short formhealth survey (SF‐36). MedicalCare 1992;30:473‐83. [PubMed] [Google Scholar]
  48. Wearden AJ. Raw data to facilitatecalculations for meta‐analysis. Personalcommunication (email),2009 March.
  49. Zigmond AS, Snaith RP. The Hospital Anxiety andDepression Scale. Acta PsychiatricaScandinavica 1983;67(6):361‐70. [DOI] [PubMed] [Google Scholar]

Articles from The Cochrane Database of Systematic Reviews are provided here courtesy ofWiley

ACTIONS

RESOURCES


[8]ページ先頭

©2009-2025 Movatter.jp