Next Article in Journal
The Relationship Between Intolerance of Uncertainty and Alcohol Use in First Responders: A Cross-Sectional Study of the Direct, Mediating and Moderating Role of Generalized Resistance Resources
Previous Article in Journal
Biodigesters for Sustainable Food Waste Management
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Reliability and Validity of Laboratory and Field Cardiorespiratory Exercise Tests for Wheelchair Users: A Systematic Review

1
Physical Education and Sport Department, Faculty of Education and Sport, University of the Basque Country, UPV/EHU, 01007 Vitoria-Gasteiz, Spain
2
Society, Sports and Physical Exercise Research Group (GIKAFIT), Physical Education and Sport Department, Faculty of Education and Sport, University of the Basque Country, UPV/EHU, 01007 Vitoria-Gasteiz, Spain
3
AKTIBOki: Research Group in Physical Activity, Physical Exercise and Sport, Physical Education and Sport Department, Faculty of Education and Sport, University of the Basque Country, UPV/EHU, 01007 Vitoria-Gasteiz, Spain
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2025, 22(3), 384; https://doi.org/10.3390/ijerph22030384
Submission received: 13 December 2024 / Revised: 26 February 2025 / Accepted: 3 March 2025 / Published: 6 March 2025
(This article belongs to the Section Global Health)

Abstract

:
Background: cardiorespiratory fitness is one of the most important components of physical fitness. In this paper, we set out to identify cardiopulmonary tests evaluated for measurement properties in wheelchair users and determine which are reliable and valid for this population. Methods: Articles were collected from PubMed, Scopus, SPORTDiscus, and Web of Science. The initial search was conducted in October 2022 and updated in July 2023 for recent publications. From 1257 screened studies, 42 met the criteria: (a) participants were wheelchair users, (b) tests measured cardiorespiratory fitness, (c) test reliability or validity was reported, (d) articles were original, and (e) full text was in English. Two independent researchers extracted participant details (number, gender, age, disability) and test information, with a third researcher resolving disagreements. Statistical analyses of test reliability and validity were documented. Results: Methodological quality was assessed using the COSMIN checklist. Evidence levels for reliability and validity were established. Moderate evidence was found for reliability in one field test, and for validity in two lab and two field tests. Conclusions: While most tests show good reliability and validity, sample sizes are small, limiting conclusions. Further research is needed to strengthen the evidence and identify the most suitable tests for wheelchair users. Researchers are encouraged to replicate validation studies to support future testing.

1. Introduction

The wheelchair-using population comprises a wide range of people with disabilities, such as tetraplegia, paraplegia, spina bifida, cerebral palsy, sclerosis or amputations [1], Nevertheless, physical activity is highly recommended for all of them in order to improve their physical fitness and also their quality of life [1,2,3,4]. One of the most important components of physical fitness is cardiorespiratory fitness (CRF), a significant factor in morbidity and mortality [5,6,7]. CRF can be measured directly, expressed as maximal oxygen consumption (VO2max), or estimated from the peak work rate achieved on a treadmill. Adequate cardiorespiratory fitness may improve the quality of life of wheelchair users, because it is associated with better health [6], reduced mortality [5,6], greater functional capacity and greater autonomy to move around without depending on others [8]. Due to different factors, such as impairments, activity limitations or participation restrictions, people with disabilities are more likely to be physically inactive [9,10,11], have poorer cardiorespiratory fitness, and tend to develop more chronic diseases and comorbidities [9,12,13]. For this reason, measuring cardiorespiratory fitness in wheelchair users is crucial.
Cardiorespiratory fitness is commonly measured in a laboratory test [14,15,16]. These tests are performed in controlled and stable situations, and they typically measure variables such as gas exchange [16,17,18], blood lactate concentration [16,17,18,19,20,21] or power output [22,23,24]. However, as laboratory tests require expensive and sophisticated equipment, qualified personnel and a high investment of time, field tests have also been used for wheelchair users to measure or estimate cardiorespiratory fitness, because they allow a large number of participants to be tested cheaply, in less time and with greater ecological validity [25,26]. As in the general population, both laboratory [27] and field [28] protocols have been used for wheelchair users. Due to differences in this population’s functional and situational capacities, the protocols used to measure cardiorespiratory fitness in people with disabilities have often been adapted from protocols used for the general population [29]. For example, Eriksson et al. [30] adapted the traditional maximal protocols used in able-bodied people for wheelchair users by creating a fixed resistance roller and measuring wheel speed. Later, more sophisticated systems were utilized for the same purpose. Klaesner et al. [31] developed a dynamometer, adjustable to any wheelchair, that could simulate different resistances and slopes and measure the forces. Arm-crank ergometers [24] or treadmills with special harnesses that attach to wheelchairs to stabilize the person in them [32] have also been used. In the same way, the field protocols used in the ordinary population have also been adapted to the characteristics and needs of wheelchair users [33]. For example, Vanderthommen et al. [34] adapted Leger and Boucher’s shuttle run test [35] for wheelchair tennis players and Yanci et al. [36] did the same with the Yo-Yo recovery test in wheelchair basketball.
Regardless of whether they are laboratory or field tests, when any variable is measured, the tool used (in this case a cardiopulmonary test) must be valid and reliable [37]. This means that it has to measure what it is designed to measure, and whenever it measures the same thing, it must give the same result [37]. The availability of valid and reliable laboratory and field protocols for wheelchair users allows their cardiorespiratory capacity to be measured adequately. This is important for identifying people who could benefit from a prevention program [38] or even for sporting purposes [33]. This allows wheelchair users to be classified by level, and in the case of large groups, to be sorted into subgroups for individualized programs. Whatever the objective of the exercise program, testing is necessary to prescribe, monitor and evaluate it [22,39] correctly. For this reason, it seems essential to deepen out scientific knowledge on the validity and reliability of laboratory and field tests for wheelchair users.
Most of the protocols used in the general population have been widely validated; however, they suffer from important variations in adjusting to the needs/characteristics of wheelchair users [36,40,41,42,43]. Therefore, the validity and reliability of these new/adapted protocols should be analyzed. Nevertheless, although there are studies on the validity of cardiopulmonary tests for wheelchair users, the conclusions that can be drawn are limited due to the reduced number of participants [22,44,45]. For this reason, this study aims to unify the information obtained in those studies, assess their quality, and show the existing scientific evidence for using each test. Similar reviews have been performed in other populations or with other types of tests as targets [29,46,47,48,49], but not for cardiorespiratory tests for wheelchair users. Eerden et al. [27] gathered the existing cardiopulmonary laboratory tests for wheelchair users with spinal cord injury, but did not assess their reliability and/or validity. The only review that is close to our topic is the one by Gossey-Tolfrey et al. [28], but this is not a systematic review, so it only includes a few articles on this topic, and it does not assess the quality of the studies; it only focuses on field tests, and the validity and reliability of the tests is not the main research topic. Therefore, given that there is no comparable systematic review and the topic to be addressed is relevant, the main aim of this review is to analyze the existing scientific evidence on the validity and reliability of laboratory and field tests for measuring cardiorespiratory fitness in wheelchair users.

2. Materials and Methods

2.1. Search Strategy

This systematic review was conducted using the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analysis) recommendations [50]. Research articles were gathered using the PubMed, Scopus, SPORTDiscus, and Web of Science database platforms, representing databases from multiple health and physical activity disciplines. The literature search was conducted in October 2022 and repeated in July 2023 for recently published articles. The search strategy included the following keywords with the relevant Boolean operators inserted: (“disability” OR “physical impairment” OR “wheelchair” OR “cerebral palsy” OR “amputation”) AND (“cardio*” OR “aerobic capacity” OR “respiratory”) AND “test” AND (“reliability” OR “validity”). The protocol was registered and published in OSF (https://doi.org/10.17605/OSF.IO/G45BR, accessed on 1 February 2025).

2.2. Eligibility Criteria

To be included in this systematic review, studies needed to meet the following criteria: (a) at least 50% of participants had to be regular wheelchair users, either for everyday life or sporting purposes; (b) the test under study had to measure or estimate cardiorespiratory fitness; (c) the article had to provide information on reliability or validity of the test; (d) studies had to be original articles published in peer-reviewed impact journals; and (e) full-text articles had to be written in English. No publication date limit was set. Failure to meet any of the criteria meant being excluded from the review. No cardiovascular cycling tests were included in this systematic review.

2.3. Screening

The selection process is presented in Figure 1. The search results were merged, and duplicate records of the same document were removed. This resulted in 1257 articles. Identified articles on the systematic search were initially checked for relevance by 2 independent researchers (the first and last authors). Articles were selected after a sequential reading of the title and abstract, always in this order. Subsequently, the researchers reviewed the full texts of potentially eligible articles. A third researcher (the second author) resolved reviewer disagreements regarding the study’s inclusion. The references of the articles that were fully read were consulted to identify possible additional studies. In the case of articles found in systematic reviews, they were considered for inclusion only when the full text was available.

2.4. Data Extraction

Data extraction was performed by 2 independent researchers (the first and last authors) and supported by a third researcher (the second author) when necessary (Figure 1). The authors critically analyzed the selected articles and extracted the data on participants (number, gender, age, disability, and characteristics), the tests studied and statistical analysis on reliability and validity.

2.5. Quality Assessment

The methodological quality of the articles was evaluated using the COSMIN (Consensus-Based Standards for Selecting Health Measurement Instruments) checklist [51]. This methodology was used in previous reviews to assess the quality of studies on the measurement properties of different tests in other populations [29,47,48]. The checklist consists of 10 boxes that evaluate different measurement properties as “inadequate”, “doubtful”, “adequate” and “very good”. For this review, Boxes 6 (relative reliability), 7 (absolute reliability), 8 (criterion validity) and 9 (convergent validity) were rated if applicable. The overall score for each box is determined by the lowest score obtained on any of the items [29].

2.6. Level of Evidence

Only studies with “doubtful” or better methodological quality were used for the evidence level analysis [29]. Evidence was considered “strong” if there was 1 study with “very good” quality or multiple studies with “adequate” quality with similar results; “moderate” if there was 1 study with “adequate” quality or multiple studies with “doubtful” quality with similar results; and “limited” if there was only 1 study with “doubtful” quality [29].

3. Results

3.1. Quality of Included Studies

A total of 42 studies were included in the review (Table 1): 16 assessed both the reliability and validity of a test, 13 assessed only reliability and 13 assessed only the validity; therefore, 29 studies assessed reliability and 29 assessed validity. The quality of reliability studies was rated as “inadequate” 24 times, “doubtful” 4 times and “adequate” once. The quality of validity studies was rated as “inadequate” 11 times, “doubtful” 14 times and “adequate” 3 times.

3.2. Characteristics of the Participants

A total of 42 studies were included in the review. Regarding gender, 22 studies were mixed, 19 included only men, and 1 article was composed entirely of women. The total number of participants was 1065 (236 females and 829 males). The studies’ sample sizes varied from 6 persons [18,20] to 102 persons [57], and the age range was 4–83 years. A total of 6 articles were focused on young people (4–18 years) [43,61,64,66,67,71], 1 article studied young and adult people together (14–46 years) [56], and the participants in the remaining 35 articles were adults (>18 years). Only three studies included people older than 60 years in their samples [17,69,73]. Concerning the disabilities of the participants, the samples in 15 papers were composed of people with spinal cord injuries (2 of them included people with poliomyelitis too), 5 articles were focused on people with cerebral palsy, 2 studies’ participants had unilateral amputations and 1 article studied people with osteogenesis imperfecta. The remaining 19 articles had people with different disabilities in the sample. In 18 studies, only wheelchair athletes were included.

3.3. Laboratory Tests

Twenty of the included studies evaluated laboratory tests. The total number of laboratory tests studied was 29. Thirteen protocols were performed in a wheelchair ergometer: eleven were incremental maximal tests (six by increasing the resistance, four by increasing the speed and one by increasing the cadence), and two were simulated races. A further six protocols were carried out with an arm-crank ergometer, of which four were incremental maximal tests, and two were submaximal tests. Another seven protocols were conducted by pushing a wheelchair on a treadmill, including six incremental maximal tests (two by increasing inclination and speed, three by increasing inclination only and one by increasing speed only) and one submaximal test. The last three protocols were incremental maximal tests performed on an arm–crank ergometer or recumbent stepper. All laboratory protocols are summarized in Table 2.
Among the 20 articles, 7 assessed both the reliability and validity of a test, 9 assessed only the reliability, and 4 assessed only the validity. A total of 16 reliability reports and 11 validity reports were registered. The quality of reliability reports was evaluated as “inadequate” in 13 cases and “doubtful” in 3 cases. The quality of validity reports was “inadequate” in six cases, “doubtful” in three cases and “adequate” in two cases. There were no “very good” reports. Considering the best quality articles, two tests showed a moderate evidence level for validity: the maximal wheelchair ergometer resistance test [23] and the 6-min arm test [17]. Both tests had limited evidence for reliability. No laboratory test showed moderate or strong evidence for reliability (Table 3).

3.4. Field Tests

Twenty-two of the included studies evaluated field tests. The total number of field tests studied was 18. A total of 12 protocols were maximal incremental wheelchair tests (8 performed in a straight line, 2 in a “figure of eight”, 1 in an octagon and 1 on a 400 m track), of which 3 were intermittent tests, and the rest were continuous. Another five tests involved covering a distance as long as possible over a limited time (5, 6 or 12 min), but in two of them, the pushing cadence was marked with a metronome. In the other four, the participants were free to perform as best they could. The remaining test was a submaximal incremental test. All the field protocols are summarized in Table 4.
Among the 22 articles, 9 assessed both the reliability and validity of a test, 4 assessed only the reliability and 9 assessed only the validity. A total of 13 reliability reports and 18 validity reports were registered. The quality of reliability reports was evaluated as “inadequate” in 11 cases, “doubtful” in 1 case and “adequate” in 1 case. The quality of validity reports was “inadequate” in 5 cases, “doubtful” in 11 cases and “adequate” in 2 cases. There were no “very good” reports. Considering the best quality articles, the shuttle wheelchair test [67] and the adapted Léger and Boucher test [58] showed a moderate evidence level for validity. Only one test showed a moderate evidence level for reliability: the 6-min push test [43] (Table 5).

4. Discussion

The main aim of this review was to analyze the existing scientific evidence on the reliability and validity of laboratory and field tests for measuring cardiorespiratory fitness in wheelchair users. To the authors’ knowledge, this is the first systematic review to analyze the level of evidence of measurement properties of cardiopulmonary tests for wheelchair users. The validity and reliability of cardiorespiratory fitness assessments are of paramount importance when conducting laboratory and field tests. High validity ensures that measures reflect true fitness levels, while reliability guarantees consistent results across different testing conditions, enhancing the effectiveness of fitness evaluations [74]. The results showed that although several studies had measured the reliability and validity of cardiopulmonary tests for wheelchair users, there was no test with strong evidence. The main reason for this was the small sample size of the studies, which is very common when studying a particular population, because it is often difficult to gather many participants. Therefore, different studies should analyze the same tests, instead of trying to evaluate new ones, as most existing studies have positive results. However, according to Janssen et al. [75], there were too many tests (57 different tests in 42 articles), and a need for unification between them.
As is often the case in general physical activity studies [76], with only 75 women (22.2%), women had less presence than men (77.8%) in the included studies. Most of the studies were mixed, with 19 involving only men and 1 involving only women. The sample sizes varied widely, ranging between 6 and 102 persons. Regarding age, most of the investigations were conducted in adults; some studies focused on young people, and very few focused on people older than 60 years. Regarding disability type, most studies were carried out with samples of individuals with different impairments or people with spinal cord injuries; there were few studies on other disabilities or impairments, such as cerebral palsy, amputations or osteogenesis imperfecta. Another important thing to consider is that 16 studies were conducted with an entirely athletic population. It is likely that, in some cases, tests with good reliability and validity values in a concrete sample may not be suitable for other samples [24]. Moreover, given that they are a population that can benefit significantly in terms of health from understanding aerobic fitness [1,2,3,4], more studies on validity and reliability in wheelchair users are needed, especially for women, in people under 18 and over 60 years of age and for some concrete disabilities.
In relation to the laboratory tests, many different tests were analyzed (29 tests) in the included studies (21 studies). Thus, the only test studied in 2 different articles was the maximal wheelchair ergometer speed test [20,21]. Therefore, although most of the studies have shown positive results, due to the limited samples and lack of corroboration, there is currently a low level of evidence to determine whether one laboratory test is more appropriate. The highest level of evidence was obtained for the maximal wheelchair ergometer resistance test [17] and 6-min arm test [23], with both obtaining a moderate level of evidence for validity. However, both had limited evidence for reliability. This shows the need for further studies on the reliability and validity of laboratory tests for wheelchair users. Furthermore, it is unclear which type of laboratory test is the gold standard for this population, so more research should be carried out on this issue. Thus, Bloemen et al. [64] recommended using a wheelchair ergometer over an arm ergometer because they obtained higher oxygen consumption and heart rate values. This is likely due to both the specificity of the movement and the involvement of more muscle mass. The results of Hartung et al. [44] suggested that when performing a wheelchair treadmill laboratory test, combining increases in both incline and resistance was more appropriate, because higher peak values were obtained by increasing both of them than by increasing the incline or resistance alone (in the resistance protocol, they obtained the lowest peak values). Other studies [69,77] suggested that for certain individuals, such as unilateral amputees, it may be more appropriate to use a recumbent stepper, as this would involve more muscle mass, obtaining higher peak values, as in the aforementioned cases. These results highlight the possible need to use different laboratory tests depending on the type of population and measurement objectives. Also, there may not be a single gold-standard test.
Regarding the field tests, the three most analyzed tests appeared in three articles each: the shuttle wheelchair test [61,66,67], the adapted Léger and Boucher test [54,55,58] and the multistage octagonal field test [34,63,65]. All three tests were continuous maximal multistage wheelchair tests: the first over a 10 m straight, the second on a 400 m athletics track, and the third in a 15 × 15 m octagon. The 6-min push test (performed over a 10-m straight) was analyzed in two articles, and the remaining tests appeared in only one. Most field tests analyzed were maximal multistage wheelchair tests (12 tests), which increased speed until exhaustion. These tests can be intermittent or continuous, and can be performed on different routes (e.g., going around an athletics track, going back and forth in a straight line, turning around an octagon or performing a figure eight-shaped route). Having rests between bouts or changes in direction can affect the outcome of the test. De Groot et al. [33] found that the test they used with tennis players was more related to skills (such as turning and the ability to propel the chair correctly at high speeds) than cardiorespiratory fitness. They argued that the increase in speed is the primary limiting factor of the test, and that an increase in resistance, often seen in laboratory tests, is the most appropriate way to obtain maximal oxygen consumption values. This is in line with the aforementioned results of Hartung et al. [44], who obtained worse values in the laboratory test when they increased only the speed. The remaining six tests included a submaximal field test [52] and five time-limited tests, two of which required participants to maintain a constant cadence [39], whereas the other three had no cadence limitations [40,41,43,71]. Different routes were also used in this type of test, such as 200 m indoor tracks, basketball courts or 10 and 15 m straights, which could affect the result. Only two time-limited tests obtained positive results: the 6-min push tests, with one over a 10 m straight [43,71] and the other over a 15 m straight [40]. This might suggest that 6 min is more appropriate than 12 min for this type of test, and that limiting the cadence may not be the most appropriate. This agrees with Christensen et al. [22], who report that upper limb tests should last between 5 and 9 min, unlike lower limb tests. Regarding evidence levels, there were two field tests with moderate evidence for validity: the shuttle wheelchair test [67] and the adapted Léger and Boucher test [58]. On the other hand, there was only one field test with a moderate level of evidence for reliability: the 6 min push test over a 10 m straight [43]. Therefore, although good results were obtained in most of the articles, there is not even moderate evidence to consider any field test as having good reliability and validity for wheelchair users, so future studies should study the psychometric properties of the above-mentioned tests for this population.

Study Limitations

Although this review was conducted with high methodological and scientific rigor standards, it is not without limitations. The literature search was conducted in English, so studies in other languages will have been left out of this review; four databases were used for this purpose, and, undoubtedly, some studies will have been missed. Furthermore, the limitation that at least 50% of participants had to be wheelchair users also meant that interesting studies with non-disabled people were not included in this review.

5. Conclusions

The main conclusion of this review is that most studies appear to obtain good levels of reliability and validity, but are sparse and lack large samples to draw solid conclusions, so more studies evaluating existing tests are needed to strengthen the level of evidence. Furthermore, many similar tests, but different protocol variations, make it impossible to group the results and draw more consistent conclusions. This is why we encourage researchers to replicate studies that assess the validity and reliability of cardiopulmonary tests in wheelchair users in order to establish which laboratory and field tests are most appropriate for this population. In particular, there is a lack of studies on validity and reliability for wheelchair users in women, children, adolescents, people over 60 years of age and people with some specific disabilities.

Author Contributions

Conceptualization, C.G. and I.G.; methodology, C.G. and I.G.; validation, C.G. and J.Y.; formal analysis, I.G., J.Y. and I.G.; investigation, I.G.; resources, I.G.; data curation, I.G. and J.Y.; writing—original draft preparation, I.G.; writing—review and editing, J.A. and A.I.; visualization, J.A. and A.I.; supervision, C.G. and J.Y.; funding acquisition, C.G. and J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been funded through the University-Company-Society 2023 Project convened by the University of the Basque Country (UPV/EHU) entitled “Muévete 2.0: análisis de la capacidad funcional de personas con discapacidad (code US23/14)” and requested in collaboration with the Fundación GaituzSport Fundazioa. Also this study was supported by a pre-doctoral grant (PIF21/008) from the University of the Basque Country.

Institutional Review Board Statement

The study did not require ethical approval.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

No new data were created.

Acknowledgments

Fundación GaituzSport Fundazioa.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Selph, S.S.; Skelly, A.C.; Wasson, N.; Dettori, J.R.; Brodt, E.D.; Ensrud, E.; Elliot, D.; Dissinger, K.M.; McDonagh, M. Physical Activity and the Health of Wheelchair Users: A Systematic Review in Multiple Sclerosis, Cerebral Palsy, and Spinal Cord Injury. Arch. Phys. Med. Rehabil. 2021, 102, 2464–2481.e33. [Google Scholar] [CrossRef] [PubMed]
  2. Gauthier, C.; Arel, J.; Brosseau, R.; Hicks, A.L.; Gagnon, D.H. Reliability and minimal detectable change of a new treadmill-based progressive workload incremental test to measure cardiorespiratory fitness in manual wheelchair users. J. Spinal Cord. Med. 2017, 40, 759–767. [Google Scholar] [CrossRef] [PubMed]
  3. Kressler, J.; Cowan, R.E.; Bigford, G.E.; Nash, M.S. Reducing cardiometabolic disease in spinal cord injury. Phys. Med. Rehabil. Clin. N. Am. 2014, 25, 573–604. [Google Scholar] [CrossRef]
  4. Van der Scheer, J.W.; Martin Ginis, K.A.; Ditor, D.S.; Goosey-Tolfrey, V.L.; Hicks, A.L.; West, C.R.; Wolfe, D.L. Effects of exercise on fitness and health of adults with spinal cord injury: A systematic review. Neurology 2017, 89, 736–745. [Google Scholar] [CrossRef] [PubMed]
  5. Al-Mallah, M.H.; Sakr, S.; Al-Qunaibet, A. Cardiorespiratory Fitness and Cardiovascular Disease Prevention: An Update. Curr. Atheroscler. Rep. 2018, 20, 1. [Google Scholar] [CrossRef]
  6. Bermejo-Cantarero, A.; Álvarez-Bueno, C.; Martinez-Vizcaino, V.; García-Hermoso, A.; Torres-Costoso, A.I.; Sánchez-López, M. Association between physical activity, sedentary behavior, and fitness with health related quality of life in healthy children and adolescents: A protocol for a systematic review and meta-analysis. Medicine 2017, 96, e6407. [Google Scholar] [CrossRef]
  7. Kodama, S.; Saito, K.; Tanaka, S.; Maki, M.; Yachi, Y.; Asumi, M.; Sugawara, A.; Totsuka, K.; Shimano, H.; Ohashi, Y.; et al. Cardiorespiratory fitness as a quantitative predictor of all-cause mortality and cardiovascular events in healthy men and women: A meta-analysis. JAMA 2009, 301, 2024–2035. [Google Scholar] [CrossRef]
  8. Bouzas, S.; Molina, A.J.; Fernández-Villa, T.; Miller, K.; Sanchez-Lastra, M.A.; Ayán, C. Effects of exercise on the physical fitness and functionality of people with amputations: Systematic review and meta-analysis. Disabil. Health J. 2021, 14, 100976. [Google Scholar] [CrossRef]
  9. Carroll, D.D.; Courtney-Long, E.A.; Stevens, A.C.; Sloan, M.L.; Lullo, C.; Visser, S.N.; Fox, M.H.; Armour, B.S.; Campbell, V.A.; Brown, D.R.; et al. Vital signs: Disability and physical activity—United States, 2009–2012. MMWR Morb. Mortal. Wkly Rep. 2014, 63, 407–413. [Google Scholar]
  10. Tawashy, A.E.; Eng, J.J.; Krassioukov, A.V.; Miller, W.C.; Sproule, S. Aerobic exercise during early rehabilitation for cervical spinal cord injury. Phys. Ther. 2010, 90, 427–437. [Google Scholar] [CrossRef]
  11. Van den Berg-Emons, R.J.; Bussmann, J.B.; Haisma, J.A.; Sluis, T.A.; van der Woude, L.H.; Bergen, M.P.; Stam, H.J. A prospective study on physical activity levels after spinal cord injury during inpatient rehabilitation and the year after discharge. Arch. Phys. Med. Rehabil. 2008, 89, 2094–2101. [Google Scholar] [CrossRef] [PubMed]
  12. Remes, L.; Isoaho, R.; Vahlberg, T.; Hiekkanen, H.; Korhonen, K.; Viitanen, M.; Rautava, P. Major lower extremity amputation in elderly patients with peripheral arterial disease: Incidence and survival rates. Aging Clin. Exp. Res. 2008, 20, 385–393. [Google Scholar] [CrossRef] [PubMed]
  13. Wezenberg, D.; van der Woude, L.H.; Faber, W.X.; de Haan, A.; Houdijk, H. Relation between aerobic capacity and walking ability in older adults with a lower-limb amputation. Arch. Phys. Med. Rehabil. 2013, 94, 1714–1720. [Google Scholar] [CrossRef] [PubMed]
  14. Davis, G.M. Exercise capacity of individuals with paraplegia. Med. Sci. Sports Exerc. 1993, 25, 423–432. [Google Scholar] [CrossRef]
  15. Stoboy, H.; Rich, B.W.; Lee, M. Workload and energy expenditure during wheelchair propelling. Paraplegia 1971, 8, 223–230. [Google Scholar] [CrossRef]
  16. Wicks, J.R.; Lymburner, K.; Dinsdale, S.M.; Jones, N.L. The use of multistage exercise testing with wheelchair ergometry and arm cranking in subjects with spinal cord lesions. Paraplegia 1977, 15, 252–261. [Google Scholar] [CrossRef]
  17. Langbein, W.E.; Robinson, C.J.; Reid, C.M. Validation of the wheelchair aerobic fitness trainer and test protocol. Proc. Annu. Conf. Eng. Med. Biol. 1991, 13, 1839–1840. [Google Scholar] [CrossRef]
  18. Bhambhani, Y.N.; Holland, L.J.; Eriksson, P.; Steadward, R.D. Physiological responses during wheelchair racing in quadriplegics and paraplegics. Paraplegia 1994, 32, 253–260. [Google Scholar] [CrossRef]
  19. Bhambhani, Y.N.; Eriksson, P.; Steadward, R.D. Reliability of peak physiological responses during wheelchair ergometry in persons with spinal cord injury. Arch. Phys. Med. Rehabil. 1991, 72, 559–562. [Google Scholar]
  20. Bhambhani, Y.N.; Holland, L.J.; Steadward, R.D. Maximal aerobic power in cerebral palsied wheelchair athletes: Validity and reliability. Arch. Phys. Med. Rehabil. 1992, 73, 246–252. [Google Scholar]
  21. Bhambhani, Y.N.; Holland, L.J.; Steadward, R.D. Anaerobic threshold in wheelchair athletes with cerebral palsy: Validity and reliability. Arch. Phys. Med. Rehabil. 1993, 74, 305–311. [Google Scholar] [PubMed]
  22. Christensen, J.; Tang, L.; Doherty, P.; Langhorn, C.; Langberg, H. Test-retest reliability of a maximal arm cycle exercise test for younger individuals with traumatic lower limb amputations. Eur. J. Physiother. 2019, 22, 115–120. [Google Scholar] [CrossRef]
  23. Hol, A.T.; Eng, J.J.; Miller, W.C.; Sproule, S.; Krassioukov, A.V. Reliability and validity of the six-minute arm test for the evaluation of cardiovascular fitness in people with spinal cord injury. Arch. Phys. Med. Rehabil. 2007, 88, 489–495. [Google Scholar] [CrossRef]
  24. Morgan, K.A.; Taylor, K.L.; Tucker, S.M.; Cade, W.T.; Klaesner, J.W. Exercise testing protocol using a roller system for manual wheelchair users with spinal cord injury. J. Spinal Cord. Med. 2019, 42, 288–297. [Google Scholar] [CrossRef]
  25. Granados, C.; Yanci, J.; Badiola, A.; Iturricastillo, A.; Otero, M.; Olasagasti, J.; Bidaurrazaga-Letona, I.; Gil, S.M. Anthropometry and Performance in Wheelchair Basketball. J. Strength. Cond. Res. 2015, 29, 1812–1820. [Google Scholar] [CrossRef] [PubMed]
  26. Mayorga-Vega, D.; Bocanegra-Parrilla, R.; Ornelas, M.; Viciana, J. Criterion-Related Validity of the Distance- and Time-Based Walk/Run Field Tests for Estimating Cardiorespiratory Fitness: A Systematic Review and Meta-Analysis. PLoS ONE 2016, 11, e0151671. [Google Scholar] [CrossRef]
  27. Eerden, S.; Dekker, R.; Hettinga, F.J. Maximal and submaximal aerobic tests for wheelchair-dependent persons with spinal cord injury: A systematic review to summarize and identify useful applications for clinical rehabilitation. Disabil. Rehabil. 2018, 40, 497–521. [Google Scholar] [CrossRef]
  28. Goosey-Tolfrey, V.L.; Leicht, C.A. Field-based physiological testing of wheelchair athletes. Sports Med. 2013, 43, 77–91. [Google Scholar] [CrossRef]
  29. Wouters, M.; Evenhuis, H.M.; Hilgenkamp, T.I. Systematic review of field-based physical fitness tests for children and adolescents with intellectual disabilities. Res. Dev. Disabil. 2017, 61, 77–94. [Google Scholar] [CrossRef]
  30. Eriksson, P.; Löfström, L.; Ekblom, B. Aerobic power during maximal exercise in untrained and well-trained persons with quadriplegia and paraplegia. Scand. J. Rehabil. Med. 1988, 20, 141–147. [Google Scholar]
  31. Klaesner, J.; Morgan, K.A.; Gray, D.B. The development of an instrumented wheelchair propulsion testing and training device. Assist. Technol. 2014, 26, 24–32. [Google Scholar] [CrossRef]
  32. Knechtle, B.; Köpfli, W. Treadmill exercise testing with increasing inclination as exercise protocol for wheelchair athletes. Spinal Cord. 2001, 39, 633–636. [Google Scholar] [CrossRef] [PubMed]
  33. De Groot, S.; Valent, L.J.; Fickert, R.; Pluim, B.M.; Houdijk, H. An Incremental Shuttle Wheel Test for Wheelchair Tennis Players. Int. J. Sports Physiol. Perform. 2016, 11, 1111–1114. [Google Scholar] [CrossRef] [PubMed]
  34. Vanderthommen, M.; Francaux, M.; Colinet, C.; Lehance, C.; Lhermerout, C.; Crielaard, J.M.; Theisen, D. A multistage field test of wheelchair users for evaluation of fitness and prediction of peak oxygen consumption. J. Rehabil. Res. Dev. 2002, 39, 685–692. [Google Scholar]
  35. Léger, L.; Boucher, R. An indirect continuous running multistage field test: The Université de Montréal track test. Can. J. Appl. Sport. Sci. 1980, 5, 77–84. [Google Scholar]
  36. Yanci, J.; Granados, C.; Otero, M.; Badiola, A.; Olasagasti, J.; Bidaurrazaga-Letona, I.; Iturricastillo, A.; Gil, S.M. Sprint, agility, strength and endurance capacity in wheelchair basketball players. Biol. Sport 2015, 32, 71–78. [Google Scholar] [CrossRef]
  37. Currell, K.; Jeukendrup, A.E. Validity, reliability and sensitivity of measures of sporting performance. Sports Med. 2008, 38, 297–316. [Google Scholar] [CrossRef] [PubMed]
  38. Ruiz, J.R.; Castro-Piñero, J.; Artero, E.G.; Ortega, F.B.; Sjöström, M.; Suni, J.; Castillo, M.J. Predictive validity of health-related fitness in youth: A systematic review. Br. J. Sports Med. 2009, 43, 909–923. [Google Scholar] [CrossRef]
  39. Laskin, J.J.; Slivka, D.; Frogley, M. A cadence based sub-maximal field test for the prediction of peak oxygen consumption in elite wheelchair basketball athletes. J. Exerc. Physiol. Online 2004, 7, 8–18. [Google Scholar]
  40. Cowan, R.E.; Callahan, M.K.; Nash, M.S. The 6-min push test is reliable and predicts low fitness in spinal cord injury. Med. Sci. Sports Exerc. 2012, 44, 1993–2000. [Google Scholar] [CrossRef]
  41. Dwyer, G.B.; Davis, R.W. The relationship between a twelve minute wheelchair push test and VO2peak in women wheelchair athletes. Sport. Med. Train. Rehabil. 1998, 8, 1–11. [Google Scholar] [CrossRef]
  42. Longmuir, P.E.; Shephard, R.J. Refinement of the arm CAFT protocol for assessing aerobic fitness in adults with mobility impairments. Adapt. Phys. Act. Q. 1995, 12, 362–376. [Google Scholar] [CrossRef]
  43. Verschuren, O.; Ketelaar, M.; De Groot, J.; Vila Nova, F.; Takken, T. Reproducibility of two functional field exercise tests for children with cerebral palsy who self-propel a manual wheelchair. Dev. Med. Child. Neurol. 2013, 55, 185–190. [Google Scholar] [CrossRef]
  44. Hartung, G.H.; Lally, D.A.; Blancq, R.J. Comparison of treadmill exercise testing protocols for wheelchair users. Eur. J. Appl. Physiol. Occup. Physiol. 1993, 66, 362–365. [Google Scholar] [CrossRef]
  45. Kelly, V.G.; Chen, K.K.; Oyewale, M. Reliability of the 30-15 intermittent fitness test for elite wheelchair rugby players. Sci. Med. Footb. 2018, 2, 191–195. [Google Scholar] [CrossRef]
  46. Ayán Pérez, C.; Martínez-Lemos, I.; Lago-Ballesteros, J.; Cancela Carral, J.M.; Loira-Camiña, N. Reliability and Validity of Physical Fitness Field-Based Tests in Down Syndrome: A Systematic Review. J. Policy Pract. Intellect. Disabil. 2016, 13, 142–156. [Google Scholar] [CrossRef]
  47. Balemans, A.C.; Fragala-Pinkham, M.A.; Lennon, N.; Thorpe, D.; Boyd, R.N.; O’Neil, M.E.; Bjornson, K.; Becher, J.G.; Dallmeijer, A.J. Systematic review of the clinimetric properties of laboratory- and field-based aerobic and anaerobic fitness measures in children with cerebral palsy. Arch. Phys. Med. Rehabil. 2013, 94, 287–301. [Google Scholar] [CrossRef] [PubMed]
  48. Lennon, N.; Thorpe, D.; Balemans, A.C.; Fragala-Pinkham, M.; O’Neil, M.; Bjornson, K.; Boyd, R.; Dallmeijer, A.J. The clinimetric properties of aerobic and anaerobic fitness measures in adults with cerebral palsy: A systematic review of the literature. Res. Dev. Disabil. 2015, 45–46, 316–328. [Google Scholar] [CrossRef]
  49. Oppewal, A.; Hilgenkamp, T.I.M.; van Wijck, R.; Evenhuis, H.M. Cardiorespiratory fitness in individuals with intellectual disabilities-A review. Res. Dev. Disabil. 2013, 34, 3301–3316. [Google Scholar] [CrossRef]
  50. Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. J. Clin. Epidemiol. 2009, 62, 1006–1012. [Google Scholar] [CrossRef]
  51. Mokkink, L.B.; Prinsen, C.A.; Patrick, D.; Alonso, J.; Bouter, L.M.; de Vet, H.C.W.; Terwee, C.B. COSMIN Study Design Checklist for Patient-Reported Outcome Measurement Instruments; Department of Epidemiology and Biostatistics, Amsterdam Public Health Research Institute, Amsterdam University Medical Centers, Location VUmc: Amsterdam, The Netherlands, 2019; pp. 1–32. [Google Scholar]
  52. Kofsky, P.R.; Davis, G.M.; Shephard, R.J.; Jackson, R.W.; Keene, G.C. Field testing: Assessment of physical fitness of disabled adults. Eur. J. Appl. Physiol. Occup. Physiol. 1983, 51, 109–120. [Google Scholar] [CrossRef]
  53. Holland, L.J.; Bhambhani, Y.N.; Ferrara, M.S.; Steadward, R.D. Reliability of the maximal aerobic power and ventilatory threshold in adults with cerebral palsy. Arch. Phys. Med. Rehabil. 1994, 75, 687–691. [Google Scholar] [CrossRef] [PubMed]
  54. Vinet, A.; Bernard, P.L.; Poulain, M.; Varray, A.; Le Gallais, D.; Micallef, J.P. Validation of an incremental field test for the direct assessment of peak oxygen uptake in wheelchair-dependent athletes. Spinal Cord. 1996, 34, 288–293. [Google Scholar] [CrossRef]
  55. Poulain, M.; Vinet, A.; Bernard, P.L.; Varray, A. Reproducibility of the adapted leger and boucher test for wheelchair-dependent athletes. Spinal Cord. 1999, 37, 129–135. [Google Scholar] [CrossRef] [PubMed]
  56. Vanlandewijck, Y.C.; Daly, D.J.; Theisen, D.M. Field test evaluation of aerobic, anaerobic, and wheelchair basketball skill performances. Int. J. Sports Med. 1999, 20, 548–554. [Google Scholar] [CrossRef]
  57. Stewart, M.W.; Melton-Rogers, S.L.; Morrison, S.; Figoni, S.F. The measurement properties of fitness measures and health status for persons with spinal cord injuries. Arch. Phys. Med. Rehabil. 2000, 81, 394–400. [Google Scholar] [CrossRef]
  58. Vinet, A.; Le Gallais, D.; Bouges, S.; Bernard, P.L.; Poulain, M.; Varray, A.; Micallef, J.P. Prediction of VO2peak in wheelchair-dependent athletes from the adapted Léger and Boucher test. Spinal Cord. 2002, 40, 507–512. [Google Scholar] [CrossRef] [PubMed]
  59. Goosey-Tolfrey, V.L.; Tolfrey, K. The multi-stage fitness test as a predictor of endurance fitness in wheelchair athletes. J. Sports Sci. 2008, 26, 511–517. [Google Scholar] [CrossRef]
  60. Leicht, C.A.; Tolfrey, K.; Lenton, J.P.; Bishop, N.C.; Goosey-Tolfrey, V.L. The verification phase and reliability of physiological parameters in peak testing of elite wheelchair athletes. Eur. J. Appl. Physiol. 2013, 113, 337–345. [Google Scholar] [CrossRef]
  61. Verschuren, O.; Zwinkels, M.; Ketelaar, M.; Reijnders-van Son, F.; Takken, T. Reproducibility and validity of the 10-meter shuttle ride test in wheelchair-using children and adolescents with cerebral palsy. Phys. Ther. 2013, 93, 967–974. [Google Scholar] [CrossRef]
  62. Goosey-Tolfrey, V.L.; Paulson, T.A.W.; Tolfrey, K.; Eston, R.G. Prediction of peak oxygen uptake from differentiated ratings of perceived exertion during wheelchair propulsion in trained wheelchair sportspersons. Eur. J. Appl. Physiol. 2014, 114, 1251–1258. [Google Scholar] [CrossRef]
  63. Weissland, T.; Faupin, A.; Borel, B.; Berthoin, S.; Leprêtre, P.M. Effects of modified multistage field test on performance and physiological responses in wheelchair basketball players. Biomed. Res. Int. 2015, 2015, 245378. [Google Scholar] [CrossRef]
  64. Bloemen, M.A.T.; De Groot, J.F.; Backx, F.J.G.; Westerveld, R.A.; Takken, T. Arm cranking versus wheelchair propulsion for testing aerobic fitness in children with spina bifida who are wheelchair dependent. J. Rehabil. Med. 2015, 47, 432–437. [Google Scholar] [CrossRef] [PubMed]
  65. Weissland, T.; Faupin, A.; Borel, B.; Lepretre, P.-M. Comparison Between 30-15 Intermittent Fitness Test and Multistage Field Test on Physiological Responses in Wheelchair Basketball Players. Front. Physiol. 2015, 6, 380. [Google Scholar] [CrossRef]
  66. Bongers, B.C.; Rijks, E.B.G.; Harsevoort, A.G.J.; Takken, T.; van Brussel, M. 10-m shuttle ride test in youth with osteogenesis imperfecta who use wheelchairs: Feasibility, reproducibility, and physiological responses. Phys. Ther. 2016, 96, 679–686. [Google Scholar] [CrossRef] [PubMed]
  67. Bloemen, M.A.T.; De Groot, J.F.; Backx, F.J.G.; Benner, J.; Kruitwagen, C.L.J.J.; Takken, T. Wheelchair shuttle test for assessing aerobic fitness in youth with spina bifida: Validity and reliability. Phys. Ther. 2017, 97, 1020–1029. [Google Scholar] [CrossRef] [PubMed]
  68. Pérez-Tejero, J.; Coterón, J.; Rabadán, M.; Sampedro, J. Field test validation for wheelchair basketball players’ aerobic capacity assessment. Eur. J. Hum. Mov. 2018, 40, 136–148. [Google Scholar]
  69. Simmelink, E.K.; Wempe, J.B.; Geertzen, J.H.B.; van der Woude, L.H.V.; Dekker, R. Feasibility, safety, and reliability of exercise testing using the combined arm-leg (Cruiser) ergometer in subjects with a lower limb amputation. PLoS ONE 2018, 13, e0202264. [Google Scholar] [CrossRef]
  70. Qi, L.; Zhang, L.; Lin, X.-B.; Ferguson-Pell, M. Wheelchair propulsion fatigue thresholds in electromyographic and ventilatory testing. Spinal Cord. 2020, 58, 1104–1111. [Google Scholar] [CrossRef]
  71. Damen, K.M.S.; Takken, T.; De Groot, J.F.; Backx, F.J.G.; Radder, B.; Roos, I.C.P.M.; Bloemen, M.A.T. 6-minute push test in youth who have spina bifida and who self-propel a wheelchair: Reliability and physiologic response. Phys. Ther. 2020, 100, 1852–1861. [Google Scholar] [CrossRef]
  72. Goosey-Tolfrey, V.L.; Groot, S.; De Tolfrey, K.; Paulson, T.A.W. Criterion Validity of a Field-Based Assessment of Aerobic Capacity in Wheelchair Rugby Athletes. Int. J. Sports Physiol. Perform. 2021, 16, 1341–1346. [Google Scholar] [CrossRef] [PubMed]
  73. Holm, N.J.; Biering-Sorensen, F.; Schou, L.H.; Møller, T. The test-retest reliability of individualized VO2peak test modalities in people with spinal cord injury undergoing rehabilitation. Spinal Cord. 2021, 59, 82–91. [Google Scholar] [CrossRef] [PubMed]
  74. Shushan, T.; Lovell, R.; McLaren, S.J.; Buchheit, M.; Dello Iacono, A.; Arguedas-Soley, A.; Norris, D. Assessing criterion and longitudinal validity of submaximal heart rate indices as measures of cardiorespiratory fitness: A preliminary study in football. J. Sci. Med. Sport. 2024, 27, 565–571. [Google Scholar] [CrossRef] [PubMed]
  75. Janssen, R.J.F.; De Groot, S.; Van Der Woude, L.H.V.; Houdijk, H.; Vegter, R.J.K. Toward a Standardized and Individualized Laboratory-Based Protocol for Wheelchair-Specific Exercise Capacity Testing in Wheelchair Athletes: A Scoping Review. Am. J. Phys. Med. Rehabil. 2023, 102, 261–269. [Google Scholar] [CrossRef]
  76. Ham, S.A.; Yore, M.M.; Fulton, J.E.; Kohl, I.I.I.H.W. Prevalence of No Leisure-Time Physical Activity—35 States and the District of Columbia, 1988–2002. JAMA 2004, 291, 1693. [Google Scholar] [CrossRef]
  77. Jack, L.P.; Purcell, M.; Allan, D.B.; Hunt, K.J. Comparison of peak cardiopulmonary performance parameters during robotics-assisted treadmill exercise and arm crank ergometry in incomplete spinal cord injury. Technol. Health Care 2010, 18, 285–296. [Google Scholar] [CrossRef]
Figure 1. Study selection flow-chart.
Figure 1. Study selection flow-chart.
Ijerph 22 00384 g001
Table 1. Evaluation of the included studies according to the COSMIN checklist.
Table 1. Evaluation of the included studies according to the COSMIN checklist.
StudyOROVR1R2R3R4R5R6R7R8R9V1V2V3V4V5V6 *V7 *
Wicks et al. [16]-43214221
Kofsky et al. [52]-322311
Bhambhani et.al. [19]4-112114-31
Langbein et al. [17]32112132-3121211
Bhambhani et al. [20]43112124-3133111
Bhambhani et al. [21]4-112124-31
Hartung et al. [44]44112324-213214221
Bhambhani et al. [18]-313211
Holland et al. [53]4-112144-31
Longimuir et al. [42]34111113-3122441
Vinet et al. [54]-323311
Dwyer et al. [41]-313313
Poulain et al. [55]4-112114141
Vanlandewijck et al. [56]43112114-3123211
Stewart et al. [57]4-113421-21
Vinet et al. [58]-222111
Vanderthomen et al. [34]4311111412123111
Laskin et al. [39]4311212411123211
Hol et al. [23]3211212321122211
Goosey-Tolfrey et al. [59]4311211411113211
Leicht et al. [60]4-111114221
Cowan et al. [40]3411212311112243
Verschuren et al. [61]4311211411123211
Verschuren et al. [43]2-112112111
Goosey-Tolfrey et al. [62]-413241
Weissland et al. [63]-41114221
Bloemen et al. [64]4411111411133341
Yanci et al. [36]4-111114-21
Weissland et al. [65]-43134221
de Groot et al. [33]-323111
Bongers et al. [66]4311211411123111
Gauthier et al. [4]4-111114111
Bloemen et al. [67]4211211411112211
Kelly et al. [45]4-111114221
Pérez Tejero et al. [68]-323211
Simmelink et al. [69]4-112134111
Christensen et al. [22]4-112124111
Morgan et al. [24]4311211411123211
Qi et al. [70]-413141
Damen et al. [71]4411211411111241
Goosey-Tolfrey et al. [72]-423241
Holm et al. [73]4-111114-11
Abbreviations: 1 = very good; 2 = adequate; 3 = doubtful; 4 = inadequate; OR = overall reliability score; OV = overall validity score; R1 = uses at least two measurements; R2 = ensures that the administrations will be independent; R3 = ensures that the patients will be stable in the interim period on the construct to be measured, R4 = uses an appropriate time interval between the two measurements, which is long enough to prevent recall, and short enough to ensure that patients remain stable; R5 = ensures that the test conditions will be similar for the measurements (e.g., type of administration, environment, instructions); R6 = performs the analysis in a sample with an appropriate number of patients (taking into account expected number of missing values); R7 = for continuous scores: calculate the standard error of measurement, smallest detectable change or limits of agreement; R8 = for continuous scores: calculates an intraclass correlation coefficient (ICC); R9 = provides a clear description of how missing items will be handled; V1 = describes whether the proposed criterion can be considered as a reasonable ‘gold standard’/formulate hypotheses about expected relationships between the PROM under study and other outcome measurement instrument(s); V2 = performs the analysis in a sample with an appropriate number of patients (taking into account expected number of missing values)/provides a clear description of the construct(s) measured by the comparator instrument(s); V3 = uses an appropriate time schedule for assessments of the PROM of interest and ‘gold standard’/Use comparator instrument(s) with sufficient measurement properties; V4 = for continuous scores: calculates correlations, or the area under the receiver operating curve/performs the analysis in a sample with an appropriate number of patients (taking into account expected number of missing values); V5 = provides a clear description of how missing items will be handled/used an appropriate time schedule for assessments of the PROM of interest and comparison instruments; V6 = uses statistical methods that are appropriate for the hypotheses to be tested; V7 = provides a clear description of how missing items will be handled. * if columns V6 and V7 are filled in, it means that convergent validity was evaluated, so the items evaluated in columns V1–V5 are those indicated in second position.
Table 2. Characteristics and reliability and validity results of laboratory tests.
Table 2. Characteristics and reliability and validity results of laboratory tests.
StudyTestTests CharacteristicsDisability/SportParticipants Sex and Age (Years)Reliability (Relative and/or Absolute)Criterion (CRI) or Convergent (CON) Validity
Wicks et al. [16] Maximal WE speed test20 rpm + 10 or 5 rpm/min with low resistanceSCI (n = 2M were able-bodied)n = 3F 4M
28.1 ± 4.1 (23–34)
CON
No significant differences
Maximal WE speed test20 rpm + 10 or 5 rpm/min with high resistance
Maximal ACE resistance test60 rpm + 25-50-100 kpm/min
Bhambhani et al. [19]Maximal WE cadence test40 strikes/min + 8 strikes/min/2 minPP and TTn = 2F 5M
26.5 ± 3.5 (22–32)
r = 0.98 *
t test no differences
Langbein et al. [17]Maximal WE resistance test6 W + 5-20 W/3 min (continuous and intermittent)TT, PP, amputations and lower limb fracturesn = 51M
(17–69)
r = 0.9 *CRI
No significant differences
r = 0.91 *
Bhambhani et al. [20]Maximal WE speed test5 km/h + 2 km/h/2 minCP athletes, national and international leveln = 6M
24.8 ± 3.7 (19–29)
r = 0.89 *CRI
r < 0.31 (n = 4)
Bhambhani et al. [21]Maximal WE speed test5 km/h + 2 km/h/2 min
with 1 min rest
CP athletes, national and international leveln = 11M
25.1 ± 4.3 (19–33)
r = 0.89 *, 0.9 *
Hartung et al. [44]Maximal TM grade and speed test3.2 km/h + 1.6 km/h &
0.5% every 2 min
PP and low extremity polio disabilityn = 7M
33 (21–44)
ICC = 0.86 *, 0.87 *CON
ANOVA, no differences
Maximal TM grade test0 + 1%/2 min at 4.8 km/hICC = 0.41 *, 0.44 *
Maximal TM speed test3.2 + 3.2 km/h/2 minICC = 0.40 *, 0.83 *
Bhambhani et al. [18]Simulated 1.6 km race in WE1.6 km simulated raceTT Activesn = 7M
30.6 ± 5.2
CRI
r = 0.79 *, 0.81 *, 0.23, 0.32
Simulated 3.2 km race in WE3.2 km simulated racePP Activesn = 6M
29.0 ± 4.6
CRI
r = 0.82 *, 0.88 *, 0.19, 0.61
Holland et al. [53]Maximal WE speed test4 km/h + 1 km/h/2 minCPn = 1F 4M
25.6 ± 3.7
(21–31)
r = 0.79 *, 0.83 *
(Data mixed with cycle ergometer test)
Longimuir et al. [42]Arm CAFTSubmaximal 3 × 3 min in ACE
(Stopping when achieving a certain HR, load dependant on sex and age)
Different physical disabilitiesn = 18F 17 M
(21–59)
r = 0.97 * (n = 30)CRI
Paired t test, no differences
Stewart et al. [57]Maximal ACE resistance test/maximal WE resistance test15 W + 15 W/3 min
+7.5 W/3 min (after RPE15)
with 2 min recovery
SCIn = 19F 83M
29.7 ± 10.3
ICC = 0.82 *
Hol et al. [23]6-MATSubmaximal 6 min at 60–70% HR max in ACEPP and TTn = 5F 25M
36.3 ± 9.3 (19–49)
ICC = 0.81 * (0.58–0.92)
SEM = 1.62 mL/kg/min
CRI
r = 0.92 *, 0.73 *, 0.63 *
Leicht et al. [60]Maximal TM grade testIndividualized speed 1% + 0.3/min or 0.1/40 sWR, WB and WT; national and international n = 24M
28.1 ± 5.2 (TT)
31.7 ± 8.7 (PP)
24.0 ± 6.2 (non-SCI)
ICC = 0.84–0.99 *
CV = 9.3% (TT), 4.5% (PP), 3.3% (non-SCI)
Goosey-Tolfrey et al. [62]Submaximal TM speed test6 × 4 minWR and WB players; elite leveln = 26M
30 ± 5 (TT)
29 ± 9 (PP)
27 ± 8 (non-SCI)
CRI
TE = 0.14–0.16 L/min (TT) 0.19–0.22 L/min (PP) 0.54–68 L/min (non-SCI)
CV = 10.1–11.1% (TT) 6.5–7.5% (PP) 16.8–20.2% (non-SCI)
Bloemen et al. [64]Maximal WE resistance test0 + 0.1 torque/min
at 60–80 rpm
Spina bifida wheelchair usersn = 4F 9M
13.4 ± 3.5 (8–17)
CRI
t test, no difference
VO2peak 15% greater than ACE test
n = 11F 13M
14.8 ± 3.0 (8–19)
ICC = 0.93 * (0.83–0.97)
SEM = 1.87 mL/kg/min
SDC = 5.18 mL/kg/min
Gauthier et al. [4]Maximal TM grade and speed testDifferent slope and speed increases every 1 minWheelchair usersn = 21F 4M
35.3 ± 14.9 (18–63)
ICC = 0.84 * (0.66–0.92),
0.86 * (0.70–0.93)
SDC = 2.27 mL/kg/min,
157.9 mL/min
SEM = 5.3 mL/kg/min,
368.5 mL/min
Simmelink et al. [69]Maximal ALE resistance test3 min 20 W + 10 W/min
at 50 rpm
Unilateral lower limb amputationn = 3F 14M
54.5 ± 18.6 (25–80)
ICC = 0.84 * (0.61–0.94),
0.91 * (0.77–0.97)
LoA = −0.56 to 0.60 L/min
Christensen et al. [22]Maximal ACE resistance test40 W+ 20 W/minUnilateral lower limb amputationn = 8M
32.5 ± 4.6 (18–40)
ICC = 0.51 * (0.11–0.85), 0.73 * (0.40–0.93), 0.74 * (0.40–0.93)
CV = 14.48%
SEM = 0.18 L/min
LoA = −0.66 to 0.40 L/min
Morgan et al. [24]Maximal WE resistance testIncreasing resistance every minute at 70% of max speedSCI
70% athletes
n = 10M
33 ± 19.6 (18–60)
ICC = 0.82 *, 0.97 *
LoA = −6 to 4.5 mL/kg/min
CRI
r = 0.79 *, 0.77 *
t test, no differences
LoA = −4.1 to 3.6 mL/kg/min
Qi et al. [70]Maximal WE resistance test10 W + 5 or 10 W/min
at 1 m/s
SCI n = 3F 7M
42.1 ± 8.4 (29–55)
CRI
ICC = 0.91 *
Holm et al. [73]Maximal ACE resistance test5 W + 10 W/min
at 60 rpm
SCI n = 1F 4M
47 ± 25.6 (21–83)
ICC = 0.99 *
mTBRS-XT25 W + 15 W/2 min in ALESCI n = 3F 6M
61.7 ± 13.3 (41–76)
ICC = 0.96 *
TBRS-XT50 W + 25 W/2 min in ALESCI n = 1F 7M
44.3 ± 15.6 (27–75)
ICC = 0.99 *
* p < 0.05; abbreviations: 6-MAT = six-minute arm test; ACE = arm-crank ergometer; ALE = arm–leg ergometer; CAFT = Canadian aerobic fitness test; CP = cerebral palsy; CV = coefficient of variation; ICC = intraclass correlation coefficient; LoA = limits of agreement; mTBRS-XT = modified total body recumbent stepper exercise test; PP = paraplegia; r = correlation coefficient; SCI = spinal cord injury; SDC = smallest detectable change; SEM = standard error measurement; TBRS-XT = total body recumbent stepper exercise test; TM = treadmill; TT = tetraplegia; VO2 = oxygen consumption volume; WB = wheelchair basketball; WE = wheelchair ergometer; WR = wheelchair rugby; WT = wheelchair tennis.
Table 3. Evidence level of reliability and validity of laboratory tests (with a limited or greater evidence level).
Table 3. Evidence level of reliability and validity of laboratory tests (with a limited or greater evidence level).
StudiesTestReliabilityValidity
Langbein et al. [17]Maximal WE resistance testLimitedModerate
Hol et al. [23]6-MATLimitedModerate
Bhambhani et al. [20]Maximal WE speed test ?Limited
Morgan et al. [24]Maximal WE resistance test ?Limited
Bhambhani et al. [18]Simulated 1.6 km race in WEN/ALimited
Bhambhani et al. [18]Simulated 3.2 km race in WEN/ALimited
Longimuir et al. [42]Arm CAFTLimited?
Abbreviations: ? = inadequate methodology; 6-MAT = six-minute arm test; CAFT = canadian aerobic fitness test; N/A = not available; WE = wheelchair ergometer. Note: those tests that did not reach a level of evidence of at least limited reliability or validity are not presented in this table.
Table 4. Characteristics and reliability and validity results of field tests.
Table 4. Characteristics and reliability and validity results of field tests.
StudyTestTests CharacteristicsDisability/SportParticipants Sex and Age (Average, Standard Deviation and Range)Reliability (Relative and Absolute)Criterion (CRI) or Convergent (CON) Validity
Kofsky et al. [52]Submaximal field test3 × 5 min at 40-60-80% HRmax
with 2 min rest
Lower limb disabilities;
sedentary individuals, active individuals and athletes
n = 7F
28.7 ± 3.4 (18–55)
CRI
r = 0.61 *
n = 42M
28.2 ± 1.3 (18–55)
CRI
r = 0.67 *
Vinet et al. [54]ALBT4 km/h + 1 km/h/min
in 400 m tartan
WT and WRA n = 9M
28.9 ± 4.2 (24–35)
CRI
r = 0.65 *
Wilcoxon, no differences
LoA ≃ −7 to 14 mL/kg/min
Dwyer et al. [41]12WPTMaximal distance in 12 min in 200 m indoor tartanWB national leveln = 13F
26 ± 6 (19–40)
CRI
r = 0.46 *, 0.30
Poulain et al. (1999) [55]ALBT 4 km/h + 1 km/h/min
in 400 m tartan
WT, WRA, WTT; regional, national and international leveln = 8M
30.8 ± 5.1 (24–39)
No significant differences (MANOVA)
LoA ≃ −2 to 1.2 km/h
CV = 2%
Vanlandewijck et al. [56]SWT5 km/h + 0.5 km/h/min
between two marks 25 m away
WB national leveln = 20M
31.7 ± 10.4 (14–46)
r = 0.97 *CRI
r = 0.64 *, 0.87 *, (n = 15)
Vinet et al. [58]ALBT4 km/h + 1 km/h/min
in 400 m tartan
WB, WRA, WF or AS n = 9F 40M
30.3 ± 0.4 (18–47)
CRI
r = 0.81 *
Vanderthomen et al. [34]MFT in octagon6 km/h + 0.37 km/h/min
in a 15 m × 15 m octagon
PP and post-polion = 10M
38.2 ± 13.0
ICC = 0.99 *, 0.88 *
LoA ≃ −1.4 to 1.8 stages
−4.4/4.6 mL/kg/min
CRI
r = 0.77 *
Laskin et al. [39]Cadence-based submaximal field test5 min 60 pushes/min
in a basketball court
WB, elite leveln = 24M
26.1 ± 6.6
ICC = 0.50 *
LoA = −0.83/1.05 L/min
(n = 16)
CRI
r = 0.49 *
t test, no differences
Cadence-based submaximal field test5 min 80 pushes/min
in a basketball court
WB, elite leveln = 24M
26.1 ± 6.6
ICC = 0.62 *
LoA = −0.66 to 1.06 L/min
(n = 16)
CRI
r = 0.56 *
t test, no differences
Goosey-Tolfrey et al. [59]SWT8.5 km/h + 0.5 km/h/min
between two marks 20 m away
WB, international leveln = 24M
29 ± 6
ICC = 0.88 * (0.58–0.97), 0.91 * (0.69–0.98)
SEM = 2.4 beats/min, 86 m
CV = 1.3%, 4.4%
(n = 10)
CRI
r = 0.57 *
Cowan et al. [40]6MPTMaximal distance in 6 min surrounding two cones 15 m awayPP and TT n = 6F 34M
34 ± 10 (20–45)
ICC = 0.97 * (0.94–0.98) (All), 0.93 (0.80–0.98) (TT), 0.97 (0.93–0.99) (PP)
LoA ≈ −70 to 65 m (All)
CRI
ICC = 0.86 * (0.75–0.92) (All), 0.86 (0.70–0.93) (PP), 0.65 (0.17–0.87) (TT)
Verschuren et al. [61]SWT2 km/h + 0.25 km/h/min
between two marks 10 m away
Spastic CP n = 5F 18M
13.3 ± 3.6
ICC = 0.99 * (0.98–1.00)
SEM = 0.5 min
SDC = 1.4 min
LoA ≃ −1.45 to 1.45 stages
CRI
r = 0.84 * (n = 15)
Verschuren et al. [43]6MPTMaximal distance in 6 min surrounding two marks 10 m awaySpastic CPn = 22F 51M
11.8 ± 3.6 (4–18)
ICC = 0.97 * (0.96–0.98)
SEM = 20.9 m
SDC = 57.9 m
LoA ≃ −54.8 to 62.2 m
Weissland et al. [63]MFT in octagon6 km/h + 0.37 km/h/min
in a 15 m × 15 m octagon
WB, national leveln = 2F 14M
32.4 ±5.3 (23–41)
CON
Student’s t test, differences
r = 0.93 *, 0.84 *
MFT in 8 figure6 km/h + 0.37 km/h/min
in 32 m × 15 m “8 figure”
Yanci et al. [36]Yoyo intermittent recovery test2 × 10 m with 10 s active recovery increasing speedWB, national leveln = 2F 14 M
33.1 ± 7.4 (21–46)
ICC = 0.94 *
Weissland et al. [65]MFT in octagon6 km/h + 0.37 km/h/min
in a 15 m × 15 m octagon
WB, national leveln = 2F 16M
32.0 ± 5.7 (22–41)
CON
Student’s t test, no differences
r = 0.84 *
LoA ≃ −8 to 7 mL/kg/min
30-15ITF6 km/h + 0.5 km/h/45 s
in 40 m (30 s push–15 s rest)
de Groot et al. [33]SWT5 km/h + 0.32 km/h/min
between two marks 20 m away
WT, national and international leveln = 15M
21.2 ± 8.4
CON
r = 0.40, 0.47
Bongers et al. [66]SWT2 km/h + 0.25 km/h/min
between two marks 10 m away
Osteogenesis imperfecta
wheelchair users
n = 5F 8 M
15.5 ± 6.4 (9–25)
ICC = 0.95 * (0.83–0.98), 0.97 * (0.89–0.99), 0.92 * (0.71–0.98)
SEM = 0.7 stages
SDC = 1.9 stages
LoA = −2.5 to 1.35 stages
t test no differences
CRI
r = 0.61 *, 0.45
Bloemen et al. [67]SWT2 km/h + 0.25 km/h/min
between two marks 10 m away
Spina bifida
wheelchair users, in daily life or in sports
n = 17F 16M
14.5 ± 3.1 (5–18)
CRI
r = 0.85 *, 0.84 *
t test, no differences
n = 12F 16M
14.7 ± 3.3 (5–18)
ICC = 0.96 * (0.92–0.98), 0.93 * (0.84–0.97), 0.98 * (0.96–0.99)
CV = 6.2%, 6.4%
SDC = 1.5 shuttles
SEM = 0.5 shuttles
Kelly et al. [45]30-15IFT-28m8 km/h + 0.5 km/h/min
in 28 m (30 s push–15 s rest)
WR, international leveln = 10M
31.8 ± 7.3 (20–44)
ICC = 0.99 *
SEM = 1.02 km/h
CV = 1.9%
LoA = −0.51 to 0.61 km/h
Pérez Tejero et al. [68]SWT6 km/h + 0.5 km/h/min
between two marks at 28 m
WB, elite leveln = 7M
33.3 ± 6.0 (27–44)
CRI
r = 0.85 *
Damen et al. [71]6MPTMaximal distance in 6 min surrounding two cones 10 m awaySpina bifida
wheelchair users, in daily life or in sport
n = 21F 32M (n = 8 lost)
13.6 ± 3.8 (5–19)
CRI
86% (18.5) of maximal VO2
n = 14F 12M (n = 5 lost)
13.6 ± 3.8 (5–19)
ICC = 0.95 * (0.83–0.98)
SDC = 60.7 m
SEM = 21.9 m
Goosey-Tolfrey et al. [72]MFT in 8 figure6.5 km/h + 0.36 km/h/min
in 28.2 m × 10.8 m “8 figure”
WR, national leveln = 16M
28 ± 6
CRI
Paired t test, no differences
* p < 0.05; abbreviations: 6MPT = six-minute push test; 12WPT = twelve-minute wheelchair push test; ALBT = adapted Léger and Boucher test; AS = adapted swimming; CP = cerebral palsy; CV = coefficient of variation; HR = heart rate; ICC = intraclass correlation coefficient; ITF = intermittent fitness tests; LoA = limits of agreement; MFT = multistage field test; PP = paraplegia; r = correlation coefficient; SDC = smallest detectable change; SEM = standard error measurement; SWT = shuttle wheelchair test; TT = tetraplegia; VO2 = oxygen consumption volume; WB = wheelchair basketball; WF = wheelchair fencing; WR = wheelchair rugby; WRA = wheelchair racing; WT = wheelchair tennis; WTT = wheelchair table tennis.
Table 5. Evidence level of reliability and validity of field tests (with a limited or greater evidence level).
Table 5. Evidence level of reliability and validity of field tests (with a limited or greater evidence level).
StudiesTestReliabilityValidity
Bloemen et al. [67]; Bongers et al. [66]; Verschuren et al. [61]SWT?Moderate
Vinet et al. [54]; Vinet et al. [58]ALBTN/AModerate
Verschuren et al. [43]6MPTModerate?
Vanlandewijck et al. [56]SWT?Limited
Vanderthomen et al. [34]MFT in octagon?Limited
Laskin et al. [39]Cadence-based submaximal field test?Limited
Goosey-Tolfrey et al. [59]SWT?Limited
Kofsky et al. [52]Submaximal field testN/ALimited
Dwyer et al. [41]12WPTN/ALimited
de Groot et al. [33]SWTN/ALimited
Pérez Tejero et al. [68]SWTN/ALimited
Cowan et al. [40]6MPTLimited?
Abbreviations: ? = inadequate methodology; 6MPT = six-minute push test; 12WPT = twelve-minute wheelchair push test; ALBT = adapted Léger and Boucher test; MFT = multistage field test; N/A = not available; SWT = shuttle wheelchair test. Note: those tests that did not reach a level of evidence of at least limited reliability or validity are not presented in this table.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Garate, I.; Yanci, J.; Ascondo, J.; Iturricastillo, A.; Granados, C. Reliability and Validity of Laboratory and Field Cardiorespiratory Exercise Tests for Wheelchair Users: A Systematic Review. Int. J. Environ. Res. Public Health 2025, 22, 384. https://doi.org/10.3390/ijerph22030384

AMA Style

Garate I, Yanci J, Ascondo J, Iturricastillo A, Granados C. Reliability and Validity of Laboratory and Field Cardiorespiratory Exercise Tests for Wheelchair Users: A Systematic Review. International Journal of Environmental Research and Public Health. 2025; 22(3):384. https://doi.org/10.3390/ijerph22030384

Chicago/Turabian Style

Garate, Iker, Javier Yanci, Josu Ascondo, Aitor Iturricastillo, and Cristina Granados. 2025. "Reliability and Validity of Laboratory and Field Cardiorespiratory Exercise Tests for Wheelchair Users: A Systematic Review" International Journal of Environmental Research and Public Health 22, no. 3: 384. https://doi.org/10.3390/ijerph22030384

APA Style

Garate, I., Yanci, J., Ascondo, J., Iturricastillo, A., & Granados, C. (2025). Reliability and Validity of Laboratory and Field Cardiorespiratory Exercise Tests for Wheelchair Users: A Systematic Review. International Journal of Environmental Research and Public Health, 22(3), 384. https://doi.org/10.3390/ijerph22030384

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop