1. Introduction
Today, healthcare systems around the world are facing significant challenges related to staff shortages [
1], increasing workloads [
2], the risk of professional burnout [
3], and maintaining the quality of patient care [
1]. Demographic changes [
4], an ageing population [
5], an increase in chronic diseases [
6], and the impact of pandemics further exacerbate this situation [
7]. In these circumstances, healthcare institutions must pay particular attention to how they plan, evaluate, and maintain their human resources. Effective staff planning, work environment assessment, and the introduction of professional support mechanisms are essential for ensuring patient safety and the quality of healthcare [
8,
9,
10]. Patient outcomes are determined by both quantitative and qualitative factors, such as competence, emotional stability, decision-making ability, and the ability to work in an interdisciplinary team [
11,
12].
The common phenomenon of work overload, stress, and emotional exhaustion leads to high staff turnover, as illustrated by data from Latvia, where approximately 12% of nurses leave the sector each year [
13,
14]. In 2023, over 40% of nurses in Latvia regularly experienced overwork, while regional hospitals often had 30% fewer nurses on duty than planned [
15]. Without adequate measurement, analysis and management of staff adequacy, the sustainability of the health system is seriously threatened [
16].
In Latvia, as in many other countries worldwide, there is no systematically validated, localised health system tool available to assess healthcare workers’ subjective experiences in relation to staff adequacy, workload, and quality of care. Although various international scales [
16,
17,
18,
19,
20] offer opportunities to assess professional competence, they rarely take into account the specifics of local healthcare systems, language, culture, and the psychosocial aspects of perceptions among staff. Unlike in Scandinavian countries, where both Nurse Professional Monitoring (NPM) [
21] and revalidation tools exist, professional supervision in Latvia is limited to maintaining registration data.
Both in Latvia and abroad, the assessment tools used are often limited to keeping track of superficial indicators such as the number of patients per nurse or work schedule structure, ignoring the subjective experience of the staff [
16,
17,
18,
19,
20]. Such tools do not provide a sufficient basis for making decisions about improving the working environment. The use of existing tools in Latvia is sporadic, with some institutions using fragmented assessments, but there is a lack of systematically validated tools [
22]. During the COVID-19 pandemic, healthcare organisations lacked systematic and reliable tools for assessing the subjective experience of employees, making it difficult to respond promptly to shortages of resources and psycho-emotional stress [
23]. This issue has become central to health policy and public trust in the wake of the pandemic.
A targeted and critical literature review was conducted to justify the development of the SRAQ-HP (Staff Resource Adequacy Questionnaire for Healthcare Professionals) tool. The reviewed tools include ASCOP (Actual Scope of Nursing Practice) [
17], NWI (Nursing Work Index) [
18], NPC (Nurse Professional Competence) [
19], and the Misscare Scale [
20]. While these tools provide information about the work environment, workload, and competence, they have the disadvantage of being unable to capture the local context, subjective perceptions, and organisational aspects simultaneously. The instruments reviewed in our conceptualisation phase address related but distinct constructs and therefore do not directly capture subjective staffing adequacy in our context. ASCOP focuses on the enactment and breadth of nurses’ roles and has clear strengths in delineating professional activities and task domains; however, it provides only an indirect view on resource sufficiency and does not quantify perceived adequacy of staffing in day-to-day practice [
17]. NWI is widely used to assess the practice environment (e.g., leadership, participation in hospital affairs, foundations for quality care), offering robust evidence on organisational climate; yet it does not specifically target staff members’ subjective evaluation of staffing adequacy or workload balance within a unit [
18]. NPC concentrates on competencies (knowledge, skills, attitudes) and thus informs professional preparedness, but it is not designed to evaluate perceived workload pressure or availability of human resources at the point of care [
19]. Finally, the Misscare Scale sensitively captures missed or rationed care and is valuable as an outcome-oriented signal of strain; still, it does not assess the structural and organisational support conditions that precede such omissions nor staff’s felt adequacy of resources [
20].
Rationale for SRAQ-HP. In summary, existing tools illuminate important adjacent domains (scope of practice, work environment, competence, and missed care) but do not jointly quantify staff’s subjective perception of resource adequacy together with workload and support in a way that is sensitive to local system features (language, culture, service organisation). The SRAQ-HP was therefore developed to fill this gap by integrating three theory-driven dimensions—Staffing Adequacy & Workload, Quality of Care, and Working Conditions & Support, and by providing locally adapted wording suitable for routine use in Latvian healthcare settings [
17,
18,
19,
20].
The SRAQ-HP was developed to address this issue. The development process involved an interdisciplinary team including healthcare practitioners, researchers, educational institution representatives, and data analysts. In the initial phase of the project, a much broader set of questions was developed, initially comprising more than 30 items covering various topics, such as the impact of organisational culture, communication between employees, emotional exhaustion, the number of patient complaints, shift frequency, work schedule flexibility, available resources, management support, and others.
Particular attention was paid to theoretical justification and empirical validity during the development. A three-dimensional model was chosen for analysing staff adequacy, the quality of care, working conditions, and support mechanisms. The theoretical basis is rooted in Kanter’s empowerment theory [
24], Besner’s concept of role realisation in practice [
25], and the professional competence models of the International Council of Nurses (ICN) and the World Health Organization (WHO) [
26]. Each item of the tool was therefore formulated in relation to a specific theoretical approach to ensure content and construct validity.
The tool was developed in a context where indicators of employee well-being and organisational effectiveness were becoming increasingly important. The new tool will help to identify organisational weaknesses such as insufficient shift allocation of staff, lack of support, excessive workload, and emotional exhaustion. It will facilitate the setting of priorities for personnel planning, the monitoring of quality improvement initiatives, and the justification of investments in human resources.
In such circumstances, evidence-based tools that provide an accurate and systematic assessment of the current situation play a crucial role. Such data enables resource shortages to be identified, the performance of units and institutions at different levels to be compared, and essential support to be provided for decision-making, organisational change planning and strategies aimed at quality patient care and staff retention. Until now, human resource policy decisions in Latvian healthcare have often been based on limited and fragmented information, such as qualitative assessments, expert opinions, or one-off studies, rather than systematic, data-driven analysis [
27]. This tool provides an opportunity to standardise the assessment of resource adequacy, an essential step towards evidence-based decision-making in workforce planning and improving care quality.
During the development process, particular emphasis was placed on taking an inclusive approach that considers the needs of managers, care workers, and researchers. The initial questions covered both objective indicators, such as the number of employees and the time taken to complete tasks, and subjective factors, such as emotional exhaustion, job satisfaction, trust in management, access to support mechanisms, and opportunities for professional development.
The study aimed to develop and conduct an initial validation of a multidimensional tool for assessing the subjective experience of healthcare professionals regarding the staff adequacy.
4. Interpretation of the Scale
Likert-type scales are usually questionnaires comprising several items and a set of response options ranging, e.g., from ‘never’ to ‘always’. These are often summed or averaged to obtain a total score. To make the results interpretable in practice, researchers often divide this total score into qualitative categories such as ‘very low’, ‘low’, ‘medium’, and ‘high’ [
42]. There are several methodological approaches to determining the thresholds between these levels:
Normative data (comparative standards) [
44];
Empirical distribution in a pilot study [
45];
Criterion-based thresholds [
46].
The interpretation levels for the SRAQ-HP scale are based on a multi-level methodological approach combining theoretical, empirical, and practical aspects. As there are no universally standardised thresholds for Likert-type scales, several scientifically based strategies were employed to develop reliable and practical assessment levels [
42,
43,
44,
45,
46]. Initially, the structure of the interpretation scales was developed based on theoretical foundations and approaches used in the international literature on tools such as the CBI [
47], MBI [
48], and PES-NWI [
49], in which the level classification was developed with the help of experts or through the analysis of empirical data distribution. This approach ensures compliance with proven and validated psychometric practice. In parallel, expert consultations were organised involving healthcare professionals with experience in personnel management and research. These experts were tasked with helping to define thresholds that realistically reflect significant changes in staff perceptions of resource adequacy and quality of care. The experts identified the average ratings that would be considered optimal, acceptable, or critical, taking into account staff workload management and management support availability. This was followed by an empirical phase, during which the proposed levels were compared with the actual data distribution obtained from both the pilot and main studies. The histograms, dispersion, and central tendency analysis of the results provided an insight into how proportionally the respondents were distributed across the different points on the scale, as well as the value ranges in which significant accumulations were formed. To ensure simplicity and consistency of interpretation, a simple mathematical calculation was used to assign each of the five rating categories an interval of approximately 0.80 on a scale from 1 to 5. This method is widely used in practical research, particularly when normative data or results from ROC analysis are not yet available [
46]. The interpretation levels were adjusted according to the data distribution and expert recommendations, ensuring they were mathematically symmetrical and semantically meaningful.
The current cut-off values are considered an initial standard, providing guidance for self-assessment by institutions and offering practical utility. It should be emphasised, however, that these levels do not yet have any clinical or normative significance and require further validation. Planned future studies will use ROC analysis to determine optimal thresholds in relation to external criteria, such as burnout and patient satisfaction. Quartile and z-score analyses will also be used to construct a normative distribution. Additionally, data must be collected again across different institutions and contexts to strengthen the comparability of the scale and standardise cut-off values for future use. In summary, this combined approach—starting with a literature- and expert-driven structure, complemented by empirical adaptation and validation planning—provides a reliable basis for developing the levels of interpretation. This enables the SRAQ-HP tool to be used as both a data collection tool and an evidence-based mechanism for assessing and improving the healthcare work environment.
The questionnaire is based on a 5-point Likert scale: Completely disagree (1); Disagree (2); Neutral (3); Agree (4); Completely agree (5). This provides easily interpretable data on staff satisfaction and resource adequacy.
Scores are averaged per subscale; higher values indicate better staffing adequacy, working conditions and quality of care, while lower values flag potential risk. The ‘Staffing Adequacy and Workload’ section assesses whether staff consider human resources to be sufficient to ensure a balanced workload.
Table 3 shows the scale rating levels.
The ‘Quality of Care’ section reveals how employees assess the impact of adequate staffing on quality of care and patient safety.
Table 4 shows the scale rating levels.
The ‘Working Conditions and Support’ section assesses how effectively management provides support to the staff, and the impact of this on their ability to perform their duties adequately.
Table 5 shows the scale rating levels.
After obtaining the average results in each section, these can be used to draw general conclusions about the adequacy of human resources and their impact on the quality of care in the hospital (see
Table 6).
Average scores can also be analysed by department or job title to identify groups with the greatest need for additional resources or support. Repeating the survey regularly makes it possible to observe changes in employee attitudes and identify trends to help management make long-term decisions about resources and work organisation.
The results can be compared with other data, such as employee turnover, sick leave, and patient satisfaction indicators, to understand the relationship between adequate resources and the organisation’s overall effectiveness. The data is analysed to identify trends and potential problems in each section. This helps management determine whether improvements are needed in staffing or working conditions.
This questionnaire has been developed to assess how effectively hospitals and healthcare institutions provide the necessary human resources and working conditions to ensure quality patient care. The tool helps identify the main challenges faced by healthcare staff and analyse the adequacy of resources and support affecting the quality of care and staff well-being. It focuses specifically on assessing the adequacy of resources in relation to quality of care, workload, and management support. This allows for a very specific focus on the availability of resources and staff support rather than general working conditions or satisfaction. This specific approach may be more effective than some international tools, which cover a broader range of topics but can sometimes lose focus on specific aspects of resources and quality of care.
The questionnaire is designed to assess three main aspects. Staffing Adequacy and Workload gauges staff perceptions of the workload, staffing levels, and whether they have the necessary resources to perform care tasks effectively. 9 statements. The Quality of Care section analyses the impact of human resource adequacy on the quality of care and patient safety. 7 statements. The Working Conditions and Support section assesses the support provided by management and the working environment, which affects staff resilience and satisfaction. 10 statements.
Potential usage scenarios have been developed. The questionnaire can help hospital management to identify departments or shifts where staffing levels need to be increased or workloads optimised. The questionnaire provides data on employee satisfaction with the work environment and management support, which can serve as a basis for planning improvements. The questionnaire allows to assess the impact of resource adequacy on the quality of care and patient safety, enabling management to take corrective action.
The scale has certain advantages and limitations. The positive and negative aspects are presented in
Table 7.
5. Results
The Results section provides a detailed overview of the development, adaptation, and psychometric validation of the SRAQ-HP tool. The obtained data reflect the results of each development stage, including CVI and FVI assessments, pilot study results, and psychometric analysis indicators from the main study. Both qualitative and quantitative methods were employed in the analysis to ensure content validity, reliability, and construct validity. The results are organised chronologically according to the stages of the development process.
Four representative groups of respondents were analysed within the study. In total, there were 10 experts who performed the initial CVI of the scale, 10 focus group participants who took part in a discussion on the content and understanding of the tool, 35 respondents in a pilot study who completed the initial version of the scale, and 5 experts who, in addition to CVI, also assessed the clarity and discrimination indicators (FVI) of the tool.
The first ten experts were all women (100%), with an average age of 45.2 years (min = 34 years; max = 59 years) and an average of 17.4 years’ experience working in healthcare. Most of the experts represented medical institutions in the Riga region (80%), while the rest came from other regions of Latvia. All of the experts had a higher education qualification in healthcare and 90% held managerial positions such as head nurses, managers, or teaching staff. 60% of the experts reported a total workload exceeding 1.0 FTE, while 30% reported shift work. All of the experts worked in inpatient care settings and 90% were also involved in other duties such as methodological work, staff training, or quality control.
The focus group participants were also all women, with an average age of 38.6 years and an average of 12.1 years work experience. In Latvia, most nurses are women, so it is common for women to be in the majority. 100% of the participants were from Riga and 90% of the focus group participants had higher education, but the distribution of positions was more diverse: 50% held managerial positions, while the remainder were senior nurses or experienced practising nurses. Relatively fewer respondents worked with a total workload above 1.0 FTE (40%) or outside normal working hours (60% worked shifts). The work environment was inpatient care in 90% of cases and 70% of respondents were involved in additional professional development activities.
The main group of respondents (n = 35) who participated in the initial scale completion were women (100%). The group’s average age was 35.6 years (min = 22; max = 65), and the average length of service was 9.46 years (min = 2; max = 40). 65% of the respondents were from the Riga region, while the rest were from other regions. 85% of the respondents had higher education, but only 20% held managerial positions. Only 30% of the respondents had a workload exceeding 1.0 FTE. However, 80% worked shifts and 88% indicated that they worked in inpatient care. Additional responsibilities were reported by 60% of the respondents.
The additional experts (n = 5) involved in both the CVI and FVI assessments were also 100% women. They had an average age of 41.0 years and an average length of service of 14.8 years. They all represented inpatient facilities in Riga, had a higher education, and held managerial positions. Of this group, 80% worked with more than 1.0 FTE workload, but only 20% indicated that they worked shifts. 100% of the respondents were involved in professional development or training activities.
To assess the compliance of the developed SRAQ-HP items with the proposed theoretical dimensions, a CVI calculation was performed based on an assessment by 10 experts. This process was carried out in accordance with internationally recognised policy and tool development recommendations [
50]. In Round 1 (CVI), I-CVI ranged 0.531–0.719; following decision rules, 3 items were removed and 9 reworded. In Round 2, CVI/FVI indicated improved relevance and clarity (S-CVI/Ave = 0.931; FVI = 0.976). The results are presented in
Table 8.
The overall CVI for the entire scale was 0.653. The CVI of the experts ranged from 0.531 to 0.719. The lowest ratings were given by Experts 1 and 6 (CVI = 0.531), which are below average. Perhaps they were stricter in their assessment or misunderstood the criteria. Experts 2, 4, and 9 gave moderately high ratings ranging from 0.62 to 0.72. Experts 3, 5, 7, 8, and 10 gave high ratings that were consistently positive with a CVI of 0.719.
When the CVI was reviewed for each statement in the survey, it was found that items 1, 8, 11, 14, and 21 received the highest ratings, with full consistency among the experts (CVI = 1.0). Of course, there were some problematic items. Items 3 (CVI = 0.3), 19 (CVI = 0), and 30–32 (CVI = 0) raise significant concerns as they are below the validity threshold. Items 12 and 18 (CVI = 0.4) are also considered problematic and may need to be excluded or clarified. Items with a CVI of 0.5 and 0.6 were also reviewed to ensure the reliability and accuracy of the questions.
Taking the obtained data into account, the problematic questions (3, 5, 12, 18, 19, and 29) were reviewed by analysing their wording and rephrasing them to better suit the respondents. Based on the analysis, questions 30, 31, and 32 were removed from the scale. No new questions were added, since all existing questions reflect the essence of the survey qualitatively. Questions that had a positive impact (e.g., questions 6, 13, 14, and 20) were left unchanged because they improve the overall quality of the scale.
To supplement the quantitative content validation process and ensure the semantic and conceptual relevance of the initial SRAQ-HP scale to the clinical context, a focus group discussion was organised during the tool development phase. The focus group served as a qualitative validation method, allowing for a deeper understanding of healthcare professionals’ perceptions of the concept of staff adequacy and assessing the relevance of the proposed items to real-life practice. The discussion aimed to identify unclear or overlapping wording, clarify the appropriateness of the language, and ensure each clause reflected relevant practical aspects.
Ten healthcare professionals were selected for the focus group using a targeted sampling approach. The participants included five nurses, two head nurses, two healthcare managers, and one university lecturer with experience in nursing education and healthcare research. Such diversity ensured that views from different levels of the healthcare system were obtained.
The discussion took place on Zoom in November 2024 and lasted approximately 90 min. It was organised according to a semi-structured interview scenario that included open-ended questions about staff workload, management support, emotional care options, and interprofessional collaboration. Written informed consent was obtained from all participants, and the discussion was recorded, transcribed, and anonymised.
The results showed a high level of overall agreement and demand for this type of tool. The participants stated that there are currently no structured tools available that enable managers to objectively assess subjective perceptions of staff adequacy. Several improvements to the wording of the items were suggested, including the simplification of technical language, the revision of negatively worded statements, and the use of more specific terms. For instance, the statement ‘During shifts, our department has sufficient staff’ was linked to the need for a clearer distinction between objective and subjective perceptions of adequacy. Participants also recommended adding a dimension to the tool that would cover interprofessional collaboration, as several of them emphasised that ‘the availability of doctors and the team’s response affect our ability to provide effective care’.
The focus group results were systematised and used as a basis for revising the items prior to the quantitative content validation phase. This ensured that the items included in subsequent development stages were conceptually appropriate, relevant in practice, and linguistically accurate. The contribution of the focus groups to the development of the tool significantly improved the validity and usability of the SRAQ-HP in clinical practice.
Following content validation and initial focus group discussions, the next stage of the tool’s development involved an empirical evaluation of the items to analyse reaction indices (item difficulties) and discrimination indices. This stage involved 35 healthcare professionals from different regions and medical specialties who filled in the initial version of the tool. Respondents’ ages ranged from 22 to 65 years, with an average age of 35.6 years (SD = 11.74). The length of service in healthcare ranged from 2 to 40 years, with an average of 9.46 years (SD = 8.98).
The reaction index (mean item endorsement) was calculated as the average rating of each question, while the discrimination index (item-total correlation) was calculated as the correlation between each item’s rating and the total scale score (excluding that item). These indicators helped to determine how effectively each item distinguished respondents with high and low perceptions of staff adequacy. The reaction and discrimination indices are shown in
Table 9.
Overall, the questionnaire consists of three dimensions. The internal consistency analysis revealed that Cronbach’s alpha was 0.620 for the first dimension, 0.817 for the second, and 0.888 for the third. The overall Cronbach’s alpha value for the entire scale is 0.841, indicating good reliability. While the alpha value for the first dimension is relatively low, it is still considered acceptable for initial studies. This may indicate a need to revise individual items or refine the content focus of the dimensions. Further analysis with a larger sample is recommended in subsequent stages of development to strengthen the psychometric properties of this dimension and possibly improve its internal consistency.
The reaction index (mean) measures the average response of respondents to each question. It provides information on how high or low the average reaction to the content of the question is. The Likert 5-point scale reaction index reflects the average rating given by respondents to a question, where 1 is the lowest rating (‘completely disagree’) and 5 is the highest rating (‘completely agree’). Results ranging from 1.8 to 4.2 cover a wide range, allowing for detailed interpretation of the respondents’ answers.
A low reaction value (mean = 1.8–2.4) indicates low agreement; in some cases it may reflect reverse-wording, limited relevance, or a mismatch with respondents’ context. For example, the question may not correspond to the respondent’s situation, or the question may be negatively worded, or the respondents may consider the topic unimportant. When analysing the n = 35 group, questions 1, 3, 5, 21, and 27 corresponded to the given limits.
A moderately low reaction value (mean = 2.5–3.0) indicates neutral or slightly negative responses. Respondents may be undecided or consider the question partially irrelevant. Questions 2, 4, 8, 16, 20, 22, 24, 26, 28, and 29 corresponded to these limits for the n = 35 group.
Moderately high reaction value (mean = 3.1–3.8). This shows a positive assessment or tendency to agree. The question is fairly well accepted and correlates with the overall conception. Questions 7, 9, 10, 11, 12, 19, 23, and 25 corresponded to these limits for the n = 35 group.
A high reaction value (mean = 3.9–4.2) indicates strong agreement or high evaluation. These questions are usually clear and appropriate. Questions 6, 13, 14, 15, 17, and 18 corresponded to these limits for the n = 35 group.
Corrected Item-Total Correlation, also known as the discrimination index, measures how well each question correlates with the total scale when the question itself is excluded. This metric allows problematic questions to be identified. The discrimination coefficient ranges from −1 to +1. A positive coefficient indicates that respondents with higher values on the scale answer this question affirmatively. A negative coefficient indicates that respondents with lower values on the scale answer this question affirmatively. When analysing the n = 35 group, all the indicators were positive, ranging from 0.104 to 0.631.
The optimal discrimination index ranges from 0.2 to 0.8. Values below 0.2 indicate a low correlation, meaning the question does not correspond well to the overall scale concept. Values between 0.2 and 0.5 reflect an average correlation—the question is acceptable but could be improved. Values above 0.5 indicate a good correlation and show that the question corresponds well to the overall scale. The analysis of the data revealed the following problem questions: Question 3 (0.181), Question 5 (0.139), Question 12 (0.062), Question 16 (0.116), Question 18 (0.104), and Question 29 (0.108). These questions correlated poorly with the overall scale, and the low correlation may indicate an issue with the wording or context. Seven questions were highly discriminative (with a score above 0.5): Question 11 (0.580), Question 13 (0.631), Question 14 (0.510), Question 21 (0.544), Question 22 (0.558), Question 23 (0.523), and Question 28 (0.545).
A combined analysis of the lowest reaction and discrimination indices revealed which questions were the most problematic, as they do not correspond to respondents’ experiences and were inconsistent with the overall scale. Question 3 had a very low reaction (2.09) and discrimination (0.181). This question is inappropriate and should be removed or reworded. Question 5 was weak in terms of both reaction (2.20) and discrimination (0.139) and should be reworded or removed. Although the reaction to Question 12 was average (3.26), the discrimination was very low (0.062), thus the question should be re-evaluated. Question 18 had a high reaction (4.09), but very low discrimination (0.104), indicating a problem with the wording. Question 29 had a low reaction (2.23) and low discrimination (0.108), which significantly weakens the overall reliability.
Standard deviation (SD) shows the variation in responses. Higher SD indicates greater variation, meaning that respondents answered differently, while lower SD indicates unanimity. Question 4 (SD = 1.323), Question 9 (SD = 1.431), and Question 28 (SD = 1.350) all showed high SD (above 1.3). Questions with high variation may indicate various opinions or different understandings. Question 6 (SD = 1.051) and Question 13 (SD = 1.014) showed low SD (below 1.1). Questions with low variation may be too unambiguous, reducing scale discrimination. The sum of the results (Sum) shows the total number of points for each question from all respondents. This indicator reflects the extent to which a particular question has been assessed overall and can be used to identify questions with extremely low or high ratings. Question 3 (73), Question 5 (77), Question 21 (78), Question 27 (78), and Question 29 (78) showed the lowest values. These questions could weaken the overall scale and should be re-examined with particular care. Question 6 (144), Question 13 (141), Question 14 (145), Question 17 (141), and Question 18 (143), on the other hand, were popular among respondents, being clear and meaningful. However, it should be analysed whether these questions provide sufficient discrimination (e.g., Question 18).
Cronbach’s Alpha if Item Deleted’ shows how the overall Cronbach’s Alpha would change if a specific question were removed. An increase in the Alpha value after removing a question indicates that the question has a negative impact on overall reliability. This indicates that the question may need to be removed or reworded. If the Alpha value remains unchanged or decreases, this indicates that the question positively contributes to the overall reliability and should be retained. Questions 6, 9, 13, 14, and 20 would reduce the Cronbach’s Alpha value if they were removed. This indicates that these questions strengthen the overall reliability and should be retained in the scale. Question 7 (0.840), Question 17 (0.837), and Question 24 (0.835) had a neutral effect on Cronbach’s alpha. These questions would have a minimal effect on the Alpha value if they were removed. They may be acceptable on the scale but could be improved. Question 3 (0.841), Question 5 (0.843), Question 12 (0.846), Question 18 (0.844), and Question 29 (0.844) would increase the Cronbach’s Alpha value if they were removed. This indicates that these questions reduce the overall reliability and should be re-evaluated or removed from the scale.
Squared Multiple Correlation (SMC) indicates the extent to which a particular question can be explained by the other questions on the scale. SMC measures how well the question fits the overall concept and its ability to discriminate between respondents. All questions had an SMC above 0.777. Question 15 had the highest SMC (0.958).
Taking into account all data in
Table 5, the problematic questions were: Questions 3, 5, 12, 18, and 29. Questions that worked well and had a positive impact on reliability: Questions 6, 9, 13, 14, and 20. Questions with good discrimination (above 0.5): Questions 11, 13, 14, 21, 22, 23, and 28. Questions that worked moderately well and have potential for improvement: Questions 7, 17, and 24.
After determining the CVI, reaction, and discrimination indices, the following questions were removed: Questions 3, 5, and 18. The following questions were revised: Questions 12 and 29 We improved the wording of the questions with average performance. Questions 7, 17, and 24. We reviewed questions with a low standard deviation (SD) to achieve greater variation: Questions 6 and 13.
Taking this psychometric evaluation into account, a new version of the scale was created which only includes items with adequate statistical indicators and theoretical significance. This improved version of the scale is shown in
Table 10 and was used in subsequent stages.
After the revision of the initial survey scale a second version containing 26 points was made. This scale was presented to five experts who met the inclusion criteria for completing the survey, in order to assess the FVI and CVI.
Table 11 below shows the results of evaluating each item of the scale, the total number of points, and the CVI indicators.
The overall average CVI for all items is 0.931, which indicates high content validity. The final version, which has a CVI of 0.931, proves that the scale redesign was successful. The initial review showed heterogeneous item-level CVI (I-CVI range 0.531–0.719); after revision, the S-CVI/Ave = 0.931. The final version (CVI = 0.931) had high content validity, indicating that the questions were clearer and more relevant.
Content validity was ensured using the content validity index from expert evaluations. The individual CVI (I-CVI) for all statements included exceeded the traditionally recommended threshold of 0.78, meaning that the experts considered them to be appropriate for assessing the target construct [
51]. The average CVI (S-CVI/Ave) calculated for the scale as a whole was also high (almost 0.9), confirming that the expert panel generally rated the scale items as highly relevant [
51]. The CVI is critical in scale development as it indicates whether the included items cover the essential aspects that the scale is trying to measure [
51]. In this study, the high CVI scores indicate that the SRAQ-HP scale covers the main aspects of human resource adequacy in healthcare and that no significant content gaps were identified in any of the retained questions. Some initially proposed questions with lower expert ratings (e.g., questions that duplicated the content of other items or were ambiguous) were excluded prior to the pilot study. This ensured that only valid statements were included in the pilot study.
Five experts participated in the FVI determination process. When measuring the FVI, the experts were asked the following question: ‘Did you understand what each point meant when you completed the Staff Adequacy Scale?’ Each expert rated the clarity of each item (0 = unclear, 1 = clear). The results are presented in
Table 12.
The overall FVI was 0.976, indicating that the majority of assessed criteria are appropriate for evaluating resources. This suggests a high level of understanding of the tool and its suitability for the intended users. Only a few items (12, 14, and 24) received a rating below the maximum value (FVI = 0.8), suggesting possible ambiguity or differences in understanding these items. These items may need to be clarified or reworded. While the ratings were similar, Experts 2–4 had FVI = 0.961, whereas Experts 1 and 5 = 1.000.
Suitability of the tool for use, given that the overall FVI was very high, it can be concluded that the tool’s assessment of staffing adequacy and the clarity of item wording were well received by the expert panel.
Brief Factor-Analytic Findings
Data were suitable for factor analysis (KMO = 0.892; Bartlett’s χ2 (325) = 33,046.524, p < 0.001). In an independent large-sample CFA (n = 1369), a correlated three-factor model showed high comparative fit (CFI = 0.968; TLI = 0.964); full model diagnostics (including RMSEA/SRMR and robustness checks) are reported in the companion paper.
7. Discussion
The Latvian healthcare system has long suffered from human resource problems, particularly a shortage of qualified nurses and doctors. Statistics show that, for example, there are only ~4.2 nurses for every 1000 inhabitants in Latvia, which is less than half of the EU average (~8.5 per 1000) [
52]. This situation limits the availability of services and even threatens the successful implementation of health reforms [
52]. In practice, this means that, in many hospitals, existing staff are forced to work excessive hours to fill vacancies. Nurses often ‘fill in’ for absent colleagues, resulting in sleepless nights and burnout syndrome [
53,
54]. In the long term, such work overload can lead to professional burnout and a decline in work quality, causing some staff to leave the sector [
54,
55]. As patients may not directly experience the effects of staff shortages in the short term, there is a risk that this issue will be overlooked or underestimated at a systemic level [
52,
56].
Until now, staff planning decisions in Latvia have mainly been based on quantitative indicators such as the number of staff positions, budget costs, and short-term solutions rather than on the assessments of staff subjective experiences and working conditions [
52,
57]. Healthcare reforms have often focused on optimising spending rather than staff well-being. For instance, during periods of budgetary constraints, the focus has typically been on reducing costs rather than improving staff well-being and the working environment [
58]. This approach is dangerous because ignoring staff needs undermines the system’s long-term capacity [
13,
54]. International studies show that in health systems where management focuses excessively on performance indicators and cost savings rather than staff support, the risks of burnout and staff turnover increase [
58]. Until now no unified tool has been developed in Latvia for regularly and systematically measuring healthcare professionals’ satisfaction with human resource adequacy and their subjective workload and work experience. Consequently, management lacks evidence-based data on staff sentiment and the adequacy of resources on the front line. Decisions on personnel policy are often made without sufficient information or based on isolated incidents and complaints rather than systematic data analysis.
Given this, the introduction of the SRAQ-HP tool fills a significant gap in Latvia’s healthcare data ecosystem. The SRAQ-HP has been developed to assess staff subjective views on aspects of the work environment that could not previously be quantified. This tool provides the first opportunity to measure subjective workload, opportunities for professional role fulfilment, and the level of management support in everyday work objectively and in a standardised way. Initial research results confirm the high potential of the tool. The SRAQ-HP survey has been found to have sufficient internal consistency and content validity, indicating that the questions in the questionnaire are reliable and fit for purpose. The consistency and reliability of the tool, such as high Cronbach’s α values, indicate that the survey reliably measures the intended dimensions and can serve as a reliable source of data for further analysis. While it is not necessary to list all the quantitative indicators again here, it should be emphasised that validating the questionnaire in the Latvian context confirms its usefulness. Respondents generally indicated that they feel the significant impact of workload and resource shortages in their daily work, which aligns with long-standing concerns in the sector. In other words, SRAQ-HP data empirically confirm issues that were previously known mainly through anecdotal experience. This tool brings the hidden to light, quantifies subjective fatigue and overwork, and identifies where management support is most lacking. The SRAQ-HP therefore provides an important basis for a more informed discussion on human resource policy in healthcare.
The importance of the tool is also confirmed by the context in which it was developed: the COVID-19 pandemic and the post-pandemic period have exacerbated workload and burnout issues for staff worldwide [
59,
60]. In Latvia, the pandemic revealed how critical it is to have sufficient and motivated staff [
27,
61]. Unfortunately, the absence of objective data on staff workload and satisfaction meant that only fragmented solutions were adopted after the crisis, such as short-term bonuses and campaign-style recruitment. These do not solve systemic problems [
57]. SRAQ-HP can serve as an early warning system by regularly collecting data to identify critical increases in workload or staff dissatisfaction, which would otherwise only become apparent at a late stage—for example, through mass resignations or strikes. In summary, introducing this tool is a step towards developing an evidence-based personnel policy. This could mark a turning point for the Latvian healthcare system, shifting the focus from crisis management to proactive monitoring and planning of the working environment.
The situation in Latvia with regard to staff resource adequacy and the use of data in decision-making should be compared with international best practice, particularly in countries with well-developed healthcare systems where workforce planning is data-driven. The Scandinavian countries are a good benchmark in this regard. Finland, for example, has been using the RAFAELA system, a special tool for hospital staff planning and workload management, since the 1990s [
62]. RAFAELA involves the daily measurement of patient care intensity and links this to the number of nurses required to provide an adequate level of care. The system is based on three elements: a patient classification tool, information on available human resources, and a professional assessment of the optimal workload (the PAONCIL method) [
62]. This allows for the daily monitoring of whether there are enough staff to provide quality care for a given patient volume, providing immediate feedback to managers. Consequently, this system is now used in almost all Finnish hospitals, with similar approaches being introduced elsewhere in Europe and Asia [
62]. The widespread use of the RAFAELA system in the Nordic countries confirms that data-based workload management is being taken seriously, with the system becoming the standard for determining optimal staffing levels rather than relying solely on intuitive management decisions [
63]. RAFAELA also serves as a professional monitoring system that enables benchmarking of departmental work intensity between hospitals and over time, facilitating data analysis and targeted improvements. Other tools based on a similar philosophy include the Safer Nursing Care Tool used in British hospitals, which helps determine the number of nurses needed per shift based on patient care intensity data [
64]. This tool is also effectively a system for monitoring appropriate workloads and staffing levels. It is widely used in NHS hospitals to ensure patient safety.
In OECD countries, public sector employers are increasingly organising regular surveys to measure staff satisfaction and working conditions [
65]. Research shows that governments in many countries are introducing staff surveys to improve performance and service quality. The most popular focus of these surveys is staff job satisfaction [
66]. In several countries, survey questions focus on staff’s subjective views of the working environment, as there is recognition that staff engagement and well-being directly impact organisational performance [
65]. For example, the US Office of Personnel Management conducts an annual survey of federal employees to monitor perceptions of working conditions in the civil service [
67]. This is a familiar practice for many healthcare institutions, with hospitals and clinics regularly surveying their staff about their perception of workload, support from management, and satisfaction. They then use this data to plan interventions to improve staff well-being [
68,
69,
70,
71]. Staff feedback thus becomes a valuable resource for organisational management, enabling targeted improvements to be initiated based on the real needs and recommendations of staff [
65]. Latvia has lagged behind in this regard, as there have been neither regular nationwide hospital staff surveys nor an integrated system for collecting workload data. Therefore, introducing SRAQ-HP will bring Latvia closer to the good practices of OECD countries and ensure that the voice of employees is heard at the policy level.
It is important to note that in other countries there are also official staff monitoring systems in place at the national level. For instance, Finnish Institute for Health and Welfare (THL) collects and analyses data on the healthcare workforce in both the public sector and overall. This includes monitoring the adequacy of the number of nurses and doctors, as well as tracking their migration [
72]. Other Nordic countries also carry out regular health workforce forecasting and monitoring activities. Norway, for example, conducts Arbeidsgivermonitoren surveys to analyse the situation of public sector employees, including those in healthcare [
73]. Frameworks have been developed at OECD [
64] and WHO [
60] levels to encourage countries to develop human resource monitoring mechanisms. For example, the World Health Organization’s Global Strategy on Human Resources for Health: Workforce 2030 emphasises the need to strengthen health workforce data by introducing National Health Workforce Accounts and other tools to ensure regular data collection and analysis for human resource planning [
74]. All of this demonstrates that the international approach is shifting towards data-driven methods, both at the micro level in hospitals, where tools such as RAFAELA [
62] and regular surveys are used, and at the macro level, where national and international monitoring systems are employed.
Comparing the situation in Latvia, we have one of the lowest ratios of nurses and doctors per capita in Europe, and at the same time, there has been a lack of modern tools to manage this problem [
54,
56]. In Scandinavia, where healthcare is better, active investments are being made in monitoring and improving working conditions. This recognises that even adequate staffing is meaningless if employees are overworked or dissatisfied [
62,
72,
73]. Following these examples, the Latvian healthcare system should not only use SRAQ-HP as a research tool, but also integrate it into daily management processes, as the Nordic countries have done with their monitoring tools. This will gradually bring us closer to the international standard, where decisions are based on data and the staff perspective is systematically taken into account.
One innovative aspect of the SRAQ-HP tool is its inclusion of multiple dimensions in a complex assessment. The survey covers subjective perceptions of workload, aspects of professional role fulfilment, and management support. When discussing the integration of these dimensions, it is important to analyse why each one is important and how they together form a holistic picture of the state of the workforce.
Objective indicators, such as the number of patients per nurse, do not always reflect how employees perceive their workload themselves [
75]. Two identical departments with a similar number of patients may experience different subjective workloads depending on factors such as the severity of the patients, the organisation of the team’s work, technical support, and so on [
76]. Studies confirm a strong link between an excessive workload and burnout as well as decline in the quality of care [
69]. For instance, structured studies have revealed [
77,
78,
79] that high work intensity promotes the implied rationalisation of care, whereby nurses unintentionally omit less critical activities due to time constraints, thereby worsening the overall quality of care. Another consequence is increased emotional exhaustion. Employees who constantly work beyond their capacity lose motivation and become emotionally numb, which negatively impacts their health and the patient experience [
79,
80,
81]. Staff fatigue and dissatisfaction caused by high workloads have been found to reduce patient safety and the quality of care. For example, systematic reviews have found a correlation between nurse burnout and lower patient satisfaction and safety indicators [
82]. Therefore, the Subjective Workload dimension within the SRAQ-HP provides a vital signal: if survey data consistently show that staff in a particular department report very high workloads, this indicates the need for immediate action to prevent serious consequences.
The Professional Role Fulfilment dimension covers the extent to which employees can fully perform their professional roles, whether they have the opportunity to use their knowledge and skills, whether they have to perform unnecessary additional tasks, and whether the boundaries of their roles and responsibilities are clear. In situations where there is a shortage of staff, employees often have to perform a wider range of tasks. For example, nurses may be forced to perform the tasks of sanitary workers, administrative staff, or doctors’ assistants, which goes beyond the scope of their direct role [
59,
83]. Similarly, a heavy workload can mean nurses are unable to devote the optimal amount of time to each patient, leading to internal dissatisfaction with the quality of their work [
84]. International studies on role conflict and ambiguity demonstrate that these factors directly impact job satisfaction. When staff have unclear roles or conflicting responsibilities, job satisfaction decreases and stress increases [
85]. In the context of Latvian hospitals, nurses often have to care for a large number of patients while also completing documentation, resolving logistical issues, etc., which negatively impacts their direct care duties. Such situations create a role conflict: formally, nurses are expected to provide empathetic, high-quality care, but in reality, they are forced to work in a ‘conveyor belt’ manner [
86]. Data collected by SRAQ-HP on the extent to which staff feel restricted in performing their professional duties provides important information for management. For example, if the majority of nurses in a department indicate that they are unable to perform their work to a high standard due to a lack of time, this suggests a systemic issue that requires attention, whether through increasing staff numbers, improving process organisation, or reviewing the scope of nurses’ roles. In short, the Professional Role Fulfilment dimension helps us understand how healthy the work environment is from a professional point of view: do employees feel proud of their work, or are they frustrated and demotivated because they are unable to work to the best of their ability?
The third important dimension is Organisational and Managerial Support. This dimension covers staff perceptions of the extent to which management and immediate superiors support them in their daily work, in crisis situations, and in their professional development. Good management support is evident in factors such as reasonable workload distribution, addressing employee concerns, providing regular feedback, recognising professional achievements, and adopting an empathetic approach [
87,
88]. Several international studies confirm [
87,
88] that management style and support are critical factors influencing staff retention and satisfaction.
Unfortunately, Latvian hospitals are often criticised for a lack of communication and support at management level. For example, staff opinion is not taken into account when schedules are drawn up, employees do not feel safe reporting problems, and good work results go unnoticed [
89]. SRAQ-HP survey data on the Managerial Support dimension enables this previously difficult-to-measure aspect to be assessed quantitatively. If the survey shows a low rating for management support, this should be a serious red flag for the institution’s management as it may indicate poor internal communication, inconsistent management practices between departments, or even an increased risk of bullying. Overall, when integrated with the other two dimensions, the Managerial Support dimension provides a more complete picture. Even at high workload, good management support can mitigate the negative effects [
85]. Conversely, poor management can exacerbate the situation, even at moderate workload. Therefore, the strength of SRAQ-HP lies in its integrated assessment of these three dimensions (workload, role, and support). Together, they reveal where workload problems are most acute, whether these are exacerbated by role or organisational problems, and whether management is able to compensate for difficulties.
In summary, the discussion on the integration of the SRAQ-HP dimensions shows that the tool addresses both the ‘hard’ and ‘soft’ aspects of the workforce: workload and psychosocial factors, respectively. This approach aligns with the contemporary understanding of the quality of the working environment in healthcare, where staff well-being is influenced by a complex set of factors beyond just salaries or position numbers [
90]. Together, data on subjective workload, professional fulfilment, and management support tell the story of how our nurses, doctors, and medical assistants are doing on a daily basis. This story is important to ensure that the healthcare system does not rely on the illusion that ‘everything is fine’ as long as there are enough staff on paper. In reality, people’s feelings and motivation are directly linked to patient care outcomes [
91]. Therefore, using the SRAQ-HP integrated approach will help develop more people-centred and sustainable human resources policies.
In summary, introducing the SRAQ-HP tool in Latvian healthcare is an important step towards systematically identifying and addressing long-standing human resource adequacy issues. The multidimensional data provided by the tool enables comparison of the situation in Latvia with international trends, as well as providing a deeper understanding of weaknesses within the system. These weaknesses range from excessive workloads and unfulfilled professional ambitions to communication gaps between staff and management. However, a more in-depth analysis of the data is required to draw detailed conclusions about the structure of the SRAQ-HP questionnaire and its measurement properties. In the future, a detailed factor analysis, i.e., statistical research will be carried out to investigate how the survey questions are grouped into dimensions and what latent factors form the constructs of staff workload and satisfaction. This analysis will provide a scientific basis for further refining the tool and interpreting the results more accurately. For instance, it will allow to ascertain whether the SRAQ-HP confirms the three intended dimensions or if additional factors emerge in practice. At this stage, we can conclude that the development of the tool and the initial results have made a significant contribution to discussions on staff policy.
In conclusion, it should be emphasised that the SRAQ-HP is not an end goal in itself, but rather a tool for promoting change. It will only realise its true potential if the data obtained is used in practice to improve the experience of both staff and patients. By working together in an evidence-based manner, we can hope to achieve long-term positive change that will benefit healthcare professionals, patients, and society as a whole [
65].