Abstract
Background and Objectives: Evaluating osteoarthritis (OA) care quality is increasingly relevant for service improvement and benchmarking purposes. The Osteoarthritis Quality Indicator questionnaire (OA-QI) measures patient-reported guideline-concordant care; however, no version has been tested in Italian primary care or bilingual contexts. This study aimed to introduce the OA-QI version 3 (OA-QI v3) in German and Italian, assess its applicability in practice, and examine its acceptability and reliability. Materials and Methods: A cross-sectional survey was conducted using the South Tyrolean General Practice Research Network. Thirty-eight general practitioners recruited 266 patients with hip or knee OA. Patients completed the OA-QI v3 in German or Italian, with subsamples for comprehensibility testing (n = 38) and retest reliability after 14 days (n = 36). Test–retest reliability was analyzed using percent agreement, Cohen’s κ, intraclass correlation coefficients (ICC), and standard error of measurement. The smallest detectable change was analyzed to estimate factual change. Results: Response rate reached 95% of the targeted patients. Patient feedback showed good comprehensibility and ease of use in both languages. Adherence to recommended quality indicators varied, with strengths in physical activity advice, NSAID prescription, and pain assessment, but gaps in weight management, occupational counseling, and assistive devices. Test–retest reliability ranged from fair to substantial at the item level (κ = 0.33–0.69) and was moderate for the total score (ICC = 0.55, 95% CI 0.28–0.74). While measurement error restricted individual-level interpretation, reliability at the practice or institutional level supports application for benchmarking and quality monitoring. Conclusions: The OA-QI v3 was feasible, acceptable, and reliable for group-level assessments in South Tyrol. These findings position OA-QI v3 as a practical tool for identifying care gaps and guiding quality improvement, while providing important lessons for the full validation of the German and Italian versions in larger cross-national samples.