Validation of the Dutch translation of the quality of recovery-15 scale

de Vlieger, Johannes C. N.; Luiting, Willem H.; Lockyer, Jessica; Meyer, Peter; Fleer, Joke; Sanderman, Robbert; Wietasch, J. K. Götz

doi:10.1186/s12871-022-01784-5

Research
Open access
Published: 01 August 2022

Validation of the Dutch translation of the quality of recovery-15 scale

Johannes C. N. de Vlieger¹,
Willem H. Luiting¹,
Jessica Lockyer²,
Peter Meyer¹,
Joke Fleer³,
Robbert Sanderman³ &
…
J. K. Götz Wietasch¹

BMC Anesthesiology volume 22, Article number: 243 (2022) Cite this article

2240 Accesses
1 Citations
1 Altmetric
Metrics details

Abstract

Background

The 15-item Quality of Recovery-15 (QoR-15) scale is strongly recommended as a standard patient-reported outcome measure assessing the quality of recovery after surgery and anesthesia in the postoperative period. This study aimed to validate the Dutch translation of the questionnaire (QoR-15NL).

Materials and methods

An observational, prospective, single-centre cohort study was conducted. Patients who underwent surgery under general anesthesia completed the QoR-15NL (preoperatively (t1) and twice postoperatively (t2 and t3)) and a visual analogue scale (VAS) for general recovery at t2. A psychometric evaluation was performed to assess the QoR-15NL’s validity, reliability, responsiveness, reproducibility and feasibility.

Results

Two hundred and eleven patients agreed to participate (recruitment rate 94%), and 165 patients were included (completion rate 78%). The QoR-15NL score correlated with the VAS for general recovery (rs = 0.59). Construct validity was further demonstrated by confirmation of expected negative associations between the QoR-15NL and duration of surgery (rs = -0.25), duration of Post Anesthesia Care Unit stay (rs = -0.31), and duration of hospital stay (rs = -0.27). The QoR-15NL score decreased significantly according to the extent of surgery. Cronbach’s alpha was 0.87, split-half reliability was 0.8, and the test–retest intra-class coefficient was 0.93. No significant floor- or ceiling effect was observed.

Conclusion

The QoR-15NL scale is a valid, easy-to-use, and reliable outcome assessment tool with high responsiveness for patient-reported quality of recovery after surgery and general anesthesia in the Dutch-speaking population. The QoR-15NL’s measurement properties are comparable to the original questionnaire and other translated versions.

Trial registration

not applicable.

Peer Review reports

Background

Recovery after surgery and anesthesia is a complex process dependent on patient, surgical, and anesthetic characteristics, as well as the presence of any adverse sequelae [1]. In the past, commonly reported outcome measures were recovery times and function, avoidance of common adverse effects (i.e. pain, nausea and vomiting) and healthcare resource utilisation (i.e. duration of intensive care unit and hospital stay) [2]. Although these parameters are essential and should be measured, they mostly ignore the quality of recovery (QoR) from the patient’s perspective [1].

QoR scales have been developed for the immediate postoperative period to provide a quantitative measure of overall health status after surgery and anesthesia. One of the strengths of these scales is the integration of a more complete range of patient experiences after surgery to avoid undue emphasis on one, or some over others (e.g. opioid pain reduction at the expense of nausea or delirium). The 40-item QoR scale, 15-item QoR scale and 9-item QoR score have been studied most extensively [2].

The multidimensional 15-item QoR (QoR-15) scale was initially developed in English and translated and validated in several European and Asian countries [1, 3,4,5,6,7,8]. The questionnaire assesses both physical and mental well-being. The 15 items incorporate five dimensions of health: physical comfort (n = 5), physical independence (n = 2), pain (n = 2), emotional state (n = 4) and psychological support (n = 2). All items are scored by an 11-point numerical rating scale. Consequently, after summing up all items, the total score ranges from 0 to 150 (ideal health status) [1, 7].

The QoR-15 scale is a valid, reliable and easy-to-use patient-reported outcome measure (PROM) with high responsiveness [1, 9, 10]. Furthermore, a systematic review following the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist showed that the QoR-15 fulfilled the requirements for outcome measurement instruments in clinical trials [9, 11]. Currently, the QoR-15 is strongly recommended as a standard outcome measure of QoR in clinical research relating to surgery and anesthesia [9]. Perioperative interventions that result in a change in QoR-15 score of 6 signify a clinically important improvement or deterioration [10, 12]. Furthermore, it could be a useful outcome measure for assessing the impact of healthcare delivery changes for quality assurance purposes [1]. Finally, the QoR-15 offers the opportunity for a standardised feedback measure for healthcare team members, especially anesthesiologists and surgeons, to acquire additional insights into their patient’s outcome.

Although validated in various linguistic and cultural contexts, the QoR-15 has never been translated into Dutch according to international standards for translating a questionnaire [4, 7, 13, 14]. Therefore, this study aims to validate the Dutch translation of the QoR-15 scale questionnaire (QoR-15NL). It was hypothesised that the QoR-15NL scale’s measurement properties would be satisfactory and comparable with the original and subsequently translated versions of the questionnaire.

Materials and methods

Prior to commencement, the study was registered in the University Medical Center Groningen (UMCG) Research Register (201,900,402). The study protocol was reviewed and declared to be outside the scope of the Medical Research Involving Human Subjects Act by the Medical Ethics Review Board of the UMCG (METc 2019/331, chairperson Prof W.A. Kamps) on June 18^th 2019.

Translation and cultural adaption

First, two independent translators from the University of Groningen Language Centre conducted the forward and backward translation of the original QoR-15 questionnaire (1). An expert panel consisting of two anesthesiology residents (JdV, JL), two senior anesthesiologists (PM, GW) and two experienced clinical psychologists (JF, RS) critically reviewed the resulting QoR-15NL pilot version and applied two modifications. Subsequently, cognitive interviews about the pilot version with patients who underwent various inpatient elective surgical procedures under general anesthesia were conducted approximately 24 h postoperatively at the surgical ward, using a structured interview guide. These interviews assessed the questionnaire’s instructions, recall, items, response options, format and length [15]. Interview transcripts were transcribed verbatim, coded (inductive) and analysed by two authors independently (JdV, JL) [16]. Eighteen patients were interviewed in three interview rounds until no new comments arose. After the first (n = 7) and second round (n = 5) of interviews, the expert panel modified the pilot version to address relevant patient comments. Consensus was reached about adding a short instruction and example about completing the questions, and three questions (6,9 and 10) were slightly modified. All relevant comments and modifications made during the translation and cultural adaption are summarised (see supplementary file 1). The resulting final version of the QoR-15NL is available at https://www.umcg.nl/-/medisch-wetenschappelijk-onderzoek/gaps.

Validation study

During the validation study, an observational, prospective, single-centre cohort study was conducted at a tertiary referral centre between August 24^th and November 29^th 2020. Adult patients, who underwent various inpatient elective surgical procedures under general anesthesia, were fluent in Dutch, and available for follow-up at the hospital on the first postoperative day were eligible for inclusion. Patients were ineligible or excluded if they did not give consent, were admitted on the Intensive Care Unit (ICU) (both scheduled and unscheduled) postoperatively, were admitted on the Postoperative Anesthesia Care Unit (PACU) for the first night postoperatively; or if they had either poor Dutch comprehension, a psychiatric disturbance that precluded complete cooperation, a known history of alcohol or drug dependence, any severe pre-existing medical condition that limited objective assessment after surgery, any life-threatening postoperative complication or a postoperative delirium [1]. Eligible patients were contacted by phone one week prior to the intended surgery. Participating patients received an information letter, an informed consent form and two QoR-15NL questionnaires by mail. As per the development study, patients completed the informed consent form and the first QoR-15NL questionnaire preoperatively (t1, baseline) and the second (t2) on the first postoperative day.

Additionally, at t2, a 100 mm visual analogue scale (VAS), marked from ‘poor recovery’ to ‘excellent recovery’, for general recovery was added to assess validity [1]. Approximately 24 h postoperatively, a researcher visited participating patients on the surgical ward. Written informed consent was obtained from all patients, and both questionnaires were collected during the visit. Every second patient was asked to repeat the QoR-15NL(t3) 30 to 60 min after t2 to measure test–retest reliability. The time interval between measurements was in line with the development study, and visiting half of the patients would result in an adequate sample size for the analysis [17].

Patient demographics, pre-, intra- and postoperative data were collected from the electronic hospital information system. The following data were recorded: gender, age, American Society of Anesthesiologists (ASA) physical status score, time of admission, duration of surgery, type of surgical procedure, duration of postoperative stay, and postoperative complications within the first postoperative day. The extent of surgery was classified as minor, intermediate, or major depending on the type of surgical procedure and the expected surgical stress response [1, 4]. The type of surgery was classified according to the surgical subspecialty [1]. The duration of surgery was determined by using the surgery start and stop times from the hospital’s perioperative information system [1]. The duration of postoperative stay in the PACU and the length of postoperative admission at the hospital were calculated using the surgery stop time and discharge time to the surgical ward and from the hospital, respectively [4].

Statistical analysis

The recommended sample size to validate a questionnaire is 10 participants per item [17, 18]. This study aimed to include 165 patients, accounting for a 10% loss to follow-up. Data are presented as mean ± standard deviation (SD), median (interquartile range (IQR)) or number (percentage) as appropriate. The recruitment rate represents the percentage of eligible patients who were contacted by phone and agreed to participate. The completion rate represents the number of patients who agreed to participate and were included in the study. Normal distribution was assessed with the Shapiro–Wilk test. Changes from baseline were compared by the paired t-test. Differences in QoR-15NL score for gender, complicated cases versus uncomplicated cases, and poor versus good recovery were compared by the unpaired t-test. Differences in the QoR-15NL score between the extent of surgery were compared by the one-way ANOVA test. Correlation coefficients were used to assess associations between variables: Pearson (r) for normally distributed and Spearman rank (rs) for non-normally distributed variables, respectively [19]. Statistical analyses were performed with SPSS Statistics version 23.0 (IBM Corp, Armonk, NY, USA). The null hypothesis was rejected if two-tailed p < 0.05.

Psychometric evaluation

The psychometric evaluation of the QoR-15NL was performed similarly to the original publication and the subsequent translation and validation studies [1, 4, 7].

Construct validity

Construct validity was assessed using convergent- and discriminant validity. Convergent validity was determined by comparing the QoR-15NL with the VAS for general recovery, and inter-item correlations were measured [1]. Additionally, it was further tested by the hypothesis that there would be a negative association between the QoR-15NL (t2) and duration of surgery, duration of stay in the PACU, and duration of postoperative hospital stay [1]. The association between the QoR-15NL and age was also determined, although previous studies reported contradictory results regarding the degree and magnitude of this association [1, 4, 7]. Finally, it was hypothesised that the QoR-15NL score would be inversely related to the extent of endured surgery and that women would have a lower score than men; since women generally have a worse postoperative recovery [4, 20].

Discriminant validity was tested by the hypothesis that patients with complications and those who had undergone a poor postoperative recovery (defined as a VAS for general recovery of < 70 mm versus > 70 mm for a good recovery) would have a lower QoR-15NL score [1].

Reliability, responsiveness and reproducibility

Reliability was tested with internal consistency (Cronbach’s alpha) and split-half reliability [17, 21, 22]. Responsiveness was assessed with Cohen’s effect size and the standardised response mean (SRM) [17, 23]. Reproducibility was tested by evaluating agreement (smallest detectible change (SDC individual)) and the test–retest reliability (intraclass correlation coefficient (ICC) for agreement (two-way random effect model)) [17, 24]. Patients with a time interval of > 90 min between t2 and t3 were excluded from the test–retest analysis to assure that the remaining patients’ clinical condition was stable between measurements, which is required for a reliable test–retest analysis.

Clinical feasibility

Clinical feasibility was determined by the recruitment- and completion rate (see above). Finally, floor or ceiling effects were present if more than 15% of the respondents achieved the lowest or highest possible score, respectively [17]. Missing items were handled as follows: in case of one missing QoR-15NL item, the worst possible score (0) was selected. Two or more missing items resulted in an invalid QoR-15NL score and exclusion. Table 1 summarises the statistical methods used for the psychometric evaluation of the QoR-15NL scale.

Table 1 Summary of statistical methods used for the psychometric evaluation of the QoR-15NL scale

Full size table

Results

Of the 224 eligible patients approached by phone, 211 agreed to participate (recruitment rate: 94%).

One patient was unable to complete the postoperative QoR-15NL scale. Thirteen patients returned QoR-15NL scores with missing items: nine at t1, four at t2 and none at t3. Most patients omitted one item, but three QoR-15NL scores (all t1) were considered invalid due to the omission of two (n = 2) or three items (n = 1). After excluding 46 patients, 165 patients were included in the study (completion rate: 78%), as shown in the flow diagram (Fig. 1).

All patients underwent general anesthesia, and 22 patients received additional analgesia with an epidural catheter (n = 13), a peripheral nerve catheter (n = 4), single-shot peripheral nerve block (n = 4) or wound catheter (n = 1). Table 2 shows the demographics and clinical characteristics of the study population.

Table 2 Demographic and clinical characteristics of the study population (n = 165)

Full size table

The mean ± SD preoperative (t1) and postoperative (t2) QoR-15NL scores were 124 ± 18 (n = 158) and 100 ± 25 (n = 165), respectively. The mean difference between t1 and t2 was 23.5 ± 26 (p < 0.01). The distribution of the postoperative (t2) QoR-15NL scores was skewed to the left (skewness -0.402) and is presented in Fig. 2. Detailed data about each item of the QoR-15NL is shown in Table 3. The median (IQR) time of postoperative assessment (t2) was 21 h (IQR 18, 22) (n = 165), and the median interval between t2 and t3 was 56 (IQR 45, 90) (n = 79) minutes.

Table 3 Mean values, change and responsiveness of the Dutch Quality of Recovery-15 scale (QoR-15NL)

Full size table