Skip to main content

Neuraxial versus general anesthesia in elderly patients undergoing hip fracture surgery and the incidence of postoperative delirium: a systematic review and stratified meta-analysis

Abstract

Background

Evidence-based effect of anesthetic regimens on postoperative delirium (POD) incidence after hip fracture surgery is still debated. Randomized trials have reported inconsistent contradictory results largely attributed to small sample size, use of outdated drugs and techniques, and inconsistent definitions of adverse outcomes. The primary objective of this meta-analysis was to investigate the impact of different anesthesia regimens on POD, cognitive impairment, and associated complications including mortality, duration of hospital stay, and rehabilitation capacity.

Methods

We identified randomized controlled trials (RCTs) published from 2000 to December 2021, in English and non-English language, comparing the effect of neuraxial anesthesia (NA) versus general anesthesia (GA) in elderly patients undergoing hip fracture surgery, from PubMed, EMBASE, Google Scholar, Web of Science and the Cochrane Library database. They were included if POD incidence, cognitive impairment, mortality, duration of hospital stay, or rehabilitation capacity were reported as at least one of the outcomes. Study protocols, case reports, audits, editorials, commentaries, conference reports, and abstracts were excluded. Two investigators (KYC and TXY) independently screened studies for inclusion and performed data extraction. The risk of bias was assessed using the Cochrane Collaboration risk-of-bias tool. The quality of the evidence for each outcome according to the GRADE working group criteria. The odds ratio (OR) and 95% confidence intervals (CI) were calculated to assess the pooled data.

Results

A total of 10 RCTs with 3968 patients were included in the present analysis. No significant differences were found in the incidence of POD comparing NA vs GA [OR 1.10, 95% CI (0.89 to 1.37)], with or without including patients with a pre-existing condition of dementia or delirium, POD incidence from postoperative day 2–7 [OR 0.31, 95% CI (0.06 to -1.63)], in mini-mental state examination (MMSE) score [OR 0.07, 95% CI (-0.22 to 0.36)], or other neuropsychological test results. NA appeared to have a shorter duration of hospital stay, especially in patients without pre-existing dementia or delirium, however the observed effect did not reach statistical significance [OR -0.23, 95% CI (-0.46 to 0.01)]. There was no difference in other outcomes, including postoperative pain control, discharge to same preadmission residence [OR 1.05, 95% CI (0.85 to 1.31)], in-hospital mortality [OR 1.98, 95% CI (0.20 to 19.25)], 30-day [OR 1.03, 95% CI (0.47 to 2.25)] or 90-day mortality [OR 1.08, 95% CI (0.53–2.24)].

Conclusions

No significant differences were detected in incidence of POD, nor in other delirium-related outcomes between NA and GA groups and in subgroup analyses. NA appeared to be associated with a shorter hospital stay, especially in patients without pre-existing dementia, but the observed effect did not reach statistical significance. Further larger prospective randomized trials investigating POD incidence and its duration and addressing long-term clinical outcomes are indicated to rule out important differences between different methods of anesthesia for hip surgery.

Trial registration

10.17605/OSF.IO/3DJ6C.

Peer Review reports

Background

Hip fractures are among the most frequent causes of hospitalization and disability in the geriatric population. Since 2010, an estimated 2.7 million patients a year were diagnosed with hip fractures [1], and this number has been projected to increase to over 6 million worldwide by 2050 [2].

Hip fracture is an acute surgical condition and approximately 98% of patients warrant operation requiring anesthesia [3]. As the geriatric group is often associated with frailty and multiple comorbidities, patients undergoing hip fracture surgery are at a significantly higher risk of perioperative mortality and morbidity, including POD, respiratory complications, myocardial ischemia, and cerebrovascular accidents [4,5,6].

Emphasis on multi-disciplinary efforts have been made over the last decade to improve geriatric hip fracture outcomes; however, evidence-based medicine in orthogeriatric anesthesia has yet to identify an optimal anesthetic technique for their management. Observational studies suggested NA is associated with lower 30-day mortality risk, in-hospital stay, readmission rates [7, 8], delirium [9], and major medical complications [10,11,12,13]. Contrarily, randomized trials have reported inconsistent contradictory results for the same outcomes depending on different anesthesia techniques; this was largely attributed to a reduced number of trial participants, small sample size, use of outdated drugs and techniques that are irrelevant to current practice, and inconsistent definitions of adverse outcomes [3].

POD, in particular, represents one of the most common postoperative complications in patients with hip fractures [14]. POD incidence has been reported in 16.9–28% of patients and has been associated with increased 30-day mortality risk, prolonged hospital stay, difficulty in regaining daily function, and a higher risk of future cognitive dysfunction [14,15,16]. The effects of NA versus GA on the incidence of POD in older patients undergoing hip fracture surgery are largely uncertain. Recent systemic reviews and meta-analyses found no evidence to suggest that anesthesia technique influences POD but noted that evidence-based results is lacking [17].

Recently, several comparative studies have been published [9,10,11,12,13] and two large RCTs investigating this outcome, namely the REGAIN trial [18] and the RAGA trial [19], have been performed. In light of the newly published data, we conducted a systematic review and stratified meta-analysis of the recent comparative studies to investigate whether NA or GA affected the incidence of POD and its associated clinical outcomes in elderly patients undergoing hip fracture surgery, including POD from day 2–7, postoperative MMSE score and neuropsychological test results, postoperative pain control, length of hospital stay, incidence of discharge to same preadmission residence, in-hospital mortality, 30-day and 90-day mortality rates.

Study design and methodology

This study protocol was pre-registered on PROSPERO (The International Prospective Register of Systematic Reviews) and on OSF.io. The meta-analysis and systematic review were conducted according to PRISMA guidelines (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) [20].

Eligibility criteria

We applied a strict inclusion and exclusion criteria. We included all randomized controlled studies published in the last 20years, from the year 2000 to December 2021 comparing NA versus GA in elderly patients undergoing hip fracture surgeries if they included delirium as an outcome. We only included traditionally published journal articles. We excluded studies that did not report the primary outcomes, as well as grey literature such as non-traditional articles including study protocols, case reports, audits, editorials, commentaries, conference reports and abstracts. No language restriction was placed on our search.

Information source and search strategy

Two authors (KYC and TXY) independently searched PubMed, EMBASE, Google Scholar, Cochrane Library, and Web of Science for relevant results, using a combination of specific keywords and terms, including “spinal anesthesia”, “regional anesthesia”, “neuraxial anesthesia’, “general anesthesia”, “hip fracture”, “surgery”, “delirium”, and others. The search was updated to December 31, 2022, and the full search strategy and results can be found in the Appendix. We then scanned the reference lists and citations of the studies and any other relevant systematic reviews for further results.

Data extraction

Both authors (KYC and TXY) performed initial data extraction from the included RCTs into a predesigned data spreadsheet separately. This data was then combined, and any disagreements were adjudicated by the third author (ES). Including the outcome measures, we extracted the following data: year of publication, country, sample size, age, sex, American Society of Anesthesiologists (ASA) physical status, type of surgery, anesthesia techniques, adjunct interventions, including use of nerve blocks and sedation, provided definitions of postoperative delirium, inclusion or exclusion of patients with pre-existing dementia or delirium, day of baseline and postoperative assessment of delirium. Due to challenges in the translation of the full study, the data from one RCT reported in Japanese [21] was obtained from Guay et al. [3] instead. The patient characteristics, study characteristics as well as the outcome data were tabulated using standardized forms. The appendices of the studies were checked for any missing data and the authors of the studies were contacted for missing data.

Outcomes

The outcomes evaluated were the incidence of postoperative delirium, postoperative delirium incidence from day 2–7, postoperative MMSE (mini-mental state examination) score and other neuropsychological tests results, postoperative pain control, length of hospital stay, incidence of discharge to same preadmission residence, in-hospital mortality, 30-day and 90-day mortality rates.

Risk of bias and methodological quality assessment

Using the Cochrane Collaboration risk-of-bias tool [22,23,24,25], both KYC and TXY independently assessed all the studies and evaluated them for bias, including selection bias, including random sequence generation and allocation concealment, performance bias such as blinding of participants and personnel, detection bias such as blinding of outcome assessment, attrition bias due to incomplete outcome data and reporting bias. We categorized the risk of bias as high risk, low risk, or unclear. Any difference in opinion was adjudicated by a third author DYC. We also evaluated the quality of evidence for each outcome according to the Grading of Recommendations, Assessment, Development and Evaluation (GRADE) tool [26, 27].

Statistical analysis and discussion

The risk of bias table and meta-analysis were performed using the latest version of RevMan (5.4), available on the Cochrane website [22,23,24,25]. If the data reported median and inter-quartile ranges, the data was converted to estimated mean and standard deviation using the method described by Wan et al. [28]. We utilized a random effects model (DerSimonian and Laird method) [29, 30] for summary estimates. Pooled estimates for dichotomous outcomes were presented as Odds ratios (ORs) with 95% confidence intervals (CIs), and for continuous outcomes, mean differences with standard deviations were used. For transparency and to balance between the potential for bias and the loss of precision when studies at high or unclear risk of bias were excluded, we stratified the studies by the summary of risk of bias and presented the estimates of the intervention effects from the studies at low risk of bias and from all studies. Studies were assessed for heterogeneity using Cochran’s Q test and I2 tests. Substantial heterogeneity was defined as I2 > 50%, and in case of significant heterogeneity, we performed robustness tests to determine the source of heterogeneity, as well as further post hoc subgroup analyses to look for confounders as means of investigating heterogeneous results [31, 32]. The risk of publication bias was assessed by funnel plots and Egger’s regression test [33,34,35].

We performed a trial sequential analysis (TSA), a cumulative meta-analysis method to control both type I and type II errors and determine if the results of the meta-analysis are reliable, utilizing TSA software version 0.9.5.10 Beta [36]. For the primary outcome, the calculation was based on an anticipated 5% relative risk reduction in the intervention group using a two-sided alpha of 0.05 and a power of 80%. We estimated the heterogeneity level and control-group event rates from the meta-analyses.

Ethics

None required.

Outcomes of meta-analysis

Study identification and selection

A total of 1075 studies were identified via database searches and screening of reference lists included in retained studies. 438 studies were removed for duplication, 606 papers were excluded by screening of titles and abstracts, 31 papers were assessed in detail, and 21 articles were excluded for the following reasons: ongoing trials, different intervention, and irrelevant outcomes of interest for this study. A total of 10 RCTs were finally included in this meta-analysis [18, 19, 21, 37,38,39,40,41,42,43]. The flow diagram of the study selection is shown in Fig. 1.

Fig. 1
figure 1

PRISMA study flow diagram. PRISMA = preferred reporting items for systematic reviews and meta-analysis

Study characteristics

The characteristics of each included RCT are summarized in Table 1 [18, 19, 21, 37,38,39,40,41,42,43]. The 10 studies included 3968 patients with hip fractures who received either NA (spinal, epidural, or combined spinal epidural) or GA. Five out of ten studies allowed for sedation if required or additional regional blocks. Patients over 40years old with an ASA status ranging from I to IV were included. All RCTs were conducted after the year 2000; six studies were conducted in Asia (China, Iran, Japan and Korea), three in Europe (UK, Greece, Italy), and one in North America (USA and Canada). Nine out of ten trials demonstrated low to moderate risk of bias except for blinding of participants and personnel (Fig. 2).

Table 1 Summary of characteristics of study
Fig. 2
figure 2

Risk of bias graph and summary for included studies. (“ + ” = low risk of bias; “-s” = high risk of bias; and “?” = unclear risk of bias)

Postoperative delirium incidence

Nine studies [18, 19, 21, 38,39,40,41,42,43] which included 3227 patients reported POD incidence as an outcome. We found no significant difference in incidence of POD between the NA and GA groups as reported in Fig. 3 [OR 1.10, 95% CI (0.89 to 1.37)] (GRADE: High). There was also no overall significant heterogeneity with an I2 of 0% and Z-value was insignificant (p = 0.82). Subgroup analysis performed also showed no significant difference in POD incidence [I2 = 0%, Chi2 = 0.03, df = 1 (p = 0.86)] between the group of studies which included patients with pre-existing dementia and those who did not. Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

Fig. 3
figure 3

Forest plot for all study outcomes (MMSE = mini mental state of examination; NA = neuraxial anesthesia; GA = general anesthesia)

Postoperative delirium incidence from day 2–7

Two studies [42, 43] including 417 patients reported incidence of POD persisting after postoperative day one to up to one week. No significant difference was found between NA and GA as reported in Fig. 3 [OR 0.31, 95% CI (0.06 to 1.63)] (GRADE: High). Both heterogeneity with an I2 of 0% and a Z-value of p = 0.17 were insignificant. Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

Postoperative MMSE score

Three studies [37, 40, 43] including 381 patients also reported postoperative MMSE score as an outcome. There was no significant difference in the MMSE score between NA and GA groups as reported in Fig. 3 [OR 0.07, 95% CI (-0.22 to 0.36)] (GRADE: Moderate). There was moderate heterogeneity with an I2 of 41% and an insignificant Z-value (p = 0.63). No sensitivity analysis was performed given the small number of patients reported for this outcome. Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

Postoperative neuropsychological tests

Two studies [37, 40] reported on postoperative neuropsychological tests including the Beck Depression Inventory, Trail-making Test Part A and B as outcomes, and a total of 351 patients were included. There were no significant differences between NA and GA groups for all three neuropsychological tests as shown in Fig. 3. For Beck Depression Inventory, the OR was 0.92, 95% CI (-0.55 to 2.40) (GRADE: Low). Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias. There was significant heterogeneity with I2 = 80%. For Trail-Making Test Part A, the OR was 9.90, 95% CI (-9.23 to 29.04) (GRADE: Low) and showed significant heterogeneity with I2 = 62%. For Trail-Making Test Part B, the OR was 15.34, 95% CI (-24.89 to 55.56) (GRADE: Low), and again there was significant heterogeneity with I2 = 64%. Z-values for all three tests were insignificant (p = 0.22 to 0.45). No sensitivity analysis was performed in consideration of the limited number of studies.

Length of hospital stay

Seven studies [18, 19, 37, 39,40,41,42] including 3717 patients reported length of hospital stay. Although NA appeared to be associated with a shorter duration of hospital stay, especially in patients without pre-existing dementia or delirium, the observed effect did not reach statistical significance as reported in Fig. 3 [OR -0.23, 95% CI (-0.46 to 0.01)] (GRADE: Moderate). Heterogeneity was moderate with an I2 = 55% while Z-value was insignificant (p = 0.06). Subgroup analyses were performed and showed no significant difference. Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

Incidence of postoperative discharge to same preadmission residence

Two studies [18, 41] that included 1876 patients reported incidence of postoperative discharge to same preadmission residence as an outcome. We found no significant difference in incidence of discharge to same preadmission residence between the NA and GA groups, as reported in Fig. 3 [OR 1.05, 95% CI (0.85 to 1.31)] (GRADE: High). There was no heterogeneity with I2 of 0% and an insignificant Z-value (p = 0.64) for overall effect. Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

In-hospital mortality

Three studies [18, 39, 42] including 2135 patients reported on in-hospital mortality as an outcome. There was no significant difference in mortality rate between the NA and GA groups, as shown in Fig. 3 [OR 1.98, 95% CI (0.20 to 19.25)] (GRADE: Low). There was significant heterogeneity with I2 = 72% and Z-value was insignificant (p = 0.56). Sensitivity analysis showed Neuman 2021 as the source of the heterogeneity. Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

30-day mortality

Three studies [19, 39, 41] reported 30-day mortality as an outcome and included 1431 patients. We did not find any significant difference between the NA and GA groups with regard to 30-day mortality, as reported in Fig. 3 [OR 1.03, 95% CI (0.47 to 2.25)] (GRADE: Moderate). There was no significant heterogeneity with I2 = 0, and Z-value for overall effect was insignificant (p = 0.94). Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

90-day mortality

Two studies [39, 41] including 498 patients reported 90-day mortality rate as an outcome. There was no significant difference in 90-day mortality rate between the NA and GA groups, as reported in Fig. 3 [OR 1.08, 95% CI (0.53–2.24)] (GRADE: Moderate). There was negligible heterogeneity with I2 = 0 and Z-value was insignificant (p = 0.83). Visual examination of the funnel plot (see appendix, eFig. 1) found no evidence of publication bias.

Postoperative pain control

Postoperative pain control was reported in 6 studies included in this meta-analysis [18, 19, 38, 40, 42,43,44] (see appendix, Table S1). However, the measuring tools and time of measurement of this outcome varied significantly across studies.

Heidari et al. [42] reported better pain relief immediately after surgery in the Post-Anesthesia Care Unit (PACU). Casati et al. [43] also reported better pain control in the NA group in the immediate postoperative period (1h) and at PACU discharge, but there was no significant difference in pain relief between the two groups 3h after the end of surgery. The REGAIN trial [18, 44], however, reported greater worst pain over the first 24 postoperative hours after surgery in the SA group (mean difference 0.40 [95% CI, 0.12 to 0.68]).

For pain control after the first 24 postoperative hours, Heidari et al. [42] reported no difference on the second and fifth postoperative days, while Tang et al. [38] and Tzimas et al. [40] also reported no significant difference in visual analogue scale (VAS) pain scores over 24h (p = 0.186) and the first 3days (p = 0.058 to 0.208) respectively. Li et al. [19] reported no significant difference in terms of worst pain score evaluated by VAS over the first 7 postoperative days. Contrarily, the REGAIN trial [18, 44] reported higher prescription analgesic use in the SA group at 60days (relative risk 1.33 [CI, 1.06 to 1.65]) compared to the GA group. Pain did not differ across groups at other time points, and overall satisfaction was similar across groups.

Trial sequential analysis

Trial sequential analysis [36] is a methodology, similar to interim analyses performed in single randomized trials, that provides adjusted wider thresholds for significance in RCT when the required number of participants and the corresponding number of trials in a meta-analysis has not been reached. It can be used to decide whether a trial should be terminated early because of a sufficiently small P-value. It is utilized in meta-analyses to control random errors and to assess the need for further trials by weighing type I and II errors and to estimate when the effect is large enough to be unaffected by further studies, and it is a useful tool to assess the definitiveness of a meta-analysis. The cumulative z-statistic curve is drawn by adding the included studies chronologically. If this line does not cross the monitoring boundaries, there is likely no statistical significance and more studies are needed. If it remains in the inner wedge within the conventional boundaries, it is unlikely that further studies would change the effect. The required information size is a predefined calculation of the number of participants and events necessary to detect or reject an a priori assumed intervention effect.

TSA was performed for the findings of POD incidence, MMSE score, POD incidence from postoperative day 2–7, duration of hospital stay, incidence of discharge to same residence, and 30-day and 90-day mortality rates.

For incidence of POD in Fig. 4, the Z-curve (blue line) remained within the futility boundaries and did not reach the required information size. More randomized trials are required to reach definitive conclusions.

Fig. 4
figure 4

Trial sequential analysis of postoperative delirium incidence for neuraxial anesthesia versus general anesthesia

The length of hospital stay favoured NA in the Forest plot although it was overall statistically insignificant. In the TSA analysis as shown in Fig. 5, it did not reach the required information size, hence no concrete conclusion can be drawn, and more trials are needed.

Fig. 5
figure 5

Trial sequential analysis of length of hospital stay for neuraxial anesthesia versus general anesthesia

Regarding other outcomes including MMSE score, persistent POD from postoperative day 2–7, discharge to same preadmission residence and 30-day mortality rate, the Z-curve (blue line) remained within the boundaries for conventional benefit and monitoring boundaries, but remained outside the futility boundaries; thus we cannot conclude on the effect of NA and GA on these outcomes unless sufficient statistical power is achieved (see Appendix, eFigs. 2, 3, 4, 5, 6, and 7).

Discussion

This meta-analysis was performed with a primary focus on investigating the impact of different anesthesia regimens on POD, cognitive impairment, and associated clinical implications including mortality, duration of hospital stay, and functional recovery.

In this meta-analysis, we showed there was no statistically significant difference in POD incidence between NA and GA groups. Subgroup analysis showed no difference in outcome either including or excluding patients with pre-existing dementia or preoperative delirium. TSA indicated the required information size was not reached and further large-sized randomized trials will be needed to reach definitive conclusions on this outcome. There were no significant differences in the other outcomes between the two anesthesia groups, but most were underpowered.

Our findings were similar to those of previous systematic reviews [17], and meta-analyses [3, 4]. However, Guay et al. [3] highlighted that most studies had poorly defined definitions of POD, and the time point of its evaluation was often unclear. These factors contributed to a widely varied outcome incidence and a statistically large heterogeneity among studies. Another concern was that most included studies were conducted before 2000 and referred to outdated clinical practices such as the use of long-acting benzodiazepines as induction agents [45]. Larger RCTs with standardized outcome definitions were recommended to eliminate the effect of such factors for recent analysis [3].

In response to the highlighted issues, this meta-analysis only included contemporary RCTs from 2000–2021 including 2 large multi-centre RCTs published in the past two years, the RAGA [19] and REGAIN trials [18], and included 3227 patients in the pooled analysis. Included RCTs provided a clear definition, despite not being uniform across considered studies, of POD and most used the CAM criteria [46] or other well-validated assessment tools for diagnosis of delirium. Nine out of ten trials demonstrated a low to moderate risk of bias except for blinding of participants and personnel. GRADE quality of evidence for seven out of eight reported outcomes were moderate to high. TSA was also performed to monitor for potential random errors and assess need for further trials.

Our study was able to demonstrate reduced heterogeneity for POD incidence compared to previous meta-analyses, which may have partially addressed the issue stemming from non-standardized outcome definitions and smaller sample size, resulting in a wide variability of reported outcomes. Although 5 RCTS used CAM test for its evaluation and 3 RCTS used other well-validated methods, 1 RCT [39] did not provide a clear definition of delirium nor information regarding its evaluation. Ren’s RCT [37] was not included in our evaluation of POD incidence because it did not have a clear definition nor actual data provided for this outcome. The information size was also not reached, indicating the need for further future trials with uniform and clear definitions of POD is essential. In addition, the development of and persistence of delirium is a multi-factorial process and confounding factors should be considered.

Pre-existing delirium or cognitive impairment is a predictor of postoperative delirium [14, 47]. Four RCTs, including the RAGA trial, adopted a broader inclusion criterion to include patients with preoperative dementia or delirium to enhance generalizability and reflect current practice. Five other studies excluded this group of patients to eliminate potential confounding factors. Subgroup analysis was conducted and showed no statistically significant difference in POD risk between these two subgroups.

The potential effect of sedatives given during NA and regional blocks performed during GA must also be considered [48,49,50,51,52]. More recent RCTS allowed the use of short-acting sedatives such as midazolam, if required. Both RAGA and REGAIN trials reported the use of regional blocks in both anesthetic groups. Although studies that prohibited sedatives did not find significant differences in POD incidence, this shift to multi-modal anesthesia regimens may mask any actual differences between NA and GA groups. Nevertheless, it is reflective of advancements in anesthesia practice and is more clinically relevant compared to previous meta-analyses.

Suboptimal pain control is another key factor to POD [14, 47, 53]. Although six studies reported on postoperative pain control, the measuring tools, time, and duration of pain score measurements varied significantly across studies, and data could not be pooled for analysis. Some studies reported better pain control in the NA group in the immediate postoperative period. The REGAIN trial, however, reported greater worse pain scores over the first 24 postoperative hours and higher prescription analgesic use at 60days for the NA group. A significant proportion of patients in both groups reported high levels of pain over the first 3 postoperative days (70% of enrolled patients). Contrary to other studies, the authors acknowledged this discrepancy and attributed it to possible undertreatment of pain for patients in the NA group, highlighting the need for additional efforts across both groups to better manage postoperative pain. Overall, none of the studies showed any significant difference in pain control between anesthetic regimens. Future studies reporting pain outcomes with standardized pain assessment methods, postoperative analgesic regimens, at standardized time points during the rehabilitation period would be beneficial for future analysis.

Regarding other delirium-related outcomes, there were no significant differences in POD prevalence after postoperative day one, MMSE scores, and postoperative cognitive functioning. Our study also did not find significant differences between groups for associated clinical implications including mortality and functional recovery during or after rehabilitation. However, most of these outcomes had limited data and were underpowered.

For duration of hospital stay, it approached statistical significance favouring NA in the Forest Plot, but the overall result was statistically insignificant. TSA showed that the required information size was not reached; hence more studies should be conducted.

We also took note of the recently published meta-analysis by Kunutsor et al. published in the BJA September 2022 issue [54] which compared several perioperative outcomes of NA vs GA for hip fractures including POD. Our findings were compatible with their data analysis results on common outcomes, including POD incidence, pain control, length of hospital stay (as proxy of being out of bed on the first postoperative day), incidence of return to preoperative preadmission residence, 30-day and 90-day mortality rates, but our meta-analysis focused more in detail on the effects of anesthesia regimens on POD incidence, neuropsychological functioning, and related clinical outcomes.

Considering the ongoing large Improve Hip Fracture Outcome in the Elderly Patient (iHOPE) trial [55] which intends to recruit 1032 patients, and thus reaching the required information size for POD based on our TSA, we hope more conclusive evidence could be made regarding POD and its associated outcomes to guide future clinical practice. Furthermore, we noted that the duration of POD was not well documented across studies, despite having significant impact on patient outcomes. Delirium-related implications including rehabilitation capacity or post-discharge to same preadmission residence were also not often evaluated. Inclusion of such outcomes in future trials are of great importance to guide future evidence-based practice.

Geriatric hip fracture anesthesia in the geriatric population remains a highly challenging part of anesthesia practice. Current studies and our meta-analysis have been unable to identify an ideal anesthetic regimen for hip fracture surgery, thus deliberate consideration should also be given to individual comorbidities, the expertise of anesthetists and surgeons, as well as patients’ wishes to achieve better care for our geriatric population.

Limitations

One limitation of this study is the lack of statistical power for key outcomes. Despite including two recent large-scale RCTs, Forest plot analysis and TSA indicated that the required information size was not reached to make a conclusive judgement. Secondary delirium-associated outcomes were often reported only in a limited number of RCTs and were mostly underpowered.

While in this meta-analysis statistical heterogeneity for the primary outcome was insignificant, moderate heterogeneity was observed in a small number of secondary outcomes of our study. The Heidari trial [42] was a potential source for heterogeneity due to its inclusion of a younger hip fracture population (study inclusion criteria as age above 40years old). Nevertheless, this subgroup of patients below 50years old (total 85 participants) was only a small fraction of the total 3968 participants in the pooled analysis and hence would have likely had a minimal impact on the pooled analyses.

Furthermore, in this meta-analysis we noted that duration of POD was not well documented and its effect on postoperative functionality was also not well explored. Reporting on pain scores and postoperative pain regimens were also varied. Limited data were reported and a lack of standardized assessment tools prevented any pooled analysis and subgroup analyses or pooled analyses for the above-mentioned outcomes. The inclusion of such standardized outcomes would be desirable in future trials.

Conclusions

This meta-analysis did not find any statistically significant difference in POD incidence between NA and GA groups or in any subgroup analyses. There was no difference in delirium incidence regardless of inclusion or exclusion of patients with pre-existing dementia or preoperative delirium. NA appeared to be associated with a shorter hospital stay, especially in patients without pre-existing dementia, but the observed effect did not reach statistical significance. There were no significant differences in other delirium-associated clinical outcomes. Larger prospective randomized trials with a uniform and clear definition of POD, outcomes on delirium prevalence or duration, rehabilitation and postoperative functionality are required.

Availability of data and materials

All data generated or analyzed during this study are included in this published article.

Abbreviations

ASA:

American Society of Anesthesiologists

CAM:

Confusion assessment method

CSE:

Combined-spinal epidural

CI:

Confidence interval

EA:

Epidural anesthesia

GRADE:

The grading of recommendations, assessment, development, and evaluation methodology

GA:

General anesthesia

MMSE:

Mini mental state examination

NA:

Neuraxial anesthesia

N/A:

Not available

OAAS:

Observer’s Assessment of Alertness/Sedation Scale

POD:

Postoperative delirium

PONV:

Postoperative nausea and vomiting

PRISMA:

Preferred reporting items for systematic reviews and meta-analyses

RA:

Regional Anesthesia

RCT:

Randomized controlled trial

OR:

Odds ratio

SMD:

Standardized mean difference

TSA:

Trial Sequential Analysis

References

  1. Odén A, McCloskey EV, Johansson H, Kanis JA. Assessing the impact of osteoporosis on the burden of hip fractures. Calcif Tissue Int. 2013;92:42–9.

    Article  PubMed  Google Scholar 

  2. Cooper C, Campion G, Melton LJ. Hip fractures in the elderly: a world-wide projection. Osteoporos Int. 1992;2:285–9.

    Article  CAS  PubMed  Google Scholar 

  3. Parker MJ, Handoll HH, Griffiths R. Anaesthesia for hip fracture surgery in adults. Cochrane Database Syst Rev. 2004;2017:CD000521.

    Google Scholar 

  4. Chen DX, Yang L, Ding L, Li SY, Qi YN, Li Q. Perioperative outcomes in geriatric patients undergoing hip fracture surgery with different anesthesia techniques: a systematic review and meta-analysis. Medicine (Baltimore). 2019;98: e18220.

    Article  PubMed  Google Scholar 

  5. Zheng X, Tan Y, Gao Y, Liu Z. Comparative efficacy of neuraxial and general anesthesia for hip fracture surgery: a meta-analysis of randomized clinical trials. BMC Anesthesiol. 2020;20:162.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Van Waesberghe J, Stevanovic A, Rossaint R, Coburn M. General vs. neuraxial anaesthesia in hip fracture patients: a systematic review and meta-analysis. BMC Anesthesiol. 2017;17:87.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Desai V, Chan PH, Prentice HA, Zohman GL, Diekmann GR, Maletis GB, et al. Is anesthesia technique associated with a higher risk of mortality or complications within 90 days of surgery for geriatric patients with hip fractures? Clin Orthop Relat Res. 2018;476:1178–88.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Guay J, Choi P, Suresh S, Albert N, Kopp S, Pace NL. Neuraxial blockade for the prevention of postoperative mortality and major morbidity: an overview of Cochrane systematic reviews. Cochrane Database Syst Rev. 2014;2014:CD010108.

    PubMed  PubMed Central  Google Scholar 

  9. Ahn EJ, Kim HJ, Kim KW, Choi HR, Kang H, Bang SR. Comparison of general anaesthesia and regional anaesthesia in terms of mortality and complications in elderly patients with hip fracture: a nationwide population-based study. BMJ Open. 2019;9: e029245.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Neuman MD, Silber JH, Elkassabany NM, Ludwig JM, Fleisher LA. Comparative effectiveness of regional versus general anesthesia for hip fracture surgery in adults. Anesthesiology. 2012;117:72–92.

    Article  CAS  PubMed  Google Scholar 

  11. Chu CC, Weng SF, Chen KT, Chien CC, Shieh JP, Chen JY, et al. Propensity score-matched comparison of postoperative adverse outcomes between geriatric patients given a general or a neuraxial anesthetic for hip surgery: a population-based study. Anesthesiology. 2015;123:136–47.

    Article  PubMed  Google Scholar 

  12. Shih Y-J, Hsieh C-H, Kang T-W, Peng S-Y, Fan K-T, Wang L-M. General versus spinal anesthesia: which is a risk factor for octogenarian hip fracture repair patients? Int J Gerontol. 2010;4:37–42.

    Article  Google Scholar 

  13. Fields AC, Dieterich JD, Buterbaugh K, Moucha CS. Short-term complications in hip fracture surgery using spinal versus general anaesthesia. Injury. 2015;46:719–23.

    Article  PubMed  Google Scholar 

  14. Wu J, Yin Y, Jin M, Li B. The risk factors for postoperative delirium in adult patients after hip fracture surgery: a systematic review and meta-analysis. Int J Geriatr Psychiatry. 2021;36:3–14.

    Article  PubMed  Google Scholar 

  15. Bai J, Zhang P, Liang X, Wu Z, Wang J, Liang Y. Association between dementia and mortality in the elderly patients undergoing hip fracture surgery: a meta-analysis. J Orthop Surg Res. 2018;13:298.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Witlox J, Eurelings LS, de Jonghe JF, Kalisvaart KJ, Eikelenboom P, Van Gool WA. Delirium in elderly patients and the risk of postdischarge mortality, institutionalization, and dementia: a meta-analysis. JAMA. 2010;304:443–51.

    Article  CAS  PubMed  Google Scholar 

  17. Patel V, Champaneria R, Dretzke J, Yeung J. Effect of regional versus general anaesthesia on postoperative delirium in elderly patients undergoing surgery for hip fracture: a systematic review. BMJ Open. 2018;8: e020757.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Neuman MD, Feng R, Carson JL, Gaskins LJ, Dillane D, Sessler DI, et al. Spinal anesthesia or general anesthesia for hip surgery in older adults. N Engl J Med. 2021;385:2025–35.

    Article  CAS  PubMed  Google Scholar 

  19. Li T, Li J, Yuan L, Wu J, Jiang C, Daniels J, et al. Effect of regional vs general anesthesia on incidence of postoperative delirium in older patients undergoing hip fracture surgery: the RAGA randomized trial. JAMA. 2022;327:50–8.

    Article  CAS  PubMed  Google Scholar 

  20. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst Rev. 2015;4:1.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Kamitani K, Higuchi A, Asahi T, Yoshida H. Postoperative delirium after general anesthesia vs. spinal anesthesia in geriatric patients. Masui. 2003;52:972–5.

    PubMed  Google Scholar 

  22. Higgins JP, Savović J, Page MJ, Elbers RG, Sterne JA. Assessing risk of bias in a randomized trial. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA, editors. Cochrane handbook for systematic reviews of interventions. Hoboken, NJ: Wiley; 2019. p. 205–28.

    Chapter  Google Scholar 

  23. Higgins JP, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, et al. The Cochrane collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343: d5928.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Sterne JAC, Savović J, Page MJ, Elbers RG, Blencowe NS, Boutron I, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. 2019;366: l4898.

    Article  PubMed  Google Scholar 

  25. Sterne JA, Hernán MA, Reeves BC, Savović J, Berkman ND, Viswanathan M, et al. ROBINS-I: a tool for assessing risk of bias in non-randomised studies of interventions. BMJ. 2016;355: i4919.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Guyatt G, Oxman AD, Akl EA, Kunz R, Vist G, Brozek J, et al. GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables. J Clin Epidemiol. 2011;64:383–94.

    Article  PubMed  Google Scholar 

  27. Schünemann H, Brożek J, Guyatt G, Oxman A. GRADE handbook for grading quality of evidence and strength of recommendations. The GRADE Working Group. 2013. https://gdt.gradepro.org/app/handbook/handbook.html . Updated October 2013.

  28. Wan X, Wang W, Liu J, Tong T. Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range. BMC Med Res Methodol. 2014;14:135.

    Article  PubMed  PubMed Central  Google Scholar 

  29. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986;7:177–88.

    Article  CAS  PubMed  Google Scholar 

  30. DerSimonian R, Laird N. Meta-analysis in clinical trials revisited. Contemp Clin Trials. 2015;45:139–45.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Higgins JP, Li T, Deeks JJ. Choosing effect measures and computing estimates of effect. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch V, editors. Cochrane handbook for systematic reviews of interventions. Hoboken, NJ: Wiley; 2019. p. 143–76.

    Chapter  Google Scholar 

  32. Higgins JP, Thompson SG. Quantifying heterogeneity in a meta-analysis. Stat Med. 2002;21:1539–58.

    Article  PubMed  Google Scholar 

  33. Sterne JA, Egger M, Smith GD. Systematic reviews in health care: investigating and dealing with publication and other biases in meta-analysis. BMJ. 2001;323:101–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Rücker G, Schwarzer G, Carpenter JR, Binder H, Schumacher M. Treatment-effect estimates adjusted for small-study effects via a limit meta-analysis. Biostatistics. 2011;12:122–42.

    Article  PubMed  Google Scholar 

  35. Egger M, Smith GD, Schneider M, Minder C. Bias in meta-analysis detected by a simple, graphical test. BMJ. 1997;315:629–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Wetterslev J, Jakobsen JC, Gluud C. Trial sequential analysis in systematic reviews with meta-analysis. BMC Med Res Methodol. 2017;17:39.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Ren W-X, Wu R-R. Effect of general and sub-arachnoid anesthesia on the incidence of postoperative delirium and cognitive impairments in elderly Chinese patients. Trop J Pharm Res. 2021;20:433–9.

    Article  Google Scholar 

  38. Tang L, Fang P, Fang Y, Lu Y, Xu G, Liu X. Comparison of effects between combined lumbar-sacral plexus block plus general anesthesia and unilateral spinal anesthesia in elderly patients undergoing hip fracture surgery: a pilot randomized controlled trial. Evid Based Complement Alternat Med. 2021;2021:6685497.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Shin S, Kim SH, Park KK, Kim SJ, Bae JC, Choi YS. Effects of anesthesia techniques on outcomes after hip fracture surgery in elderly patients: a prospective, randomized, controlled trial. J Clin Med. 2020;9:1605.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Tzimas P, Samara E, Petrou A, Korompilias A, Chalkias A, Papadopoulos G. The influence of anesthetic techniques on postoperative cognitive function in elderly patients undergoing hip fracture surgery: general vs spinal anesthesia. Injury. 2018;49:2221–6.

    Article  PubMed  Google Scholar 

  41. Parker MJ, Griffiths R. General versus regional anaesthesia for hip fractures. a pilot randomised controlled trial of 322 patients. Injury. 2015;46:1562–6.

    Article  PubMed  Google Scholar 

  42. Heidari SM, Soltani H, Hashemi SJ, Talakoub R, Soleimani B. Comparative study of two anesthesia methods according to postoperative complications and one month mortality rate in the candidates of hip surgery. J Res Med Sci. 2011;16:323–30.

    PubMed  PubMed Central  Google Scholar 

  43. Casati A, Aldegheri G, Vinciguerra E, Marsan A, Fraschini G, Torri G. Randomized comparison between sevoflurane anaesthesia and unilateral spinal anaesthesia in elderly patients undergoing orthopaedic surgery. Eur J Anaesthesiol. 2003;20:640–6.

    Article  CAS  PubMed  Google Scholar 

  44. Neuman MD, Feng R, Ellenberg SS, Sieber F, Sessler DI, Magaziner J, et al. Pain, analgesic use, and patient satisfaction with spinal versus general anesthesia for hip fracture surgery : a randomized clinical trial. Ann Intern Med. 2022;175:952–60.

    Article  PubMed  Google Scholar 

  45. Bigler D, Adelhøj B, Petring OU, Pederson NO, Busch P, Kalhke P. Mental function and morbidity after acute hip surgery during spinal and general anaesthesia. Anaesthesia. 1985;40:672–6.

    Article  CAS  PubMed  Google Scholar 

  46. Wei LA, Fearing MA, Sternberg EJ, Inouye SK. The confusion assessment method: a systematic review of current usage. J Am Geriatr Soc. 2008;56:823–30.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Pompei P, Foreman M, Rudberg MA, Inouye SK, Braund V, Cassel CK. Delirium in hospitalized older persons: outcomes and predictors. J Am Geriatr Soc. 1994;42:809–15.

    Article  CAS  PubMed  Google Scholar 

  48. Guay J, Kopp S. Peripheral nerve blocks for hip fractures in adults. Cochrane Database Syst Rev. 2020;11:CD.

    Google Scholar 

  49. Mouzopoulos G, Vasiliadis G, Lasanianos N, Nikolaras G, Morakis E, Kaminaris M. Fascia iliaca block prophylaxis for hip fracture patients at risk for delirium: a randomized placebo-controlled study. J Orthop Traumatol. 2009;10:127–33.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Guay J, Parker MJ, Griffiths R, Kopp SL. Peripheral nerve blocks for hip fractures: a Cochrane review. Anesth Analg. 2018;126:1695–704.

    Article  PubMed  Google Scholar 

  51. Sieber FE, Zakriya KJ, Gottschalk A, Blute MR, Lee HB, Rosenberg PB, et al. Sedation depth during spinal anesthesia and the development of postoperative delirium in elderly patients undergoing hip fracture repair. Mayo Clin Proc. 2010;85:18–26.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Sieber FE, Neufeld KJ, Gottschalk A, Bigelow GE, Oh ES, Rosenberg PB, et al. Effect of depth of sedation in older patients undergoing hip fracture repair on postoperative delirium: the STRIDE randomized clinical trial. JAMA Surg. 2018;153:987–95.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Vaurio LE, Sands LP, Wang Y, Mullen EA, Leung JM. Postoperative delirium: the importance of pain and pain management. Anesth Analg. 2006;102:1267–73.

    Article  PubMed  Google Scholar 

  54. Kunutsor SK, Hamal PB, Tomassini S, Yeung J, Whitehouse MR, Matharu GS. Clinical effectiveness and safety of spinal anaesthesia compared with general anaesthesia in patients undergoing hip fracture surgery using a consensus-based core outcome set and patient-and public-informed outcomes: a systematic review and meta-analysis of randomised controlled trials. Br J Anaesth. 2022;129:788–800.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Kowark A, Adam C, Ahrens J, Bajbouj M, Bollheimer C, Borowski M, et al. Improve hip fracture outcome in the elderly patient (iHOPE): a study protocol for a pragmatic, multicentre randomised controlled trial to test the efficacy of spinal versus general anaesthesia. BMJ Open. 2018;8: e023609.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

There was no funding source in this study.

Author information

Authors and Affiliations

Authors

Contributions

KYC and TXY designed the study, performed the statistical analysis, and drafted the manuscript. Any difference in opinion was adjudicated by DYC and EHS participated in overall coordination of the study. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Karis Yui-Lam Cheung.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cheung, K.YL., Yang, T.X., Chong, D.YC. et al. Neuraxial versus general anesthesia in elderly patients undergoing hip fracture surgery and the incidence of postoperative delirium: a systematic review and stratified meta-analysis. BMC Anesthesiol 23, 250 (2023). https://doi.org/10.1186/s12871-023-02196-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12871-023-02196-9

Keywords