Skip to main content

Machine learning approach for predicting post-intubation hemodynamic instability (PIHI) index values: towards enhanced perioperative anesthesia quality and safety

Abstract

Background

Adequate preoperative evaluation of the post-intubation hemodynamic instability (PIHI) is crucial for accurate risk assessment and efficient anesthesia management. However, the incorporation of this evaluation within a predictive framework have been insufficiently addressed and executed. This study aims to developed a machine learning approach for preoperatively and precisely predicting the PIHI index values.

Methods

In this retrospective study, the valid features were collected from 23,305 adult surgical patients at Peking Union Medical College Hospital between 2012 and 2020. Three hemodynamic response sequences including systolic pressure, diastolic pressure and heart rate, were utilized to design the post-intubation hemodynamic instability (PIHI) index by computing the integrated coefficient of variation (ICV) values. Different types of machine learning models were constructed to predict the ICV values, leveraging preoperative patient information and initiatory drug infusion. The models were trained and cross-validated based on balanced data using the SMOTETomek technique, and their performance was evaluated according to the mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE) and R-squared index (R2).

Results

The ICV values were proved to be consistent with the anesthetists’ ratings with Spearman correlation coefficient of 0.877 (P < 0.001), affirming its capability to effectively capture the PIHI variations. The extra tree regression model outperformed the other models in predicting the ICV values with the smallest MAE (0.0512, 95% CI: 0.0511–0.0513), RMSE (0.0792, 95% CI: 0.0790–0.0794), and MAPE (0.2086, 95% CI: 0.2077–0.2095) and the largest R2 (0.9047, 95% CI: 0.9043–0.9052). It was found that the features of age and preoperative hemodynamic status were the most important features for accurately predicting the ICV values.

Conclusions

Our results demonstrate the potential of the machine learning approach in predicting PIHI index values, thereby preoperatively informing anesthetists the possible anesthetic risk and enabling the implementation of individualized and precise anesthesia interventions.

Peer Review reports

Background

Anesthetic management is accompanied by high risks associated with perioperative complications [1]. In assessing these potential risks, there is growing concern regarding the preanesthetic evaluation, which is implemented via methods such as medical record analysis, patient interviews, physical examinations and preoperative tests [2]. Thus, designing effective and predictable evaluation indices to quantify the perioperative risk is of vital importance for improving the situational awareness [3] skills of the anesthetist and promoting safety management in anesthesiological practice [4] that relies on high levels of human performance [5].

Data-driven analysis and machine-learning models have become possible with increasingly structured data, such as preoperative electronic health/medical records [6]. Recently, various models have been developed to predict postoperative mortality [6, 7], postoperative pain [8], surgical acceptance [9], hypotension during anesthesia [10,11,12], surgery time [13, 14], intraoperative bradycardia [15], postoperative delirium [16, 17] and other risk outcomes [18,19,20]. Nevertheless, there is a paucity of endeavors dedicated to preoperative prediction of post-intubation hemodynamic instability (PIHI), despite its critical significance in enhancing the perioperative anesthesia quality and safety, as well as mitigating postoperative risks [21].

The patient’s hemodynamic variables exhibit rapid responsiveness in the operative procedure, thereby providing valuable feedback for anesthetists to make appropriate adjustments in subsequent treatment steps, such as drug administration. However, this is not applicable at the time of initial drug infusion, as there is no patient response yet. Following intubation, the patient is at higher risk for hemodynamic instability, and thus, the anesthetist’s empirical determination of the initial dose, based on preoperative influencing factors [22], is critical for ensuring anesthesia safety. Therefore, preoperatively and precisely evaluating the risk of hemodynamic instability to different drug dosages in diverse patient populations holds paramount reference value for anesthesiologists in the context of perioperative care management and closed-loop intravenous drug administration [23].

The aim of this study is to propose a post-intubation anesthetic risk index of hemodynamic instability and develop a machine-learning model for predicting it. To the best of our knowledge, no prior investigation has integrated perioperative hemodynamic parameters into a comprehensive index value. Moreover, with the help of the predictive ability of machine-learning models [24, 25], we hypothesized that we could develop a data-driven approach for preoperatively predicting the proposed index utilizing preoperative patient information and planned initial drug infusion data. The prediction performances of different models were compared, and the contribution of the variables were evaluated in the secondary analysis.

Materials and methods

Study sample and data description

This single center retrospective study was approved by the institutional ethics committee of Chinese Academy of Medical Sciences & Peking Union Medical College (I-23PJ746), with waivers for consent. All data underwent de-identification procedures for the purpose of preserving privacy and ensuring data confidentiality. The manuscript adheres to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines for observational studies.

The original dataset comprises data from a cohort of 56,083 adult (age ≥ 18 yr) surgical patients who underwent general anesthesia with intravenous induction and endotracheal intubation at Peking Union Medical College Hospital, collected over a span of 9 years from 2012 to 2020. Given our primary objective of establishing a general framework to evaluate post-intubation hemodynamic instability, independent of specific surgical categories, we selectively reserved the patients with American Society of Anesthesiologists (ASA) scores of 1 and 2. This inclusion criterion was employed to mitigate the disproportionate influence of severe complications of the patients with ASA score 3 or 4 on the model complexity and study outcomes. Furthermore, by excluding the outliers and the data with missing values for key variables, the dataset was refined to encompass a cohort of 23,305 patients, thereby ensuring data integrity and enhance the robustness of subsequent analysis. By leveraging the Electronic Health Record System (EHRS) and Anesthesia Information Management System (AIMS) of the hospital, we retrospectively obtained comprehensive data of the study samples including preoperative patient information, initial drug infusion details, and perioperative hemodynamic profiles.

Preoperative patient information

The differences in patient characteristics greatly influence the process of the operation and notably affects the patient anesthetic response. A fundamental tenet of feature selection involves prioritizing attributes derived from readily obtainable basic information during each surgical procedure. This approach ensures the applicability of the prediction framework across a wider spectrum of clinical scenarios. The study utilized basic patient information from the EHRS, including age (median = 47 years, range = 18–91 years), sex (0 = female, 1 = male, female proportion 67.39%), height (median = 164 cm, 25th percentile = 160 cm, 75th percentile = 170 cm), weight (median = 64 kg, 25th percentile = 56 kg, 75th percentile = 72 kg), body mass index (BMI) (median = 23.71 kg/m2, 25th percentile = 21.45 kg/m2, 75th percentile = 26.16 kg/m2) and hypertension (0 = no hypertension, 1 = hypertension, hypertension proportion 14.49%). These parameters were considered in the modeling process according to the empirical experience of anesthetist and data availability. Furthermore, the preoperative physiological characteristics of patients, including systolic pressure (SP), diastolic pressure (DP) and heart rate (HR), were retrospectively obtained from the first stable measurements recorded in the AIMS. The median (25th percentile, 75th percentile) values of these preoperative physiological characteristics were SP = 130 (117, 144) mmHg, DP = 78 (70, 86) mmHg, and HR = 78 (69, 88) BPM (beats per minute), respectively.

Initiatory drug infusion

The initiatory drug infusion, which is the primary pharmacological intervention undertaken by the anesthetist during surgery, constitutes a fundamental step towards achieving favorable anesthesia safety. The collection of initiatory drug infusion data included the dosages of four frequently utilized drugs in general anesthesia, specifically fentanyl, lidocaine, propofol, and rocuronium. The determination of drug dosages is based on the anesthetist’s evaluation of the patient’s clinical status, and is conventionally expressed as the amount of drug administered (in microgram or milligrams) per kilogram (kg) of the patient’s body weight. Table 1 displays the ranges of both absolute and converted drug dosages in the whole dataset, along with their corresponding means and standard deviations. In this study, the converted drug dosages of fentanyl, lidocaine, propofol, and rocuronium are considered as the input features of the model.

Table 1 Drug characteristics. The range and converted range of the drug doses are represented as (minimum, maximum). The median (25th percentile, 75th percentile) of the converted doses for different drugs are also provided

Integrated coefficient of variation

Hemodynamic profiles serve as the primary foundation for constructing the PIHI index. The systolic pressure, diastolic pressure and heart rate were measured six times in 1 minute during each operation and stored in the AIMS, comprising sequences of hemodynamic characteristics. As shown in Eq. 1, by describing the extent of variability concerning the mean value, the coefficient of variation (CV) value were applied to indicate the instability of a sequence. The inherent property of ratio values of the CV serves to eliminate the adverse effects of varying scales and dimensions in different types of sequences. To comprehensively integrate the CV value of different hemodynamic characteristics, we estimate their weights according to Eq. 2. The weight of each hemodynamic parameter is calculated based on the corresponding information entropy [26], which generates a smaller weight for a more homogeneous sequence set. In this study, the CV values captured from the three hemodynamic response sequences were merged to obtain the integrated coefficient of variation (ICV) value as the PIHI index, according to Eq. 3.

$${y}_{is}=\frac{\sqrt{\frac{1}{K}\sum \limits_{k=1}^K{\left({x}_{ik}^s-\frac{1}{K}\sum \limits_{k=1}^K{x}_{ik}^s\right)}^2}}{\frac{1}{K}\sum \limits_{k=1}^K{x}_{ik}^s}$$
(1)
$${w}_s=1+\frac{1}{\log (N)}\sum \limits_{i=1}^N\left[\left({y}_{is}/\sum \limits_{i=1}^N{y}_{is}\right)\times \log \left({y}_{is}/\sum \limits_{i=1}^N{y}_{is}\right)\right]$$
(2)
$${z}_i=\sum \limits_{s=1}^S\left({w}_s{y}_{is}/\sum \limits_{s=1}^S{w}_s\right)$$
(3)

where yis is the CV value of the s-th sequence of patient i; \({x}_{ik}^s\) is the k-th measured value in the s-th sequence of patient i; ws is the overall weight of the s-th sequence sets; zi is the ICV value of patient i; K represents the sequence length; N is the number of patients; and S is the number of hemodynamic characteristics, which in this paper is 3.

In general, a larger ICV value could indicate that the patient may be at higher risk of an unexpected PIHI. The original minimum and maximum values were determined to be 0.0055 and 0.6746, respectively. For convenience, the ICV values are normalized to values between 0 and 1, and proposed to be the anesthetic risk index of PIHI in this study. To depict the whole picture of the sequence around the intubation, 5 minutes of data (around 30 data points) before and after the intubation time are collected and utilized to calculate the ICV index. Figure 1 shows the sequences for different patients and the achieved ICV values. After intubation, patient 5 experienced significant unexpected changes in systolic pressure (Fig. 1A), diastolic pressure (Fig. 1B) and heart rate (Fig. 1C), while comparatively, patient 1 showed more stable anesthetic responses. Figure 1D illustrates a discernible contrast in the ICV value between patient 1, exhibiting a lower ICV value of 0.10, and patient 5, demonstrating a larger ICV value of 0.99 due to the unstable or sudden change of hemodynamic characteristics.

Fig. 1
figure 1

Sequences of hemodynamic characteristics around intubation and the calculated ICV values. Plots were generated by using the anesthesia monitoring data of 5 specific patients. In A, B and C, the sequences of the systolic pressure, diastolic pressure and heart rate of the 5 patients are presented with a time step of 10 seconds. The time of intubation is plotted as a dotted line to distinguish the pre- and postintubation periods. The calculated ICV values of the patients, as the anesthetic risk index of PIHI, are provided in D. BPM beats per minute, ICV integrated coefficient of variance

Statistical analysis

A correlation analysis was conducted on the various features of the original data, and their Pearson correlation coefficients are depicted in Fig. 2. It is evident that there exists a certain degree of correlation among the variables, with weight exhibiting the strongest negative correlation with rocuronium, yielding a correlation coefficient of − 0.582 (p < 0.01). Additionally, weight demonstrates the highest positive correlation with BMI, with a correlation coefficient of 0.862 (p < 0.01). The proposed ICV index demonstrates a moderate correlation with other variables. Specifically, it displays a relatively higher correlation with age, indicated by a significant Pearson correlation coefficient of 0.204 (p < 0.01).

Fig. 2
figure 2

Pearson correlation coefficients among the variables. Different colors are assigned to symbolize the diverse coefficient values. Fentanyl, Lidocaine, Propofol, and Rocuronium represent the converted doses of drugs at initial infusion. PreSP preoperative systolic pressure, PreDP preoperative diastolic pressure, PreHR preoperative heart rate, BMI body mass index

Model architecture

To deeply understand the nonlinear impact of different variables on post-intubation hemodynamic instability, a machine learning framework was constructed for predicting ICV. The model architecture is given in Fig. 3. The input features of the prediction models consist of preoperative patient information (age, sex, height, weight, BMI, hypertension, preoperative systolic pressure, preoperative diastolic pressure, and preoperative heart rate) and the converted doses for different drugs in initiatory infusion (fentanyl, lidocaine, propofol, and rocuronium), which are extracted from the EHRS and AIMS. The PIHI index, that is the ICV value of the patient, is taken as the output feature in the models. We choose the possible variables that may affect the post-intubation hemodynamic stability according to the collective experience of anesthetist and data availability. For example, the preoperative volemia assessment results are not always available in our dataset and can hardly be applied in the machine-learning training process for building up a more general model. Meanwhile, other comorbidities except for hypertension are not taken into account. Because chronic high blood pressure can lead to increased collagen production in the arterial wall [27], resulting in arterial rigidity and decreased vascular elasticity, which may influence the hemodynamic instability. Even though the hypertension may overlap with preoperative systolic pressure to some extent, the machine learning methods with more complex architecture were recommended to better mitigate multicollinearity [28].

Fig. 3
figure 3

Model architecture with preoperative patient information, perioperative initiatory drug infusion and ICV. EHRS electronic health record system, BMI body mass index, AIMS anesthesia information management system, MLR multiple linear regression, SVR support vector regression, ETR extra tree regression, MLP multilayer perceptron, XGBoost extreme gradient boosting, SMOTETomek synthetic minority over-sampling technique with Tomek links identification, ICV integrated coefficient of variance

Furthermore, it should be noted that the original dataset is insufficient to facilitate comprehensive model training, owing to the data imbalance pertaining to the predicted ICV value. In light of this, balancing the data distribution could be helpful, e.g., by using oversampling and undersampling methods. Given the certified and widely used techniques for addressing class imbalance, the continuous ICV values are first converted to a typical class imbalance problem. The ICV values are classified into 5 classes, with ranges of [0, 0.2), [0.2, 0.4), [0.4, 0.6), [0.6, 0.8) and [0.8, 1.0]; this establishes a class imbalance in the dataset, with class 1 (10.08%), class 3 (30.29%), class 4 (3.32%) and class 5 (0.16%) as minority classes and class 2 (56.15%) as the majority class. The SMOTETomek (synthetic minority over-sampling technique with Tomek links identification) technique [29] is then utilized to solve this class imbalance problem; SMOTE creates new minority class data by interpolating the adjacent original data in the minority class, and Tomek is used to identify and remove noisy or borderline samples caused by the creation of these new data.

Following the resolution of the imbalanced data issue, the extended dataset was randomly divided into training and testing sets in a 9:1 ratio. The statistics of the input features in the training and testing sets are given in Table 2. We found that the training and testing sets exhibited a comparable distribution, with no significant differences. Ten-fold cross-validation was performed with the training dataset to ensure the robustness of the prediction models by assessing the model performance with averaged evaluation indices over different fold partitions.

Table 2 The statistics of the input features in the training and testing sets

Machine learning models

Five typical machine learning models were created to predict the ICV value using multiple linear regression (MLR), support vector regression (SVR), extra tree regression (ETR), multilayer perceptron neural network (MLP), and eXtreme Gradient Boosting (XGBoost) regression methods. MLR was used as a baseline method, as it establishes a linear relationship between the input features and output features. The SVR method produces a classical model that provides a nonlinear solution by mapping input features into a higher-dimensional feature space [30]. As the most commonly used mapping kernel, the radial basis function (RBF) was used in this work to establish the SVR model. ETR [31] is an extension of random forest regression and has been shown to be more reliable than random forest regression in terms of resisting overfitting [32]. MLP is a typical artificial neural network with multiple hidden layers and neuron units and is mainly trained by the backpropagation algorithm [33]. The network parameters were adjusted and updated through iterative computation of their gradients, that is, partial derivative calculations. Finally, XGBoost is a method that operates under the gradient boosting framework [34]. Its decision tree ensembles are composed of sequentially additive trees that learn the residual errors of predictions. All these models were implemented using the mature packages in Python 3.7 [35, 36].

Model hyperparameters

Grid search and cross-validation were adopted to determine the model hyperparameters. The SVR model was trained with RBF kernel coefficient of 100 and a penalty parameter of 1.5 (L2 penalty) to ensure robust model regularization. The optimal ETR model was trained with 890 estimators and without assigning a maximum tree depth. Two hidden layers constructed by 66 and 65 neuron units were set for the MLP model with the Relu activation function. The learning rate and momentum coefficient for the MLP model were 0.001 and 0.7, respectively. The optimal hyperparameters of the XGBoost model were identified with 648 estimators, a maximum tree depth of 16 and a minimum child weight of 9.

Performance metrics

The mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE) and R-squared index (R2) were used to evaluate the prediction performance of the different models based on Eqs. 47.

$$MAE=\frac{1}{n}\sum \limits_{i=1}^n\left|{\hat{y}}_i-{y}_i\right|$$
(4)
$$RMSE=\sqrt{\frac{1}{n}\sum \limits_{i=1}^n{\left({\hat{y}}_i-{y}_i\right)}^2}$$
(5)
$$MAPE=\frac{100\%}{n}\sum \limits_{i=1}^n\left|\frac{{\hat{y}}_i-{y}_i}{y_i}\right|$$
(6)
$${R}^2=1-\frac{\sum \limits_{i=1}^n{\left({\hat{y}}_i-{y}_i\right)}^2}{\sum \limits_{i=1}^n{\left(\overline{y}-{y}_i\right)}^2}$$
(7)

in which \({\hat{y}}_i\) and yi are the predicted and observed ICV values of the anesthetic responses, respectively, of patient i; n is the number of patients; and \(\overline{y}\) is the average ICV value of the anesthetic responses.

Results

Feature difference analysis

To assess the differences in feature values associated with post-intubation hemodynamic instability (PIHI), the ICV values were categorized into 5 classes, with ranges of [0, 0.2), [0.2, 0.4), [0.4, 0.6), [0.6, 0.8) and [0.8, 1.0]. Utilizing ANOVA analysis, as detailed in Table 3, it was observed that none of the features satisfied the assumption of equal variances based on Levene’s test. Consequently, the Welch variance and Brown-Forsythe variance were employed for further analysis. Notably, the features of age, height, BMI, hypertension, preoperative systolic pressure, preoperative diastolic pressure, preoperative heart rate, as well as the initial doses of propofol and rocuronium exhibited significant differences (P < 0.05) across the various ICV ranges. Refer to Fig. 4 for a graphical representation of the statistical distribution of continuous features within the different ICV ranges.

Table 3 The feature difference analysis for different ICV ranges
Fig. 4
figure 4

The statistics of the continuous features within the different ICV ranges. Fentanyl, Lidocaine, Propofol, and Rocuronium represent the converted doses of drugs at initial infusion. PreSP preoperative systolic pressure, PreDP preoperative diastolic pressure, PreHR preoperative heart rate, BMI body mass index, BPM beats per minute, SD standard deviation

Evaluation of ICV effectiveness

The weights of the hemodynamic response sequences are achieved for designing objective and reasonable structure of the ICV. As mentioned above, three hemodynamic response sequences, including systolic pressure (SP) sequence, diastolic pressure (DP) sequence and heart rate (HR) sequence, were utilized to describe the hemodynamic instability after intubation. To calculate the ICV value, the weight of each hemodynamic response sequence is obtained by measuring variable homogeneity according to Eq. 2. After normalization, the weights of the SP, DP and HR sequences were 0.342, 0.355 and 0.303, respectively. This result helps to take a look into the data-driven structure of the ICV. For the cases in this study, the variability extent of SP, DP and HR fluctuation among the populations are basically at the same level, while with DP higher (0.355), SP moderate (0.342) and HR lower (0.303), indicating their significances in constructing the risk index. The weights may exhibit slight variability in the event of alterations to the distributions of SP, DP, and HR values across populations.

In order to validate the effectiveness of the proposed ICV value in quantifying hemodynamic instability, we randomly selected 30 data samples of the hemodynamic response sequences for a questionnaire. Twenty-five expert anesthetists were asked to rate the plotted samples of the sequences on a scale ranging from level 1 (indicating stability) to level 5 (indicating instability). The average ratings provided by the anesthetists and the computed ICA values are depicted in Fig. 5. It is found that the ICA values are consistent with the human ratings, with Spearman correlation coefficient of 0.877 (P < 0.001), affirming the indicator’s capability to accurately capture the level of instability.

Fig. 5
figure 5

The average ratings of anesthetists and the computed ICA values. The ICA values are consistent with the human ratings, with Spearman correlation coefficient of 0.877 (P < 0.001). Rs Spearman correlation coefficient, ICV integrated coefficient of variance

Machine learning model performance

The prediction performance of different machine learning models was evaluated for picking out the most fitted one. Using appropriate models, the ICV values which reflect the post-intubation anesthetic risk of hemodynamic instability, could be predicted accordingly. Table 4 shows a detailed comparison of the performance of the different models on the whole testing datasets according to the MAE, RMSE, MAPE and R2 with their corresponding confidence intervals (CIs). It is noted that the ETR and XGBoost models follow the observed data better than the MLR, SVR and MLP models. The ETR model achieved the smallest MAE (0.0512, 95% CI 0.0511–0.0513), RMSE (0.0792, 95% CI 0.0790–0.0794), and MAPE (0.2086, 95% CI 0.2077–0.2095) and the largest R2 (0.9047, 95% CI 0.9043–0.9052). In other words, the ETR model outperformed the other models, achieving higher accuracy and reliability.

Table 4 Performance of the machine learning models on the testing datasets

To clearly distinguish the hemodynamic instability and provide simplified instruction to anesthetist, we further classify the ICV values using 0.3 as threshold into stable and instable classes (stable: [0.0, 0.3); instable: [0.3, 1.0]). Figure 6 shows the receiver operating characteristic (ROC) curve of the ETR prediction model employed for the classification of hemodynamic instability. The achieved area under curve (AUC) value of ROC reaches 0.958 (95% CI 0.954–0.962), indicating a substantial improvement over random guess. Notably, the model exhibited impressive performance both in prediction and classification, satisfying the requirement of heightened accuracy for decreasing anesthetic risk involved.

Fig. 6
figure 6

ROC curve of ETR prediction model for classification on ICV values. The AUC value exceeded 0.95, indicating a substantial improvement over random guess. Larger AUC value close to 1 represent better classification performance. ROC receiver operating characteristic, AUC area under curve

Feature importance

The model was further interpreted to get insight on the feature importance, which could tell how the considered input features affect the anesthetic risk. Based on the computation results of ETR model, the Gini importance of the different features can be extracted. Figure 7 shows the feature importance of the ETR model, with higher value indicating more important feature; according to the figure, age, preoperative systolic pressure, preoperative diastolic pressure and preoperative heart rate are more important for accurately predicting the post-intubation anesthetic risk index of hemodynamic instability, with normalized importance value of 0.1524, 0.1145, 0.0962 and 0.0952 respectively.

Fig. 7
figure 7

Feature importance of 13 features in the ETR model. The importance values of the features sum to 1 with higher value indicating more important feature. The error bars represent the double standard deviation of the importance values obtained from 10-fold cross-validation. Fentanyl, Lidocaine, Propofol, and Rocuronium represent the converted doses of drugs at initial infusion. PreSP preoperative systolic pressure, PreDP preoperative diastolic pressure, PreHR preoperative heart rate, BMI body mass index

Discussion

In this study, we proposed the ICV index as a means of quantifying the patient post-intubation hemodynamic instability (PIHI). The inner structure of the ICV was objectively designed through analyzing the hemodynamic instability on systolic pressure, diastolic pressure and heart rate sequences. Furthermore, we conducted a comparative analysis of various types of machine-learning models to predict the ICV index. The ETR prediction model performed best on the given dataset and was proved to be effective in accuracy, reliability and interpretability.

This retrospective study makes two primary contributions that address the existing gaps in the domain of perioperative anesthesia quality and safety. One contribution is the adoption of the ICV as a PIHI index by merging the various hemodynamic changes after intubation. In previous literatures, most of the attentions were focused on the postoperative outcomes [6,7,8, 13, 16,17,18, 20] and perioperative binary judgement on specific symptom using classification models [10, 11, 15, 19]. However, few studies have attempted to quantify the post-intubation anesthetic risk with continuous value and build up the prediction model. Traditionally, the assessment of potential post-intubation risk relies upon the clinical expertise of the anesthetist. The ICV index, which ranges from 0 to 1, provides an opportunity to quantify how a patient physiologically reacts to the intubation under general anesthesia with a specific initial drug infusion. In other words, it could quantitatively and properly describe the extent to which the patient hemodynamic status could be changed during the operation, thereby providing the perioperative anesthetic risk reference.

Another contribution of this study regards the construction of the predictive models and knowledge discovery of the complex anesthetic response system. Machine learning models typically involve a ‘black box’ problem, in which the model performs its function without providing insight into the decision-making process [37]. Additionally, a pressing problem for anesthetists is understanding how the preoperative patient status could affect the perioperative quality [38] and the role of the different drugs in accomplishing their perioperative goals [39]. The knowledge on different importance degrees of the model features implies what factors should be greatly concerned in what order of priority to prevent possible anesthetic risk from data-driven empirical perspective. From this perspective, the ensemble learning methods (ETR and XGBoost) not only presented better prediction accuracy than SVR and MLP but could also help to interpret the variable importance, helping to shed a light on understanding the ‘black box’. As discovered in this study, the feature of age presents evident importance for estimating the patient PIHI; the features of preoperative hemodynamic characteristics, including SP, DP and HR, also show considerable impacts. The availability of other preoperative parameters throughout the dataset would be beneficial to construct a more specific model and achieve a more comprehensive parameter importance evaluation in future works. In the generally available measures, the preoperative physiological status of patient can be deemed as partially controllable characteristic and could be readjusted accordingly with the appropriate instruction; comparatively, as the fully controllable feature, the drug dosage of initial infusion manifests appreciable importance and can be properly manipulated by anesthetists. The medicinal effectiveness of propofol and lidocaine on attenuating hemodynamic responses to intubation/extubation have been noted in the previous researches [40,41,42,43]. Other drugs’ influence could be further considered with the current model architecture on an extended dataset.

Furthermore, the study also provides insights on the development of individualized and precise anesthetic treatments. The proper manual evaluation of a professional anesthetist is highly dependent on his/her empirical experience and the comprehensiveness of the given information. From the perspective of artificial intelligence, the proposed method is learning the rich empirical experience of many anesthetists from the readily obtainable basic information of a large number of surgeries. Comparatively, one anesthetist’s experience is limited, which may compromise the evaluation. An erroneous decision has the potential to elicit abrupt and severe hemodynamic changes in the patient, necessitating repeated interventions and resulting in an unstable physiological workload. Facing to each individual patient with unique characteristics, the expertise of a lone anesthetist is inherently constrained, thereby potentially compromising the evaluation process and leading to unfavorable treatment outcomes. To address this concern, the proposed machine-learning model could help anesthetist accurately comprehending the possible perioperative anesthetic risk and taking rational treatment accordingly.

In practical, the constructed machine-learning model could be applied before an operation to verify whether the assumed initial infused drugs are appropriate for the current patient. The prediction could greatly help anesthetists manage the perioperative anesthesia quality and safety right from the start. Additionally, better perioperative hemodynamic profiles may influence postoperative pain [44] and adverse outcomes [45] over the long term, so future studies could investigate the subsequent impacts. For example, the proposed index could be further utilized to predict major adverse cardiovascular events (MACEs) with estimated metabolic equivalents (METs) [46]. The model could also be used to train inexperienced anesthetists, since the wide variability in the characteristics of the patients and the anesthetic drugs doses increase the difficulty of assessing the post-intubation hemodynamic instability. It would be meaningful to fully utilize historical big data to guide inexperienced anesthetists in a substantially shorter duration of time, which is one of the original intentions of this retrospective research. Furthermore, it would be worthwhile to pursue further studies on individualized medication in closed-loop intravenous drug administration by analyzing the heterogeneous hemodynamic responses observed during subsequent infusions. Follow-up studies could also consider the potential impact of patient’s higher ASA scores (i.e., 3 or 4) and the varying levels of expertise among anesthetists on the perioperative anesthesia quality and safety.

Conclusion

In conclusion, our results suggest that machine learning models are able to preoperatively and precisely predict the post-intubation hemodynamic instability based on the preoperative patient information and planned initial drug infusion data. The ETR model performs best and indicates the varied influence of the different variables on predicting ICV values. Further investigations are warranted to explore more influencing factors in the model, assess the subsequent impacts of the extended models, evaluate its training performance for inexperienced anesthetists and examine the model feasibility in practical implementation of closed-loop intravenous drug administration.

Availability of data and materials

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Abbreviations

AIMS:

Anesthesia information management system

ASA:

American Society of Anesthesiologists

AUC:

Area under curve

BMI:

Body mass index

BPM:

Beats per minute

CI:

Confidence interval

CV:

Coefficient of variation

DP:

Diastolic pressure

EHRS:

Electronic health record system

ETR:

Extra tree regression

HR:

Heart rate

ICV:

Integrated coefficient of variation

MAE:

Mean absolute error

MAPE:

Mean absolute percentage error

MLP:

Multilayer perceptron neural network

MLR:

Multiple linear regression

RMSE:

Root mean square error

ROC:

Receiver operating characteristic

R2 :

R-squared index

SMOTETomek:

Synthetic minority over-sampling technique with Tomek links identification

SP:

Systolic pressure

STROBE:

Strengthening the reporting of observational studies in epidemiology

SVR:

Support vector regression

XGBoost:

Extreme gradient boosting

References

  1. Yoon U, Topper J, Elia E, Goldhammer J. Preoperative evaluation and anesthetic management of patients with liver cirrhosis undergoing cardiac surgery. J Cardiothor Vascular Anesthes. 2022;36:1429–48. https://doi.org/10.1053/j.jvca.2020.08.022.

    Article  Google Scholar 

  2. Prabhu M, Valchanov K. Pre-anaesthetic evaluation of the patient with end-stage lung disease. Best Pract Res Clin Anaesthesiol. 2017;31:249–60. https://doi.org/10.1016/j.bpa.2017.05.002.

    Article  PubMed  Google Scholar 

  3. Fioratou E, Flin R, Glavin R, Patey R. Beyond monitoring: distributed situation awareness in anaesthesia. Br J Anaesth. 2010;105:83–90. https://doi.org/10.1093/bja/aeq137.

    Article  CAS  PubMed  Google Scholar 

  4. Patriarca R, Di Gravio G, Costantino F, Fedele L, Tronci M, Bianchi V, et al. Systemic safety management in anesthesiological practices. Saf Sci. 2019;120:850–64. https://doi.org/10.1016/j.ssci.2019.08.021.

    Article  Google Scholar 

  5. Kelly FE, Frerk C, Bailey CR, Cook TM, Ferguson K, Flin R, et al. Human factors in anaesthesia: a narrative review. Anaesthesia. 2023;78(4):479–90. https://doi.org/10.1111/anae.15920.

    Article  CAS  PubMed  Google Scholar 

  6. Hill LB, Brown R, Gabel E, Rakocz N, Lee C, Cannesson M, et al. An automated machine learning-based model predicts postoperative mortality using readily-extractable preoperative electronic health record data. Br J Anaesth. 2019;123:877–86. https://doi.org/10.1016/j.bja.2019.07.030.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Fritz BA, Cui Z, Zhang M, He Y, Chen Y, Kronzer A, et al. Deep-learning model for predicting 30-day postoperative mortality. Br J Anaesth. 2019;123:688–95. https://doi.org/10.1016/j.bja.2019.07.025.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Morisson L, Nadeau-Vallée M, Espitalier F, et al. Prediction of acute postoperative pain based on intraoperative nociception level (NOL) index values: the impact of machine learning-based analysis. J Clin Monit Comput. 2023;37:337–44. https://doi.org/10.1007/s10877-022-00897-z.

    Article  PubMed  Google Scholar 

  9. Sobrie O, Lazouni ME, Mahmoudi S, Mousseau V, Pirlot M. A new decision support model for preanesthetic evaluation. Comput Methods Prog Biomed. 2016;133:183–93. https://doi.org/10.1016/j.cmpb.2016.05.021.

    Article  Google Scholar 

  10. Mizota T, Matsukawa S, Fukagawa H, Daijo H, Tanaka T, Chen F, et al. Preoperative hypercapnia as a predictor of hypotension during anesthetic induction in lung transplant recipients. J Cardiothorac Vasc Anesth. 2015;29:967–71. https://doi.org/10.1053/j.jvca.2014.10.027.

    Article  PubMed  Google Scholar 

  11. Zhou CM, Xue Q, Liu P, Duan W, Wang Y, Tong J, et al. Construction of a predictive model of post-intubation hypotension in critically ill patients using multiple machine learning classifiers. J Clin Anesth. 2021;72:110279. https://doi.org/10.1016/j.jclinane.2021.110279.

    Article  PubMed  Google Scholar 

  12. Zhang G, Yuan J, Yu M, Wu T, Luo X, Chen F. A machine learning method for acute hypotensive episodes prediction using only non-invasive parameters. Comput Methods Prog Biomed. 2021;200:105845. https://doi.org/10.1016/j.cmpb.2020.105845.

    Article  Google Scholar 

  13. Bartek MA, Saxena RC, Solomon S, Fong CT, Behara LD, Venigandla R, et al. Improving operating room efficiency: a machine learning approach to predict case-time duration. J Am Coll Surg. 2019;229:346–54. https://doi.org/10.1016/j.jamcollsurg.2018.07.317.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Gabriel RA, Harjai B, Simpson S, Goldhaber N, Curran BP, Waterman RS. Machine learning-based models predicting outpatient surgery end time and recovery room discharge at an ambulatory surgery center. Anesth Analg. 2022;135(1):159–69. https://doi.org/10.1213/ANE.0000000000006015.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Solomon SC, Saxena RC, Neradilek MB, et al. Forecasting a crisis: machine-learning models predict occurrence of intraoperative bradycardia associated with hypotension. Anesth Analg. 2020;130(5):1201–10. https://doi.org/10.1213/ANE.0000000000004636.

    Article  PubMed  Google Scholar 

  16. Menzenbach J, Guttenthaler V, Kirfel A, Ricchiuto A, Neumann C, Adler L, et al. Estimating patients' risk for postoperative delirium from preoperative routine data - trial design of the PRe-operative prediction of postoperative DElirium by appropriate SCreening (PROPDESC) study - a monocentre prospective observational trial. Contemp Clin Trials Commun. 2020;17:100501. https://doi.org/10.1016/j.conctc.2019.100501.

    Article  PubMed  Google Scholar 

  17. Herman JA, Urman RD, Urits I, Kaye AD, Viswanath O. A prediction model for delirium after cardiac surgery: another step towards prevention? J Clin Anesth. 2021;79:110238. https://doi.org/10.1016/j.jclinane.2021.110238.

    Article  PubMed  Google Scholar 

  18. Tomlinson JH, Moonesinghe SR. Risk assessment in anaesthesia. Anaesthes Intens Care Med. 2016;17:486–91. https://doi.org/10.1016/j.mpaic.2016.07.003.

    Article  Google Scholar 

  19. Zhou CM, Xue Q, Ye HT, Wang Y, Tong J, Ji MH, et al. Constructing a prediction model for difficult intubation of obese patients based on machine learning. J Clin Anesth. 2021;72:110278. https://doi.org/10.1016/j.jclinane.2021.110278.

    Article  PubMed  Google Scholar 

  20. Kiyatkin ME, Aasman B, Fazzari MJ, Rudolph MI, Melo MFV, Eikermann M, et al. Development of an automated, general-purpose prediction tool for postoperative respiratory failure using machine learning: a retrospective cohort study. J Clin Anesth. 2023;90:111194. https://doi.org/10.1016/j.jclinane.2023.111194.

    Article  PubMed  Google Scholar 

  21. Aronson S, Dyke CM, Levy JH, et al. Does perioperative systolic blood pressure variability predict mortality after cardiac surgery? An exploratory analysis of the ECLIPSE trials. Anesth Analg. 2011;113(1):19–30. https://doi.org/10.1213/ANE.0b013e31820f9231.

    Article  PubMed  Google Scholar 

  22. Qu GZ, Wu H, Hartrick CT, Niu JW. Local analgesia adverse effects prediction using multi-label classification. Neurocomput. 2012;92:18–27. https://doi.org/10.1016/j.neucom.2011.08.038.

    Article  Google Scholar 

  23. Coeckelenbergh S, Joosten A, Cannesson M, et al. Closing the loop: automation in anesthesiology is coming. J Clin Monit Comput. 2023; https://doi.org/10.1007/s10877-023-01077-3.

  24. Gambus PL, Jaramillo S. Machine learning in anesthesia: reactive, proactive…predictive! Br J Anaesth. 2019;123:401–3. https://doi.org/10.1016/j.bja.2019.07.009.

    Article  PubMed  Google Scholar 

  25. van der Ven WH, Terwindt LE, Risvanoglu N, et al. Performance of a machine-learning algorithm to predict hypotension in mechanically ventilated patients with COVID-19 admitted to the intensive care unit: a cohort study. J Clin Monit Comput. 2022;36:1397–405. https://doi.org/10.1007/s10877-021-00778-x.

    Article  PubMed  Google Scholar 

  26. Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27:379–423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x.

    Article  Google Scholar 

  27. Renna NF, de Las HN, Miatello RM. Pathophysiology of vascular remodeling in hypertension. Int J Hypertens. 2013;2013:808353. https://doi.org/10.1155/2013/808353.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Chan JY-L, Leow SMH, Bea KT, Cheng WK, Phoong SW, Hong Z-W, et al. Mitigating the multicollinearity problem and its machine learning approach: a review. Mathematics. 2022;10(8):1283. https://doi.org/10.3390/math10081283.

    Article  Google Scholar 

  29. Batista GEAPA, Bazzan ALC, Monard MC, Balancing training data for automated annotation of keywords: a case study. Proceedings of the 2nd Brazilian Workshop on Bioinformatics. 2003; 10–18. https://www.inf.ufrgs.br/maslab/pergamus/pubs/balancing-training-data-for.pdf.

  30. Hasibuan D, Jaya IK, Rumahorbo B, Naibaho J, Napitupulu J, Rajagukguk E. Time series financial market forecasting based on support vector regression algorithm. Int Conf Comput Sci Informa Technol. 2019:1–4. https://doi.org/10.1109/ICoSNIKOM48755.2019.9111533.

  31. Geurts P, Ernst D, Wehenkel L. Extremely randomized trees. Mach Learn. 2006;63:3–42. https://doi.org/10.1007/s10994-006-6226-1.

    Article  Google Scholar 

  32. John V, Liu Z, Guo C, Mita S, Kidono K. Real-time lane estimation using deep features and extra trees regression. Image Video Technol. 2016;9431:721–33. https://doi.org/10.1007/978-3-319-29451-3_57.

    Article  Google Scholar 

  33. Rumelhart DE, McClelland JL. Parallel distributed processing: explorations in the microstructure of cognition. MIT Press; 1986.

    Book  Google Scholar 

  34. Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29:1189–232. https://www.jstor.org/stable/2699986

    Article  Google Scholar 

  35. Chen T, Guestrin C. XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. San Francisco, CA 2016; 785–94 https://doi.org/10.1145/2939672.2939785.

  36. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30. https://doi.org/10.48550/arXiv.1201.0490.

    Article  Google Scholar 

  37. Ven der Ven WH, Veelo DP, Wijnberge M, Van der Ster BJP, Vlaar APJ, Geerts BF. One of the first validations of an artificial intelligence algorithm for clinical use: the impact on intraoperative hypotension prediction and clinical decision-making. Surgery. 2021;169:1300–3. https://doi.org/10.1016/j.surg.2020.09.041.

    Article  Google Scholar 

  38. Sanders RD, Hughes F, Shaw A, Thompson A, Bader A, Hoeft A, et al. Perioperative quality initiative consensus statement on preoperative blood pressure, risk and outcomes for elective surgery. Br J Anaesth. 2019;122:552–62. https://doi.org/10.1016/j.bja.2019.01.018.

    Article  PubMed  Google Scholar 

  39. Joshi GP. General anesthetic techniques for enhanced recovery after surgery: current controversies. Best Pract Res Clin Anaesthesiol. 2021;35:531–41. https://doi.org/10.1016/j.bpa.2020.08.009.

    Article  PubMed  Google Scholar 

  40. Kawasaki S, Kiyohara C, Tokunaga S, Hoka S. Prediction of hemodynamic fluctuations after induction of general anesthesia using propofol in non-cardiac surgery: a retrospective cohort study. BMC Anesthesiol. 2018;18:167. https://doi.org/10.1186/s12871-018-0633-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Ferrier DC, Kiely J, Luxton R. Propofol detection for monitoring of intravenous anaesthesia: a review. J Clin Monit Comput. 2022;36(2):315–23. https://doi.org/10.1007/s10877-021-00738-5.

    Article  PubMed  Google Scholar 

  42. Bidwai AV, Bidwai VA, Rogers CR, Stanley TH. Blood pressure and pulse rate responses to endotracheal extubation with and without prior injection of lidocaine. Anesthesiol. 1979;51(2):171–3. https://doi.org/10.1097/00000542-197908000-00020.

    Article  CAS  Google Scholar 

  43. Wilson IG, Meiklejohn BH, Smith G. Intravenous lignocaine and sympathoadrenal responses to laryngoscopy and intubation. The effect of varying time of injection. Anaesthesia. 1991;46(3):177–80. https://doi.org/10.1111/j.1365-2044.1991.tb09403.x.

    Article  CAS  PubMed  Google Scholar 

  44. Turksal E, Alper I, Sergin D, Yuksel E, Ulukaya S. The effects of preoperative anxiety on anesthetic recovery and postoperative pain in patients undergoing donor nephrectomy. Brazilian J Anesthesiol. 2020;70:271–7. https://doi.org/10.1016/j.bjane.2020.06.004.

    Article  Google Scholar 

  45. Zeng J, Zheng GQ, Li YL, Yang YY. Preoperative pulse pressure and adverse postoperative outcomes: a meta-analysis. J Cardiothorac Vasc Anesth. 2020;34:624–31. https://doi.org/10.1053/j.jvca.2019.09.036.

    Article  PubMed  Google Scholar 

  46. Giovanna ALB, Eckhard M, Shaw A, Daniela L, Wojciech S, Stefan DH, et al. Risk assessment for major adverse cardiovascular events after noncardiac surgery using self-reported functional capacity: international prospective cohort study. Br J Anaesth. 2023;130(6):655–65. https://doi.org/10.1016/j.bja.2023.02.030.

    Article  Google Scholar 

Download references

Acknowledgements

Thanks to Dr. Le Shen, Dr. Lijian Pei, Wei Liu and Sumei Wang for their valuable insights on the research and kind support in the preparation of the paper.

Funding

This study was supported by the National Social Science Fund of China for Education Research, Youth Project “Research on big data reconstruction of anesthetic medical records and self-reliant simulative teaching of anesthesiologists for perioperative precise quality control” (Grant Number CCA220321).

Author information

Authors and Affiliations

Authors

Contributions

RT, BZ and XZ contributed to the study conceptualization and design. Data collection was performed by RT, XZ and YH. Model construction was performed by RT and GQ. The results were interpreted by RT, BZ and HM. The first draft of the manuscript was written by RT and all authors contributed to the manuscript revision and final version approval.

Corresponding author

Correspondence to Bo Zhu.

Ethics declarations

Ethical approval and informed consent

This study was approved by the institutional ethics committee of Chinese Academy of Medical Sciences & Peking Union Medical College (I-23PJ746), with waivers for consent.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Te, R., Zhu, B., Ma, H. et al. Machine learning approach for predicting post-intubation hemodynamic instability (PIHI) index values: towards enhanced perioperative anesthesia quality and safety. BMC Anesthesiol 24, 136 (2024). https://doi.org/10.1186/s12871-024-02523-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12871-024-02523-8

Keywords