Skip to main content

External validation of the HACOR score and ROX index for predicting treatment failure in patients with coronavirus disease 2019 pneumonia managed on high-flow nasal cannula therapy: a multicenter retrospective observational study in Japan

Abstract

Background

The HACOR score for predicting treatment failure includes vital signs and acid–base balance factors, whereas the ROX index only considers the respiratory rate, oxygen saturation, and fraction of inspired oxygen (FiO2). We aimed to externally validate the HACOR score and ROX index for predicting treatment failure in patients with coronavirus disease 2019 (COVID-19) on high-flow nasal cannula (HFNC) therapy in Japan.

Methods

This retrospective, observational, multicenter study included patients, aged ≥ 18 years, diagnosed with COVID-19 and treated with HFNC therapy between January 16, 2020, and March 31, 2022. The HACOR score and ROX index were calculated at 2, 6, 12, 24, and 48 h after stating HFNC therapy. The primary outcome was treatment failure (requirement for intubation or occurrence of death within 7 days). We calculated the area under the receiver operating characteristic curve (AUROC) and assessed the diagnostic performance of these indicators. The 2-h time-point prediction was considered the primary analysis and that of other time-points as the secondary analysis. We also assessed 2-h time-point sensitivity and specificity using previously reported cutoff values (HACOR score > 5, ROX index < 2.85).

Results

We analyzed 300 patients from 9 institutions (median age, 60 years; median SpO2/FiO2 ratio at the start of HFNC therapy, 121). Within 7 days of HFNC therapy, treatment failure occurred in 127 (42%) patients. The HACOR score and ROX index at the 2-h time-point exhibited AUROC discrimination values of 0.63 and 0.57 (P = 0.24), respectively. These values varied with temporal changes—0.58 and 0.62 at 6 h, 0.70 and 0.68 at 12 h, 0.68 and 0.69 at 24 h, and 0.75 and 0.75 at 48 h, respectively. The 2-h time-point sensitivity and specificity were 18% and 91% for the HACOR score, respectively, and 3% and 100% for the ROX index, respectively. Visual calibration assessment revealed well calibrated HACOR score, but not ROX index.

Conclusions

In COVID-19 patients receiving HFNC therapy in Japan, the predictive performance of the HACOR score and ROX index at the 2-h time-point may be inadequate. Furthermore, clinicians should be mindful of time-point scores owing to the variation of the models’ predictive performance with the time-point.

Trial registration UMIN (registration number: UMIN000050024, January 13, 2023)

Background

Patients diagnosed with severe coronavirus disease 2019 (COVID-19) pneumonia sometimes require intensive care unit (ICU) admission [1]. In such patients, a high-flow nasal cannula (HFNC) has often been needed before intubation [2]. Failure of HFNC therapy may result in delayed intubation and increased mortality [2]. Therefore, an early prediction of HFNC failure and determination of the appropriate timing of endotracheal intubation are important strategies for patient management.

The ROX index has been validated and widely used as a predictor of treatment failure (intubation) in patients with COVID-19 pneumonia treated with an HFNC [3]. This score is calculated using only three variables: respiratory rate, oxygen saturation (SpO2), and fraction of inspired oxygen (FiO2) [4, 5]. Other critical indicators, such as the level of consciousness, blood pressure, and acid–base balance, are also important parameters in determining the need for intubation. The HACOR score, a tool for predicting failure of noninvasive ventilation (NIV) and HFNC therapy, incorporates these additional factors [6, 7]. Previous studies have reported a high discriminatory performance of the HACOR score in patients with heart failure and acute respiratory failure [6, 7]. However, only one single center study by Valencia et al. [8] has externally validated the HACOR score in patients with COVID-19. Since the study was conducted at a single center, its generalizability was limited. Moreover, calibration assessment was not performed. Additionally, the decision-making process for treatment strategies during a pandemic is influenced by the medical setting, including the availability of limited resources such as ICU beds, mechanical ventilators, and healthcare workers [9, 10].

To enhance generalizability and transportability, we conducted a multicenter study to externally validate the HACOR score and ROX index for predicting treatment failure in patients with COVID-19 managed with HFNC therapy in Japan.

Methods

Design and setting

This retrospective, observational, multicenter study was conducted according to the Transparent Reporting of a Multivariate Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) guidelines for prediction model validation [11]. The Institutional Review Board of Tohoku University (2022-I-265) and that of each center reviewed and approved the study.

The study was registered in the University Hospital Medical Information Network Clinical Trials Registry (UMIN-CTR ID UMIN000050024). The requirement for informed consent from all participants enrolled in this study was waived by the ethics committee owing to the retrospective study design.

External validation cohort

We performed an external validation of two clinical prediction models, HACOR score and ROX index, using a multicenter, retrospective cohort study involving nine tertiary hospitals in Japan. These hospitals were the Yokohama Medical Center in Kanagawa, Tohoku University Hospital in Miyagi, National Defense Medical College Hospital in Saitama, Tokyo Metropolitan Hiroo Hospital in Tokyo, Saga University Hospital in Saga, Kochi Health Sciences Center in Kochi, Nagoya University Hospital in Aichi, Kameda Medical Center in Chiba, and Jichi Medical University Saitama Medical Center in Saitama. Additional information regarding the characteristics of these institutions can be found in Additional file 1: S1. The data were collected from January 16, 2020, to March 31, 2022, and during this period, the circulating strains of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in Japan were as follows: wild-type (January 2020 to February 2021); B.1.1.7 (alpha, March to April 2021): B.1.617.2 (delta, July to December 2021); and B.1.1.529 (omicron, January to June 2022) [12, 13].

Study population

We included patients aged ≥ 18 years, who were diagnosed with COVID-19 and treated with an HFNC for > 2 h. The inclusion criteria were as follows: (1) patients aged ≥ 18 years; (2) confirmed SARS-CoV-2 infection detected using the real-time reverse transcription-polymerase chain reaction-loop-mediated isothermal amplification method or antigen test; and (3) treatment with an HFNC for at least 2 h. Patients who met any of the following conditions were excluded: (1) received NIV before HFNC therapy, (2) had do-not-intubate orders, and (3) were already extubated.

Data collection

We utilized baseline data collected immediately before initiating HFNC therapy, which included patient characteristics such as age, sex, height, weight, comorbidities, and vital signs. Blood sampling data, Sequential Organ Failure Assessment (SOFA) scores, and Acute Physiology and Chronic Health Evaluation (APACHE) II scores were collected within the first 24 h before initiating HFNC therapy. Furthermore, we collected the data including those on vital signs, arterial blood gas, and oxygen device settings at multiple time-points after the initiation of HFNC therapy, specifically at 2, 6, 12, 24, and 48 h.

Outcome measurement

The outcome, treatment failure, was defined as either intubation or death within 7 days. In a previous study [8], treatment failure was defined as HFNC therapy failure, including the need for mechanical ventilation and death within 7 days [14]. According to the study design, the decision regarding intubation was at the discretion of the clinicians at each participating site.

Predictor variables

The HACOR score and ROX index were calculated at the 2, 6, 12, 24, and 48-h time-points following the commencement of HFNC therapy. The scores were measured for each of these time periods until the withdrawal of HFNC.

Statistical analyses

Baseline characteristics are presented as descriptive statistics including the median (interquartile range) for continuous variables and frequency (percentage) for categorical variables.

Sample size

According to the TRIPOD guidelines [11], we defined the research period from January 16, 2020, to March 31, 2022. For external validation, we determined a study sample size that included a minimum of 100 patients with the outcome of interest and 100 without the outcome. All patients were recruited during the specified period with the objective of collecting data from more than 200 patients. Owing to the uncertainty regarding the required sample size required for each facility to achieve this target, we extended the maximum period to the time when COVID-19 was first confirmed in Japan.

Validation of the models

We evaluated the performance of the HACOR score and ROX index at each time-point. To avoid multiple testing, we predetermined using that the primary analysis would focus on the prediction at the 2-h time point, while predictions at other time-points would be considered the secondary analyses. The 2-h time point prediction was chosen as the primary analysis because of the clinical importance of early prediction and its use in existing studies [7, 8]. To estimate the discrimination ability between patients with and without treatment failure, we calculated the area under the receiver operating characteristic curve (AUROC). We interpreted AUROC values greater than 0.9 as high, 0.7–0.9 as moderate, 0.5–0.7 as low, and 0.5 as a chance result [15]. Additionally, we evaluated the predictive performance, including sensitivity, specificity, and positive and negative predicted values, utilizing the threshold criteria established in previous studies for the HACOR score versus those for the ROX index as follows: > 5 versus < 2.85 at 2 h; > 5 versus < 3.47 at 6 h; and > 5 versus < 3.85 at 12 h, respectively [4, 7]. Because of limited evidence regarding the cutoff points at 24 and 48 h, we employed the threshold established at 12 h. To evaluate the calibration, we compared the observed proportion of treatment failures with predicted risks. Calibration graphs were visually assessed and considered to be well calibrated if the observed treatment failures increased as the risk of the group increased. In clinical practice, the HACOR score is utilized as a point-based scoring system, whereas the ROX index is not assigned specific numerical values but is instead assessed using a three-tiered risk categorization [16]. Consequently, the calculated values are employed for assessing the HACOR score, whereas the ROX index is evaluated based on its categorization. For the HACOR score, patients were stratified according to individual score groups. Concerning the ROX index, we categorized patients into three risk groups based on their ROX index as follows: high (ROX index < 3.85), intermediate (3.85 ≤ ROX index ≤ 4.88), and low (ROX index > 4.88) [4, 16].

Comparison of the two models

The HACOR score and ROX index were compared by comparing the AUROC using the Delong method [17].

Sensitivity analysis

We conducted a sensitivity analysis to ensure the robustness of the results of the primary and secondary outcome analyses. Given the subjective nature of the decision to intubate, we defined clinically important intubation as having met at least one of the following criteria: loss or impaired level of consciousness (a Glasgow Coma Scale score of < 8), hypotension (a systolic arterial blood pressure of < 90 mmHg or a mean arterial blood pressure of < 65 mmHg), a respiratory rate of > 40 /min, or hypoxia with an SpO2 of < 92% despite an FiO2 of 1.0 or a pH < 7.35, by blood gas analysis [6]. We performed the same analysis using clinically significant intubation or death within 7 days as the outcome. As part of a sensitivity analysis, we conducted the same analysis with intubation as the outcome. In addition, a sensitivity analysis was performed excluding facilities that used the ROX index as a treatment guide. Multiple imputations were performed for missing values using multiple imputations by chained equation with 50 iterations that generated 100 datasets for imputed missing values [18, 19]. Analyses were performed using R software, version 4.1.1 (The R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/).

Results

Patient characteristics

During the study period, 652 patients received HFNC therapy. Of these, 352 were excluded; ultimately, the data of 300 patients were analyzed (Fig. 1). Seventy-six percent (228/300) patients were male, and the median age was 60 (IQR, 51–70) years. The median number of days from onset to hospitalization was 7 (IQR 5–10) days. Notably, vaccines against COVID-19 were not widely available in Japan for more than half of the study period (vaccination against COVID-19 began on February 17, 2021), and 89% (245/276) of the patients were unvaccinated (Table 1). While the HACOR score did not affect treatment decisions at any of the facilities, the ROX index influenced treatment policies at two participating sites (Additional file 1: S1).

Fig. 1
figure 1

Flow diagram of the study participant selection process. To achieve this enrollment, we set the research period from January 16, 2020, to March 31, 2022. Upon collecting cases during this period, the actual enrollment reached 300 participants. DNI: do-not-intubate

Table 1 Baseline characteristics of patients with COVID-19 pneumonia treated with high-flow nasal cannula

Treatment failure

The time to intubation was 27 (IQR 13–37) h, and 42% (127/300) of the patients were intubated by day 7. The 7-day and in-hospital mortality were 2% (6/300) and 14% (43/300), respectively. A total of 42% (127/300) patients experienced treatment failure (Table 2), and 45% (136/300) patients either required endotracheal intubation or died within 28 days (Table 2).

Table 2 Intubation, length of stay and mortality associated with COVID-19 patients with high-flow nasal cannula

Predictive performance of the HACOR score and ROX index

The HACOR score and ROX index values at each time point are presented in Additional file 1: S2. In primary analysis, the discrimination of the HACOR score and ROX index at the 2-h time-point was indicated by AUROC values of 0.63 and 0.57 (P = 0.24), respectively. In the secondary analyses, the temporal changes in the discrimination of the HACOR score and the ROX index were 0.58 and 0.62 at 6 h (P = 0.045), 0.70 and 0.68 at 12 h (P = 0.37), 0.68 and 0.69 at 24 h (P = 0.63), and 0.75 and 0.75 at 48 h (P = 0.84), respectively (Table 3). The sensitivity and specificity at 2 h were 18% and 91% for the HACOR score and 3.9% and 100% for the ROX index, respectively (Table 4).

Table 3 The area under the receiver operating characteristic curve of each predictions
Table 4 Other discriminations of the HACOR score and ROX index

Calibration of the HACOR score is shown in Fig. 2. The figure displays an observable trend of increasing intubation rate with an increasing score, although 24% of the patients required intubation even when the HACOR score was 0 at 2 h. By contrast, Fig. 3 illustrates that the ROX index was not well calibrated because approximately 40%, 57%, and 53% of the patients in the low-, intermediate-, and high-risk groups, respectively, experienced treatment failure in 2 h.

Fig. 2
figure 2

Calibration of the HACOR score. Calibration of the HACOR score is shown; 27% of patients experienced treatment failure even when the HACOR score was 0 at 2 h. An increasing probability in treatment failure was observed with increasing scores. Treatment failure was defined as either intubation or death within 7 days

Fig. 3
figure 3

Calibration of the ROX index. We categorized patients into three risk groups based on their ROX index: high (ROX index < 3.85), intermediate (3.85 ≤ ROX index ≤ 4.88), and low (ROX index > 4.88). Calibration of the ROX index is illustrated; at 2 h, approximately 40%, 57%, and 53% of patients in the low-, intermediate-, and high-risk groups, respectively, experienced treatment failure, indicating that the ROX index was not well calibrated. Treatment failure was defined as either intubation or death within 7 days

Sensitivity analyses

Of the 42% (127/300) patients who were intubated within 7 days, 80 met the criteria for clinically significant intubation (Additional file 1: S3). When the outcome was clinically important intubation or death within 7 days, the discrimination of the HACOR score and ROX index at the 2-h time-point, the primary outcome, was indicated by AUROC values of 0.65 and 0.63, respectively (P = 0.24). The temporal changes in the discrimination of the HACOR score and ROX index, the secondary outcome, were 0.57 and 0.65 at 6 h (P = 0.045), 0.69 and 0.73 at 12 h (P = 0.37), 0.67 and 0.74 at 24 h (P = 0.63), and 0.73 and 0.76 at 48 h (P = 0.84), respectively (Additional file 1: S4). I In a sensitivity analysis using intubation as an outcome, as no deaths occurred within day 7, treatment failure within day 7 and intubation within day 7 had the same results (Tables 3 and 4). The sensitivity and specificity at 2 h were 18% and 91% for the HACOR score and 3.9% and 100% for the ROX index, respectively (Table 4). The sensitivity analysis results excluding facilities that used the ROX index as a treatment guide were in Additional file 1: S5 and S6, showing AUROCs at 2 h of 0.64 for the HACOR score and 0.61 for the ROX index.

Discussion

Summary of the key findings

Our study showed that in Japanese patients with COVID-19 on HFNC therapy, the HACOR score and ROX index had low discrimination and poor calibration at 2 h; however, their AUROC tended to increase over time. The predictive performance of the HACOR score and ROX index in Japanese patients with COVID-19 on HFNC therapy may be inadequate due to low discrimination and poor calibration. The novelty of our study is that in addition to the discriminative ability of the HACOR score and ROX index, we also evaluated their calibration ability. At 2 h, even among patients with an HACOR score of 0, approximately 25% were intubated, and in evaluating the ROX index, even in the low-risk patient group, approximately 40% were intubated. Calibration of the HACOR score may improve over time; however, no improvement was observed in the ROX index.

Discussion of the results in relation to previous findings

A previous study [20] has suggested the utility of the ROX index in predicting the failure of HFNC therapy in patients with COVID-19. According to a prior systematic review [3] that included eight cohort studies with 1301 patients, the ROX index showed moderate ability to discriminate between outcomes, with a summary AUROC of 0.81. However, the ability of the HACOR score to predict failure of HFNC therapy in patients with COVID-19 receiving HFNC has only been investigated in one study [8].

Valencia et al. [8] reported the discriminatory ability of the HACOR score for treatment failure after 2 h of HFNC therapy in patients with COVID-19. In their study, the HACOR score demonstrated an AUROC of 0.71. Our data’s poorer discriminatory ability could be explained by several reasons. First, there were differences in the intubation criteria, which are a part of the treatment failure criteria. Valencia et al. [8] evaluated the HACOR score and ROX index after 2 h of HFNC therapy, and if there was no improvement in the signs of muscle fatigue, SaO2 (> 90%), and PaO2/FiO2, endotracheal intubation was performed. The criteria used to determine intubation were similar to those for the HACOR score and the ROX index. When the index test and outcome were closely related, the AUROC tended to be higher. In our study, owing to the nature of the study design, there were no standardized intubation criteria. This may have influenced the lower discrimination power. In the sensitivity analysis, an improvement in the AUROC was observed when clinically important intubation criteria were used. Second, there is a possibility of diagnostic review bias due to the lack of blinding during the evaluation [21]. In prediction model studies, outcomes should ideally be assessed in a blinded manner, without prior knowledge of the predictors [22]. This approach prevents the predictors from influencing the outcome assessment, thereby preventing biased estimation in the association between predictors and outcomes. In the study by Valencia et al. [8], physicians were aware of the patients' HACOR score and ROX index when deciding to intubate. In such a setting, there is a potential for these scores to influence the decision-making process, leading to an overestimation of reported accuracy estimates [21]. In our study, we performed a sensitivity analysis by excluding facilities that used the ROX index as a treatment indicator. The results showed a trend toward improved discrimination over time, similar to the main analysis.

The lack of significant differences in the predictive performance of the HACOR score and ROX index at 2-h time point could be attributed to the specific nature of COVID-19. In COVID-19, low oxygen levels without accompanying respiratory distress, increased breathing effort, or elevated respiratory rate are observed, a phenomenon known as “happy hypoxemia [23, 24]”. Moreover, in most of our patients, the median GCS was 15 and the heart rate did not exceed 120 bpm (Table 1). In such cases, the predictive performance of the HACOR score may be similar to that of the ROX index. Further research is required to evaluate the effectiveness of the HACOR score in patients with COVID-19.

The predictive performance of the ROX index improved over time. This finding is consistent with that of a previous study [20]. One possible explanation for the improved performance of the ROX index in later time windows is the initial undifferentiated state of patients. As HFNC therapy progresses, it allows the typical course of COVID-19 to unfold, offering an opportunity for specific treatments, such as steroids and antiviral drugs, to demonstrate their effectiveness [20]. Another contributing factor could be the inherent nature of predictive models; their accuracy typically improves when the predicted event is closer to the data’s time point. Essentially, more recent data tend to better reflect the current clinical scenario, leading to a more precise prediction.

A higher likelihood of treatment failure was observed in the intermediate-risk group. Although a previous study [25] indicated that prophylactic intubation does not improve patient outcomes, our findings suggest that early elective intubation may have been performed to protect healthcare workers from potential aerosol transmission of COVID-19 during HFNC therapy. This interpretation was supported by our sensitivity analysis, which revealed that 47 of 127 patients were intubated without meeting the criteria for clinically important intubation. On the other hand, the reason for the lower incidence of treatment failures in high-risk group as opposed to intermediate-risk group remains unclear. The decision for intubation during a pandemic may have taken into account not only the risk of the ROX index, but also other factors such as infection control, ventilator availability, facility policies, and manpower [9, 10]; however, data supporting these factors were not recorded in this study.

Implications for clinical practice and future research

The incidence of treatment failure among patients initiating HFNC therapy was 42%, and a significant number of patients experienced treatment failure despite having a HACOR score of 0 or being classified as low-risk according to the ROX index. These groups still present risks that should be carefully managed, suggesting that the use of predictive models may not change clinical decision-making. For example, if a patient scores 0 on the HACOR scale, there remains a 24% chance that they might require intubation. Therefore, a score of 0 does not justify overlooking careful observation. Moreover, it may be inappropriate to use these models to rule out treatment failure. As the predictive performance of these models varied depending on the time-point, clinicians need to be mindful of the time-point score they are utilizing. An improved version of the HACOR score, the updated HACOR score [26], has been recently published, but has only been assessed in NIV. More accurate models need to be developed to predict HFNC treatment failure and to support clinical decision-making.

Limitations

Our study had certain limitations. First, the data might exhibit heterogeneities. Especially, there was heterogeneity in the criteria for initiating HFNC therapy between facilities. However, external validation of predictive models must be performed in real-world settings. Second, the generalizability might be limited. Although we analyzed data from nine centers over two years, further validation over different time periods and geographical locations is essential. Third, the decision to perform intubation was subjective. Although sensitivity analysis demonstrated an improved AUROC for the ROX index, its predictive performance remained inadequate. The HFNC initiation settings were not standardized across the participating facilities, with HFNC flow settings varying from 30 to 60 L/min. Given that HFNC flow settings have been reported to play an important role in physiological effects, concerns remain regarding the possibility that differences in flow settings may have affected the results [27, 28]. However, the influence of flow settings could be minimal, as previous systematic reviews that did not account for HFNC flow settings have still demonstrated reasonable predictive accuracy [20]. Fourth, the study design was retrospective, resulting in missing data. We adhered to the recommended methods for handling missing data and conducted our analysis based on the TRIPOD guidelines [11]. Finally, in  Japan, vaccinations have been administered since July 2021; therefore, the study included data obtained before the availability of vaccines, potentially limiting the generalizability of our findings to the current era.

Conclusions

In patients with COVID-19 managed on HFNC therapy in Japan, the predictive performance of the HACOR score and ROX index at 2 h may be inadequate.

Consequently, these models may not be reliable for excluding early treatment failure. However, the discrimination of both scores tended to improve over time. It is important for healthcare providers to consider the timing of score assessment, as the accuracy of these predictive models varied depending on when they were used.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available, as they may be used for a post hoc analysis by the co-authors, but are available from the corresponding author upon reasonable request.

Abbreviations

AUROC:

Area under the receiver operating characteristic curve

APACHE:

Acute Physiology and Chronic Health Evaluation

CI:

Confidence interval

COVID-19:

Coronavirus disease 2019

FiO2 :

Fraction of inspired oxygen

HFNC:

High-flow nasal cannula

ICU:

Intensive care unit

IQR:

Interquartile range

NIV:

Noninvasive positive pressure ventilation

NPV:

Negative predictive value

PPV:

Positive predictive value

SOFA:

Sequential Organ Failure Assessment

SpO2 :

Oxygen saturation

References

  1. Grasselli G, Pesenti A, Cecconi M. Critical care utilization for the COVID-19 outbreak in Lombardy, Italy: early experience and forecast during an emergency response. JAMA. 2020;323:1545–6. https://doi.org/10.1001/jama.2020.4031.

    Article  CAS  PubMed  Google Scholar 

  2. Kang BJ, Koh Y, Lim CM, Huh JW, Baek S, Han M, et al. Failure of high-flow nasal cannula therapy may delay intubation and increase mortality. Intensive Care Med. 2015;41:623–32. https://doi.org/10.1007/s00134-015-3693-5.

    Article  PubMed  Google Scholar 

  3. Prakash J, Bhattacharya PK, Yadav AK, Kumar A, Tudu LC, Prasad K. ROX index as a good predictor of high flow nasal cannula failure in COVID-19 patients with acute hypoxemic respiratory failure: a systematic review and meta-analysis. J Crit Care. 2021;66:102–8. https://doi.org/10.1016/j.jcrc.2021.08.012.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Roca O, Caralt B, Messika J, Samper M, Sztrymf B, Hernández G, et al. An index combining respiratory rate and oxygenation to predict outcome of nasal high-flow therapy. Am J Respir Crit Care Med. 2019;199:1368–76. https://doi.org/10.1164/rccm.201803-0589OC.

    Article  PubMed  Google Scholar 

  5. Roca O, Messika J, Caralt B, García-de-Acilu M, Sztrymf B, Ricard JD, et al. Predicting success of high-flow nasal cannula in pneumonia patients with hypoxemic respiratory failure: the utility of the ROX index. J Crit Care. 2016;35:200–5. https://doi.org/10.1016/j.jcrc.2016.05.022.

    Article  PubMed  Google Scholar 

  6. Magdy DM, Metwally A. The utility of HACOR score in predicting failure of high-flow nasal oxygen in acute hypoxemic respiratory failure. Adv Respir Med. 2021;89:23–9. https://doi.org/10.5603/ARM.a2021.0031.

    Article  PubMed  Google Scholar 

  7. Duan J, Han X, Bai L, Zhou L, Huang S. Assessment of heart rate, acidosis, consciousness, oxygenation, and respiratory rate to predict noninvasive ventilation failure in hypoxemic patients. Intensive Care Med. 2017;43:192–9. https://doi.org/10.1007/s00134-016-4601-3.

    Article  CAS  PubMed  Google Scholar 

  8. Valencia CF, Lucero OD, Castro OC, Sanko AA, Olejua PA. Comparison of ROX and HACOR scales to predict high-flow nasal cannula failure in patients with SARS-CoV-2 pneumonia. Sci Rep. 2021;11:22559. https://doi.org/10.1038/s41598-021-02078-5.

    Article  CAS  PubMed  PubMed Central  ADS  Google Scholar 

  9. Anesi GL, Kerlin MP. The impact of resource limitations on care delivery and outcomes: routine variation, the coronavirus disease 2019 pandemic, and persistent shortage. Curr Opin Crit Care. 2021;27:513–9. https://doi.org/10.1097/MCC.0000000000000859.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Dar M, Swamy L, Gavin D, Theodore A. Mechanical-ventilation supply and options for the COVID-19 pandemic. Leveraging all available resources for a limited resource in a crisis. Ann Am Thorac Soc. 2021;18:408–16. https://doi.org/10.1513/AnnalsATS.202004-317CME.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;131:211–9. https://doi.org/10.1136/bmj.g7594.

    Article  Google Scholar 

  12. Kurita J, Sugawara T, Ohkusa Y. Infectivity of omicron BA5 comparison with original strain and other mutated strain of SARS-CoV-2 in Japan. J Disaster Res. 2023;18:4–10. https://doi.org/10.20965/jdr.2023.p0004.

    Article  Google Scholar 

  13. Akaishi T, Kushimoto S, Katori Y, Sugawara N, Egusa H, Igarashi K, et al. COVID-19-related symptoms during the SARS-CoV-2 omicron (B.1.1.529) variant surge in Japan. Tohoku J Exp Med. 2022;258:103–10. https://doi.org/10.1620/tjem.2022.J067.

    Article  PubMed  Google Scholar 

  14. Blez D, Soulier A, Bonnet F, Gayat E, Garnier M. Monitoring of high-flow nasal cannula for SARS-CoV-2 severe pneumonia: less is more, better look at respiratory rate. Intensive Care Med. 2020;46:2094–5. https://doi.org/10.1007/s00134-020-06199-9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Akobeng AK. Understanding diagnostic tests 3: receiver operating characteristic curves. Acta Paediatr. 2007;96:644–7. https://doi.org/10.1111/j.1651-2227.2006.00178.x.

    Article  PubMed  Google Scholar 

  16. Myers LC, Mark D, Ley B, Guarnieri M, Hofmeister M, Paulson S, et al. Validation of respiratory rate-oxygenation index in patients with COVID-19-related respiratory failure. Crit Care Med. 2022;50:e638–42. https://doi.org/10.1097/CCM.0000000000005474.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Delong ER, Delong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–45. https://doi.org/10.2307/2531595.

    Article  CAS  PubMed  Google Scholar 

  18. Rubin DB. Multiple imputation for nonresponse in surveys, vol. 81. John Wiley & Sons; 2004.

    Google Scholar 

  19. Groothuis-Van Buuren S, Groothuis-Oudshoorn K. Mice: multivariate imputation by chained equations in R. J Stat Soft. 2011;45:1–67.

    Google Scholar 

  20. Yau CE, Lee DYX, Vasudevan A, Goh KJ, Wong E, Ho AFW, et al. Performance of the ROX index in predicting high flow nasal cannula failure in COVID-19 patients: a systematic review and meta-analysis. Crit Care. 2023;27:320. https://doi.org/10.1186/s13054-023-04567-7.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Whiting PF, Rutjes AW, Westwood ME, Mallett S, QUADAS-2 Steering Group. A systematic review classifies sources of bias and variation in diagnostic test accuracy studies. J Clin Epidemiol. 2013;66:1093–104. https://doi.org/10.1016/j.jclinepi.2013.05.014.

    Article  PubMed  Google Scholar 

  22. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162:W1-73. https://doi.org/10.7326/M14-0698.

    Article  PubMed  Google Scholar 

  23. Dhont S, Derom E, Van Braeckel E, Depuydt P, Lambrecht BN. The pathophysiology of ‘happy’ hypoxemia in COVID-19. Respir Res. 2020;21:198. https://doi.org/10.1186/s12931-020-01462-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Bepouka B, Odio O, Mayasi N, Longokolo M, Mangala D, Mandina M, et al. Prevalence and outcomes of COVID-19 patients with happy hypoxia: a systematic review. Infect Drug Resist. 2022;15:5619–28. https://doi.org/10.2147/IDR.S378060.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Lee HJ, Kim J, Choi M, Choi WI, Joh J, Park J, et al. Early intubation and clinical outcomes in patients with severe COVID-19: a systematic review and meta-analysis. Eur J Med Res. 2022;27:226. https://doi.org/10.1186/s40001-022-00841-6.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Duan J, Chen L, Liu X, Bozbay S, Liu Y, Wang K, et al. An updated HACOR score for predicting the failure of noninvasive ventilation: a multicenter prospective observational study. Crit Care. 2022;26:196. https://doi.org/10.1186/s13054-022-04060-7.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Maggiore SM, Grieco DL, Lemiale V. The use of high-flow nasal oxygen. Intensive Care Med. 2023;49:673–6. https://doi.org/10.1007/s00134-023-07067-y.

    Article  CAS  PubMed  Google Scholar 

  28. Li J, Albuainain FA, Tan W, Scott JB, Roca O, Mauri T. The effects of flow settings during high-flow nasal cannula support for adult subjects: a systematic review. Crit Care. 2023;27:78. https://doi.org/10.1186/s13054-023-04361-5.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank the Japanese Society of Education for Physicians and Trainees in the Intensive Care Clinical Trial Group (JSEPTIC-CTG) for their suggestions and comments on the earlier concept of this study.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

HO designed the study, acquired the data, performed statistical analyses, and interpreted the data. RY designed the study, performed statistical analyses, interpreted the data, and drafted the manuscript. DI, TT, TO, HN, AK, HT, MO, and AM acquired the data. YI designed the study and interpreted the data. DK, MY, and KA reviewed and edited the manuscript. The first draft of the manuscript was written by HO, and all authors commented on the previous versions of the manuscript. All the authors have read and approved the final version of the manuscript.

Corresponding author

Correspondence to Ryohei Yamamoto.

Ethics declarations

Ethics approval and consent to participate

The Institutional Review Board of Tohoku University (2022-I-265) and that of each center reviewed and approved the study. The requirement for informed consent from all participants enrolled in this study was waived by the committee because of the retrospective study design.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

S1: Characteristics of each institution and ICU. S2: Test values at each measurement point and HACOR score and ROX Index. S3: Clinical important intubation and treatment failure. S4: The area under the receiver operating characteristic curve of each predictions for the outcomes, which were clinically important intubation or death within 7 days. S5: The area under the receiver operating characteristic curve of each predictions for excluding facilities that used ROX index as a guide for treatment. S6: Discrimination of the HACOR score and ROX index predictions excluding facilities that used use ROX index as a guide for treatment.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Okano, H., Yamamoto, R., Iwasaki, Y. et al. External validation of the HACOR score and ROX index for predicting treatment failure in patients with coronavirus disease 2019 pneumonia managed on high-flow nasal cannula therapy: a multicenter retrospective observational study in Japan. j intensive care 12, 7 (2024). https://doi.org/10.1186/s40560-024-00720-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40560-024-00720-8

Keywords