If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Address correspondence to: Adrian G Barnett, Institute of Health and Biomedical Innovation, Queensland University of Technology, 60 Musk Avenue, Kelvin Grove, Queensland, 4059, Australia
Many studies disregard the time dependence of nosocomial infection when examining length of hospital stay and the associated financial costs. This leads to the “time-dependent bias,” which biases multiplicative hazard ratios. We demonstrate the time-dependent bias on the additive scale of extra length of stay.
Methods
To estimate the extra length of stay due to infection, we used a multistate model that accounted for the time of infection. For comparison we used a generalized linear model assuming a gamma distribution, a commonly used model that ignores the time of infection. We applied these two methods to a large prospective cohort of hospital admissions from Argentina, and compared the methods' performance using a simulation study.
Results
For the Argentina data the extra length of stay due to nosocomial infection was 11.23 days when ignoring time dependence and only 1.35 days after accounting for the time of infection. The simulations showed that ignoring time dependence consistently overestimated the extra length of stay. This overestimation was similar for different rates of infection and even when an infection prolonged or shortened stay. We show examples where the time-dependent bias remains unchanged for the true discharge hazard ratios, but the bias for the extra length of stay is doubled because length of stay depends on the infection hazard.
Conclusions
Ignoring the timing of nosocomial infection gives estimates that greatly overestimate its effect on the extra length of hospital stay.
]. A patient with an NI is likely to stay longer in hospital, incurring extra cost. The cost arises because other patients are denied access to the hospital bed while it is used to treat the infected patient. There will be only small changes to financial expenditures from reducing NI because most are fixed within the cost structures of the hospital within the time frame of infection control decisions [
]. It is the short-run willingness to pay for the marginal “bed day” released by reducing risk of NI that represents the cost of bed days used to treat NI, and this value will vary by decision maker and jurisdiction.
Correctly quantifying the extra LOS (in days) due to NI is crucial for economic and policy decision making. However, this analysis is complicated by the fact that patients acquire NI during their hospital stay, thus they have already spent some time in hospital before they become infected. This time requires specific consideration in the analysis by treating NI infection as a time-dependent exposure.
Many studies are prone to the “time-dependent bias” [
]. The bias has recently been rigorously studied by comparing the discharge hazard between infected and noninfected patients using hazard ratios (via Cox regression) [
In this article, we demonstrate the time-dependent bias in terms of LOS using a simulation study and real data. We show how to avoid the bias and give computer code to fit appropriate statistical models.
Methods
Ignoring the time of infection
Length of stay (in days) is often modeled using a generalized linear model assuming a gamma distribution because this helps to model its strong skew distribution [
Using a longitudinal model to estimate the effect of methicillin-resistant Staphylococcus aureus infection on length of stay in an intensive care unit.
]. These include a multistate model in which NI is an intermediate event between admission and discharge. Our model has three states: 0 = “no NI”, 1 = “NI”, and 2 = “discharged,” and we model the hazards among them (Fig. 1). The hazard is approximately equal to the probability of changing states in the next short time interval divided by the length of the interval, given that patients have been in the current state up to that time. We denote λij(t) = λij as the hazard for moving from state i to state j. An example hazard is,
Fig. 1Multistate model with three states: 0, admission without nosocomial infection (unexposed); 1, nosocomial infection (exposed); 2, discharge/death. Every patient enters state 0 when he/she is admitted to the hospital, then he/she may acquire a nosocomial infection (moving to state 1) or is discharged without this infection (moving to state 2). Once infected, the patient can only move from state 1 to 2. The (possibly time-dependent) hazard rate λ01(t) is the hazard rate to acquire a nosocomial infection during the hospital stay; λ02(t) the hazard rate to be discharged without a nosocomial infection and λ12(t) the hazard rate to be discharged after acquiring a nosocomial infection.
The actual hazard λ01(t) is obtained by taking limits as Δt → 0.
For simplicity we assume that the hazards are constant in time, which allows us to demonstrate the key points concerning LOS. However, methods for the general case do exist, and we give references as appropriate.
Under a constant hazard assumption, one estimates the λij's using the usual ‘incidence rates’,
Using this model the mean extra LOS of an infected patient is 1/λ12. The mean extra LOS of an uninfected patient (who may acquire infection later on) is the mean extra time in state 0 of ‘no HI’, 1/(λ01 + λ02), plus the mean extra time in the infected state, 1/λ12, times the probability of infection, λ01/(λ01 + λ02). The extra LOS due to an infection then is the difference in extra LOS for an infected and non-infected patient,
(1)
Three important facts follow from formula 1. The leading term ([(λ02)/(λ12)] − 1) is the ratio of the end-of-stay hazards ‘uninfected versus infected’ minus 1. This ratio determines whether infection prolongs or shortens LOS. The extra LOS is zero if λ02 = λ12, is positive if λ02 > λ12, and is negative if λ02 < λ12. Second, it is a mathematical fact that not modeling the timing of infection will overestimate a genuine prolonging effect of infection on LOS in terms of the hazard ratio [
]. This is because the hazard ratio (comparing infected with uninfected patients) is underestimated because the risk set of infected patients is incorrectly increased and the risk set of uninfected patients is incorrectly decreased. Finally, the actual number of extra days following infection does not only depend on the end-of-stay hazards, but also on the infection hazard λ01.
These three facts demonstrate that estimation of extra LOS will be biased if the timing of infection is not taken into account. In practice, bias is usually introduced by retrospectively stratifying patients into infected and noninfected. Under a constant hazards assumption, the extra LOS can be unbiasedly estimated by plugging in the incidence rates into formula 1. If the hazards are time dependent, however, then formula 1 will also be time dependent. A time dependent version of formula 1 can be created by giving more weight to those days when most infections occurred. Below, we have simulated data using constant hazards, but the analysis of both the simulated data and the real data did not rely on such an assumption. R code to run the time dependent version formula 1 is given in the supplement (doi:10.1016/j.jval.2010.09.008).
The hazards for the multistate model were calculated using the R-package mvna [
] was used to estimate the mean extra length of stay and 95% confidence intervals for the mean. Both R-packages are available at: http://cran.r-project.org [
We simulated individual LOS data (in days) using constant hazards and the multistate model in Figure 1. For each individual the time spent in state 0 was randomly generated with hazard λ01 + λ02 using an exponential distribution. The individual was randomly infected with probability λ01/(λ01 + λ02). For an infected individual the time spent in state 1 was randomly generated with hazard λ12.
We generated 1,000 studies with 1,000 individuals for 18 scenarios (Table 1). These scenarios were chosen to mimic realistic lengths of stay from an intensive care unit (λ02 = 0.2) and longer average lengths of stay from a hospital (λ02 = 0.1). By varying λ12 we considered infections that: prolonged LOS (λ12 < λ02), shortened LOS (λ12 > λ02), and had no effect on LOS (λ12 = λ02). A shorter LOS after infection is possible if the infection seriously harms the patient and so hastens their death. By varying λ01 we examined three infection rate levels: rare (5%), moderate (9%), and common (13%).
Table 1Simulation results: estimated bias for the extra LOS (days).
We used prospectively collected data from an observational study of NI from the intensive care units of 11 hospitals in Buenos Aires, Argentina. The data were collected as part of the International Nosocomial Infection Control Consortium (INICC) [
The International Nosocomial Infection Control Consortium (INICC): goals and objectives, description of surveillance methods, and operational activities.
]. INICC is an international non-profit, multicenter collaborative health-care-acquired infection control program that uses a surveillance system based on the US National Healthcare Safety Network (NHSN) [
]. The laboratory techniques used, training programs for data-collectors, definitions of infection, and surveillance activities were previously described [
The International Nosocomial Infection Control Consortium (INICC): goals and objectives, description of surveillance methods, and operational activities.
]. All patients entering the intensive care unit for more than 24 hours were prospectively followed; detailed information was collected on each day. The following variables were used in this analysis: hospital name and location; date of admission, discharge, and death; date, type, and site of health-care acquired infection; Average Severity of Illness Score (ASIS); and age.
To investigate possible moderating or confounding effects of age and ASIS, we estimated the extra length of stay due to infection after stratifying on these variables.
Results
Simulation
The simulation results are shown in Table 1 and demonstrate the bias in realistic settings. For the “intensive care unit” scenario the gamma model overestimated the extra LOS by an average of around 5 days, and for the “hospital” scenario by around 10 days. In contrast, the multistate model estimated the correct extra LOS on average. The bias of the gamma model was consistent regardless of the rate of infection or the effect of infection. Changing the hazard for infection to discharge (λ12) had a slight impact on the bias for the gamma model when the rate of infection was very high. A further simulation with λ01 = λ02 = 0.1 (infection rate = 50%) and λ12 = 0.06, 0.1 and 0.2, had a mean bias for the gamma model of 7.5, 10.0 and 13.3 days. The bias of the gamma model was always positive, even when an infection shortened the length of stay. So the gamma model did not even correctly estimate the direction of change, let alone its size.
Our results, ignoring the time of infection, were based on a parametric gamma model. As a sensitivity analysis, we instead used a non-parametric Kaplan-Meier model and found exactly the same biased estimated extra LOS due to infection (results not shown).
Argentina
Summary statistics for the 9545 admissions are shown in Table 2. There were 826 admissions acquiring an NI (8.7%). The average LOS was 5.8 days, the median was 4 days.
Table 2Descriptive statistics for the intensive care unit data from Buenos Aires, Argentina, January 2003 to November 2008.
Variable
Category
Statistic(s)
Admissions, n
9,545
Age, mean (SD)
68.5 (17.8)
Gender, n (%)
Men
5,018 (52.6)
ASIS, n (%)
A
853 (9.7)
B
2,840 (32.4)
C
3,172 (36.2)
D
1,478 (16.9)
E
418 (4.8)
NI, n (%)
Yes
826 (8.7)
LOS (days), mean (median)
5.8 (4.0)
ASIS, average severity of illness score (see Table 4 for definitions); LOS, length of stay; NI, nosocomial infection.
Figure 2 shows the cumulative hazards for moving between three states for the multistate model. The cumulative infection hazard is roughly a straight line, meaning the slope is a constant infection hazard of about 0.02 (1 divided by 50 days). This means that each day an average of 2 patients out of 100 get an NI. The hazards of discharge are not straight lines, but the discharge hazard is consistently reduced for patients with an NI.
Fig. 2Cumulative hazards in the first 50 days for the three states (see Figure 1 for definition) for nosocomial infection (NI) using data from Argentina. The slope of each line corresponds to the actual hazard rate, e.g., a straight line would mean a constant hazard rate.
Using the multistate model and the real data from Argentina the expected length of stay on each day is shown for patients with and without an infection in Figure 3. The extra length of stay due to NI is greater for earlier days. The average extra length of stay over all days is calculated by weighting the difference in LOS on each day. This gives an estimated extra LOS of 1.35 days (95% confidence interval: 0.77–1.93 days). The gamma model estimated an extra LOS due to NI of 11.23 days (95% confidence interval: 10.10–12.44 days).
Fig. 3Weights and expected LOS for patients with and without a NI in the first 20 days using data from Argentina. The weights are given by the distribution of the time until group membership (infected with NI, not infected with NI) becomes definite, i.e., when a patient leaves the initial state. LOS, length of stay; NI, nosocomial infection.
The extra lengths of stay estimated by age group and ASIS are shown in Table 3. Lengths of stay due to infection were longer in the youngest age group (under 60), with an average of close to 3 extra days. By ASIS there was an increasing extra length of stay with increasing morbidity (groups ‘A’ to ‘D’; Table 4), but there was no extra length of stay for the sickest patients who acquired an infection (group ‘E’). Infection may have hastened death in this group.
Table 3Estimated extra length of stay by age group and ASIS for the intensive care unit data from Buenos Aires, Argentina, January 2003 to November 2008.
Variable
Group
Admissions
Extra LOS (days)
Mean
95% CI
Age group
<60
2,013
2.98
1.44, 4.49
(years)
60–79
4,655
0.76
0.07, 1.45
80+
2,610
0.83
0.04, 1.62
ASIS
A
853
−0.48
−1.19, 0.22
B
2,840
0.13
−0.85, 1.12
C
3,172
0.79
0.01, 1.58
D
1,478
1.89
0.92, 2.87
E
418
−1.87
−4.32, 0.58
All admissions
—
9,545
1.35
0.77, 1.93
ASIS, average severity of illness score (see Table 4 for definitions); LOS, length of stay.
Table 4Definitions for the five average severity of illness score categories.
Letter
Definition
A
Surgical admissions who require routine postoperative observation only
B
Physiologically stable non-surgical patients who require overnight observation
C
Admissions who need continuous nursing care and monitoring
D
Physiologically unstable patients who require intensive nursing and medical care and need frequent reassessment and adjustment of therapy
E
Physiologically unstable admissions who are in a coma or shock and require cardiopulmonary resuscitation or intensive medical and nursing care with frequent reassessment
Our results demonstrate the time-dependent bias in terms of extra LOS. The results show that the extra LOS is always overestimated when ignoring the time of infection. Using intensive care unit data from Argentina, the gamma model greatly overestimates the extra LOS due to infection. The consequences of this are compounded when extrapolating to economic costs.
The costs of NI-expressed in monetary values are used by advocates to justify extra infection control investments. Spending money to save money is a powerful argument that will resonate with decision makers. Typically a dollar value is applied to each bed day lost to NI and the aggregate cost outcome disseminated to decision makers; the promise is that cost savings will be enjoyed if cases of infection are prevented. A value of $700 assigned to each hospital bed day lost to NI in Argentina implies cost savings of $7861 (95% confidence interval: $7070–$8708) using biased estimates and $945 (95% confidence interval: $602–$1288) using unbiased estimates. Two possible outcomes from using biased estimates to make the economic case are that too many resources are allocated to infection control, and that decision makers are disappointed with the actual savings. Unbiased estimates should be used to inform decisions, and decisions should include information on changes to all costs and changes to health benefits, such as reduced risk of mortality and reduced morbidity [
To examine the bias in a hospital setting with generally longer length of stay compared with an intensive care unit, we multiplied all three constant hazards by a half. This multiplication does not alter the true hazard ratio, but the bias in terms of extra length of stay is roughly doubled (Table 1). This shows the additional value of examining the time-dependent bias in terms of the additive length of stay compared with multiplicative hazard ratios.
This study has a few limitations. In our simulations we assumed constant hazards between states; however, the multistate model allows hazards to change over time. We did not distinguish between patients dying while in the hospital and being discharged alive. However, the multistate model can be extended to incorporate such competing events. This makes it is possible to estimate extra LOS due to NI for patients who died, and a separate extra LOS for those who were discharged [
Nosocomial infections may develop during hospital stay, but may only become symptomatic and detected after discharge. If asymptomatic patients suffer the same morbidity as infected patients, then the methods used here will underestimate the increased length of stay due to infection because some patients in the non-infected group are misclassified. The size of underestimation will depend on how many patients are misclassified. An interesting approach to this problem is to model an imperfect sensitivity for the test of NI [
The methodology in this article is appropriate for studying statistical association. In various fields of epidemiology there has been increasing interest in statistical methodology for causal effects rather than association, typically motivated by time-dynamic treatment regimes. An investigation of the impact of time-dependent adverse events on LOS has been conducted [
]. Our study makes some headway into causation because the temporal sequence of events is accounted for. This is crucial to any concept of causality, because the cause has to precede the effect. It is precisely this temporal sequence that is ignored in the time-dependent bias. We have, however, refrained from a more in-depth treatment of the issue of causality, because the development of causal models for time-dependent exposures is ongoing.
Our findings are not limited to studies of infection; they are extendable to other settings with a time-dependent exposure; therefore, they are relevant for an appropriate risk-adjustment in other studies. Future studies should be aware of the time-dependent bias; ignoring it leads to overestimated lengths of stay and economic impacts. The bias applies to any method (Kaplan–Meier, Cox, and generalized linear [mixed] models with any type of distribution) unless the time of infection is taken into account.
Using a longitudinal model to estimate the effect of methicillin-resistant Staphylococcus aureus infection on length of stay in an intensive care unit.
The International Nosocomial Infection Control Consortium (INICC): goals and objectives, description of surveillance methods, and operational activities.
Funding: Martin Wolkewitz, Jan Beyersmann and Arthur Allignol were supported by Deutsche Forschungsgemeinschaft (FOR 534). Martin Wolkewitz's visit to Brisbane was supported by the Institute of Health and Biomedical Innovation visiting researcher program.