Home
HOME ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Services
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
PubMed
Right arrow PubMed Citation
The Gerontologist 44:624-632 (2004)
© 2004 The Gerontological Society of America

Using Resident Reports of Quality of Life to Distinguish Among Nursing Homes

Robert L. Kane, MD1,, Boris Bershadsky, PhD1, Rosalie A. Kane, PhD1, Howard H. Degenholtz, PhD2, Jiexin (Jason) Liu, PhD3, Katherine Giles, RN, MBA1 and Kristen C. Kling, PhD4

Correspondence: Address correspondence to Robert L. Kane, MD, University of Minnesota School of Public Health, D351 Mayo (MMC 197), 420 Delaware Street SE, Minneapolis, MN 55455. E-mail: kanex001{at}umn.edu


    Abstract
 TOP
 Abstract
 Background
 Methods
 Results
 Discussion
 References
 
Purpose: We used measures created to assess the quality of life (QOL) of nursing home residents to distinguish among nursing facilities. Design and Methods: We statistically adjusted scores for 10 QOL domains derived from standardized interviews with nursing home residents for age, gender, activities of daily living functioning, cognitive functioning, and length of stay, and then we aggregated them to the facility level. We compared the patterns across a sample of 40 facilities. We correlated facility characteristics with QOL scores. Results: The pattern of QOL scores for each of the 10 domains was generally consistent within a given facility. Although resident characteristics played a major role in explaining variance, there were significant effects of facilities as well. Some modest relationships were found between facility characteristics such as ownership, percentage of private rooms, and rural–urban location and facility QOL scores. No effect of facility size was detected. Implications: This article shows that it is possible to differentiate among facilities on the basis of resident self-reported QOL. On the basis of our analysis, we find that a sample of 28 residents per facility is sufficient to generate a reliable QOL score for each of the domains studied.

Key Words: Quality of life • Nursing homes • Resident reports


Although concerns about the inadequacies in quality of life in nursing homes have been constant and ubiquitous since 1965, when public money became widely used to fund nursing homes (Institute of Medicine, 1986; Wunderlich & Kohler, 2001), measures to assess this construct have not been available and the emphasis, accordingly, has been on quality of care. Partially in response to this gap, the Centers for Medicare and Medicaid Services (CMS), formerly HCFA, created a contract to develop a set of measures to tap relevant quality-of-life domains for nursing home residents. A crucial part of that challenge was to determine how to aggregate the data received from individual residents into a score to characterize a particular facility.

The first step is, of course, to specify and test self-report measures of quality-of-life outcomes experienced by individual residents. That work was the first part of the CMS contract and is described elsewhere (Kane, 2001). However, even after reliable and valid measures of any given resident's quality of life are developed, many questions arise about whether such individual information can be summarized to characterize the quality of life of the nursing facility. For some accountability purposes, it is necessary to combine the individual data in a way that characterizes the nursing home as a whole. Moreover, such aggregation will facilitate studying the role of nursing facility characteristics in influencing the quality of its residents' lives. This article describes our method of aggregating individual data to the facility level and the extent to which such facility-level data could serve to differentiate among a sample of nursing homes. It is designed to test the feasibility of using this quality-of-life data to distinguish among nursing facilities.


    Background
 TOP
 Abstract
 Background
 Methods
 Results
 Discussion
 References
 
Why is it important to distinguish quality of life among nursing facilities? Such information can be used in several ways. It can be part of the quality-assurance and regulatory process; it can be used for internal quality-improvement efforts; and the results can be made available to potential users to help them in selecting a nursing home. Central to all of these efforts is the assumption that nursing homes should be held accountable for outcomes of interest. Nursing homes differ from many other health care programs in that, for many residents, they serve as dwelling places for long periods of time. The nursing home, therefore, has the potential to influence residents' lives for better or worse on outcomes that are far broader than health status. Arguably, the nursing homes' effects on their residents' quality of life are at least as important as the more traditional measures of quality of care that are routinely collected.

Some controversy exists, as yet uninformed by much data, about the extent to which a nursing home can actually influence the social and psychological domains of quality of life. For example, outcomes such as meaningful activity, relationships, and the like may be heavily determined by social factors outside facility control, such as family structure and availability, quality of family relationships, and resident's interests, education, and even personality. It is also reasonable to hypothesize that quality-of-life levels in a nursing home will be related to various health and disability characteristics such as health status, prognosis, functional abilities, sensory abilities, pain, persistent serious depressive illness, and cognitive abilities. These factors can affect how a resident appraises his or her quality of life. Nursing homes, however, can likely influence some of these potential mediators of quality of life directly, especially pain, functional status, and depression (particularly depressive affect that is not part of a long-standing psychiatric diagnosis). Although it is difficult for nursing homes to overcome the effects of extreme sensory impairment, high disease burden, irreversible cognitive impairment, and poor prognoses, we suggest that nursing homes can take effective steps to improve or maintain quality of life for those who may be at highest risk of poor quality of life, including those with dementia, those who are facing imminent death, and those who have limited external social support systems.

Indicators of quality of a health care organization may be expressed at several levels. One way to generate indicators is to identify facility-level structural and process factors that relate to outcomes of interest, such as nursing staff levels or infection control. Another approach is to aggregate individual-level data on outcomes of interest (e.g., proportion of bedsores or proportion of urinary tract infections) to describe the facility. When such outcomes are aggregated, adjustments are necessary for aspects of the case mix that are out of the control of the facility. With the use of the second approach, nursing home data generated through the Minimum Data Set (MDS) have been under study for more than a decade and various health-oriented outcome indicators have been developed by use of different case-mix adjustment strategies (Zimmerman et al., 1995). For quality-assurance purposes, and for presenting meaningful comparative data to the public, it is necessary to avoid describing nursing homes as better in quality if a substantial amount of the differences can be attributed to the characteristics of the residents who are admitted. In contrast, it is also imperative not to overadjust and, therefore, fail to hold nursing homes accountable for resident characteristics that they may be able to change (e.g., bed-bound or wheelchair-bound residents, or residents who are depressed).

A central task in creating facility-level measures of quality of life is to adjust appropriately for differences in case-mix across facilities, although this need is not always appreciated (Davis, 1991). The choice of case-mix adjusters is important; they should reflect elements that might influence quality of life, but they should not include items that are under the control of the nursing home, lest important differences in quality are adjusted away. Although a growing body of information is available on case-mix adjustment for quality of care in nursing homes (Braun, 1991; Mukamel, 1997; Phillips et al., 1996; Porell & Caro, 1998), little work has been done on quality of life.

The MDS assessment is performed by various staff members (nurses, social workers, therapists, and activities personnel) based on their knowledge of the resident during the time period of the assessment. Although considerable detail is provided about how to define each MDS element, ultimately the MDS is composed of staff observations and reflections, not resident reports; although the MDS manuals emphasize the value of resident input, neither specific instructions nor structured mechanism to obtain such input is provided. Quality of life as a phenomenon is poorly addressed by MDS 2.0. Among the 24 MDS-based Quality Indicators created by the Center for Health Services Research and Analysis at the University of Wisconsin, 2 were labeled as tapping quality of life: prevalence of daily physical restraints and prevalence of little or no activity (Zimmerman et al., 1995). However, quality of life is a more complex phenomenon than can be readily captured by these two statistics; moreover, it is a subjective phenomenon that requires direct statements from residents.

In this study, we envisaged quality of life as a multidimensional concept, and we developed measures for a spectrum of quality-of-life domains that the literature and focus groups had identified as important: comfort, autonomy, privacy, dignity, meaningful activity, relationships, food enjoyment, security, functional competence, and spiritual well-being (Kane, 2001; Kane et al., 2003). Conceptually, and on the basis of early analyses, we had reason to expect that the quality-of-life score on some domains would be worse for residents with poor cognitive abilities, though again this might be related to the kind of care and environments and general milieu ordinarily offered to those with worse cognitive abilities. In contrast, for some other domains that require more cognitive processing, such as privacy and dignity, we made no specific hypotheses about the influence of declining cognitive abilities on the quality-of-life scores. Therefore, an adjustment for cognitive status was necessary. Moreover, cognitive status is not a characteristic that a nursing home can easily influence through providing worse care.

To characterize a nursing home on any outcome, one must sample a sufficient number of residents to yield stable estimates (that is, the estimate should not be subject to the idiosyncratic outcomes of one or two residents). Given that data on quality of life would have to be collected directly from residents, a sampling approach seemed practical. Another goal we had for this study, therefore, was to determine how many resident appraisals were sufficient to yield stable estimates of the quality-of-life domains.


    Methods
 TOP
 Abstract
 Background
 Methods
 Results
 Discussion
 References
 
Sample
The sample for these analyses is composed of the 40 nursing homes that participated in the national study to develop quality-of-life measures and indicators. We selected the facilities from five states (California, Florida, Minnesota, New Jersey, and New York), which together reflect considerable diversity in nursing home environments. The facilities were evenly divided between large and small, and urban and rural facilities, and (to facilitate a different set of questions of interest to CMS) we overrepresented facilities with high proportions of private rooms. We derived the initial sampling frame from facilities that had responded to an initial contact survey about their quality-assurance activities related to quality of life and other descriptive characteristics. The questionnaire was mailed to all 1,745 nursing homes in the five geographic catchment areas accepting either Medicare or Medicaid payments, and 1,000 responded. We used these responses to identify those facilities with 70% or more private rooms. We used three basic sampling components in selecting facilities: percentage of private rooms, size, and rural–urban location. We thus created six sampling cells. We used a median split on bed size to classify facilities as small (fewer than 109) or large (109 or more). First, within each state, we randomly chose two facilities with 70% or more private rooms; these homes were assigned to a size and urban–rural stratum. We then randomly chose the remainder of the sample in each state to complete the needed strata.

At this stage in measure development, our procedures demanded a great deal from facilities, and their participation in the study was voluntary. If a facility declined to participate, we randomly made a replacement within the size and rural–urban designation. Because our goal in this paper is to examine the patterns of performance across the facilities in the sample and not to attempt any generalization to the larger universe of nursing facilities, we propose no effort at weighting to compensate for the selection procedures. Instead, we examine the effects of various facility characteristics on the facilities' performance.

Consistent with our belief that each resident is the best judge of his or her quality of life, we collected the data on quality of life directly from residents. The resident sampling plan within each facility yielded 50 residents from each of up to five units per facility. It was based on two factors: the percentage of private rooms and residents' cognition. In the first step we sought to ensure that we had at least 10 residents in each facility in private rooms if possible. These residents were randomly selected first and assigned to cognitive categories. Then we drew the rest of the sample to achieve a 50–50 split in those with high and low cognitive scores until the facility sample included 50 residents. (Comatose residents and residents under the age of 65 were not included.) Interviews were attempted with all residents who were sampled, regardless of cognitive level. We abstracted cognitive functioning information from the MDS data on all residents in the facility, using a variant of the Cognitive Performance Scale (CPS; Morris et al., 1994). Specifically, we eliminated from the CPS the item on being comatose or in a vegetative state (because those residents were excluded) and the item on feeding. Our scale used a short-term memory problem (yes–no), a long-term memory problem (yes–no), and a 0–3 rating of decision-making capability to form a 6-point scale (0–5), where 5 represents severe impairment. We formed two strata and evenly sampled from residents with scores of 0–2 and 3–5. Among the latter (i.e., those who were more impaired), 52.8% completed enough of the questionnaire to enable construction of at least one quality-of-life domain scale, and for 33.5% we could construct 9 or more of the 10 domains measured. Among the higher functioning residents (with scores from 0 to 2), we could construct at least one quality-of-life measure for 90.8% of residents and 9 or more domains for 78.7% of the residents.

Data
Quality-of-life data came from in-person interviews conducted in the participating facilities by teams of university data collectors who received a week of training and testing to establish their interrater reliability. They were supervised in the field by periodic visits, but no formal interrater reliability testing was done after their initial certification. As described elsewhere, we created individual scale scores for each of the 10 domains, which were created by using a combination of factor analysis (to create domain scales) and cluster analysis (to shorten the length of the scales; Kane et al., 2001; Kane et al., 2003). Although the primary response mode used a Likert scale, the questions allowed for dichotomous responses from residents who could not use the Likert responses. We used this dual response option to encourage participation of cognitively impaired respondents. We combined the two response modes by using a z-score adjustment described elsewhere. These scales were subjected to the usual psychometric tests for reliability and validity, the details of which are provided elsewhere (Kane et al., 2003). The original data collected included a large number of questions designed to create scales for the postulated 11 domains. A first analytic step in creating the scales for these domains was to identify the questions that best captured each of them. For this analysis, we used the short versions of the scales, which consisted of three to six items per domain. We tested the internal reliability of these scales by using Cronbach's alpha coefficients, which essentially measure the degree to which each item in a putative scale correlates with the total scale score. Cronbach's alpha coefficients for the scales and the number of items for each scale are shown in Table 1. Ideally, these scores should exceed.80. All but two alphas were above.60 and four were above.70. Confirmatory factor analyses showed that the presumed factor structure was supported for the 10 domains reported here (Kane et al., 2003).


View this table:
[in this window]
[in a new window]
 
Table 1. Facility Characteristics.

 
We derived other data for these analyses from the MDS. We acquired the MDS files for all the residents in all the 40 participating facilities for 18 months prior to our data collection. In addition to the cognitive score we created, the activities of daily living (ADL) score used for adjustment was based on a count of the number of ADLs in which the resident was rated as requiring assistance as recorded in the MDS evaluation closest to the time of interview.

Analysis
We implemented the comparison of quality-of-life domain scores among facilities on two levels: raw scores and scores adjusted for differences in patient characteristics. First, we calculated quality-of-life domain scores by using all available resident-level responses and aggregated them by facilities to obtain nonadjusted facility-level average quality-of-life domain scores. We also used resident-level data to develop a case-mix-adjusted model that regressed individual quality-of-life domain scores on selected resident characteristics obtained from MDS data. These included ADL, cognition, age, gender, and length of stay. We dichotomized length of stay into fewer than 3 months and 3 months or more.

To enable the comparison of facilities while adjusting for patient characteristics, we calculated differences between the observed individual quality-of-life domain scores and the expected ones calculated based on the case-mix-adjusted model already described. We averaged the standardized resident-level residuals by facility to obtain facility-level adjusted scores. We transformed the distribution of the facility-level adjusted scores for all 40 facilities by using z scores. As a result, we expressed the differences among facilities in units of the standard deviation of average scores for all 40 facilities. By construction (assuming close to a normal distribution), we could expect that approximately half of all facilities would have positive and another half would have negative standardized scores in the range from –3 to +3.

We used mixed-effect linear models with main effects to fit the data. We used 10 components of the quality-of-life instrument as dependent variables. Independent variables included various combinations of Facility and Interviewer factors, and covariates. The allocation of interviewers by facilities was not planned in advance, and post hoc cross-tabulation revealed a very unbalanced design with many empty cells. This imbalance dictated the use of a type of sum of squares that can accommodate the design with empty cells. Covariates included in the models served as risk adjustors and included length of stay in the nursing facility and MDS-based cognition score (six-level ordinal variable), the ADL score, age, education (five-level ordinal variable), and binary variables representing race (White, other), marital status, gender, and the presence of living children. We constructed these variables on the basis of the measured values to obtain reasonably unskewed distributions. We implemented calculations by using the Univariate General Linear Model and Variance Components procedures in SPSS 11.0.


    Results
 TOP
 Abstract
 Background
 Methods
 Results
 Discussion
 References
 
The characteristics of the facility sample are shown in Table 1, which also contrasts some of the characteristics of the facilities in this sample with the national profile. About half of the randomly sampled facilities declined to participate when contacted by telephone. Of the 49 homes that were visited for recruitment, 9 refused. The mean size of facilities in the resultant sample was 128 beds; the median was 109.

By design, half the sample was rural. There was a considerable range in staffing. The proportion of homes that were proprietary was much lower than the national average, partly because Minnesota and New York have high numbers of nonprofits, but also because most of the refusals occurred in for-profit facilities, whereas all the facilities oversampled for private rooms were nonprofit. There was considerable variation in the proportion of cognitively impaired residents. This distribution was generally similar across four of the states (averaging about 55%), but California homes had a much higher proportion (67%) than the rest of the sample.

The mean facility scores for each domain are shown in Table 2. We created an average score across domains by dividing the additive scale by the number of questions for the particular domain. Each quality-of-life domain could be scored between 4 and 1, with higher scores reflecting higher quality of life. Facility-level scores varied from 2.70 (meaningful activity) to 3.67 (dignity). The extent of between-facility variation in the scores is reflected in the standard deviations, which varies over 100% (from 0.101 for dignity to 0.212 for spiritual well-being).


View this table:
[in this window]
[in a new window]
 
Table 2. Quality-of-Life Domain Alpha Value and Facility Scores.

 
Table 3 is designed to show the overall patterns of performance for each facility, by displaying the results arrayed by facility according to whether the facility mean score was above or below the sample average on each domain. The facilities are ranked from best to worst. Each plus or minus sign represents a standard deviation; the more plus or minus signs, the greater the facility average deviates from the mean for that domain. The results show that facilities do indeed differ in their average quality of life on these domains. In 23 facilities, two or more domains were consistently positive or negative by at least 1 SD. Ten facilities showed a consistently positive pattern and 13 a consistently negative pattern.


View this table:
[in this window]
[in a new window]
 
Table 3. Facility-Level Deviations in Resident-Reported Quality of Life, Risk-Adjusted for Resident Characteristics.

 
The differences in facility quality-of-life domain scores can be due to differences in facility performance, but they can also be attributed to resident case mix and the differences in the way interviewers performed. Table 4 presents a comparison of the ability of various components of quality of life to discriminate among facilities by using raw nonadjusted measures and adjusted measures. The numbers in the table represent relative variance and p values associated with a corresponding random factor. Higher relative variance and lower p values indicate better chances to discriminate the analyzed objects (i.e., facilities and interviewers). When comparing p values, one has to apply the correction for multiple comparisons; the threshold p value of.005 instead of.05 should be used. Prior to risk adjustment, 6 domains out of 10 show a p value less than.005, which corresponds to a relative variance greater than 3%. In one case (spiritual well-being) this value is close to 6%. Two more domains (meaningful activity and autonomy) fall in the range of p values from.005 to.05. When we applied risk adjustment for patients' characteristics (covariates included into the model), we found no dramatic changes. After adjustment, the relative variance of the Facility factor increased in five cases and decreased in five cases. Six domains demonstrated p values less than.005 and three between.005 and.05. Two domains did not show any significant contribution of the Facility factor. As a result of risk adjustment, the overall level of significance associated with the same relative variance dropped because of the degrees of freedom consumed by risk adjustment. For example, the factor Facility, prior to adjustment, was able to explain 2.2% of total variance of AUT (autonomy) with p =.007. After adjustment, the relative variance increased to 2.4% but the corresponding significance became slightly worse (p =.009).


View this table:
[in this window]
[in a new window]
 
Table 4. Ability Various Components of Quality of Life to Differentiate Facilities.

 
The study design did not specify how interviewers were distributed across facilities. Consequently, some interviewers worked in as few as two facilities and some visited as many as seven facilities. All interviewers were trained and demonstrated their ability under the observation of the trainers. Nevertheless, it does not exclude the possibility that observed differences between facilities might be partially related to differences among interviewers. To test this assumption, we applied an additional correction for the random Interviewer factor (Table 5). Eight domains demonstrated a loss of relative variance; only two domains did not show any change. In four cases, variance associated with interviewers was even greater than variance associated with facilities (comfort, privacy, meaningful activity, and security). This trend was combined with the fewer degrees of freedom; therefore, just one Facility p value (domain dignity) fell in the range.005 to.05, and no one domain had a significant p value (p <.005). We can illustrate the effect of losing degrees of freedom by comparing models for autonomy (Tables 4 and 5). Table 4 shows that the model adjusted for patient characteristics explains 2.4% of total variance and the Facility factor has a p value equal to.009. Table 5 shows that the model with the Interviewer factor added has the same relative variance of 2.4% associated with the Facility factor and zero variance explained by the Interviewer factor. Nevertheless, significance of the Facility factor became much worse (.073 instead of.009).


View this table:
[in this window]
[in a new window]
 
Table 5. Ability of Various Components of Quality of Life to Differentiate Facilities.

 
We studied the effect of facility characteristics on various components of quality of life by using mixed-effect linear models with two random factors (Location–urban or rural and Ownership–for profit, private not for profit, or public) and two covariates (Percentage of Private Rooms and Numbers of Beds). As shown in Table 6, having a higher percentage of private rooms is associated with better average scores on comfort and privacy. Rural facilities demonstrate a better CMF (comfort) score, and private nonprofit facilities have better dignity, security, and spiritual well-being scores. We did not find the size of the facilities to be significant with respect to the studied components of quality of life.


View this table:
[in this window]
[in a new window]
 
Table 6. Effects of Facilities Characteristics on Various Components of Quality of Life.

 
We examined the effect of cognition in comparing the level of discrimination among facilities. More homogeneous but smaller groups of residents did not yield highly significant differences of quality-of-life components across facilities. Nevertheless, the pattern differs between these two populations. In the cognitively impaired population, the facilities could be significantly distinguished on the basis of functional competence, dignity, meaningful activity, and security. By contrast, among the cognitively intact population, the facilities could be compared on the basis of comfort, privacy, food enjoyment, relationships, and spiritual well-being.

An important question is the sample size needed to produce significantly different facility scores. To estimate the effect of different sample sizes, we used a method described by Dupont and Plummer (1990). This method, which produces results that are in close agreement with those of Pearson and Hartley (1970), uses the relationship of the between-group difference and the within-group difference. Because these data could be used to describe facility performance, we must take special care to ensure that false positives (Type II errors) are avoided. Hence, we should use a conservative alpha value. Table 7 shows the estimated sample sizes needed to detect differences between two facilities that exceed 1 SD within groups, but the choice of a 1-SD threshold difference is arbitrary. In the first case, the risk of declaring a difference simply as a result of chance variation in sampling (alpha) is less than 5%, and the chance of declaring no difference when there really might be one (beta or 1-power) is 20%. In the second case, the alpha level is set at 1% and the beta is set at 10%. The sample size varies among the domains. For the first case it runs from 7 (spiritual well-being) to 12 (dignity). For the second, more stringent case, it runs from 13 to 22. Thus, a sample size of 22 per nursing home would permit a reasonable comparison across facilities. This number represents the number of nursing home residents who responded to an adequate number of the questions about quality of life. Presumably, a somewhat larger sample would have to be approached to net this amount. If smaller units within a facility were to be compared, then the necessary sample size would increase considerably.


View this table:
[in this window]
[in a new window]
 
Table 7. Sample Size Needed to Detect a Mean Difference Across Facilities Equal to Different Levels of Between-Groups Standard Deviation.

 

    Discussion
 TOP
 Abstract
 Background
 Methods
 Results
 Discussion
 References
 
These analyses suggest that it is possible to create facility-level scores for quality of life and to compare facilities on this basis. Despite the unusual configuration of the sample, the facility scores did appear to identify facilities with better and worse quality of life. In general, the performance pattern showed consistency. That is, facilities that were below the mean in one area tended to show a similar pattern for other domains, which may result from correlated facility-level domain scores. The analysis of variance test in Table 4 revealed significant differences between facilities in 8 domains out of 10, indicating that, among all studied facilities, there is at least one pair with significantly different quality-of-life domain scores on 8 domains. It makes them the preferable choice in comparing facilities in practical applications when we are applying the six-sigma methodology (i.e., more than two standard deviations from the mean value) to find outliers. Two domains (functional competence and dignity) failed to show any significance, which indicates that all observed differences between facilities on these domains were likely to appear by chance. Although the practical value of these domains is very limited when we are comparing facilities, they might be useful in longitudinal studies organized to test the effects of facility-level organizational interventions.

At this point we have used a separate quality-of-life score for each of the 10 domains. A further step would be to create a weighted score to establish a single quality-of-life score for each facility. This would render comparisons simpler, but might, in fact, obscure differences of interest, especially when the nursing home profile has both positive and negative deviations, a situation that occurred 30% of the time in our data (See Table 3). Moreover, from the perspective of the quality-assurance requirements, nursing homes are accountable for outcomes in specific areas such as dignity, rather than the general construct Quality of Life.

One concern with using domain averages is that if these average scores are near the upper or lower ends of the possible range, a ceiling or floor effect could result, and we would not be able to detect any positive or negative deviations. However, the results in Table 2 show that this potential threat did not occur. None of the average facility-level domain scores approached either end of the possible range.

A next step will be to establish the validity of these results by comparing the facility performance in quality of life to other markers such as deficiencies in survey results and the quality indicators derived from the MDS. In addition, it is worth exploring whether quality of life is influenced by a variety of administrative policies, such as staffing and program emphasis. For this we will be using a different sample of facilities that is more likely to include facilities with bad regulatory histories, which will not have the bias that comes from using a volunteer sample.

The analysis of the effect of adjusting for residents' characteristics suggests two things: (a) The proportion of total variance explained by the facility is modest, so resident factors undoubtedly explain a substantial amount of the variance; however, (b) adjusting for the characteristics does not importantly influence the amount of variance explained by the facility. In other words, there is a small but important facility effect, which is separate from the effect of the residents.

Some limitations of this study should be acknowledged. We have used z-scoring techniques to simplify the data presentation, but these transformations could create apparently large differences when the actual differences in scores are small. Care must be taken in interpreting the relative performance of any facility. However, if attention is focused on patterns across domains, the risk of overinterpretation is minimized. The sample used for this study was not intended to be nationally representative. The emphasis here was on developing a method for aggregating resident-level quality-of-life domain scores to the facility level. The actual findings are perhaps less noteworthy than is the approach used, because this sample of facilities was artificially created to emphasize single rooms and rural status. The general lack of statistically significant relationships between quality-of-life domain scores and nursing home characteristics may be attributed in part to the limited sample size. The sample itself is atypical of the general distribution of nursing homes in the United States. The lower proportion of proprietary homes may reflect the effect of this sampling. Further work is needed to see how well this approach can discriminate among a larger, more representative sample of facilities.

While analyzing the possibility of various quality-of-life domains to discriminate facilities, we found that difference in patient characteristics between facilities (the most commonly used reason for adjustment) changed the resolution of the comparison very little. A much greater source of variation was attributable to the measurement process (i.e., allocation of interviewers to facilities). Because the study had not been planned to examine this effect, we had to rely on a post hoc analysis of this observational study with a very unbalanced design and a sample size that was insufficient to test this effect. Nevertheless, we demonstrated the importance of the measurement process by comparing the relative variances associated with two random factors (we found Interviewer and Facility factors to have comparable variance). If confirmed, this finding would affect the process of comparing quality-of-life domains among facilities. Such studies should be designed to prevent confounding the Facility and Interviewer factors to allow their effects to be separated.

The interviewer effect is troublesome. We spent a week training the interviewers in uniform approaches to interviewing and establishing that they met the performance criteria as observed by their trainers. The logistics of the study made it much easier to concentrate a few interviewers in each facility, thereby increasing the effects of inter-interviewer differences, but also making it harder to distinguish among interviewer, facility, and state effects. Because different groups of interviewers worked in different states, we cannot say for certain how much is an interviewer and how much is a state effect. In an operational model it will be important to use the same interviewers across many facilities to at least convert potential bias into error. We were pleased in later work in this contract to see that it was possible to train both nursing home staff and surveyors in how to conduct resident interviews. The levels of interrater reliability (based on a different sample of residents) for domain scores between those trainees and our own interviewer staff were of the order of 0.61 to 0.75.

Even though the amount of overall variance explained by facilities is small when other factors are considered, it is important. Whereas resident characteristics are critical in predicting quality of life, some portion of the variance remains under the control of facilities. They thus can be held accountable for this important aspect of nursing home life, if appropriate case-mix adjustments are made in the analysis of the results. The specific pattern of strengths and weakness across the various domains may be especially informative for quality-improvement efforts.

This project suggests that it is possible to collect data on nursing home residents' quality of life and to use that information to discriminate among facilities. Such data can expand the dimensions of quality and can play an important role in helping people choose among nursing facilities. The feasibility demonstrated by this work has led to active discussion about incorporating quality-of-life items into MDS 3.0 and surveyor procedures.


    Footnotes
 
This research was supported by a 1998 contract from the Centers for Medicare and Medicaid Services (CMS; under Master Contract 500-96-0008). The content and opinions are solely those of the authors and should not be construed as representing the policy of CMS or the University of Minnesota. We thank our CMS project officers, Mary Pratt and Karen Schoeneman. We also thank M. Powell Lawton, an investigator on this project until his death in 2001; the articles based on Wave 1 of this data collection are gratefully dedicated to his memory. Back

1 Division of Health Services Research and Policy, University of Minnesota School of Public Health, Minneapolis. Back

2 Department of Health Services Administration, Center for Bioethics and Health Law, University of Pittsburgh, PA. Back

3 Robert C. Byrd Health Sciences Center, West Virginia University Institute for Health Policy Research, Morgantown. Back

4 Department of Psychology, St. Cloud State University, MN. Back

Decision Editor: Linda S. Noelker, PhD

Received for publication April 4, 2003. Accepted for publication October 21, 2003.


    References
 TOP
 Abstract
 Background
 Methods
 Results
 Discussion
 References
 




This article has been cited by other articles:


Home page
Journal of Health Politics, Policy and LawHome page
E. A. Miller and V. Mor
Balancing Regulatory Controls and Incentives: Toward Smarter and More Transparent Oversight in Long-Term Care
Journal of Health Politics Policy and Law, January 1, 2008; 33(2): 249 - 279.
[Abstract] [PDF]


Home page
Journals of Gerontology Series B: Psychological Sciences and Social ScienceHome page
R. L. Kane, R. A. Kane, B. Bershadsky, H. Degenholtz, K. Kling, A. Totten, and K. Jung
Proxy Sources for Information on Nursing Home Residents' Quality of Life
J. Gerontol. B. Psychol. Sci. Soc. Sci., November 1, 2005; 60(6): S318 - S325.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Services
Right arrow Download to citation manager
Citing Articles
Right arrow Citing Articles via HighWire
PubMed
Right arrow PubMed Citation


HOME ARCHIVE SEARCH TABLE OF CONTENTS