
Performance Indicators
for
Mental Health Services
Values, Accountability, Evaluation, and Decision Support
Final Report of the Task Force on the Design of Performance
Indicators Derived from the MHSIP Content
June 5, 1993
TABLE OF CONTENTS
TASK FORCE ON THE DESIGN OF PERFORMANCE
INDICATORS DERIVED FROM THE MHSIP CONTENT
Rand L. Baker
Oklahoma Department of Mental Health and Substance Abuse Services
P.O. Box 53277
Oklahoma, Oklahoma 73152
(405)271-8666
(405)271-7413 FAX
Judith Cook, Ph.D.
Thresholds Research and Training Center
2001 North Clybourn, Suite 302
Chicago, IL 60614
(312)348-5522
(312)348-4416 FAX
Trevor R. Hadley, Ph.D.
Department of Psychiatry
University of Pennsylvania
3600 Market St., Room 717
Philadelphia, Pa. 19104
(215)662-2886
(215)349-8715 FAX
Edna Kamis-Gould, Ph.D.
632 Bryn Mawr Ave.
Penn Valley, PA 19072
(215)667-3569
(215)667-8415 FAX
Walter Leginski, Ph.D. Division of Demonstration Programs, CMHS
5600 Fishers Lane
Room 7C-08
Rockville, MD 20857
(301)443-3706
(301)443-6349 FAX
Ted Lutterman Director of Computer Operations & Research
NASMHPD
1101 King Street, Suite 160
Alexandria, VA 22314
(703)739-9333
(703)548-9517 FAX
Jack Morgenstern, M.D.
VP for Medical Affairs
Hallmark Health Care
P.O. Box 723049
Atlanta, GA 30339-0049
(404)933-5539
Ron Norris
316 Cox Road
Newark, DE 19711
(302)774-3504
(302)999-3969 FAX
Gregory B. Teague, Ph.D.
New Hampshire-Dartmouth Psychiatric Research Center
2 Whipple Place
Lebanon, NH 03766
(603)448-0126
(603)448-0129 FAX
Robin Turpin, Ph.D.
JCAHO
1 Renaissance Blvd.
Oakbrook Terrace, IL 60181
(708)916-5923
(708)916-5644 FAX
Jonas Waizer, Ph.D., Chair
Senior Vice President
F.E.G.S.
510 Sixth Ave.
New York, N.Y. 10011
(212)366-8024
(212)366-8033 FAX
EXECUTIVE SUMMARY
In its Seventeen year history, the Mental Health Statistics Improvement Program (MHSIP) has enjoyed advancements in two areas: enhancing mental health statistics and information Systems and supporting the use of statistical information in the management and study of mental health programs. Data-based decision making will probably not continue to advance, however, unless there is a significant increase in use of the data generated by MHSIP-consistent systems.
Input from the MHSIP community has suggested that performance indicators (PIs), derived from the content of MHSIP can provide managers with important information and analytic capability. PIs can also reinforce accountability, evaluation and data-based management decision making.
At the request of the MHSIP Ad Hoc Advisory Group, a task force was convened and charged with
o the development of a conceptual framework and a model of PIs that can be derived from the MHSIP content, and
o preparation of a report that incorporates the conceptual framework, choice of indicators and their use, and recommended format for presentation of indicator data.
The Task Force consisted of a broad representation of potential users of PIs, including SMHAs, public and private service providers, families of and advocates for persons with mental disorders, academia, and the Joint Commission on Accreditation of Health Organizations (JCAHO).
Early in its deliberation, the group decided to
o emphasize the multiple perspectives and differential needs for indicators,
o underscore the importance of going through the process of participatory development of a System of PIs, and
o present a set of scenarios that reflect multiple perspectives and
corresponding differential sets of indicators.
Purpose
The purpose of PIs is to communicate meaningful, important, data-based performance information in concise terms. The information communicated as ratios or rates can reflect available resources, processes, or outcomes, in order to assess general performance, assist and support management functions, and maximize responsiveness to service needs or legislative mandates.
PIs and values. PIs should reflect the performance of the mental health service system in the areas that are most valued by the different constituencies who feel ownership or have stakes in the service System.
PIs and policy. Policy is typically an articulation of what ought to be and of what is or is not desirable. In this sense, policy codifies the values in human services by guiding the actions of the service delivery System. Since PIs, by definition, reflect actions, they should also reflect policy implementation and its effects.
PIs and responsiveness to the needs of people with mental disorders. The mandate to the mental health system is to meet service needs of defined populations and subpopulations. PIs help determine how persons with mental disorders, their families and their communities are being served.
PIs and resources. Every translation of a policy into action requires the use of resources, whether money, physical property, or staff time. One of the most fruitful uses of PIs is to measure the volume and efficiency of the resources consumed.
PIs and impact. PIs are invaluable mechanisms for ascertaining whether policies and use of resources have produced the desired effects and whether the effects were of sufficient benefit.
PIs and decision support. PIs are powerful tools for decision support. They are a robust and parsimonious way of reducing and presenting a large volume of data in a way that assists in making decisions about, for example, allocation of and accountability for resources, compliance with mandates, choice of service providers, etc.
Effective PI systems. As PIs are developed, it is useful if the following principles are forefront.
o Explicit relationship to values and policy
o Sensitivity to context and environment
o Participatory development
Incorporating these principles into ongoing Systems is likely to increase their
effectiveness.
A Conceptual Framework for Performance Indicators
There are a few key principles that should govern P1 systems.
o PIs should reflect values and policies.
o The process of selecting PIs should ensure flexibility and the ability to shift to measures that reflect the most important issues and policies at a given time.
o The logical sequence and process of selecting PIs consists of four phases:
oo Identification of the "need to know," which is determined by policies, objectives and political dynamics;
oo Incorporation of information needs and a conceptual framework for PIs;
oo Articulation of management and stakeholder questions and concerns; and
oo Development of a corresponding set of agreed-upon PIs to assess policy implementation and answer the most important questions.
o PIs should be ratios and rates-not raw numbers-and should not be merely descriptive statistics; rather PIs should help show the degree to which service Systems perform as intended.
o PIs are largely organization-based because organizations are the source of the data. This should not diminish the importance of consumer-centered Systems of care. Being organization-based, PIs provide Systems managers with leverage for shaping the performance of individual organizations.
o Frequently, PIs raise questions, e.g., about causes of the performance level shown. A combination of related indicators could suggest an explanation for the revealed performance level.
o Performance is multi-faceted, and different aspects of performance are not independent of each other. Because of possible trade-offs, an emphasis on only one aspect of performance (e.g., efficiency) could be at the expense of another aspect (e.g., effectiveness) and, therefore, should be avoided.
There are several types of deteminants of Systems of PIs:
o The selection of specific PIs is influenced by the perspective of
the reviewing body, i.e., the entity conducting the analysis.
o The three primary uses of PI data are 1) shaping of behavior,
2) gauging performance in terms of congruence with standards or contractual agreements,
and 3) analytic, as in research.
o Performance is always measured from one of three basic comparison
points:
oo Performance of the same unit over time,
oo Performance level across units, or
oo Comparisons against an a priori value, such as a goal, standard, or
norm.
Concerns to be addressed via PI data could center around broad subjects of analysis, such
as target populations, services being provided, quality of services provided and the
viability of organizations providing the services.
Proposed Paradigm
Mental health system managers have concerns that reflect three dimensions of
performance - responsiveness to need for services, efficiency, and effectiveness -
and typically focus their concerns on two levels of measurement, or units of analysis -
client characteristics and behavior, and organizational characteristics and behavior.
Integrating the type-of-performance with level-of-concern results in a two dimensional
matrix of performance indicators which compare across either client type or across
organizations.
Responsiveness is the congruence of the service structure, activities, and clientele with
assessed needs; efficiency is the volume of output achieved, given the resources provided;
effectiveness is the extent to which the outcomes were achieved through use of the
available resources. Responsiveness, efficiency, and effectiveness are assessed across
client cohorts, or other groups of recipients of service, or across organizations and
organizational units.
System integration is another concern within each cell of the three-by-two matrix of
dimensions of performance by units of analysis. System integration includes clients' needs
for generic services, linkage, and referral patterns and other measures of the degree to
which performance has transcended a traditional organizationally-based system of care.
The matrix purposefully does not include measures of compliance. This exclusion is based
on the notion that any measure of performance can be built into legal requirements,
policies and procedures, and standards of care. Compliance is, therefore, not a dimension
of performance and its measures could reflect performance in any cell of the six~eli
matrix.
In sum, the proposed paradigm consists of a matrix of three dimensions of performance by
two levels, or units of analysis. System integration is another level within each cell of
the matrix. Compliance could involve any of the cells and subcells of the model.
Development and Utilization of Performance Indicators
Three themes permeate the topic of development and utilization of PIs:
o the need to focus on utilization and its relations to and impact on
policy,
o the importance of the process of the design and implementation of PI
systems,
o the logical and most productive sequence of development of a PI System.
The wide range of intended uses of PI data tend to fall into three categories:
oo continuous improvement of performance and service delivery through
both comparison of one's performance with that of others and through periodic assessment
and a self-correcting process,
oo gauging performance and service delivery against contractual
agreements, regulations, or standards -- to maintain or secure accreditation, licensure,
or future contracts, or
oo participation in basic or applied research for describing, predicting,
or explaining performance.
The motivation to develop and use PIs appears to be either theory-driven (i.e., how things
work best), or policy-driven (i.e., how things ought to work). Becoming fully aware of
these motivations and articulating the intent behind the development and implementation of
a P1 System are essential to maximum utilization of the resulting data.
The ideal environment for the development of a system of PIs is one in which:
o Intents of all stakeholders are articulated.
o There is a culture of respect for and constructive use of data.
o Changes are accomplished through participatory development.
o Resistance is reduced through an open discussion of any misgivings
about the PIs and implementation of safeguards that address those misgivings.
The development of a sound System is contingent on a shared process tailored to the
specific context and situation. A tailored, participatory process integrates the interests
of all stakeholders. It also maximizes the relevance of the formal set of indicators to
whatever is most important in a particular situation at a particular time.
A sound and logical sequence of developing a PI System consists of four components:
o identification of the developers of the system and their perspectives;
o articulation of stakeholders' questions, issues, and concerns to be
addressed by the PIs;
o selection of PIs; and
o advanced decisions about the use of resulting data.
Different audiences may require different presentation strategies. The presentation of PI
data should be designed to facilitate the reading and communicate the meaning of the
results. Often, this is best accomplished through graphical portrayal of the findings.
Carefully crafted decision rules should be developed in advance and applied uniformly in
the utilization of PI findings. The rules should be used to define high and low
performance and determine how levels of performance should be related to consequences,
such as the allocation of resources and service contracts.
The final step in the implementation and use of PIs is the evaluation and possible
revision of the PI system.
Practical and Technical issues
A host of practical and technical issues should be considered in the development and
implementation of a PI system. Addressing the following issues can help avoid common
pitfalls and enhance the system to be implemented.
o Planning - political and organizational considerations
related to the role of quantitative measurement in a given management system and how to go
about selecting and implementing PI's
o cost. burden and system capacity - direct and indirect
costs of the development, maintenance and management of both on-going and new data
collection that is necessary to produce PIs
o the need for multiple indicators that reflect all
important aspects of performance with minimum duplication and redundancy
o technical measurement issues related to error,
validity, reliability, reactivity, range, variation and sensitivity to change, sensitivity
and specificity of classification measures and appropriate interpretation of results
Illustrative Scenarios
Seven scenarios of PI use are identified to underscore the themes presented throughout the
report, and to illustrate the logic of relating perspectives, management questions,
corresponding PIs, and the use of resulting data. There are no "right answers"
or "right indicators" for each of the scenarios. The presented sets are examples
of potentially meaningful and useful measures.
The sample scenarios reflect a wide variety of organizational structures, perspectives,
concerns, stakeholders, and policy positions. The seven scenarios are:
o an SMHA;
o a local area, such as a county that contracts with private service
agencies for the provision of mental health services;
o the perspective and concerns of both consumers and their advocates;
o a psycho-social rehabilitation service agency;
o a private, multiple-location mental health service corporation;
o a private managed care organization; and
o assessment of compliance with two legislative requirements of PL99-660.
Congruence with MHSIP Data Standards
The process presented in this report is driven by MHSIP, but is likely to result in an
extensive set of useful indicators, some of which draw upon data that are outside the
current MHSIP content. Data not currently in MHSIP, but of interest to mental health
organizations in generating PIs include the following:
o need data that require general population statistics and
epidemiological findings;
o support and generic services statistics, consisting of data collected
by other health and human services entities that are involved in providing services to
persons with mental disorders;
o consumer outcome measures, such as increased level of functioning,
improved quality of life, and satisfaction with services received
Desirable systems of PIs will, therefore, have three relevant features:
o Their content will include data reflecting services and service
organizations beyond those operated and funded by SMHAs.
o The measures will reflect current trends to include information on all
services to consumers of the specialty mental health system, mental health services
provided to consumers of other health and human service agencies, information that fosters
decision support, and information that promotes a consumer-centered service system.
o As much as possible, data items will be consistent with the MHSIP
standards and will enable cross-system comparisons.
INTRODUCTION
Background
In its fifteen year history, the Mental Health Statistics Improvement Program (MHSIP) has
enjoyed advancements in two areas: enhancing mental health statistics and information
systems, and supporting the use of statistical information in the management and study of
mental health programs. Three factors bolstered MHSIP's recent accomplishments:
o endorsement of MHSIP standards by the National Association of State
Mental Health Program Directors (NASMHPD)
o documentation of both the philosophy and standards of MHSIP in the
National Institute of Mental Health (NIMH) publication Data Standards for Mental
Health Decision Support Systems (FN-10)
o funding by NIMH of MHSIP implementation grants to the states
Data-based decision making will probably not continue to advance, however, unless there is
a significant increase in the use of data generated by MHSIP-consistent systems.
Members of the 1990 MHSIP Implementation Task Force and many in the MHSIP community have
suggested that PIs, derived from the content recommendations of MHSIP, can provide
managers with important information and analytic capability. PIs can also reinforce
accountability, evaluation, and management decision making. The group that produced the
fiscal indicators in FN-10 took a preliminary step in this direction. A similar, but more
comprehensive approach could offer more, showing how individual items of content can be
combined into a variety of ratios, indices, and formulas to measure different aspects of
performance.
Most state mental health agencies (SMHAs) have made progress implementing MHSIP. Continued
support for implementation, however, will require the demonstration of management payoff,
that is, assurance that MHSIP content improves acquisition, distribution and defense of
resources, and the monitoring and evaluation of service programs. PIs derived from
MHSIP-consistent statistics enable managers to pose and analyze complex questions and
realize measurable benefits. To that end, the MHSIP Ad Hoc Advisory Group recommended the
convening of a task force consisting of representatives of SMHAs, public and private
service providers, families of and advocates for persons with severe mental disorders PSMD1,
academia and the Joint Commission on Accreditation of Health Organizations (JCAHO). In
response, Jack Burke, M.D., Director of DASR, NIMH(1),
approved in March 1991 the convening of and funds for the Task Force on the Design of
Performance Indicators Derived from MHSIP Content. The Task Force consisted of the eleven
members who prepared this report, whose names and affiliations are listed on page i.
Approach
The overall goal of the Task Force was to design, develop and deliver a document that
shows how the content of FN-10 can be used to generate a variety of PIs for mental health
program management and decision making. The charge to the group included the following:
o Review the charge and related documents, develop a plan and propose a
model for a set of mental health PIs.
Address the purpose of PIs, deciding whether the indicators should be descriptive or
valuative, whether the model should be multidimensional and, if so, what dimensions should
be represented, what additional data (other than MHSIP) to incorporate, etc.
o Develop a consensus concerning the number and choice of a minimum, core
set of indicators per category (component) of the model and suggest formats for the
presentation of indicator data.
o Prepare a report for the MHSIP Ad Hoc Advisory Group that incorporates
the conceptual framework of the PIs, choice of indicators and their use, recommended
format for presentation of indicator data, and recommended support activities and
processes.
As directed by the MHSIP Ad Hoc Advisory Group, the Task Force approached its assignment
in several phases. First the group reviewed its charge, work plan, expected schedule,
tasks, assignments, and general orientation. Next the group reviewed and discussed all
relevant background material, including the following:
o results of the survey of technical assistance needs
o pertinent developments in provider and auxiliary level
settings
o a sample of models of PIs
o other prior work in the area
The most pressing questions, the answers to which would determine the final product, were
identified as follows:
o Who will be the audience for PIs to be developed?
o What is the desired model?
o What specific data should be included?
The group's activities, individually and in five meetings over fifteen months, consisted
of the discussion and development of the answers to these questions and of alternative
models. The models varied in complexity (e.g., number of dimensions), content (specific
ratios), and emphasis (e.g., whether to include compliance with mandates). Deliberations
also weighed the merits of a general model against those of models tailored to specific
situations, as exemplified by The State Comprehensive Mental Health Services Plan Act of
1986 PL99-660).
Early in its deliberation, the group re-examined its charge and intended product. majority
of the group advocated, one, an emphasis on multiple perspectives and differential needs
for indicators and, two, a product consisting of scenarios of such perspectives and
corresponding differential sets of indicators. This decision and the rationale for it are
explained later in this document.
Literature Review
The task force reviewed the available literature on the development and use of performance
assessment systems and PIs for mental health services. while PI models have been used in
many areas (e.g., economics, engineering, agriculture), the review for this report is
limited to mental health services.
Overall, the literature on PI models in mental health reveals a peak of studies in the
early 1980s (Hadley, et al., 1983, Kimmel, 1983) and a renewed interest in the late 1980s
and early 1990s (Skinner, et al., 1988, Anderson, 1991, Barrett, et al., 1992). An
emerging literature on the use of PIs for Total Quality Management (TQM) has added to
emphasis on productivity and efficiency with a focus on consumers' values, employees'
contributions, and quality as an operating strategy (Deming 1986, Walton 1986, Peters
1987).
Windle (1986) defined program performance measures as "operational specification of
how well an organization is functioning along one or more dimensions that represent agreed
upon goals or values of the program." He added that "these measures are expected
to be quantitative, objective, and calibrated against some standard(s) that permit
comparison within organizations over time and between organizations participating in the
program."
What do managers and other stakeholders need to know about the performance of mental
health service programs? Several systems of PIs described in the literature present
different answers to this question. Jacobs and Thompson (1986) described the NIMH's
Operations Management System (OMS) for community mental health centers. In the OMS system,
PIs were selected as indicators of three goals: service accessibility, service
organization's financial viability, and productivity/efficiency. Sorenson et al. (1986)
have developed a set of indicators that addresses four areas of concern: revenues,
consumers, staff, and services. Kamis-Gould (1987a, 1987b) described the New Jersey PI
system, which incorporated Sorenson's areas of concern as the databases and as the sources
for performance measures. The emphasis in the design of the New Jersey system was on what
the measures were to indicate, i.e., the dimensions of program appropriateness, adequacy,
efficiency, and effectiveness.
The literature identifies multiple uses of PIs. Hadley, et al. (1983) described a system
of PIs that was implemented in Pennsylvania and used in allocation of state funds to
county mental health programs. The intent of that system of PIs was to reinforce the
reduction of the number of admissions to state hospitals, increase efficiency, expand the
range of services, especially to PSMD, and encourage prompt submission of reports.
Barrett, Berger, and Bradley (1992) described the Colorado model of performance assessment
and its implementation. That system consisted of five dimensions of performance: financial
viability, productivity/efficiency, community responsiveness, comprehensiveness, and
consumer/patient outcomes. The Colorado system was designed for performance contracting.
Recognition of potential pitfalls lead to a stage-wise implementation and the use of
safeguards, such as the development of baseline data and avoidance of sanctions for a
predetermined period.
Anderson (1991) described PIs as a shared "vocabulary of performance" for
internal management and continuous quality improvement (CQI), benchmarking via comparisons
with other service providers and accountability to consumers. He emphasized that the
choice of PIs should be determined by the mission of the organization and the suitability
of each measure to gauge whether the organization carries out its own mission.
Rosen, Miller and Parker (1989) promoted the use of PIs for the development of standards
of care. Leff and Natkins (1985) described models of funds allocation, where indicators
reflected performance in terms of equity and need for service. Russell and Cole (1987)
promoted the need to assess outcomes, impacts, and effectiveness.
Kamis-Gould (1987a) stressed two issues: one, the importance of the process of the
design, development, and implementation of PIs and, two, likely tradeoffs between some
facets of performance and others. In the system she described, decision rules about
desirable and undesirable performance always involved more than one dimension of
performance, more than one reporting period, and a confidence level of two standard
deviations. Wholey and Hatry (1992) also suggested the need for the development of
recommendations on the process for implementing effective systems of performance,
monitoring, and assessment.
Kimmel (1983) as well as Wholey and Hatry (1992) stressed that there is no "right
set" of PIs, that performance measurement and monitoring in mental health focuses
attention on some behaviors and outputs (and not on others), and that multiple factors
contributed to the selection process. Kimmel saw the selective attention and tailored
Systems as a weakness and a factor in the dynamics where agencies try to "game"
the system and distort data to appear favorably. Wholey and Hatry suggested that
"creaming" (serving "nice consumers and those who are more likely to
improve) and "gaming" could be minimized by the creation of realistic
expectations, participatory development of P's, implementation of a balanced system of
PIs, and using PIs for comparisons of only comparable programs and consumers. Skinner et
al. (1988), described another risk of using PIs by documenting amplification of errors
through the use of indicators. The authors acknowledged, however, that audits could
alleviate this problem.
What can be learned about PIs from either the literature, or from the collective
experience of members of the task force? Most likely, PIs are here to stay. The quest for
performance assessment and accountability to all constituents is gaining momentum, as
exemplified by federal legislation, e.g., The State Comprehensive Mental Health Services
Plan Act of 1986 (PL99-660) and Senator Roth's bill "Federal Program Performance
Standards and Goals Act of 1991" (S. 20), and by various state initiatives. National
agencies insist on monitoring and evaluating the public use of federal funds and
compliance with legislative requirements, while consumers and their advocates make their
own assessments.
Some systems of PIs, e.g., the one in New Jersey, were implemented and then discontinued.
However, both ongoing and discontinued systems, e.g., the one in Pennsylvania, have shown
multiple uses of PIs and their success as change agents and as instruments for shaping the
performance of mental health service programs. Evidently, all management, whether short-
or long term, consumer-focused, value-added, or market-driven, needs accurate and timely
information to communicate performance, stimulate improvement, increase confidence, gain
and justify resources.
There are several lessons to be learned from past experiences.
1. PIs are important as operational measures of key policies, policy implementation, and
successive approximations toward the desired system of care, within a particular context.
2. PIs should always be dynamic, valuative ratios to be used in cross sectional or
longitudinal comparisons. In other words, PIs should be measures of assessment, rather
than description, with dear meanings of whether high, or low values are desirable. A
corollary of this second point is the importance of longitudinal data and PIs and their
usefulness in steering change and development.
3. Performance is multi-faceted and systems of PIs should include a balanced reflection of
performance--neither only strengths, nor only weaknesses.
4. The process of choosing and implementing PIs is very important. The choice should be
based on a consensus about how the system of care should be changed. Implementation should
be through participatory development and collaboration among all stakeholders. This is
especially important because mental health programs usually operate within a political
system and because data, PIs, and information on performance are often exercised in
struggles for influence.
5. PIs are powerful tools for policy implementation and shaping the behavior of service
programs. To prevent misuse of PIs, reliability and validity of measures must be
demonstrated, the burden of data collection should be minimized, and resulting information
should be used judiciously.
This report departs from and adds to the existing literature in two ways: one, it offers a
synthesis and (at least partial) resolution of issues raised in the literature, and two,
it promotes a generic model for the development of a sound system of PIs, including
examples. The proposed model is generic and applicable to diverse situations. Its content,
however, is likely to vary in response to different situations (context, policy issues,
etc.), as illustrated in the examples. The development and implementation of systems of
PIs that follow the proposed model should advance the implementation of MHSIP and foster
data use in management and policy development.
About Performance Indicators
There are four underlying assumptions about PIs that guide this report.
First, PIs should be developed as a reflection of specific values and policy concerns of
the developers. It follows that the optimal choice of PIs often changes as policies and
knowledge bases change. Because P's are developed for managers to be able to monitor
policy implementation, relevant specific measures are likely to be unique to the context
and policy arena in which they are created. In most cases, PIs should only be viewed in
the organizational and policy context from which they are derived, and interpreted
contextually, not in isolation.
Second, PIs can be used by different audiences for a variety of purposes. They can be used
to self-manage, to increase quality, or to improve productivity. PIs can be used by state,
county, or private insurers to allocate and manage resources in light of the policies to
which they subscribe. PIs can also be used as diagnostic tools in the evaluation of
Systems or agencies.
Third, PIs are always comparative in nature. They provide for comparisons of similar
organizations or consumer populations, comparisons of the same organization or consumer
group over time, comparisons of organizations against requirements, goals set by the
agency, or by an external environment. Thus, PIs are analytic and evaluative, rather than
descriptive.
Finally, it must be kept in mind that PIs are inter-related and that one aspect of
performance (e.g., efficiency) is not independent of others (e.g., effectiveness). The
reading and interpretation of PIs should, therefore, be treated as a system of related
measures and never in isolation.
In sum, the primary way to further the implementation of MHSIP and foster data-based
decision-support systems is through the implementation and use of PIs. This report is the
culmination of the work of the Task Force on the Design of Performance Indicators Derived
from the MHSIP Content, convened by the MHSIP Ad Hoc Advisory Group and supported by NIMH.
This document offers a discussion of key issues related to the design, implementation, and
use of PIs and promotes a generic, multi-dimensional model of PIs for mental health
service systems.
Overview of the Report
Chapter II articulates the purpose of developing and operating a PI system. A conceptual
model for such a system is presented in chapter III and detailed in the paradigm that is
presented in chapter IV. Chapter V outlines everything involved in creating a PI system
and its use, including presentation of results. This is followed by a discussion in
chapter VI of practical and technical considerations. Chapter VII consists of seven
scenarios of PIs that represent a broad range of perspectives, concerns, management
questions, and PI measures. The last chapter is a brief discussion of the congruence of
PIs and MHSIP data standards.
II. PURPOSE
The purpose of PIs is to communicate meaningful, important, data-based
performance information in concise terms. The information communicated as ratios or rates
can reflect processes (e.g., staff productivity), outcome (e.g., average functional
improvement per consumer discharged), or resources (e.g., full time equivalent direct
service staff per 100,000 population). The most common goals of PIs are the following:
o Assess general performance.
o Assist and support management in allocating resources, monitoring
services, and evaluating impacts.
o Account for and assess responsiveness to service needs or legislative
mandates.
Monitoring and assessment functions can be performed by several groups:
o managers
o professional peers
o consumers
o advocates on behalf of consumers and their families
o quality assurance organizations
o funding authorities
PIs can be thought of as a funnel that transforms several sources and types of data into
concise, useful assessment information.
Performance Indicators and Values
PIs should reflect the performance of the mental health service system hi the areas that
are most valued by the different constituencies who feel ownership or have a stake in it.
Some stakeholders may be interested in whether the system relieves human suffering; others
may value whether the system turns a profit while it operates; still others may care
whether it serves a reasonable number of people with the resources it has available; and
most stakeholders will have a multiple set of concerns about the service system. Systems
of PIs should consider this range of stakeholders, what they value about the operation of
the mental health service system and what they want to know about it. The PIs should be as
responsive as possible to these concerns.
The purpose of PIs is to reflect policy formation and implementation in three general
dusters of values and concerns shared by most stakeholders:
o responsiveness to the needs of persons with mental disorders,
o the use of available resources, and
o the consequences and impacts of policy.
Performance Indicators and Policy
Policy is typically an articulation of what ought to be, what is or is not desirable. In
this sense, policy codifies the values in human services by guiding the actions of the
service delivery system. Since PIs, by definition, reflect actions, they are intimately
related to policy. This relation is one of reciprocal influence. Policies define what is
valued and thus what should be measured; PIs measure how well a policy is working. Both
should stem from a single perspective. The examples at the end of this report illustrate
policies that drive the generation of sets of PIs from different perspectives. Data
gathered over time will confirm or question whether each policy is having a desirable or
acceptable effect.
Sometimes, values outface technology and have no ready measures, information systems, or
specific pieces of information that can generate a numerical indicator for an area that is
valued. For example, consumers may value the extent to which they have been accorded
fundamental respect and dignity by the service system. Although a value of genuine
significance, neither the mental health statistical community, nor consumer groups are yet
able to articulate a formula or algorithm to measure it. In 1992, MHSIP began to
collaborate with consumers to address this problem and begin to internalize consumer-based
perspectives into the data standards, which are a fundamental component of MHSIP. The
paradigm proposed in this report will be able to incorporate such measures, once
available.
Performance Indicators and Responsiveness to the Needs of PSMD
PIs help determine how well PSMD, their families, and their communities are being served.
while policies are needed to structure and direct the activities of mental health services
(e.g., establish priority consumer groups to which to target specific services), the
"bottom line" is the people. PIs are a practical and valuable tool for
addressing the bottom line. They deal ultimately with the people by ascertaining whether:
o the service system can adequately meet their needs
o the right persons have been reached by the system
o the priority consumers are served
o services are appropriately tailored to the needs of consumers.
Performance Indicators and Resources
As policy is shaped, articulated, and operationalized, it takes the form of regulations,
procedures, statements of goals and objectives, the establishment of priorities,
governance and advisory structures, and prescriptions about program behaviors. virtually
every translation of a policy into action requires the use of a resource, e.g., money,
physical property, or staff time.
One of the most fruitful applications of PIs is to reflect the volume and efficiency in
the use of these resources. PIs become an empirical shorthand that accounts for how well
resources were used in translating policy into action.
Performance Indicators and Impacts
In nearly every instance in which policy or resource consumption is considered, there is
an accompanying concern about impact. That is, did the policy or the use of resources
produce the desired effects? Was the effect of sufficient benefit? PIs are invaluable
mechanisms for answering these questions and handling a complex array of considerations.
The policy (an operationalized value statement) sets the stage for what is supposed to
happen and thus suggests the types of indicators that should be examined. The indicators
show whether appropriate services were available and the volume of resources that was
consumed. By adding an outcome, impact, or quality expectation, a stakeholder gains some
insight about four impacts:
o whether the policy is sufficient, reasonable, and is being implemented
o the resources consumed to make it happen
o whether the effect was intended and of acceptable magnitude
o whether the service recipients have benefited as intended
Performance Indicators and Decision Support
It should be apparent from the preceding section that PIs can reflect a great deal about
the operation of a particular service program and the service system. Perhaps the best way
to summarize this is to say that PIs are powerful tools for decision support. They are a
robust and parsimonious way of reducing and presenting a large volume of data in a way
that assists in making decisions.
As emphasized above, PIs need to be grounded in values and policies. PIs will help
decision makers feel comfortable with a policy and realize what is not working and what
deserves to be reconsidered or managed better. They can even expose an area in which no
policy has been articulated.
One major responsibility of decision makers is to assure that the service system does what
it is designed to do, e.g., whether it is responsive to the needs of PSMD and of other
consumers. PIs enable managers to monitor and evaluate performance in this domain and make
decisions accordingly.
Another key responsibility for decision makers is the allocation and monitoring of
resources. PIs shed light on the use of resources and, through periodic accounting for the
consumption of resources, become a de facto monitoring tool.
One of the most telling pieces of information for decision making is whether the impacts
produced by operations are ones that are desired and of acceptable quality. if so, the
decision maker has a knowledge base for repeating the impact and possibly improving It. if
not, the PI prompts further questions, examination of other data, etc., to determine what
needs to be done differently.
Finally, PIs should not be viewed as ad hoc or unique to each service program. If
the data items are grounded in MHSIP or other data that are comparable across other
providers, the decision maker can compare performance not only within his or her own
program, but with other similar programs. Such comparisons often shake the complacency of
a manager who realizes that the program's performance looks much different when compared
to others. On a more systemic level, the comparisons can provide the best holistic views
of the system and its components. They can reveal, for example, that many providers are
having similar problems, that some providers are efficiently producing high quality
outcomes, or that some programs need intervention to preclude embarrassment or legal
actions.
This iterative process of empirical measures, policies, and management activities is a
uniquely connected system -- a nexus - in mental health services. If done well, everyone
benefits, resulting in the ability to develop sound policies, select useful indicators,
and use indicators effectively.
Effective Performance Indicator Systems
As PIs are developed, it is useful if the following principles are forefront.
1. Explicit relationship to values and policy
As discussed above, PIs will be of greatest credibility and usefulness if they are
developed in a manner that is sensitive to the values of the stakeholders who will use
them and if they can be explicitly related back to policy. The process by which this
occurs is the focus of the remaining principles for effective PI systems.
2. Sensitivity to context and environment
The mental health field, as a whole, now has more automated information Systems in place
and more data in use than ever before. This is manifested by recent activity around
legislative issues such as comprehensive community mental health planning PL99-660 and
102-321) and the entire MHSIP effort.
Over the past decade, stakeholders in this field have become increasingly proactive in
using data to advocate for and oversee mental health services. Examples include surveys
used by the National Alliance for the Mentally Ill (NAMI) to advocate increased research
and improved services, use of data by providers to improve clinical practices, and use of
data by managers to ensure better accountability and justify budgets. Two fundamental
insights unite all these efforts:
First, there is a concern to serve better the PSMD. Three factors have combined to focus
attention on interventions, treatment, and supports that are less restrictive and more
affirmative of human potential:
o a major shift away from inpatient care,
o dissatisfaction with what was accomplished in the twenty-year community
mental health center era,
o a pervasive belief in the potential of psychosocial rehabilitation for
persons with mental illnesses.
Second, resources are scarce and accountability for their allocation is stressed. Closer
scrutiny is paid to resource use and efficiency, of course, but there is also a greater
openness to a wider variety of constituent perspectives, of family members and consumers,
participating in decisions.
PI systems that key into these contextual concerns and local circumstances will determine
the developmental agenda. For example, the first of the above emphasizes indicators that
track inpatient use. A relevant set of indicators would include the following:
o reduced hospital stay
o increased portal-of-entry controls/reviews
o reduced instances of recidivism
o inpatient care that is limited to only the most severe, dysfunctional
and grave psychiatric disabilities
3. Approach and process
No one "right set" of PIs covers all situations or satisfies all perspectives.
Effective selection and use of PIs depends on multiple perspectives of key stakeholders.
In addition, the development of PI systems should accommodate the evolving nature of
policy, the sometimes precipitous alteration of priorities and developments that evolve
from scientific knowledge.
As different stakeholders are engaged in the process of developing PI systems, two things
will occur. The first, the manifest function, is that their perspectives will be
solicited. The second, the latent function, is that an education objective will be
attained. They will come to understand and be willing to use the indicators. Furthermore,
this understanding will cover not just those to which they have had input; each
stakeholder group will come to appreciate aspects of the inputs and concerns of other
groups. The achievement of both the manifest and latent functions will result in highly
effective use of the resulting system.
Another gain that accrues from a collaborative approach to PI system development comes
from the literature on total quality management (TQM). Systems that are used improve,
self-correct, acquire credibility and develop advocates. With a broad array of users,
there will be continuous monitoring of the results and of the PIs. Such continuous use of
the system leads to sharpened policies, greater efficiency, and improved quality/impact at
the level of service delivery.
Finally, although the input of many stakeholders has been emphasized, the ultimate burden
of producing the data falls to the providers of mental health services. They should always
be involved in the process of PI system development. Aside from the benefits of their
input and the sense of ownership they will develop from the process, a data quality goal
will be facilitated. Specifically, as providers participate and develop feelings of
partnership, they will place a high value on generating data that are of sound quality to
the Pi system and come to recognize that the data they release to the system are treated
with confidentiality (when appropriate), with professionalism and for the accomplishment
of system benefits. In short, they will see a salutary value rather than punitive burden.
Thus, the purpose of PIs is to integrate extensive and detailed data and to transform them
into meaningful and useful information. The resulting information should assist all
stakeholders in assessing whether policies and mandates have been implemented and whether
they had the intended impacts. The most frequent assessments will be concerned with
responsiveness to service needs, use of resources, and the impacts of services. The
findings from these assessments should shape future policies and management activities.
The next chapter presents a conceptual framework for PIs and leads to a discussion of a
generic model, its implementation, and its use.
III. A CONCEPTUAL FRAMEWORK FOR
PERFORMANCE INDICATORS
PIs are the vehicle for capturing and reflecting important characteristics
and "vital signs" of mental health service delivery in a minimal amount of data.
As such, PIs can be a portentous and useful management tool. Because PIs are potentially
powerful, it is important to recognize and describe their key features. This chapter lists
and expounds upon key principles concerning PIs and identifies several types of
determinants that shape the design of a system of indicators. This chapter thus provides a
conceptual framework for a generic model of indicators.
Principles
PIs concisely compare consumers, organizations, and their attributes. Embedded in
the comparisons are implicit values of who should be receiving what services, when, where,
at what costs, and with what effects and policies that are expected to bring about desired
performance. The values may be implicit or explicit, shared or unique, and be varied in
degree of subjectivity and loyalty to specific ideologies.
In using PIs, managers want to assure that their organizations do what they are
supposed to do and do it well. Doing well includes both objective values, such as the
desire for staff productivity and organizational efficiency, and more subjective, or
ideologically-based propensities, such as preference for services in the least restrictive
environment. The policies are likely to reflect the values of key decision makers and
emerging issues that are important to their organizations. Since the importance of
different issues and concerns shifts over time, the choice and employment of specific
indicators are likely to change, as well. The process of selecting PIs should ensure
flexibility and the ability to shift to PIs that reflect the most important issues and
policies at a given time.
There is a connection and a logical sequence that ties shared values and the
derivation of PIs. Values and principles determine priorities and the selection of goals
to be achieved by the system of care. Policies and procedure are established to facilitate
the attainment of the goals in a manner that is consistent with the values and principles.
Therefore, similar values (in comparable environments) are likely to produce similar goals
and policies, which in turn result in a similar set of selected PIs.
Often the sequence and process of selecting PIs consists of four phases:
1. identifying the "need to know", which is determined by policies,
objectives, and political dynamics,
2. incorporating a conceptual framework for PIs,
3. formulating related management and stakeholders' questions and concerns, and
4. selecting and deploying an agreed upon set of corresponding PIs.
The relevant policies, in this approach, determine and influence the "need to
know", the questions asked and the development of consensus about the indicators to
be selected for the set. Findings produced by the PIs should be used in process and
outcome evaluations and the results fed back into possible new management questions.
In all these cases, PIs should not be merely descriptive statistics, but rather help show
the degree to which service systems perform as intended. For example, the number of
elderly who are consumers is a descriptive statistic; the proportion of elderly among
consumers in relation to proportion of elderly in the general population is an indicator
of proportional participation in the services offered and an indirect measure of service
accessibility. Thus, PIs should facilitate change and serve as a proactive management tool
for discerning whether wanted changes have taken place and whether unwanted phenomena have
been eradicated.
PIs, whether measures that reflect a policy of meeting 'minimum standards or of
maintaining market share, should be developed and implemented in response to the most
important concerns and policies. At times, measures of the implementation of key policies
can override the values and belief of individual managers. Unlike quality measures (e.g.,
low infection rate in inpatient setting, timely and comprehensive treatment plan, etc.),
there is no universal agreement about value systems. Values change and new ones emerge,
which is the reason for the importance of the process by which PIs are selected. Some of
the differences in the selection of PIs can be attributed, for example, to the values of
politically- vs. outcome-driven systems.
The ideal process of developing PIs requires that management make policies explicit, so
that desired policy implementation can be operationalized and measured. This, however, may
be problematic because some managers have difficulty formulating their own policies;
others may prefer to keep their policies implicit, in order to maintain the flexibility to
shift emphasis. While some managers may prefer to be presented with a set of rules (rather
than go through the process of articulating their own policies), there is much value and
merit to going through the complete process of articulating intent and translating it into
sound operational measures, i.e., PIs.
PIs are largely organization-based. Organizations are the source of data and the resulting
ratios are primarily a reflection of the organizations that capture and relay the data.
This "reflection" can consist, for example, of both attributes of consumers
served by the organization and of the treating staff. This focus on organizations should
not diminish in any way the importance of consumer-centered systems. Measures that focus
on organizations are important because they provide Systems managers with leverage for
shaping the performance of individual organizations (e.g., through performance contracts)
and, thus shaping the total service system.
PIs provide richness of information for multiple audiences and should be viewed within
their relevant context. The context could be a matter of the audience's perspective,
informational needs, etc., which would also determine the degree of required details. An
analogy could be drawn to information presentation via maps, where a general map (e.g., of
a state) is analogous to a total set of generic PIs, indicators produced on state or
national level, and a detailed insert (e.g., of a city) is analogous to PIs that pertain
to a tailored informational needs, e.g., about compliance with contractual commitments of
a single organization.
PIs, whether directly or indirectly rooted in values and policies, always
"indicate" or reflect a level (or degree). They should always be in terms of
ratios and rates, not raw numbers. For example, staffing levels could be indicated in
terms of direct service FTE, per 100,000 population, or per 100 consumers. It should not
be raw numbers. Interestingly and frequently, PIs raise questions, e.g., concerning causes
for the level shown; they rarely provide answers.
A combination of related indicators could suggest an explanation for the level revealed
(and therefore guide decisions). For example, a measure of low proportion of minority
consumers might be due to a low proportion of minorities in the service area. Services in
a rural area, where clinicians have to travel a great deal to reach consumers, might
explain a high cost per unit of service, etc. A desirable set of PIs can be used to teach
people how to use them. The set should be small, but sufficient to reflect inter-related
aspects of performance. if, for example, the cost per unit of service is high, the set
should contain the measures that could be related and might explain the level of cost.
A particular type of decision, e.g., about allocation of funds, can be based on indicators
reflecting different aspects of performance, whichever is to be reinforced. Thus, a state
mental health agency (SMHA) may allocate funds based on high performance in compliance
with a legislative mandate, in efficiency, in effectiveness, or a combination of the
three. It is important, however, to keep in mind that facets of performance are not
independent of each other and that, because of possible trade-offs, an emphasis on one
aspect of performance (e.g., efficiency) could be at the expense of another aspect (e.g.,
effectiveness). Again, the ideal system should consist of a set of measures that is small
enough to be meaningful and manageable, but reflect all essential and inter-related
aspects of performance.
While performance standards are best developed via scientific data, client satisfaction
must also be included. Measures of satisfaction are applicable to all scenarios of
performance assessment, although they might originate in different intents. For example, a
private corporation might monitor its quality and satisfaction standards in order to
maintain its market share. The Joint Commission might monitor satisfaction because it
views consumers' feedback as an essential outcome measure. The presentation of findings
should be graphic and the interpretation thought provoking.
Determinants of Systems of Performance Indicators
PIs often derive from on-going information systems. They are anchored in specific
points in time and are often used within the context of continuous monitoring. Examples of
indicators range from simple ratios, e.g., unemployment rates, to complex ones, such as
the Gross National Product. A similar range could be expected in mental health, where many
aspects of performance are related to values, policy and compliance with mandates.
The selection of specific PIs is influenced by the perspective of the reviewing
body, i.e., the entity conducting the analysis.
o Managers of a provider organization may review the performance
of their own organizations, for many possible uses, using any type of comparison
identified below.
o Administrators of a state mental health agency
(SMHA) may assess the performance of mental health organizations in light of state
regulations and funding agreements.
o Executives of a corporate entity may review the performance of
their corporate agencies.
o External parties (e.g., JCAHO, NAMI, consumers, etc.) that
transcend the management of an organization may have strong interest related to the
service delivery of an organization, or a system of service, and use PIs to review
providers' performance.
In the context of a decision support system, certain primary uses may be
made of PIs. Three broad categories have been identified, while a number of subcategories
and specific applications are possible within each of these.
o Shaping behavior/performance; revealing levels of
performance, diagnosing possible reasons and identifying leverages for changing and
improving performance. This use is also referred to as Continuous Quality Improvement
(CQl).
o Gauging performance in terms of compliance with laws and
regulations, or against either standards or contractual agreements. This use of PIs
usually involves the application of sanctions, rewards, or punishments, as consequences of
the degree to which performance conformed to expectations.
o Analytic, as in research, the purpose of which is to describe,
predict and explain performance and contribute to generalizable knowledge.
Performance is always measured from one of three basic comparison points:
1. Changes may be assessed over time, e.g., comparison of the same unit of analysis from
one year to the next.
2. Comparison may be across analytic units; consumer groups, organizations,
organizational components, or a group of organizations, through the use of statistical, or
normative base.
3. Performance may be assessed via comparisons with an a priori value, such as a
goal, standard, or a norm (which in turn, can be based on best practice, average, or
minimal acceptable level of performance).
As mentioned above, the specific PIs to be selected must be congruent with articulated
values and policies and promote analyses in response to particular questions, issues, or
concerns. The questions, or concerns, could center around certain broad subjects of
analysis. For example,
o target populations - the recipients of services, e.g.,
whether, or not, intended consumers are being served, differential needs of consumer
cohorts are being served, etc.
o services being provided - their appropriateness and effect
o quality of services provided
o viability of organizations providing the services
PIs appear in a matrix of dimensions of performance and units of
analysis. The dimensions reflect mutually exclusive, major categories of what
decision makers want to know, e.g., whether organizations serve the high priority
populations, whether staff members are productive, whether costs are contained, whether
consumers' level of functioning improve with services, etc. The units of analysis are the
subject of the description and assessment, i.e., whether information compares consumer
groups, or organizations. The content of the PIs, i.e., the types of data used in the
numerators and denominators, will vary according to the combinations of and applicability
to dimension of performance and unit of analysis. The next chapter delineates a generic
model of PIs arrayed by dimension and unit of analysis. The content of the matrix cells
varies according to specific contextual policies and issues, of which examples are
provided in chapter VII.
IV. PROPOSED PARADIGM
Management Concerns
Managers use PIs to monitor the implementation of specific policies (concerns,
goals, objectives, etc.) in those areas of the mental health system for which they have
responsibility. It is critical that measures be developed specifically to provide
information on concerns and policy agendas of a particular management entity or
policy-making body. These concerns are translated into questions, and the questions are in
turn operationalized into ratios of data, i.e., PIs. Despite the necessary specificity of
an indicator to its policy or management context, there are general categories of
indicators. Combining some of these categories in a conceptual framework of system
responsibility and performance yields a tool for developing performance measures.
Mental health system managers typically focus concern on each of two levels of
measurement, or units of analysis: consumer characteristics and behavior; organizational
characteristics and behavior. In addition, the interaction of consumers and organizations,
as well as of both with their respective environments, are critical concerns. An important
example of such interaction pertains to system integration, which is addressed below. For
heuristic purposes, however, these other types of concern can be subsumed under a primary
focus on either the consumer or organizational level. This triangle of concern can be
visualized as in figure 1 (see next page):
The Paradigm
PI measures can also be grouped into three dimensions, or categories of
performance; one, responsiveness to need for services, two, efficiency, and, three,
effectiveness. In all cases such performance measures are expressed as ratios in order to
permit comparison. They are ratios of such things as use, prevalence, resource
consumption, or outcomes. Comparison is made across categories of consumer or of
organization.
Integrating type of performance with level of concern results in a two dimensional matrix
of PIs, i.e., of the possible comparisons across consumer type or across organizations. It
is generally feasible to develop measures appropriate to each cell, but for any particular
policy issue only certain cells may be relevant. In fact, designers of systems of PIs
should not feel compelled to fill every cell of the matrix. The two-dimensional matrix is
described in figure 2.
Figure 2. A Two-Dimensional Paradigm for
Performance Indicators
| Unit of Analysis |
Dimension of Performance |
||
| Responsiveness | Efficiency | Effectiveness | |
Consumer Cohort |
|||
Organization Cohort |
|||
V. DEVELOPMENT AND UTILIZATION OF PERFORMANCE INDICATORS
o
There is a culture of respect for and constructive use of data.o
Changes are accomplished through participatory development.o
Resistance is reduced through disclosure of fears and implementation of safeguards that address those fears.Pareto chart: a prioritized bar graph, most suitable for display of
the order of prevalence of disorders, magnitude of different presenting problems, etc.
Control chart: a trend chart with statistically determined limits that predict
bow much variation in the data is to be expected and when a reaction is warranted. Control
charts are tools used to analyze and monitor process and outcomes. These charts graph
trends in the data over time and include control limits (e.g., standard deviations) that
delimit the extent of variation expected under normal conditions and indicate what would
be considered a significant deviation. A control chart would be a good choice for
displaying admission rates to a state hospital from the various catchment areas around it
and two standard deviations above and below the mean of those rates, because it would
identify rates that are statistically significant from the average and expected rate.
Scatter diagram: these five graphical tools are illustrated in figure 4 (see next page).
Figure 4. Graphical Tools
Use of graphs is increasingly advocated and practiced, but
graphical presentation carries its own matters of technique, frequently ignored. Tufte
(1983) provides a thorough analysis of the strengths and weaknesses of various graphical
techniques in particular applications. Among numerous recommendations, Tufte cautions, for
example, that small sets of numbers may be better presented as numbers. Instances of
graphical excellence are almost always multivariate and give "to the viewer the
greatest number of ideas in the shortest time with the least ink in the smallest
space." Distances and dimensions on graphs should be proportional. Data should not be
presented out of context - e.g., omitting the space representing low values from a graph.
Extraneous decoration should be minimized, and unrelated dimensionality should be avoided
altogether - e.g., three dimensional presentations of two-dimensional data. Although not
all of Tufte's recommendations are pressing concerns for presenters of PI data, the goal
of graphical presentations should be clear, efficient display, free of distraction or
distortion.
Whatever the audience, presentation of data using techniques that are visually pleasing
and helpful will enhance the utility of the information. Over all, the presentation of
results should facilitate interpretation and provoke thought. This is because service
systems are complex and performance levels could be interpreted in more than one way. For
example, funds expended on staff, per 10,000 population, might reveal that more had been
spent on white, than on minority staff. The dollar figures could represent insufficient
minority human resources. They could also result from having newly recruited, younger
minority staff who are paid lower wages than the "long timer" whites. Further
examination of the data would be needed, in order to find the explanation of the
differences. In all cases, to assure accurate interpretation and broad impacts, PI results
must be disseminated to all stakeholders. Wrong interpretation of findings are also likely
to be picked up and disproved if all constituencies review the results.
Dissemination of Findings
Much of what needs to be incorporated in presentation of the data is implicit in
the foregoing discussion. If policies, concerns, data sources, algorithms, caveats, and
results are presented or available along with interpretations or hypotheses, productive
discussion can be encouraged and much unproductive reaction can be averted. If the
processes of planning, development and implementation have incorporated the participation
and investment of relevant stakeholder groups, and if the resulting indicator set is
indeed linked to critical policy issues, the most important audiences will be ready-made.
For particular PIs, such as indicators of client outcome, accompanying case illustrations,
properly identified, can improve understanding of embedded concepts.
Different audiences may require different presentation strategies. Disseminators may want
to think in terms of two key message types, each with its own objective (Joint Commission,
1992). Knowledge communication assumes that the consequences of actions taken by
either individuals or the organization are unknown and that imparting factual evidence of
these consequences would instill appropriate reaction and encourage appropriate action on
either an individual or an organizational level. For example, an organization unaware that
its indigent care rate was lower than would be expected might use this information to
increase accessibility. Persuasive communication assumes that the audience is
skeptical, and must be convinced to take action. Approaches to changing attitudes include
sharing information about expectations or about linkages between behavior and positive or
negative outcomes (Fishbein, Ajzen and McArdle, 1980). Effective persuasive communication
is constructed to provide a set of arguments, appropriately matched with the belief system
of the targeted audience, along with factual evidence designed to support the arguments.
A third type of impact on behavior and performance involves public disclosure of
performance levels. Embarrassment from publicly-known low performance or pressure created
by revealing deviation from the performance levels of other, similar entities creates
strong motivation to shape future behavior in the desired direction. Thus, data and
disclosure of performance information are powerful tools and can be used as leverage in
shaping the behavior of service organizations. This step in the implementation of PI
systems is one of the most important and, providing that previous tasks have been
carefully executed, could serve as an aid to change agents, with relatively little
investment of resources.
Decision Rules and Utilization of Findings
Carefully crafted decision rules should be developed in advance and applied
uniformly in the application and utilization of PI findings. For example, in using a PI
model that consists of three dimensions - responsiveness to need, efficiency and
effectiveness -- an advanced agreement might stipulate the following decision rules:
o High and low performance on any one indicator is defined as two standard
deviations above and below the mean, respectively.
o Designation of high performance is defined as high results (two standard
deviations above the mean) on at least 2 indicators, in at least 2 of the 3 dimensions of
performance, for at least 2 consecutive periods of assessment, and no low (two standard
deviations below the mean) on any 2 indicators within any dimension.
In this example, an organization might have the absolutely highest indicator
values in the area of efficiency, but not count as a high performer unless it is also high
in either responsiveness, or effectiveness and does not perform poorly in any area. This
kind of a rule is designed to underscore the inter-relationships among indicators and the
need for reliable and stable findings. Allowing variability up to two standard deviations,
this decision rule illustrates a somewhat less stringent approach to identifying high and
low levels on any specific indicator. It is more stringent, however, in its designation of
high performance. The example also illustrates that decision rules are a matter of
agreement among all parties concerned. The rules could vary from one system to another and
could change over time. The same system that permitted variability up to two standard
deviations at the time of implementation, might tighten its rules after several rounds of
assessment and limit variability to one standard deviation.
Ultimate Uses of Performance Indicator Findings
There are four categories of the application of PI findings.
o internal use as feedback in the process of continuous improvement
o use in services research in an attempt to understand the determinants of high
performance
o input into modification of existing policies and development of new ones
o organizational and administrative sanctions
The fourth application represents major leverage in influencing and shaping the
behavior of service-providing organizations. It could be in terms of new service contracts
with a high performer, or discontinuation of such contracts with a poor performer.
Applications could also be more thoughtful and complex. For example, a SMHA might want to
integrate PI findings with data on assessed needs for mental health services in catchment
areas served by the assessed organizations. In this case SMHA managers might decide that
high performing agencies in high need areas will receive additional funds and will be
contracted to provided expanded services. Low performing agencies in low need areas will
be de-funded and their service contracts not renewed. Low performing agencies in high need
areas will have to develop corrective action plans, and will be provided with technical
assistance in order to help them improve. High performing agencies in low need areas might
be recognized for their high performance and contracted to provide technical assistance to
deserving low performing agencies. PI findings, in this example, provide managers with a
major tool for both allocation of resources and the shaping of the service system.
Perhaps the ultimate test of the usefulness of a performance indicator system is its
impact on the organization and its policies. Periodic review of the performance indicators
within a policy framework can ensure the continuing relevance of individual indicators and
of the PI system itself. Changes in the political environment, in specific policies, or in
organizational performance or context may require the development of new or revised
performance indicators as old ones achieve their ends, diminish in relevance, or even
produce unintended, negative consequences. The principles and issues discussed in this
chapter, critical to successful development and use of PIs, are equally important in the
ongoing adaptation of the information system to the policy and service system
environments. Continued sensitivity to these key practical and technical matters will help
ensure that the PI system remains useful.
This chapter emphasized and detailed three topics: one, that the total design and
implementation of a PI system should be oriented toward the utilization of resulting
information, two, the importance of the process for a successful and lasting PI system,
and, three, the components and sequence of the steps in the development and implementation
of a PI system. Some details were also provided about the presentation, interpretation and
utilization of results of any performance assessment data. Also mentioned were the utility
of graphic display of data and agreement about decision rules in designation of high and
low performance. The next chapter addresses important practical and technical
considerations for a sound system of PIs.
VI. PRACTICAL AND TECHNICAL ISSUES
Previous chapters defined PIs, offering a structure for conceptualizing
them. This chapter highlights several important practical and technical issues in planning
and implementing PI systems as well as in using the results they produce. The development
process contains many potential pitfalls at various steps. The savvy practitioner will be
aware of and anticipate these pitfalls by heeding the caveats in this chapter.
Planning and Implementing
An important set of issues concerns the practice of planning and implementing
PIs. These include both political and organizational considerations.
In the process of development of a system of performance indicators, there is a prior and
usually tacit question: What is the proper role of quantitative measurement in a given
management system? Although increasing credence and priority are given to objective
demonstration of performance, top managers vary in the degree to which they are
comfortable with routine, widespread dissemination of data indicating organizational
performance in potentially sensitive areas.
This report has repeatedly offered the view that effective indicators are policy-driven,
and thus that they are factors in a political context. In the kinds of complex systems
where PIs may be most relevant, information is an instrument of power in decisions about
appropriate allocation of societal resources. Management style, strategic purpose,
organizational context, and political environment all affect the way PIs can be introduced
into a system, on the degree of penetration of a PI system into the organizational
hierarchy, and on the type of content that might be measured.
Typically, as with implementation of TQM approaches, effective development of PI systems
requires the joint commitment and collaboration of top management with a political vision
and staff with a more technical, even technocratic orientation. Often the more technical
subordinate will contribute some leadership in proposing and developing PIs and PI
systems. This report emphasizes the critical linkage between PIs and policy, and cautions
potential developers about the risk of basing systems too heavily on simplistic
assumptions of rational planning and decision-making. A PI system must be adapted to its
political context, notwithstanding the fact that this adaptation is necessarily mutual,
since a successful PI system will also influence its context. From the point of view of a
subordinate staff member with reporting system responsibility, for example, the political
context includes both the management style and the political contingencies of the top
manager.
The need for broad participation, discussed earlier, has much relevance here. Involvement
of the appropriate range of stakeholders early in the planning stages provides not only
opportunity to clarify attitudes and interests, i.e., relative to data on specific issues,
but also to shape them toward a more productive consensus.
Cost, Burden, and System Capacity
Generation and use of data in a reporting system entails consumption of
resources; part of the planning process is consideration of these anticipated costs in
relation to anticipated benefits.
The use of PIs as a management strategy represents a significant commitment of system
resources. System-wide introduction of new data collection to support PIs requires startup
time for planning, training, and implementation, as well as additional ongoing effort.
Even if a majority of PIs can be constructed from data generated for other purposes, such
as financial accounting, program management, or clinical care, and if much of the cost is
thus "buried," there may be significant effort in transmitting and
analyzing the data, in communicating results, and in dealing with the effects of this
information on the system. Much of this cost may be incurred from the PI set as a whole;
the marginal cost of a single indicator is typically minimal. Nonetheless, the
cost-effectiveness of each PI should ideally be estimated in advance, at least informally,
and alternatives considered.
If a new indicator, however valid, requires substantial new training and data collection
across a large organizational system, or, if a new indicator requires substantial
programming or other data manipulation to achieve integration across the appropriate
universe of organizations, implementers should evaluate the probable costs of these
activities in relation to the probable benefits of the hypothesized changes to be brought
about in the system through use of the data. In many instances, these costs will be
worthwhile investments to achieve valuable ends; in other instances, less expensive
indicators may be warranted.
Two other issues related to costs are redundancy and frequency. Developers considering
inclusion of two measures of very similar phenomena need to weigh the advantages of mutual
validation against the marginal cost of additional information gained; highly correlated
measures may add little in analysis of results. Frequency of data collection should be
relevant to the rate and importance of expected change; too short an interval provides
redundant information, and too long an interval may prevent timely feedback for effective
policy implementation.
Many costs will have tangible, financial value; others may take the form of intangible
burden on staff or clients. A new evaluation or rating process, e.g., a measure of client
need, eligibility or outcome, may not only take staff time and therefore represent
consumption of a tangible resource, but the new activity may also impact staff morale.
Such changes may have positive impact. If a measure provides improved focus or enhanced
justification to clinical activity, it may improve morale along with services. Whenever
possible, developers should pilot-test new data-collection methods to determine their
effects in advance of full-scale implementation.
The issue of cost of measurement is increasingly salient as diverse groups become
increasingly interested in questions of service outcome and system design and integration.
Most current data systems still focus on intra-organizational client and service data and
have not evolved to be able to address these more current concerns. Questions that
conceptually, at least, seem relatively simple, such as the characteristics or even counts
of people who are clients of both mental health and chemical dependency treatment systems,
can become quite costly to answer. Developers may have to be satisfied with modest proxies
while processes for data system enhancements and integration are still in the future. At
the same time, identification of pressing policy and service planning questions can drive
future data system development.
Multiple Indicators
It has been emphasized that no indicator can stand alone. In order for an
individual PI to have validity and thus utility, it needs to focus on a specific concern.
But different aspects of performance in complex systems are interrelated, and measurement
of this complexity requires multiple indicators simultaneously tapping key dimensions. The
framework offered earlier, along with suggested applications and the scenarios to follow,
can serve as a device for planning a comprehensive set.
Participation of a range of stakeholders in the planning process may help to ensure that
an appropriate range of indicators is developed, including relevant data sources other
than providers and provider organizations. Despite the need to minimize bias and therefore
to maximize the objectivity of the data, subjective information (e.g., client
satisfaction) is sometimes important to include in a PI system. These data can only be
validly and reliably provided from the perspective of clients themselves.
Participation of appropriate stakeholders in development will increase the chances for
widespread understanding and consensus around results. Ciarlo et al. (1986) have suggested
that measures that allow parallel measurement from several alternative perspectives are to
be preferred over singl~perspective measures. Given the political context of a PI system,
this argument seems applicable to a PI system as a whole; credible measurement of system
performance may need to take into account the view from multiple perspectives.
Concern with inclusion of stakeholders formerly given too little attention
(e.g., consumers and families) should not obscure the importance of groups that continue
to be central stakeholders for PIs: the direct service providers. They are typically the
source of a high percentage of the data used in constructing indicators. Their commitment
to data quality may be crucial to its ultimate utility. The inclusion of PIs with direct
relevance to the clinical enterprise (e.g., integrated measures of need, treatment
process, and service outcome as aspects of responsiveness, efficiency, and effectiveness)
will help both to keep data quality high and to maintain the linkage between policy and
performance.
Inclusion of multiple indicators does also force consideration of another issue:
prioritization of PIs. Since PIs will typically be used, even if indirectly, in a process
of decision-making about allocation of resources, there will be explicitly or implicitly
some algorithm or other procedure for weighting and integrating findings across multiple
indicators. This, too, is an aspect of policy; the more explicit it is, the more impact it
can have through a PI system.
Technical Measurement Issues
A second area to which developers of PI systems should be alert concerns issues
around the techniques of measurement.
Performance indicators are intended to serve as measures of key aspects of a complex
system. All measurements inherently contain some degree of error; performance indicators
are no exception to this rule, and they are vulnerable to several types and sources of
error. By being attuned to potential pitfalls, by anticipating and avoiding them,
practitioners can ultimately use PIs much more effectively and efficiently.
Many of the kinds of issues involved in the use of performance indicators mirror those
discussed in great detail in the literature on methodology in research and evaluation.
Although particulars of technical discussion typically derive from concerns identified in
the development and use of complex, standardized tests, the concepts are applicable even
to the apparently simple, unitary ratios used in many PIs. And even if the driving force
behind PIs is performance accountability rather than knowledge generation, the essential
activity of using information to draw valid inferences is equivalent in the two domains.
In addition, there are other issues particular to using data interactively in real-world
settings, as use of performance indicators requires. In this section, these technical
concerns will be identified and reviewed briefly. References to relevant works on research
methodology are provided in the bibliography.
Validity
By their very nature, performance indicators are intended to capture and measure
efficiently the most critical dimensions of a system. The first and most important
question is therefore whether an indicator validly represents the behavior or
characteristic of interest. In designing measures, researchers distinguish several types
of validity that are quite relevant to performance indicator system design (Nunnally,
1978; Campbell & Stanley, 1963; Cook & Campbell, 1979).
Predictive validity refers to the degree to which performance on a measure
correlates with later actual performance in a domain of interest, i.e., the degree to
which a measure predicts real-world performance. This aspect of validity is relevant to
indicators designed to measure, for example, capacity, fiscal viability, or need - to
those indicators which, rather than showing current performance, are of interest because
of their presumed relationship to possible future performance or risk. Choice of
indicators with a predictive purpose should be based on their validity for a specific
concern as demonstrated in empirically based research literature.
Content validity concerns the degree to which the sources of data for the measure
are sufficiently representative of the domain being measured. This type of validity may
seem more appropriate to a test of a characteristic such as mathematical ability, in which
a specifiable range of information and skills is to be assessed, than to a performance
indicator, which is typically a ratio of simple, single variables. However, PI system
designers can use the concept of content validity to assess two important questions.
First, is a proposed indicator as central as possible to the content area it purports to
represent? Since the number of indicators must be limited, each PI should do the best
possible job of focusing on the content area of concern. Second, does a subset of
indicators, taken as a whole, represent the range of content within the broad range of
concern? An important feature of PI development, discussed elsewhere in this report, is
that PI systems should if at all possible sample broadly from the universe of possible
behavior. Since PIs will typically be used in a context of sanctions and rewards, a
broadly representative set of indicators will minimize the risk that organizations may
distort performance counterproductively in order to maxi