Informed Decisions: Research Based Practice Notes 360 Degree Feedback:
Methodological Advice from Multiple Sources
Matthew E. Sederburg and Steven G. Rogelberg
Bowling Green State University
Implementing an effective 360 feedback program can be a complicated
process. Many methodological issues (e.g., how many raters are needed from each source,
how to interpret discrepant data between sources, etc.) must be considered. Further, the
manner in which you deal with such issues can have a profound impact on the quality of
your intervention. Much basic research has been done on 360 feedback (such as
selfother comparisons), but unfortunately little or none has examined the optimal
means of implementing and carrying out a successful 360 program. From a methodological
standpoint, practice has been far ahead of research in this area.
Given the current state of affairs, a review of the existing literature
seems ill suited to finding answers to the types of questions we wish to address in this
column. The "state of the art" in 360 feedback lies with those who practice it
regularly and in the knowledge they have gained from years of experience. In order to meet
our goal of providing the best advice possible, we have interviewed four seasoned
practitioners in the area, and present their responses to our questions in this paper.
Wed like to thank Allan H. Church, Manny London, Walt Tornow,
and Fran Yammarino for their participation in this project, and hope that their
words of wisdom enable you to design better 360 programs in the future.
What would you say are the keys to success when implementing a 360
feedback program?
London: "One is getting people involved in the process right
from the start. Then you focus on the survey instrument itselfgetting them involved
in determining what the items should be, writing the items, making sure they understand
what is being measured. 360 is a vehicle for communicating to people what is important,
and so you want to be sure that the performance dimensions being measured are ones that
you buy into as a corporation or as an executive. Facilitation of the results is important
as well, and that might take the form of workshops where people get the results and you
help them work through it individually. Remember though, it is not just a matter of them
getting the results, but encouraging people to share the results with others to make it a
more commonplace experience to talk about performance management issues within the
organization. Finally, there is follow-up. You look at whether people have improved, have
they accomplished the goals they have set for themselves, has their behavior changed, and
has their performance improved over time."
Tornow: (a) Consider the power of purposewhy the organization
wants to use a 360 feedback program. For example, is the feedback going to be used for
performance appraisal or development? Treat development as a system and look at 360 as
one part of that overall system and as a process, not an event. (b) Be longitudinal in
perspective. It takes time to develop. Emphasizing follow-up can also create impact. (c)
Link individual with organization-level changehow organizational change gets
translated at the individual level, and how individual-level change plans get integrated
into organizational business plans. (d) Focus on both raters and rateesinvolve both
in the design and use of the 360 system.
Church: "One key is to make sure that youve determined
the desired outcome of the 360 process, paying particular attention to how the feedback
system is going to be linked to existing and ongoing objectives in the organization. If an
organization decides to do 360 because its a fadwithout knowing whats
going to happen to the data, who is going to have access, and how its going to be
implementedits probably going to flop. Another key is ensuring the
confidentiality of the process. The third is making sure the instrumentation is solid. It
doesnt necessarily have to be developed based on a job analysis or even similarly
rigorous research methods, but it should at least makes sense to the user. At a bare
minimum the instrument should not break the cardinal rules of item writing (e.g., no
quadruple barreled items, no mismatched scales, and no jargony or culturally biased
items). Also, the items need to be linked to some organizational concept that is
meaningful to people. Finally, the delivery of the feedback itself has to be in some kind
of system that is effective and makes an impact. It should not be just a
deskdrop. It is critical to provide the recipient with opportunities for
focused attention on the results. Specifically, feedback can be most effective when it is
delivered in the context of a developmental program with professionals on hand who can
provide interpretation and coaching. Ideally, these coaches should be people who can make
on ongoing commitment to work with the feedback recipients to help them work on the things
they are trying to change, and to provide them with resources to do so. Sometimes, the
relationship can be almost like therapy. You need to have a more extended connection to
the recipient than just a single discussion."
Yammarino: "The number one key is youve really got to
have top managements commitment to do this. It cant be just window dressing;
they really have to be behind it. The second key is that the people who are going to be
involved in the program, both raters as well as the people receiving the feedback, have to
understand that this is being done for developmental purposes rather than performance
appraisal. The answers can get kind of weird from some sources when people know that
its linked to performance evaluations or selection. So, I think if you get top
management commitment to the process, and use it for developmental purposes, these can
really go a long way to making the initiative successful."
How do you feed back to the individual the meaning of his/her data when
large discrepancies exist among sources?
Church: "When the data are not the same, you have to get the
individual to think about his/her current situation, how he/she relates to different
groups of people, and what might be driving the differential set of ratings. Its
really an issue of looking at the specific items, and asking the individual to explore
possibilities as to why the ratings differ among groups. But you have to pull it out of
the personyou cant second guess the whole thing. It can be useful to get the
individual to think of recent, specific behaviors or interactions that may have
contributed to these ratings. Sometimes as a coach you can observe behaviors that your
client cannot see in him or herself. Getting your client to discover these tendencies
through his or her own feedback and experiences will be more effective and valid than
making pronouncements about what you have observed. Again, it can be a little like
therapy; you have to get that person to tell you what the problem is."
Yammarino: "The rating sources dont always see the focal
individual in the same way. And my own personal view is that thats ok. These are
different people, they have different perspectives on that individual and the role he or
she plays, and thats all valid information. So the last thing you want to do is
combine rating sources, or average across them. What you need to do is feed back to the
focal individual what the different sources say. And you present them as independent,
unique pieces of information."
Tornow: "One of the key learnings about 360 feedback is that
there are multiple realities, or perspectives, that different people have. Therefore,
differences are to be valued and understood in order to do a better job in relating to the
different constituencies."
London: "This is part of the process; it is not necessarily
the case that these differences are an indication of unreliability. It could indeed be
that there are distinctly different perspectives. You need to go through the data and see
what it means. It may depend on the questions asked. It might depend on the situation that
the manager is in. The manager may be behaving differently with different constituent
groups. So those are the things that need to be processed as the person gets feedback, and
the recipient needs to be encouraged to think about the meaning of those differences and
try to understand them. He/she needs to determine if they are a valid indication of actual
performance, and then incorporate that into the meaning of the results and the goals that
are set."
How do you interpret the data when large discrepancies exist within a
data source? Do you provide ratees with an index of within source variability?
Yammarino: "Thats a tougher issue. The standard practice
has been to average the ratings. The assumption that youre making in this case is
either one of two things: the raters are all seeing the individual the same, or the
variability in their ratings is really error. The reality is that neither one of those
assumptions may be correct. What we do is report frequency counts, or category responses
on the feedback forms. In this way individuals can see the dispersion of their scores. One
of the interesting things weve found is that you get these splits. You might have a
couple of people clustered on the high end of the response format, and a couple on the low
end. People can then start to get the sense that even within this group, people see me
somewhat differently. And thats useful information."
London: "There is probably some general thinking that within
sources at least there ought to be agreement, but that is not necessarily the case. A
person can behave differently with different subordinates, and the subordinates will have
different views. What all this says is that processing of 360 is a difficult cognitive
and emotional process, and one that needs to be gone through very carefully. You really
need to have a broad understanding as a recipient as to how the results were derived and
to be able to think about what they mean. So you have to look at the items within each
dimension, and understand that there may be disagreement there too because the internal
consistency in the dimension might not be that strong. People need to be educated
recipients of feedback. In particular they need to be educated as to how to process it
cognitively."
Church: "Most of our feedback systems dont focus on this
aspect. Its our belief that we shouldnt provide dispersion information on
direct reports or peers as this will likely jeopardize the confidentiality of the data.
Its particularly dangerous with direct report data because we dont want
somebody assuming that the lowest ratings are all from a single particular person they can
identify. Having said that, we have provided this information for external client ratings,
as the nature of the power in this relationship is different. In a client scenario, if you
see discrepancies, whether its within or between, you have to ask why this is
happening."
What kind of information do you present in the written feedback report,
and how do you lay out that information?
Tornow: "Data should be displayed so that it is easy for the
ratee to quickly see how different rater sources see him or her. These should also be
compared with self-ratings. Norms should be presented as well."
Church: "We use a combination of graphics and text. Typically
this consists of bar charts and tables. And its basically displayed on a summary
level, across the key competencies of the organization. Usually, we provide an overall
summary score followed by individual item means. The feedback is always displayed
separately for each constituency (i.e., self- and each of the various rating
perspectives). Alongside that, we typically provide importance ratings and some type of
norm, which is usually the average for everybody in that organization."
London: "Reports vary across the board. Ive seen some
very complex ones and then I have seen some very simple ones. Generally, something in
between is best. But it is certainly something to be tested within the organization, and
it depends on how you are going to provide the feedback, and how much info and help you
are going to be giving to people in using it. If you are not going to be giving them much
help, it had better be self-explanatory and easily useable. I also think it is good to
have norms presented so they can see how other managers at their level in the organization
did, so they can compare themselves with others. I also like to present some measure of
dispersion, such as a range. Usually the mean is presented, by source. It is important not
to aggregate it across sources, but to report each source separately. The norm might be
presented as the average of all other ratings across sources, or it might be presented for
each source separately; it probably should be for each source separately. It is a
difficult issue and there is no one right way to do it, but presenting something that is
understandable and where people can get help in understanding it is really
necessary."
Yammarino: "Well give the self scores, the average of
the other scores by source, then well give a breakout or frequency count of the
other scores, also by source. What we dont do is report things in terms of variances
or standard deviations, or some of these other indicators, because the reality is that the
managers dont understand it or dont care about it. You want to compute those
things for yourself, but you dont necessarily want to report that information to the
focal individuals who are getting the feedback.
Do you ever use open-ended items on the rating forms? If so, how do you
use this information?
London: "Often we have asked for open-ended measures or
responses at the end of the survey. That information is generally transcribed so that
identifying material is taken out and those verbatim responses are presented as part of
the feedback report."
Yammarino: "What we are doing is asking a general, open-ended
question at the end. We ask a question such as "What are the three best things that
this person does relevant to the survey items, what are the three worst things, and is
there anything else that you want to tell us about this individual?" When we report
back this information it is verbatim, unedited. We literally pipe from the surveys onto
the feedback form. And all we do is identify it by source. We scramble the order, so that
they cant tell that good comment #2 comes from the same person as bad comment
#2."
Tornow: "Yes, open-ended comments can be very useful in
providing some flesh to the quantitative data."
Church: "Yes, but they are more difficult to keep clean. What
I mean is, you have to make sure theyre carefully content coded and analyzed, and
cleaned of all identifying information. As an aside, we prefer not to use verbatim
comments (without content coding) even if cleaned for the same reasons. Actually,
weve found that managers and executives love write-ins. It gives them some
additional context that they wouldnt otherwise have. Weve done various 360
programs with and without write-in comments, and it seems that participants get much more
out of the standard numeric feedback when write-ins are included in their reports. It also
seems easier for them to create developmental action plans for the future. But not
everyone offers their comments on the rating forms, so this type of data can be more
skewed in one direction or another."
What is the best way to select raters?
Church: "Typically, we let the participants choose who will
rate them because the participants are the ones who will either use or not use the data
they receive. While they may not get everything back from everyone using this method, and
they wont know which individuals responded, at least participants will know the pool
of people theyve picked. We recommend strongly that they choose people of different
types in order to get wider representation. If you do it the other way (i.e., someone else
selects the raters), participants wont necessarily know who is in the pool at
allit could be any of a number of people. This can cause credibility and validity
issues with the data. They dont know why there might be discrepancies among the
sources because they didnt pick the raters. Youre left with data without a
tag. Of course, you certainly do run the risk of individuals picking out raters who like
them the best, but if you can establish the credibility of the process and its
confidentiality in the larger system, weve found that you tend to get pretty honest
ratings anyway."
Tornow: "Raters should be selected on the basis of
representing the key constituencies defining job success, and being knowledgeable about
the ratees performance."
Yammarino: "Weve done it a variety of
waysweve had the focal person pick, weve had HR departments pick,
weve done random selection kinds of things, and weve also done this thing
where we get everybody that we can whos in that source group. My experience is that
its the latter two that work best, either some kind of random selection of people or
everybody. With the other methods, people pick raters for a variety of reasons, and not
necessarily to give the most accurate and comprehensive feedback. And so weve not
put a lot of faith in data coming from those types of selection systems."
London: "I think that having the focal individuals choose the
raters makes most sense, particularly initially when you are introducing the process.
Because once again, it reduces defensiveness and gives them a chance to give it to people
who they value, people whose input they care about. As a result they will pay more
attention to the results than they would otherwise. So I think if you want to enhance that
credibility and attention to the results, you can have the ratees give it out to people
they have confidence in."
How many raters from each source do you feel are needed? Is obtaining
adequate rater data ever a problem? How do you deal with it?
Yammarino: "My feeling is that you want to get all the people,
or a reasonable random sample of the people, in that source. The other rule of thumb we
use is that if we get fewer than 3 or 4 responses from any particular source, we
wont present the breakout data. We might even be reluctant to report mean data,
because at that point the data can be skewed or colored in some way. The anonymity of the
rater also goes out the window. If youre going to do a 360, and you cant
identify a reasonable customer pool, you might want to consider not presenting data from
that source. Likewise, if a person only has one direct report, you dont want to
report that data because you can no longer protect anonymity. In the end we use reasonable
judgment. You want to be able to get accurate information and protect the anonymity of
sources, but you dont want to force the issue just to have a complete set of 360
ratings."
London: "I think I have seen three, four, and five used in
various companies. Four is probably a good number. Three is too small, I suppose,
particularly if measures of disbursement are used, if youre presenting the frequency
of responses per item. Four is probably a reasonable number to protect the anonymity of
the rater, as well as getting a fairly reliable data point."
Church: "At least threethree peers, three direct
reports, and three clients. This can often be a problem. The worst case is you dont
give them the data from that group. We do lots of follow-up, and try to get as much data
as possible, but if you dont have enough responses, you simply cant give them
the data. One client of ours has actually forced us to collapse ratings together on peers
and direct reports when one of them is incomplete, but we typically dont advocate
that because then youre mixing up perspectives, making interpretation even more
difficult. On the flip side we have had clients request as many as 15 direct reports and
peers each to assess them. This can also be problematic though because of central tendency
effects."
Tornow: "A minimum of three, preferably five, to assure
anonymity."
What techniques, if any, do you use to improve your rater response
rates?
London: "Response rates have been pretty good in the programs
that I am familiar with, 405060%, some as high as 90%. So you do the normal
things by explaining what this process is for, encouraging people to participate, and
follow up if necessary. Increasingly, 360 is being administered electronically, and that
makes it easier for people to respond."
Church: "In our work, we have found that 360 rater response
rates are not nearly as problematic as survey response rates. In fact, its not
really a problem at all. Nevertheless, we do lots of follow-up, stressing confidentiality.
Basically its by telephone, fax and/or e-mail. Some organizations are culturally
very good at responding, and some are consistently poor. The general level of response
depends on a few factors, such as (a) the developmental orientation of the organization,
(b) how often the organization does other types of data collection efforts (i.e., people
can get simply sick of responding), and (c) whether or not raters are comfortable with the
process."
Yammarino: "Weve had tremendous luck with response
rates, even with customers when they have been included in the process. If you get top
managements commitment, and people know that the feedback is for developmental
purposes, people seem much more willing to participate and respond. Even before the 360
process begins it is important to get the word out that top management values and endorses
the feedback process. Then, you need to let people know that the 360 process is coming.
When the feedback materials arrive, they should include a cover letter that describes the
feedback process in detail. This letter should lay out the purpose and importance of the
360 process, as well as specific instructions for completing the questionnaires.
Moreover, the letter should firmly state top managements commitment to the process
and reconfirm that the feedback will be used solely for development. Finally, appeals,
sponsorship and support are also important to include in the cover letter. When it comes
time to follow up, postcards, memos, and phone calls are all effective techniques."
Though our experts did not agree on every issue discussed here, several
common themes seemed to emerge. Commitment and buy-in from top management and the
participants in the 360 program is crucial. The decision to make use of the feedback
ultimately lies with the recipient, so gaining his/her confidence and establishing
credibility is vital to the success of the project. The rating form itself should be
designed to reflect organizational goals and relevant performance dimensions in order to
communicate what the organization holds important. Furthermore, the use of open-ended
items in the form can be useful, as they provide valuable information that would not
otherwise be available. When selecting raters, choosing people the feedback recipient
views as credible and who can accurately report on the relevant performance dimensions are
the most important qualities to look for. Concerning interpretation, each source, as well
as each rater, has a story to tell. Effective coaches, in turn, facilitate the process
whereby the participant uncovers and understands the underlying themes and messages from
each data source. Although aggregating data within a particular source is necessary to
preserve anonymity, between-source information should be kept separate to reflect the
different perspectives accurately. Also, remember that when you have fewer than 3 or 4
raters within a given source (excluding supervisor and self-ratings), it is probably best
not to present that data to the recipient. Finally, 360 programs should promote a
process orientation to development rather than an event orientation. Specifically, a long
term developmental perspective should be encouraged, with participants working
interactively with their coaches during and after the 360 program occurs.
In conclusion, 360 feedback is a potentially powerful intervention
that can positively impact individual and organizational effectiveness if educated and
careful methodological decisions are made and implemented. We hope that the information
contained in this article will help you in your quest to provide high quality services to
organizations. Although each organization is in a sense unique, and operates in its own
contextual environment, these important issues identified by our experts will certainly
present some relevance to your efforts. Again, we would like to thank Allan Church, Manny
London, Walt Tornow, and Fran Yammarino for their time and great insights. Feel free to
e-mail us at sederbu@bgnet.bgsu.edu if you
have any questions or comments about this article, or about 360 feedback in general.
Biographies
Allan H. Church is a Principal of W. Warner Burke Associates, Inc.
where he specializes in the design, analysis and implementation of customized multirater
feedback systems for executive, leadership, and management development, and large-scale
organizational surveys for diagnosis, development and change. He is also currently an
Adjunct Assistant Professor at Columbia University Teachers College. Dr. Church received
his Ph.D. and M.A. in Organizational Psychology from Columbia University. He has published
over 70 articles in academic and practitioner journals. In 1997 he co-edited a special
issue of Group & Organization Management with David W. Bracken on 360
feedback systems. Currently he is serving as Editor of both The
Industrial-Organizational Psychologist (TIP) and the Organization
Development Journal, as well as Associate Editor of the International Journal of
Organizational Analysis, and O.D. Forum Field Editor for ASTDs Performance in
Practice.
Manuel London is Professor and Director of the Center for Human
Resource Management in the Harriman School for Management and Policy at the State
University of New York at Stony Brook, where he is also Acting Associate Provost for
Enrollment and Retention Management. He received his A.B. degree in 1971 from Case Western
Reserve University in philosophy and psychology, and M.A. in 1972 and Ph.D. in 1974 from
the Ohio State University in industrial and organizational psychology. Before joining
Stony Brook in 1989, Dr. London worked for AT&T for almost 12 years in a series of
human resources and training department assignments. Dr. Londons main research
activities are in employee development, career programs, human resources forecasting and
planning, performance appraisal, and change management. He is a consultant for business
and government organizations, and is the author of more than 50 papers and 15 books
including several on job feedback.
Walt Tornow is a Senior Fellow at the Center for Creative
Leadership. Prior to joining the Center, Walt had extensive applied research, consulting,
and teaching experience. He also has held executive positions in major organizations with
responsibilities for leadership development research and publications, human resource
research and planning, advanced product and business development, and organizational
consulting services. He has a Ph.D. in I-O psychology from the University of Minnesota.
His 1993 edited special issue on 360 feedback for the Human Resource Management
Journal was one of the first publications to bring together extant research and
practice on 360 feedback. He co-authored with Manuel London and CCL Associates a
recently published book by Jossey-Bass, Maximizing the Value of 360-Degree Feedback: A
Process for Successful Individual and Organizational Development.
Francis J. Yammarino is a Professor of Management, Director of the
Doctoral Program, and a Fellow at the Center for Leadership Studies at the State
University of New York at Binghamton. He received his Ph.D. from the State University of
New York at Buffalo in organizational behavior (management) with minors in social
psychology and educational psychology-statistics. Dr. Yammarino has extensive experience
in basic and applied research in the areas of superior-subordinate relationships,
leadership, self-other agreement processes, and multiple levels of analysis issues. He is
one of the developers of Within and Between Analysis (WABA), a multiple-level
data-analyatic technique. Dr. Yammarino is the Senior Editor of the Leadership
Quarterly and is a Fellow of the American Psychological Society. He is author/editor
of four books, has published about 70 journal articles or book chapters, delivered
approximately 50 conference presentations, and authored about 60 technical reports or
working papers. Dr. Yammarino has served as a consultant to numerous organizations
including IBM, Textron, TRW, Medtronic, Loral, Lockheed Martin, United Way, and the U.S.
Navy, Air Force, and Department of Education.
TIP
Vol. 36/No. 2 October, 1998
Return to Table of Contents