October
30, 2006
Speech
Recognition: Technology vs. Solution
By Todd R. Charest, MBA, and Lynn J. Kosegi, PMP
For The Record
Vol. 18 No. 22 P. 6
Amazing claims are made about speech-recognition technology.
Some of them are true. There is no doubt that speech recognition can
potentially boost transcription efficiency, reduce errors, increase
turnaround times, and save money. However, not every hospital and healthcare
organization is positioned to take advantage of this powerful tool.
The first thing healthcare facilities need to understand
about speech recognition is that it is a useful technology and not a
solution in and of itself. As a technology, it can be integrated into
the transcription process, often with dramatic results. However, facilities
expecting speech-recognition technology to be the solution for all their
transcription needs are likely to be sorely disappointed. In general,
speech recognition is most effective when it is used to accelerate the
medical report creation and review process, not replace the human element.
What It Is
There are two basic types of speech recognition available today: front-end
(or active) and back-end (or passive). Front-end speech recognition
requires physicians to make significant changes to their dictation habits
and edit their own files as they dictate. Often this is met with resistance
by doctors who believe they are being burdened with additional work.
Back-end speech recognition requires minimal changes
in physician behavior, relying instead on a medical language editor
to edit the document after the speech-recognition software has produced
a draft report.
Front-end speech recognition has shown promise in a
small number of specialties with high turnaround time demands and limited
medical terminology. These specialties include radiology, pathology,
and cardiology. However, back-end speech recognition is much more widely
accepted and successful in the healthcare industry. The reason is simple:
Back-end speech recognition typically allows healthcare facilities to
recognize the greatest gains in efficiency and quality because it integrates
almost seamlessly into the traditional transcription or report creation
process.
What It Isn’t
Successful deployment of back-end speech recognition begins with a realistic
understanding of the technology and its capabilities. If you are evaluating
the benefits of speech recognition for your facility, there are a number
of common misconceptions you need to avoid. Speech recognition will
not:
• eliminate the need for a knowledgeable medical
transcription staff;
• correct poor dictation habits;
• remedy a poor dictation environment;
• typically produce 100% accuracy; and
• alert transcription staff to errors in recognition
to within a high degree of accuracy.
No matter which form of speech recognition you are considering,
there are some trade-offs your facility should be aware of if you are
going to adopt and maximize the technology. Ask yourself whether your
facility and your physicians are willing to accept each of the following:
• fewer customized instructions for document creation;
• more verbatim reports;
• compliance with appropriate dictation habits;
• potential turnaround time restraints during
initial ramping period;
• limiting “normals” to group account
standards; and
• tighter control of environmental factors that
affect the quality of voice files.
These factors are among the most significant reasons
why some facilities achieve great success with speech recognition while
others minimize the technology’s potential benefits.
Achieving Success
To achieve the most significant gains with speech recognition, it is
important to carefully select which physicians will be using the new
technology. Although it is possible to realize some benefits from speech
recognition even if clinicians do not modify dictation habits, the best
results will be achieved if you limit participation to physicians who
are willing to modify their dictation habits, who generally obey dictation
protocols and produce high-quality files, and who are willing to accept
more verbatim transcription.
Also, if transcription is performed in house, close
attention should be paid to the selection of which staff members will
be editing speech recognition files. Although the jobs are closely related,
an editor working with speech recognition uses different skills than
a traditional medical transcriptionist (MT). The medical language editor
must be detail-oriented and tech savvy to adapt to the new technology.
Also, a healthcare facility may wish to keep its top-performing MTs
assigned to their current tasks, while allowing average performers to
train for editor positions since they will generally see the highest
gains in productivity when switching to speech-recognition technology.
A facility should not forget MTs who may have experienced
health issues related to repetitive stress injuries. Speech recognition
may be a way to retain valuable staff for whom traditional transcription
and typing have become too difficult.
Speech recognition may also allow an MT to extend his
or her career by years. Shifting from traditional typing to editing
speech recognition may alleviate some physical stress that can be caused
by long hours at the computer.
Measuring Results
When evaluating the success of speech-recognition technology, the most
important measurement is the production ratio. In speech recognition,
the production ratio is the number of minutes it takes a medical language
editor to edit one minute of dictation. In traditional transcription,
the production ratio is the number of minutes it takes an MT to transcribe
and then edit one minute of dictation.
Using this as the key measure of success instead of
lines per hour decreases the risk of inaccurate results due to individual
work habits, long pauses, differences in how lines are counted, and
other variables associated with measuring productivity by line counts.
In addition to increasing productivity, speech-recognition
technology can provide a number of other benefits:
• improved turnaround time;
• reduction in minor errors;
• reduced need for proofreaders and quality assurance
staff; and
• improved quality.
Because the speed and quality of medical transcription
has a direct impact on the quality of patient care, the deployment of
speech-recognition technology has the potential to improve a facility’s
overall performance. Once a speech-recognition system is selected and
deployed, the rollout period for fully implementing the new technology
is typically between 10 and 24 weeks.
Be aware that it is not unusual to see a decrease in
transcription productivity during the initial few weeks of the rollout.
Productivity should begin to exceed previous performance between weeks
three and six if adequate training and feedback is provided to the medical
language editors.
Speech recognition can be an important tool in the creation
of quality medical documents. It is, however, just one tool in an overall
process improvement effort. Understanding the differences between the
hype and the reality of speech recognition is necessary to take full
advantage of the benefits of this advancing technology.
A speech recognition readiness assessment should be
undertaken to evaluate whether the facility and its clinicians are ripe
for change.
If you plan accordingly, involve the end users in the
process, and are realistic about what this technology can do, success
with speech recognition can be achieved.
— Todd R. Charest, MBA, is director of business
development for Spheris.
— Lynn J. Kosegi, PMP, is project manager
of business development for Spheris.
Subscribe
to For the Record Magazine! |