Skip to main content
Top
Published in: Journal of Digital Imaging 4/2008

01-12-2008

Improving the Utility of Speech Recognition Through Error Detection

Authors: Kimberly Voll, Ph. D., Stella Atkins, Ph. D., Bruce Forster, M. D.,

Published in: Journal of Imaging Informatics in Medicine | Issue 4/2008

Login to get access

Abstract

Despite the potential to dominate radiology reporting, current speech recognition technology is thus far a weak and inconsistent alternative to traditional human transcription. This is attributable to poor accuracy rates, in spite of vendor claims, and the wasted resources that go into correcting erroneous reports. A solution to this problem is post-speech-recognition error detection that will assist the radiologist in proofreading more efficiently. In this paper, we present a statistical method for error detection that can be applied after transcription. The results are encouraging, showing an error detection rate as high as 96% in some cases.
Literature
1.
go back to reference Horii SC, Redfern R, Kundel H, et al: PACS technologies and reliability: are we making things better or worse? Proc SPIE 4685:16–24, 2002CrossRef Horii SC, Redfern R, Kundel H, et al: PACS technologies and reliability: are we making things better or worse? Proc SPIE 4685:16–24, 2002CrossRef
2.
go back to reference Mehta A, Dreyer K, Schweitzer A, et al: Voice recognition—an emerging necessity within radiology: Experiences of the Massachusetts general hospital. J Digit Imaging 11:20–23, 1998PubMedCrossRef Mehta A, Dreyer K, Schweitzer A, et al: Voice recognition—an emerging necessity within radiology: Experiences of the Massachusetts general hospital. J Digit Imaging 11:20–23, 1998PubMedCrossRef
3.
go back to reference Al-Aynati NM, Chorneyko KA: Comparison of voice-automated transcription and human transcription in generating pathology reports. Arch Pathol Lab Med 127(6):721–725, 2003PubMed Al-Aynati NM, Chorneyko KA: Comparison of voice-automated transcription and human transcription in generating pathology reports. Arch Pathol Lab Med 127(6):721–725, 2003PubMed
4.
go back to reference Ranaa DS, Hurst G, Shepstone L, et al: Voice recognition for radiology reports: is it good enough? Clin Radiol 60(11):1205–1212, 2005CrossRef Ranaa DS, Hurst G, Shepstone L, et al: Voice recognition for radiology reports: is it good enough? Clin Radiol 60(11):1205–1212, 2005CrossRef
6.
go back to reference Gale B, Safriel Y, Lukban A: Radiology report production times: voice recognition vs. transcription. Radiol Manage 23:18–22, 2001PubMed Gale B, Safriel Y, Lukban A: Radiology report production times: voice recognition vs. transcription. Radiol Manage 23:18–22, 2001PubMed
7.
go back to reference Jeong M, Kim B, Lee G: Using higher-level linguistic knowledge for speech recognition error correction in a spoken Q/A dialog. In: Proceedings of the HLT-NAACL special workshop on Higher-Level Linguistic Information for Speech Processing, Boston, USA, 2004, pp 48–55 Jeong M, Kim B, Lee G: Using higher-level linguistic knowledge for speech recognition error correction in a spoken Q/A dialog. In: Proceedings of the HLT-NAACL special workshop on Higher-Level Linguistic Information for Speech Processing, Boston, USA, 2004, pp 48–55
8.
go back to reference Jurafsky D, Martin J: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Englewood Cliffs: Prentice-Hall, 2000 Jurafsky D, Martin J: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Englewood Cliffs: Prentice-Hall, 2000
9.
go back to reference Allen JF, Miller BW, Ringger EK, et al: A robust system for natural spoken dialogue. In: Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, USA, 1996, pp 62–70 Allen JF, Miller BW, Ringger EK, et al: A robust system for natural spoken dialogue. In: Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, USA, 1996, pp 62–70
10.
go back to reference Kaki S, Sumita E, Iida H: A method for correcting errors in speech recognition using the statistical features of character co-occurrence. In: ACL-COLING, Montreal, Canada, 1998, pp 653–657 Kaki S, Sumita E, Iida H: A method for correcting errors in speech recognition using the statistical features of character co-occurrence. In: ACL-COLING, Montreal, Canada, 1998, pp 653–657
11.
go back to reference Sarma A, Palmer D: Context-based speech recognition error detection and correction. In: Proceedings of the HLT-NAACL, Boston, USA, 2004, pp 85–88 Sarma A, Palmer D: Context-based speech recognition error detection and correction. In: Proceedings of the HLT-NAACL, Boston, USA, 2004, pp 85–88
13.
go back to reference Manning CD, Schütze H: Foundations of Statistical Natural Language Processing. Cambridge: MIT Press, 2002 Manning CD, Schütze H: Foundations of Statistical Natural Language Processing. Cambridge: MIT Press, 2002
14.
go back to reference Voll K: A Methodology of Error Detection: Improving Speech Recognition in Radiology. Ph.D. thesis, Simon Fraser University, 2006 Voll K: A Methodology of Error Detection: Improving Speech Recognition in Radiology. Ph.D. thesis, Simon Fraser University, 2006
15.
go back to reference Sistrom C: Conceptual approach for the design of radiology reporting interfaces: the talking template. J Digit Imaging 18(3):176–187, 2005PubMedCrossRef Sistrom C: Conceptual approach for the design of radiology reporting interfaces: the talking template. J Digit Imaging 18(3):176–187, 2005PubMedCrossRef
16.
go back to reference Caviedes JE, Cimino JJ: Towards the development of a conceptual distance metric for the UMLS. J Biomed Inform 37:77–85, 2004PubMedCrossRef Caviedes JE, Cimino JJ: Towards the development of a conceptual distance metric for the UMLS. J Biomed Inform 37:77–85, 2004PubMedCrossRef
17.
go back to reference Shiffman S, Detmer WMS, Lane, CD, et al: A continuous-speech interface to a decision support system: I. Techniques to accommodate misrecognized input. J Am Med Inform Assoc 2:36–45, 1995PubMed Shiffman S, Detmer WMS, Lane, CD, et al: A continuous-speech interface to a decision support system: I. Techniques to accommodate misrecognized input. J Am Med Inform Assoc 2:36–45, 1995PubMed
Metadata
Title
Improving the Utility of Speech Recognition Through Error Detection
Authors
Kimberly Voll, Ph. D.
Stella Atkins, Ph. D.
Bruce Forster, M. D.,
Publication date
01-12-2008
Publisher
Springer-Verlag
Published in
Journal of Imaging Informatics in Medicine / Issue 4/2008
Print ISSN: 2948-2925
Electronic ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-007-9034-7

Other articles of this Issue 4/2008

Journal of Digital Imaging 4/2008 Go to the issue