Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2007

Open Access 01-12-2007 | Software

Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles

Authors: Mir S Siadaty, Jianfen Shu, William A Knaus

Published in: BMC Medical Informatics and Decision Making | Issue 1/2007

Login to get access

Abstract

Background

Receiving extraneous articles in response to a query submitted to MEDLINE/PubMed is common. When submitting a multi-word query (which is the majority of queries submitted), the presence of all query words within each article may be a necessary condition for retrieving relevant articles, but not sufficient. Ideally a relationship between the query words in the article is also required. We propose that if two words occur within an article, the probability that a relation between them is explained is higher when the words occur within adjacent sentences versus remote sentences. Therefore, sentence-level concurrence can be used as a surrogate for existence of the relationship between the words.
In order to avoid the irrelevant articles, one solution would be to increase the search specificity. Another solution is to estimate a relevance score to sort the retrieved articles. However among the >30 retrieval services available for MEDLINE, only a few estimate a relevance score, and none detects and incorporates the relation between the query words as part of the relevance score.

Results

We have developed "Relemed", a search engine for MEDLINE. Relemed increases specificity and precision of retrieval by searching for query words within sentences rather than the whole article. It uses sentence-level concurrence as a statistical surrogate for the existence of relationship between the words. It also estimates a relevance score and sorts the results on this basis, thus shifting irrelevant articles lower down the list.
In two case studies, we demonstrate that the most relevant articles appear at the top of the Relemed results, while this is not necessarily the case with a PubMed search. We have also shown that a Relemed search includes not only all the articles retrieved by PubMed, but potentially additional relevant articles, due to the extended 'automatic term mapping' and text-word searching features implemented in Relemed.

Conclusion

By using sentence-level matching, Relemed can deliver higher specificity, thus eliminating more false-positive articles. By introducing an appropriate relevance metric, the most relevant articles on which the user wishes to focus are listed first. Relemed also shrinks the displayed text, and hence the time spent scanning the articles.
Appendix
Available only for authorised users
Literature
4.
go back to reference Mani I, Maybury MT, (Eds): Advances in Automatic Text Summarization. 1999, Cambridge: MIT Press Mani I, Maybury MT, (Eds): Advances in Automatic Text Summarization. 1999, Cambridge: MIT Press
15.
go back to reference Peterson WW, Birdsall TG, Fox WC: The theory of signal detectability. Transactions of the IRE professional group on information theory. 1954, 4: 171-212. 10.1109/TIT.1954.1057460.CrossRef Peterson WW, Birdsall TG, Fox WC: The theory of signal detectability. Transactions of the IRE professional group on information theory. 1954, 4: 171-212. 10.1109/TIT.1954.1057460.CrossRef
16.
go back to reference Tanner WP, Swets JA: A decision-making theory of visual detection. Psychol Rev. 1954, 61 (6): 401-409. 10.1037/h0058700.CrossRefPubMed Tanner WP, Swets JA: A decision-making theory of visual detection. Psychol Rev. 1954, 61 (6): 401-409. 10.1037/h0058700.CrossRefPubMed
25.
go back to reference R Development Core Team: R: A language and environment for statistical computing. 2004, Vienna, Austria: R Foundation for Statistical Computing R Development Core Team: R: A language and environment for statistical computing. 2004, Vienna, Austria: R Foundation for Statistical Computing
26.
go back to reference Loader C: Local Regression and Likelihood. 1999, New York: Springer Loader C: Local Regression and Likelihood. 1999, New York: Springer
27.
go back to reference Willinger M, James LS, Catz C: Defining the Sudden Infant Death Syndrome (SIDS): Deliberations of an Expert Panel Convened by the National Institute of Child Health and Human Development. Pediatric Pathology. 1991, 11: 677-684.CrossRefPubMed Willinger M, James LS, Catz C: Defining the Sudden Infant Death Syndrome (SIDS): Deliberations of an Expert Panel Convened by the National Institute of Child Health and Human Development. Pediatric Pathology. 1991, 11: 677-684.CrossRefPubMed
28.
go back to reference U.S. Department of Health and Human Services: Healthy People 2010: Understanding and Improving Health. 2000, Washington, DC: U.S. Government Printing Office, 2 U.S. Department of Health and Human Services: Healthy People 2010: Understanding and Improving Health. 2000, Washington, DC: U.S. Government Printing Office, 2
Metadata
Title
Relemed: sentence-level search engine with relevance score for the MEDLINE database of biomedical articles
Authors
Mir S Siadaty
Jianfen Shu
William A Knaus
Publication date
01-12-2007
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2007
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/1472-6947-7-1

Other articles of this Issue 1/2007

BMC Medical Informatics and Decision Making 1/2007 Go to the issue