EPSRC logo

Details of Grant 

EPSRC Reference: EP/J008427/1
Title: Language Processing for Literature Based Discovery in Medicine
Principal Investigator: Stevenson, Dr M
Other Investigators:
Danson, Dr S Bandmann, Professor O Gaizauskas, Professor R
Researcher Co-Investigators:
Project Partners:
Department: Computer Science
Organisation: University of Sheffield
Scheme: Standard Research
Starts: 01 June 2012 Ends: 31 May 2015 Value (£): 293,127
EPSRC Research Topic Classifications:
Artificial Intelligence Bioinformatics
Comput./Corpus Linguistics Information & Knowledge Mgmt
EPSRC Industrial Sector Classifications:
Healthcare
Related Grants:
Panel History:
Panel DatePanel NameOutcome
26 Oct 2011 EPSRC ICT Responsive Mode - Oct 2011 Announced
06 Sep 2011 EPSRC ICT Responsive Mode - Sep 2011 Deferred
Summary on Grant Application Form
The amount of published material in biomedicine has been growing exponentially in recent years, particularly in very productive areas, such as genomics. The knowledge it contains is now so vast and fragmented that it is no longer possible for any individual or research group to keep up with the advances relevant to their area. The research literature is also fragmented and researchers naturally concentrate their attention on their own area of expertise, meaning they may not identify research that is relevant to their own if it does not appear within the literature of their scientific discipline. However, medical research is becoming increasingly interdisciplinary with progress being made by combining outputs from various fields.

Hidden knowledge occurs when a connection can be inferred by combining information from multiple documents, but that connection has not been noticed. Literature Based Discovery (LBD) provides tools that analyse the research literature to identify hidden knowledge automatically. Connections it has been used to identify include treatments for diseases (e.g. that fish oil can be used to treat Raynaud's syndrome) and cases of diseases (e.g. that migraines can be related to magnesium deficiency). Despite these successes, the knowledge that has been discovered has been limited by the relatively simple techniques used to analyse the research literature.

This project will develop new approaches to LBD by applying recent advances in the automatic processing of biomedical literature. This analysis will provide a LBD system with more detailed and accurate information about this literature than has previously been possible. In particular, the project will make use of two language processing technologies, Information Extraction and Word Sense Disambiguation, which can now be applied to the biomedical literature on a large scale. Information Extraction will be used to identify connections between items mentioned in documents and will provide more accurate analysis than the simple techniques used by previous LBD systems. Word Sense Disambiguation will be used to avoid the problems caused by polysemy and synonymy (the suggestion of spurious connections and connections being missed) which can adversely effect LBD performance.

The project will implement a LBD system and test it on two domains: oncology and neuroscience. The effectiveness of the system will be judged by researchers working in these areas with interests in melanoma and Parkinson's disease.
Key Findings
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Potential use in non-academic contexts
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Impacts
Description This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Summary
Date Materialised
Sectors submitted by the Researcher
This information can now be found on Gateway to Research (GtR) http://gtr.rcuk.ac.uk
Project URL:  
Further Information:  
Organisation Website: http://www.shef.ac.uk