Motivation: Recent analyses
in systems biology pursue the discovery of functional
modules within the cell. Recognition of modules in genome-wide
information
requires the integrative analysis of experimental data together with
available
functional schemes. Since
the literature covers all aspects of biology, chemistry, and medicine,
there is
almost no limit to the types of information that may be recovered
through
careful and exhaustive mining [1]. The
scientific literature contains the information to
bridge the gap between the abstract definitions of biological processes
in
current schemes and the interlinked nature of biological networks. |
|
Supplementary data files Datasets: (GOP/PMIDs files) Results:
|
Objective:
To construct pair-wise similarities between biological processes through the analysis of the biomedical literature. Abstract: In this
work we explore the use of the scientific literature to establish
potential relationships among biological processes. To this end we have
used a document based similarity method (latent semantic analysis [2]),
to compute pair-wise
similarities of GO biological process categories. The method has
been applied to the biological processes annotated for Saccharomyces
cerevisiae genome (www.yeastgenome.org).
SGD annotation file (01/25/2006), GO database (01/27/2006).
References: [1] Shatkay, H. and R. Feldman, Mining the biomedical literature in the
genomic era: An overview. Journal of Computational Biology, 2003. 10(6): p. 821-855. [2] Deerwester
S, Dumais S, Landauer T, Furnas G and Beck L, Improving Information-Retrieval with
Latent Semantic Indexing. P Asis Annu Meet, 1988. 25: p. 36-40. [3] Chagoyen, M., et al., Discovering semantic features in the literature: a foundation for building functional associations. BMC Bioinformatics, 2006 7(1):p.41. [PubMed] [Full article] |