|
BMC Bioinformatics 2008
GenCLiP: a software program for clustering gene lists by literature profiling and constructing gene co-occurrence networks related to custom keywordsAbstract: We present our software program, GenCLiP (Gene Cluster with Literature Profiles), which is based on the methods presented by Chaussabel and Sher (Genome Biol 2002, 3(10):RESEARCH0055) that search gene lists to identify functional clusters of genes based on up-to-date literature profiling. Four features were added to this previously described method: the ability to 1) manually curate keywords extracted from the literature, 2) search genes and gene co-occurrence networks related to custom keywords, 3) compare analyzed gene results with negative and positive controls generated by GenCLiP, and 4) calculate probabilities that the resulting genes and gene networks are randomly related. In this paper, we show with a set of differentially expressed genes between keloids and normal control, how implementation of functions in GenCLiP successfully identified keywords related to the pathogenesis of keloids and unknown gene pathways involved in the pathogenesis of keloids.With regard to the identification of disease-susceptibility genes, GenCLiP allows one to quickly acquire a primary pathogenesis profile and identify pathways involving abnormally expressed genes not previously associated with the disease.Biomedical researchers often want to explore pathogenesis and pathways regulated by abnormally expressed genes, such as those identified by microarray analyses. Literature mining is an important way to assist in this task [1]. Many literature mining tools or methods have been developed [2-14] that can extract keywords or gene networks from the literature to functionally group genes and visualize their relationships. However, to our knowledge none of these tools are capable of automatically and easily constructing gene networks among the analyzed genes based on specified keywords, an aspect that would be helpful for investigating disease-associated signaling pathways. Here we report our software program, GenCLiP (Gene Cluster with Literature Profiles), which is based on the two
|