%0 Journal Article %T Algorithms for effective querying of compound graph-based pathway databases %A Ugur Dogrusoz %A Ahmet Cetintas %A Emek Demir %A Ozgun Babur %J BMC Bioinformatics %D 2009 %I BioMed Central %R 10.1186/1471-2105-10-376 %X Towards this goal, we developed a querying framework, along with a number of graph-theoretic algorithms from simple neighborhood queries to shortest paths to feedback loops, that is applicable to all sorts of graph-based pathway databases, from PPIs (protein-protein interactions) to metabolic and signaling pathways. The framework is unique in that it can account for compound or nested structures and ubiquitous entities present in the pathway data. In addition, the queries may be related to each other through "AND" and "OR" operators, and can be recursively organized into a tree, in which the result of one query might be a source and/or target for another, to form more complex queries. The algorithms were implemented within the querying component of a new version of the software tool PATIKAweb (Pathway Analysis Tool for Integration and Knowledge Acquisition) and have proven useful for answering a number of biologically significant questions for large graph-based pathway databases.The PATIKA Project Web site is http://www.patika.org webcite. PATIKAweb version 2.1 is available at http://web.patika.org webcite.Especially with the help of novel large-scale analysis methods, a massive amount of data is now being gathered on cellular processes [1-3]. Unfortunately, most of these data are fragmented and incomplete. One of the biggest challenges of bioinformatics today is to represent and integrate this type of knowledge effectively to construct a knowledge base that can act as a blueprint for simulations and other analysis methods, enabling us to better understand and predict the behavior of a cell [4].Even though traditional way of representing cellular pathways with still images often yields very pretty pictures, such drawings are mostly not reusable. In addition, underlying ontology and notation are often far from being uniform or consistent, mostly dependent on implicit conventions rather than explicit, formal rules [4]. Recently, these problems have resulted in a major %U http://www.biomedcentral.com/1471-2105/10/376