KEGG pathway database is a collection of manually drawn pathway maps representing current knowledge on molecular interaction and reaction networks, accompanied with KGML (KEGG pathway xml format) files for automatic computational analyses and modeling of metabolic and signaling networks. In a KGML file the pathway is represented as a graph object with entry elements (gene products, compounds, pathways) as its nodes, and relations between elements as edges. However, in most cases there is a lack of correspondence between static images of pathways and accompanying KGML files, so preprocessing of information contained in a KGML file is needed before it can be used in automatic analysis. Several KGML parsers have been developed recently, both standalone or integrated in different bioinformatics packages.
We introduce KEGGParser (pathway parser/editor), which is based on Matlab biograph class.
Features:
1. Retrieval, parsing and automatic correction for protein-compound-protein interactions, group nodes and binding directions;
2. Pathway graph editing (edge and node manipulations) based on "Graph manipulation" (http://www.mathworks.com/matlabcentral/fileexchange/37475-graph-manipulation);
3. Analysis of parsed pathways can be performed using Matlab built-in graph-based calculations. |