Home
This Title All WIREs
WIREs RSS Feed
How to cite this WIREs title:
WIREs Syst Biol Med
Impact Factor: 2.941

Recent approaches to the prioritization of candidate disease genes

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

Many efforts are still devoted to the discovery of genes involved with specific phenotypes, in particular, diseases. High‐throughput techniques are thus applied frequently to detect dozens or even hundreds of candidate genes. However, the experimental validation of many candidates is often an expensive and time‐consuming task. Therefore, a great variety of computational approaches has been developed to support the identification of the most promising candidates for follow‐up studies. The biomedical knowledge already available about the disease of interest and related genes is commonly exploited to find new gene–disease associations and to prioritize candidates. In this review, we highlight recent methodological advances in this research field of candidate gene prioritization. We focus on approaches that use network information and integrate heterogeneous data sources. Furthermore, we discuss current benchmarking procedures for evaluating and comparing different prioritization methods. WIREs Syst Biol Med 2012. doi: 10.1002/wsbm.1177

This WIREs title offers downloadable PowerPoint presentations of figures for non-profit, educational use, provided the content is not modified and full credit is given to the author and publication.

Download a PowerPoint presentation of all images


Figure 1.

Exemplary molecular network of candidate genes and known disease genes. Red nodes represent known disease genes, and green nodes correspond to candidate genes. For candidate genes C1 and C2, the table lists the node degree, the 1N and 2N indices, and the average network distance to disease genes (see also Box 1).

[ Normal View 48K | Magnified View 85K ]
Figure 2.

Integrative approaches to disease gene prioritization. The typical workflow of integrative prioritization approaches based on multiple data sources consists of three major steps. The first step involves preparing the input data consisting of two different sets of genes, the known disease genes and the candidate genes. For each gene, further biomedical knowledge is retrieved from various data sources such as functional annotations from the Gene Ontology and molecular pathways from the KEGG database. In the second step, the collected information is integrated using a network representation (top) or evaluated individually for each data source, resulting in different ranking lists (bottom). The third step computes a final ranking list of candidate genes based on network measures or rank aggregation. The candidate genes are thus prioritized by their relevance to the disease of interest.

[ Normal View 79K | Magnified View 134K ]

Related Articles

Systems biology and the future of medicine
Protein–protein interaction networks and subnetworks in the biology of disease
Role of systems pharmacology in understanding drug adverse events
Layers of epistasis: genome‐wide regulatory networks and network approaches to genome‐wide association studies
9th International Conference on Pathways, Networks, and Systems Medicine

Browse by Topic

Laboratory Methods and Technologies > Genetic/Genomic Methods
Translational, Genomic and Systems Medicine > Translational Medicine
Analytical and Computational Methods > Computational Methods
blog comments powered by Disqus

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts

In the Spotlight

Philip Benfey

Philip Benfey

Is intrigued by one of the key questions in developmental biology: how cells acquire their identities. This is an important question in human development, where stem cells divide and differentiate into skin, muscle, fat etc. It is equally central to plant development, where most organs and cells are formed from stem cell populations known as meristems. The Benfey lab addresses this question using a combination of genetics, molecular biology, and genomics to identify and characterize the genes that regulate formation of the root in the plant model system, Arabidopsis thaliana. The choice of the root as a model was based on the simplicity of its organization and its stereotyped developmental program.

Learn More

Twitter: WIREsSysBioMed Follow us on Twitter