This Title All WIREs
How to cite this WIREs title:
WIREs Syst Biol Med
Impact Factor: 3.709

Integrating molecular networks with genetic variant interpretation for precision medicine

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

More reliable and cheaper sequencing technologies have revealed the vast mutational landscapes characteristic of many phenotypes. The analysis of such genetic variants has led to successful identification of altered proteins underlying many Mendelian disorders. Nevertheless the simple one‐variant one‐phenotype model valid for many monogenic diseases does not capture the complexity of polygenic traits and disorders. Although experimental and computational approaches have improved detection of functionally deleterious variants and important interactions between gene products, the development of comprehensive models relating genotype and phenotypes remains a challenge in the field of genomic medicine. In this context, a new view of the pathologic state as significant perturbation of the network of interactions between biomolecules is crucial for the identification of biochemical pathways associated with complex phenotypes. Seminal studies in systems biology combined the analysis of genetic variation with protein–protein interaction networks to demonstrate that even as biological systems evolve to be robust to genetic variation, their topologies create disease vulnerabilities. More recent analyses model the impact of genetic variants as changes to the “wiring” of the interactome to better capture heterogeneity in genotype–phenotype relationships. These studies lay the foundation for using networks to predict variant effects at scale using machine‐learning or algorithmic approaches. A wealth of databases and resources for the annotation of genotype–phenotype relationships have been developed to support developments in this area. This overview describes how study of the molecular interactome has generated insights linking the organization of biological systems to disease mechanism, and how this information can enable precision medicine. This article is categorized under: Translational, Genomic, and Systems Medicine > Translational Medicine Biological Mechanisms > Cell Signaling Models of Systems Properties and Processes > Mechanistic Models Analytical and Computational Methods > Computational Methods
Network analysis measures. PPI network of the NTRK2 activation pathway through FRS2/FRS3. (a) The degree (k) is the number of edges of a node. The degree of FRS2 is 8. The edges are highlighted in red. (b) The clustering coefficient C of a node is calculated as the ratio between the connected triangles (delimited by the solid red and gray lines) and the total number of possible triangles k × (k − 1). The dashed lines represent the unbound triangles. For FRS2, C is 0.75 (21/28). (c) The degree centrality (CD) of a node is the number of edges divided by the total number of possible edges. The CD of FRS2 is 0.8 (8/10). Red dotted lines represent the missing edges. (d) The betweenness centrality (B) is the sum over all the possible pairs of the fraction of shortest path passing through a node (red) divided by the total number of shortest paths. B of FRS2 is 9.167 (18 × 0.5 + 0.167). Gray edges are part of the shortest paths not passing through FRS2. In this example, edge length is determined by the layout algorithm and does not have a quantitative interpretation
[ Normal View | Magnified View ]
Propagating variant effects on networks. Variants can be used as signal sources for network propagation in order to identify network neighborhoods affected by variants. Edgetic effects can be used to constrain network propagation according to the effects of variants on specific protein interactions. On the left side of this schematic, two variants to the purple node affect interactions with different subsets of partners (indicated by blue and pink nodes respectively). Network propagation can be used to implicate network regions likely to be affected by each variant, and these can be contrasted to identify regions perturbed by both variants that could explain shared phenotypes (right network, circled purple shaded nodes), or regions affected specifically by each variant (right network, blue and pink shaded regions) which could help explain pleiotropic effects
[ Normal View | Magnified View ]
Modeling the edgetic effects of genetic variants supports exploration of disease mechanisms. (a) Pleiotropy can result when different variants in the same gene affect different interactions in which a protein participates. (b) Variants at reciprocal interfaces of interacting proteins can contribute to locus heterogeneity
[ Normal View | Magnified View ]
Mapping amino acid position to potential to interfere with protein interactions. Protein structures of FGF2 and FGFR1 are shown on the left and right respectively, and as a complex in the center (protein data Bank structure ID: 1CVS). In the complex, residues are colored according to location in the protein core (purple and green), at the interface (pink and blue) or at the surface outside of the interface (transparent pink and blue) on the two proteins respectively
[ Normal View | Magnified View ]
Conceptual framework of edgetics. Location of variants within a protein has implications for their phenotypic consequences. Variants that map to the core of the protein are more likely to destabilize it, resulting in a loss of all interactions in which the protein participates. In contrast, mutations at protein interaction interfaces are more likely to perturb specific interactions. Variants mapping to the protein surfaces outside of binding interfaces are less likely to create a phenotype than core or interface variants
[ Normal View | Magnified View ]
Exploring network topology as a determinant of gene–phenotype relationships. Topological location within the network has implications for biological function. (a) Nodes can be described with respect to particular characteristics in the network, including high degree hubs (red), nodes at the periphery (yellow) and nodes with the highest centrality according to four popular measures of centrality. We calculated network measures including (b) degree and (c) betweenness centrality for four groups of genes: 1,371 essential genes (Hart et al., ), 125 cancer genes (Vogelstein et al., ), 2,921 Mendelian disease genes (Stenson et al., ), and 7,099 other genes based on the latest release of STRING (Szklarczyk et al., ) to illustrate the types of observation that have been revealed by systematic studies of genes with respect to location in the interactome
[ Normal View | Magnified View ]

Browse by Topic

Analytical and Computational Methods > Computational Methods
Biological Mechanisms > Cell Signaling
Models of Systems Properties and Processes > Mechanistic Models
Translational, Genomic, and Systems Medicine > Translational Medicine

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts