<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Bañó-Polo, Manuel</style></author><author><style face="normal" font="default" size="100%">Baldin, Francesca</style></author><author><style face="normal" font="default" size="100%">Tamborero, Silvia</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author><author><style face="normal" font="default" size="100%">Mingarro, Ismael</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">N-glycosylation efficiency is determined by the distance to the C-terminus and the amino acid preceding an Asn-Ser-Thr sequon.</style></title><secondary-title><style face="normal" font="default" size="100%">Protein science : a publication of the Protein Society</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2011</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2011 Jan</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">20</style></volume><pages><style face="normal" font="default" size="100%">179-86</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;N-glycosylation is the most common and versatile protein modification. In eukaryotic cells, this modification is catalyzed cotranslationally by the enzyme oligosaccharyltransferase, which targets the &amp;beta;-amide of the asparagine in an Asn-Xaa-Ser/Thr consensus sequon (where Xaa is any amino acid but proline) in nascent proteins as they enter the endoplasmic reticulum. Because modification of the glycosylation acceptor site on membrane proteins occurs in a compartment-specific manner, the presence of glycosylation is used to indicate membrane protein topology. Moreover, glycosylation sites can be added to gain topological information. In this study, we explored the determinants of N-glycosylation with the in vitro transcription/translation of a truncated model protein in the presence of microsomes and surveyed 25,488 glycoproteins, of which 2,533 glycosylation sites had been experimentally validated. We found that glycosylation efficiency was dependent on both the distance to the C-terminus and the nature of the amino acid that preceded the consensus sequon. These findings establish a broadly applicable method for membrane protein tagging in topological studies.&lt;/p&gt;</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Baù, Davide</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Structure determination of genomic domains by satisfaction of spatial restraints.</style></title><secondary-title><style face="normal" font="default" size="100%">Chromosome research : an international journal on the molecular, supramolecular and evolutionary aspects of chromosome biology</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2011</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2011 Jan</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">19</style></volume><pages><style face="normal" font="default" size="100%">25-35</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;The three-dimensional (3D) architecture of a genome is non-random and known to facilitate the spatial colocalization of regulatory elements with the genes they regulate. Determining the 3D structure of a genome may therefore probe an essential step in characterizing how genes are regulated. Currently, there are several experimental and theoretical approaches that aim at determining the 3D structure of genomes and genomic domains; however, approaches integrating experiments and computation to identify the most likely 3D folding of a genome at medium to high resolutions have not been widely explored. Here, we review existing methodologies and propose that the integrative modeling platform ( http://www.integrativemodeling.org ), a computational package developed for structurally characterizing protein assemblies, could be used for integrating diverse experimental data towards the determination of the 3D architecture of genomic domains and entire genomes at unprecedented resolution. Our approach, through the visualization of looping interactions between distal regulatory elements, will allow for the characterization of global chromatin features and their relation to gene expression. We illustrate our work by outlining the recent determination of the 3D architecture of the &amp;alpha;-globin domain in the human genome.&lt;/p&gt;</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Baù, Davide</style></author><author><style face="normal" font="default" size="100%">Sanyal, Amartya</style></author><author><style face="normal" font="default" size="100%">Lajoie, Bryan R</style></author><author><style face="normal" font="default" size="100%">Capriotti, Emidio</style></author><author><style face="normal" font="default" size="100%">Byron, Meg</style></author><author><style face="normal" font="default" size="100%">Lawrence, Jeanne B</style></author><author><style face="normal" font="default" size="100%">Dekker, Job</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">The three-dimensional folding of the α-globin gene domain reveals formation of chromatin globules.</style></title><secondary-title><style face="normal" font="default" size="100%">Nature structural &amp; molecular biology</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2011</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2011 Jan</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">18</style></volume><pages><style face="normal" font="default" size="100%">107-14</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;We developed a general approach that combines chromosome conformation capture carbon copy (5C) with the Integrated Modeling Platform (IMP) to generate high-resolution three-dimensional models of chromatin at the megabase scale. We applied this approach to the ENm008 domain on human chromosome 16, containing the &amp;alpha;-globin locus, which is expressed in K562 cells and silenced in lymphoblastoid cells (GM12878). The models accurately reproduce the known looping interactions between the &amp;alpha;-globin genes and their distal regulatory elements. Further, we find using our approach that the domain folds into a single globular conformation in GM12878 cells, whereas two globules are formed in K562 cells. The central cores of these globules are enriched for transcribed genes, whereas nontranscribed chromatin is more peripheral. We propose that globule formation represents a higher-order folding state related to clustering of transcribed genes around shared transcription machineries, as previously observed by microscopy.&lt;/p&gt;</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Webb, Benjamin M</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author><author><style face="normal" font="default" size="100%">Eswar, Narayanan</style></author><author><style face="normal" font="default" size="100%">Sali, Andrej</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Alignment of multiple protein structures based on sequence and structure features.</style></title><secondary-title><style face="normal" font="default" size="100%">Protein engineering, design &amp; selection : PEDS</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2009</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2009 Sep</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">22</style></volume><pages><style face="normal" font="default" size="100%">569-74</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;Comparing the structures of proteins is crucial to gaining insight into protein evolution and function. Here, we align the sequences of multiple protein structures by a dynamic programming optimization of a scoring function that is a sum of an affine gap penalty and terms dependent on various sequence and structure features (SALIGN). The features include amino acid residue type, residue position, residue accessible surface area, residue secondary structure state and the conformation of a short segment centered on the residue. The multiple alignment is built by following the ’guide’ tree constructed from the matrix of all pairwise protein alignment scores. Importantly, the method does not depend on the exact values of various parameters, such as feature weights and gap penalties, because the optimal alignment across a range of parameter values is found. Using multiple structure alignments in the HOMSTRAD database, SALIGN was benchmarked against MUSTANG for multiple alignments as well as against TM-align and CE for pairwise alignments. On the average, SALIGN produces a 15% improvement in structural overlap over HOMSTRAD and 14% over MUSTANG, and yields more equivalent structural positions than TM-align and CE in 90% and 95% of cases, respectively. The utility of accurate multiple structure alignment is illustrated by its application to comparative protein structure modeling.&lt;/p&gt;</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Capriotti, Emidio</style></author><author><style face="normal" font="default" size="100%">Arbiza, Leonardo</style></author><author><style face="normal" font="default" size="100%">Casadio, Rita</style></author><author><style face="normal" font="default" size="100%">Dopazo, Joaquin</style></author><author><style face="normal" font="default" size="100%">Dopazo, Hernán</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans.</style></title><secondary-title><style face="normal" font="default" size="100%">Hum Mutat</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Hum Mutat</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms</style></keyword><keyword><style  face="normal" font="default" size="100%">Codon</style></keyword><keyword><style  face="normal" font="default" size="100%">Computational Biology</style></keyword><keyword><style  face="normal" font="default" size="100%">Databases, Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">DNA Mutational Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Evolution, Molecular</style></keyword><keyword><style  face="normal" font="default" size="100%">Genetic Predisposition to Disease</style></keyword><keyword><style  face="normal" font="default" size="100%">Genetic Variation</style></keyword><keyword><style  face="normal" font="default" size="100%">Genome, Human</style></keyword><keyword><style  face="normal" font="default" size="100%">Humans</style></keyword><keyword><style  face="normal" font="default" size="100%">Iduronic Acid</style></keyword><keyword><style  face="normal" font="default" size="100%">Point Mutation</style></keyword><keyword><style  face="normal" font="default" size="100%">Polymorphism, Single Nucleotide</style></keyword><keyword><style  face="normal" font="default" size="100%">Proteins</style></keyword><keyword><style  face="normal" font="default" size="100%">Tumor Suppressor Protein p53</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2008</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2008 Jan</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">29</style></volume><pages><style face="normal" font="default" size="100%">198-204</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;Predicting the functional impact of protein variation is one of the most challenging problems in bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data, allowing the application of rigorous statistical approaches for predicting whether a given single point mutation has an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Here we introduce a new method (SeqProfCod) to predict the likelihood that a given protein variant is associated with human disease or not. Our method relies on a support vector machine (SVM) classifier trained using three sources of information: protein sequence, multiple protein sequence alignments, and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 8,987 single point mutations from 1,434 human proteins from SWISS-PROT. It achieves 82% overall accuracy and a correlation coefficient of 0.59, indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Moreover, this study demonstrates the synergic effect of combining two sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and the evolutionary estimation of the selective pressures at the codon level. The results of large-scale application of SeqProfCod over all annotated point mutations in SWISS-PROT (available for download at http://sgu.bioinfo.cipf.es/services/Omidios/; last accessed: 24 August 2007), could be used to support clinical studies.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">1</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/17935148?dopt=Abstract</style></custom1></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author><author><style face="normal" font="default" size="100%">Pieper, Ursula</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M S</style></author><author><style face="normal" font="default" size="100%">Rossi, Andrea</style></author><author><style face="normal" font="default" size="100%">Eswar, Narayanan</style></author><author><style face="normal" font="default" size="100%">Davis, Fred P</style></author><author><style face="normal" font="default" size="100%">Al-Shahrour, Fátima</style></author><author><style face="normal" font="default" size="100%">Dopazo, Joaquin</style></author><author><style face="normal" font="default" size="100%">Sali, Andrej</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">DBAli tools: mining the protein structure space.</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title><alt-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></alt-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms</style></keyword><keyword><style  face="normal" font="default" size="100%">Amino Acid Sequence</style></keyword><keyword><style  face="normal" font="default" size="100%">Computational Biology</style></keyword><keyword><style  face="normal" font="default" size="100%">Data Interpretation, Statistical</style></keyword><keyword><style  face="normal" font="default" size="100%">Databases, Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">Internet</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Sequence Data</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Conformation</style></keyword><keyword><style  face="normal" font="default" size="100%">Proteins</style></keyword><keyword><style  face="normal" font="default" size="100%">Pseudomonas aeruginosa</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Alignment</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Analysis, Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">Sequence Homology, Amino Acid</style></keyword><keyword><style  face="normal" font="default" size="100%">Software</style></keyword><keyword><style  face="normal" font="default" size="100%">Structure-Activity Relationship</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2007</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2007 Jul</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">35</style></volume><pages><style face="normal" font="default" size="100%">W393-7</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools include (i) the DBAlit program that allows users to input the 3D coordinates of a protein structure for comparison by MAMMOTH against all chains in the PDB; (ii) the AnnoLite and AnnoLyze programs that annotate a target structure based on its stored relationships to other structures; (iii) the ModClus program that clusters structures by sequence and structure similarities; (iv) the ModDom program that identifies domains as recurrent structural fragments and (v) an implementation of the COMPARER method in the SALIGN command in MODELLER that creates a multiple structure alignment for a set of related protein structures. Thus, the DBAli tools, which are freely accessible via the World Wide Web at http://salilab.org/DBAli/, allow users to mine the protein structure space by establishing relationships between protein structures and their functions.&lt;/p&gt;</style></abstract><issue><style face="normal" font="default" size="100%">Web Server issue</style></issue><custom1><style face="normal" font="default" size="100%">https://www.ncbi.nlm.nih.gov/pubmed/17478513?dopt=Abstract</style></custom1></record></records></xml>