<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Webb, Benjamin M</style></author><author><style face="normal" font="default" size="100%">Marti-Renom, Marc A</style></author><author><style face="normal" font="default" size="100%">Eswar, Narayanan</style></author><author><style face="normal" font="default" size="100%">Sali, Andrej</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Alignment of multiple protein structures based on sequence and structure features.</style></title><secondary-title><style face="normal" font="default" size="100%">Protein engineering, design &amp; selection : PEDS</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2009</style></year><pub-dates><date><style  face="normal" font="default" size="100%">2009 Sep</style></date></pub-dates></dates><volume><style face="normal" font="default" size="100%">22</style></volume><pages><style face="normal" font="default" size="100%">569-74</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;Comparing the structures of proteins is crucial to gaining insight into protein evolution and function. Here, we align the sequences of multiple protein structures by a dynamic programming optimization of a scoring function that is a sum of an affine gap penalty and terms dependent on various sequence and structure features (SALIGN). The features include amino acid residue type, residue position, residue accessible surface area, residue secondary structure state and the conformation of a short segment centered on the residue. The multiple alignment is built by following the ’guide’ tree constructed from the matrix of all pairwise protein alignment scores. Importantly, the method does not depend on the exact values of various parameters, such as feature weights and gap penalties, because the optimal alignment across a range of parameter values is found. Using multiple structure alignments in the HOMSTRAD database, SALIGN was benchmarked against MUSTANG for multiple alignments as well as against TM-align and CE for pairwise alignments. On the average, SALIGN produces a 15% improvement in structural overlap over HOMSTRAD and 14% over MUSTANG, and yields more equivalent structural positions than TM-align and CE in 90% and 95% of cases, respectively. The utility of accurate multiple structure alignment is illustrated by its application to comparative protein structure modeling.&lt;/p&gt;</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Pieper, U.</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Rossi, A.</style></author><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">Davis, F. P.</style></author><author><style face="normal" font="default" size="100%">Fatima Al-Shahrour</style></author><author><style face="normal" font="default" size="100%">Dopazo, J.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">DBAli tools: mining the protein structure space</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">*Algorithms Amino Acid Sequence Computational Biology/*methods Data Interpretation</style></keyword><keyword><style  face="normal" font="default" size="100%">Amino Acid *Software Structure-Activity Relationship</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Internet Molecular Sequence Data Protein Conformation Proteins/*chemistry/classification/*metabolism Pseudomonas aeruginosa/*metabolism Sequence Alignment/*methods Sequence Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein/*methods Sequence Homology</style></keyword><keyword><style  face="normal" font="default" size="100%">Statistical *Databases</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2007</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=17478513</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">Web Server issue</style></number><volume><style face="normal" font="default" size="100%">35</style></volume><pages><style face="normal" font="default" size="100%">W393-7</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">The DBAli tools use a comprehensive set of structural alignments in the DBAli database to leverage the structural information deposited in the Protein Data Bank (PDB). These tools include (i) the DBAlit program that allows users to input the 3D coordinates of a protein structure for comparison by MAMMOTH against all chains in the PDB; (ii) the AnnoLite and AnnoLyze programs that annotate a target structure based on its stored relationships to other structures; (iii) the ModClus program that clusters structures by sequence and structure similarities; (iv) the ModDom program that identifies domains as recurrent structural fragments and (v) an implementation of the COMPARER method in the SALIGN command in MODELLER that creates a multiple structure alignment for a set of related protein structures. Thus, the DBAli tools, which are freely accessible via the World Wide Web at http://salilab.org/DBAli/, allow users to mine the protein structure space by establishing relationships between protein structures and their functions.</style></abstract><notes><style face="normal" font="default" size="100%">Marti-Renom, Marc A Pieper, Ursula Madhusudhan, M S Rossi, Andrea Eswar, Narayanan Davis, Fred P Al-Shahrour, Fatima Dopazo, Joaquin Sali, Andrej GM 62529/GM/NIGMS NIH HHS/United States GM074929/GM/NIGMS NIH HHS/United States GM54762/GM/NIGMS NIH HHS/United States GM71790/GM/NIGMS NIH HHS/United States Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2007 Jul;35(Web Server issue):W393-7. Epub 2007 May 3.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">Webb, B.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Eramian, D.</style></author><author><style face="normal" font="default" size="100%">Shen, M. Y.</style></author><author><style face="normal" font="default" size="100%">Pieper, U.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Comparative protein structure modeling using Modeller</style></title><secondary-title><style face="normal" font="default" size="100%">Curr Protoc Bioinformatics</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms Amino Acid Sequence Computer Simulation Crystallography/*methods *Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Chemical *Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Molecular Sequence Data Protein Conformation Protein Folding Proteins/*chemistry/*ultrastructure Sequence Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein/*methods *Software</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2006</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=18428767</style></url></web-urls></urls><volume><style face="normal" font="default" size="100%">Chapter 5</style></volume><pages><style face="normal" font="default" size="100%">Unit 5 6</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Functional characterization of a protein sequence is one of the most frequent problems in biology. This task is usually facilitated by accurate three-dimensional (3-D) structure of the studied protein. In the absence of an experimentally determined structure, comparative or homology modeling can sometimes provide a useful 3-D model for a protein that is related to at least one known protein structure. Comparative modeling predicts the 3-D structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications. Modeling lactate dehydrogenase from Trichomonas vaginalis (TvLDH) is described as an example. The download and installation of the MODELLER software is also described.</style></abstract><notes><style face="normal" font="default" size="100%">Eswar, Narayanan Webb, Ben Marti-Renom, Marc A Madhusudhan, M S Eramian, David Shen, Min-Yi Pieper, Ursula Sali, Andrej P01 A135707/PHS HHS/United States P01 GM71790/GM/NIGMS NIH HHS/United States R01 GM54762/GM/NIGMS NIH HHS/United States U54 GM62529/GM/NIGMS NIH HHS/United States Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t United States Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] Curr Protoc Bioinformatics. 2006 Oct;Chapter 5:Unit 5.6.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Pieper, U.</style></author><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">Davis, F. P.</style></author><author><style face="normal" font="default" size="100%">Braberg, H.</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Rossi, A.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Karchin, R.</style></author><author><style face="normal" font="default" size="100%">Webb, B. M.</style></author><author><style face="normal" font="default" size="100%">Eramian, D.</style></author><author><style face="normal" font="default" size="100%">Shen, M. Y.</style></author><author><style face="normal" font="default" size="100%">Kelly, L.</style></author><author><style face="normal" font="default" size="100%">Melo, F.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">MODBASE: a database of annotated comparative protein structure models and associated resources</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Binding Sites *Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Polymorphism</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Humans Internet Ligands *Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Systems Integration User-Computer Interface</style></keyword><keyword><style  face="normal" font="default" size="100%">Single Nucleotide Protein Structure</style></keyword><keyword><style  face="normal" font="default" size="100%">Tertiary Proteins/*chemistry/genetics/metabolism Software *Structural Homology</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2006</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=16381869</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">Database issue</style></number><volume><style face="normal" font="default" size="100%">34</style></volume><pages><style face="normal" font="default" size="100%">D291-5</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">MODBASE (http://salilab.org/modbase) is a database of annotated comparative protein structure models for all available protein sequences that can be matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on MODELLER for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE is updated regularly to reflect the growth in protein sequence and structure databases, and improvements in the software for calculating the models. MODBASE currently contains 3 094 524 reliable models for domains in 1 094 750 out of 1 817 889 unique protein sequences in the UniProt database (July 5, 2005); only models based on statistically significant alignments and models assessed to have the correct fold despite insignificant alignments are included. MODBASE also allows users to generate comparative models for proteins of interest with the automated modeling server MODWEB (http://salilab.org/modweb). Our other resources integrated with MODBASE include comprehensive databases of multiple protein structure alignments (DBAli, http://salilab.org/dbali), structurally defined ligand binding sites and structurally defined binary domain interfaces (PIBASE, http://salilab.org/pibase) as well as predictions of ligand binding sites, interactions between yeast proteins, and functional consequences of human nsSNPs (LS-SNP, http://salilab.org/LS-SNP).</style></abstract><notes><style face="normal" font="default" size="100%">Pieper, Ursula Eswar, Narayanan Davis, Fred P Braberg, Hannes Madhusudhan, M S Rossi, Andrea Marti-Renom, Marc Karchin, Rachel Webb, Ben M Eramian, David Shen, Min-Yi Kelly, Libusha Melo, Francisco Sali, Andrej GM 08284/GM/NIGMS NIH HHS/United States P50 GM62529/GM/NIGMS NIH HHS/United States R01 GM 54762/GM/NIGMS NIH HHS/United States R33 CA84699/CA/NCI NIH HHS/United States U54 GM074945/GM/NIGMS NIH HHS/United States Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov’t England Nucleic acids research Nucleic Acids Res. 2006 Jan 1;34(Database issue):D291-5.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Sanchez, R.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Variable gap penalty for protein sequence-structure alignment</style></title><secondary-title><style face="normal" font="default" size="100%">Protein Eng Des Sel</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Algorithms Amino Acid Sequence Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Amino Acid *Software</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Molecular Sequence Data Proteins/*chemistry Sequence Alignment/*methods Sequence Analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein/*methods *Sequence Homology</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2006</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=16423846</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">3</style></number><volume><style face="normal" font="default" size="100%">19</style></volume><pages><style face="normal" font="default" size="100%">129-33</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">The penalty for inserting gaps into an alignment between two protein sequences is a major determinant of the alignment accuracy. Here, we present an algorithm for finding a globally optimal alignment by dynamic programming that can use a variable gap penalty (VGP) function of any form. We also describe a specific function that depends on the structural context of an insertion or deletion. It penalizes gaps that are introduced within regions of regular secondary structure, buried regions, straight segments and also between two spatially distant residues. The parameters of the penalty function were optimized on a set of 240 sequence pairs of known structure, spanning the sequence identity range of 20-40%. We then tested the algorithm on another set of 238 sequence pairs of known structures. The use of the VGP function increases the number of correctly aligned residues from 81.0 to 84.5% in comparison with the optimized affine gap penalty function; this difference is statistically significant according to Student’s t-test. We estimate that the new algorithm allows us to produce comparative models with an additional approximately 7 million accurately modeled residues in the approximately 1.1 million proteins that are detectably related to a known structure.</style></abstract><notes><style face="normal" font="default" size="100%">Madhusudhan, M S Marti-Renom, Marc A Sanchez, Roberto Sali, Andrej DE016274/DE/NIDCR NIH HHS/United States GM54762/GM/NIGMS NIH HHS/United States GM62529/GM/NIGMS NIH HHS/United States Comparative Study Research Support, N.I.H., Extramural England Protein engineering, design &amp; selection : PEDS Protein Eng Des Sel. 2006 Mar;19(3):129-33. Epub 2006 Jan 19.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Alignment of protein sequences by their profiles</style></title><secondary-title><style face="normal" font="default" size="100%">Protein Sci</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">*Algorithms Amino Acid Sequence Computational Biology Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Markov Chains Molecular Sequence Data *Protein Folding Protein Structure</style></keyword><keyword><style  face="normal" font="default" size="100%">Tertiary Proteins/*chemistry *Sequence Alignment Sequence Homology *Software</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2004</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=15044736</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">4</style></number><volume><style face="normal" font="default" size="100%">13</style></volume><pages><style face="normal" font="default" size="100%">1071-87</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">The accuracy of an alignment between two protein sequences can be improved by including other detectably related sequences in the comparison. We optimize and benchmark such an approach that relies on aligning two multiple sequence alignments, each one including one of the two protein sequences. Thirteen different protocols for creating and comparing profiles corresponding to the multiple sequence alignments are implemented in the SALIGN command of MODELLER. A test set of 200 pairwise, structure-based alignments with sequence identities below 40% is used to benchmark the 13 protocols as well as a number of previously described sequence alignment methods, including heuristic pairwise sequence alignment by BLAST, pairwise sequence alignment by global dynamic programming with an affine gap penalty function by the ALIGN command of MODELLER, sequence-profile alignment by PSI-BLAST, Hidden Markov Model methods implemented in SAM and LOBSTER, pairwise sequence alignment relying on predicted local structure by SEA, and multiple sequence alignment by CLUSTALW and COMPASS. The alignment accuracies of the best new protocols were significantly better than those of the other tested methods. For example, the fraction of the correctly aligned residues relative to the structure-based alignment by the best protocol is 56%, which can be compared with the accuracies of 26%, 42%, 43%, 48%, 50%, 49%, 43%, and 43% for the other methods, respectively. The new method is currently applied to large-scale comparative protein structure modeling of all known sequences.</style></abstract><notes><style face="normal" font="default" size="100%">Marti-Renom, Marc A Madhusudhan, M S Sali, Andrej P50 GM62529/GM/NIGMS NIH HHS/United States R01 GM54762/GM/NIGMS NIH HHS/United States Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, P.H.S. United States Protein science : a publication of the Protein Society Protein Sci. 2004 Apr;13(4):1071-87.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Pieper, U.</style></author><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">Braberg, H.</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Davis, F. P.</style></author><author><style face="normal" font="default" size="100%">Stuart, A. C.</style></author><author><style face="normal" font="default" size="100%">Mirkovic, N.</style></author><author><style face="normal" font="default" size="100%">Rossi, A.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Fiser, A.</style></author><author><style face="normal" font="default" size="100%">Webb, B.</style></author><author><style face="normal" font="default" size="100%">Greenblatt, D.</style></author><author><style face="normal" font="default" size="100%">Huang, C. C.</style></author><author><style face="normal" font="default" size="100%">Ferrin, T. E.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">MODBASE, a database of annotated comparative protein structure models, and associated resources</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Amino Acid Sequence Animals Binding Sites *Computational Biology *Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Molecular Sequence Data Polymorphism</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Genomics Humans Internet Ligands Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Single Nucleotide Protein Binding Protein Conformation Proteins/*chemistry/genetics Sequence Alignment Software User-Computer Interface</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2004</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=14681398</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">Database issue</style></number><volume><style face="normal" font="default" size="100%">32</style></volume><pages><style face="normal" font="default" size="100%">D217-22</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">MODBASE (http://salilab.org/modbase) is a relational database of annotated comparative protein structure models for all available protein sequences matched to at least one known protein structure. The models are calculated by MODPIPE, an automated modeling pipeline that relies on the MODELLER package for fold assignment, sequence-structure alignment, model building and model assessment (http:/salilab.org/modeller). MODBASE uses the MySQL relational database management system for flexible querying and CHIMERA for viewing the sequences and structures (http://www.cgl.ucsf.edu/chimera/). MODBASE is updated regularly to reflect the growth in protein sequence and structure databases, as well as improvements in the software for calculating the models. For ease of access, MODBASE is organized into different data sets. The largest data set contains 1,26,629 models for domains in 659,495 out of 1,182,126 unique protein sequences in the complete Swiss-Prot/TrEMBL database (August 25, 2003); only models based on alignments with significant similarity scores and models assessed to have the correct fold despite insignificant alignments are included. Another model data set supports target selection and structure-based annotation by the New York Structural Genomics Research Consortium; e.g. the 53 new structures produced by the consortium allowed us to characterize structurally 24,113 sequences. MODBASE also contains binding site predictions for small ligands and a set of predicted interactions between pairs of modeled sequences from the same genome. Our other resources associated with MODBASE include a comprehensive database of multiple protein structure alignments (DBALI, http://salilab.org/dbali) as well as web servers for automated comparative modeling with MODPIPE (MODWEB, http://salilab. org/modweb), modeling of loops in protein structures (MODLOOP, http://salilab.org/modloop) and predicting functional consequences of single nucleotide polymorphisms (SNPWEB, http://salilab. org/snpweb).</style></abstract><notes><style face="normal" font="default" size="100%">Pieper, Ursula Eswar, Narayanan Braberg, Hannes Madhusudhan, M S Davis, Fred P Stuart, Ashley C Mirkovic, Nebojsa Rossi, Andrea Marti-Renom, Marc A Fiser, Andras Webb, Ben Greenblatt, Daniel Huang, Conrad C Ferrin, Thomas E Sali, Andrej P41 RR01081/RR/NCRR NIH HHS/United States P50 GM62529/GM/NIGMS NIH HHS/United States R01 GM 54762/GM/NIGMS NIH HHS/United States R33 CA84699/CA/NCI NIH HHS/United States Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, P.H.S. England Nucleic acids research Nucleic Acids Res. 2004 Jan 1;32(Database issue):D217-22.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Koh, I. Y.</style></author><author><style face="normal" font="default" size="100%">Eyrich, V. A.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Przybylski, D.</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">Grana, O.</style></author><author><style face="normal" font="default" size="100%">Pazos, F.</style></author><author><style face="normal" font="default" size="100%">Valencia, A.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author><author><style face="normal" font="default" size="100%">Rost, B.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">EVA: Evaluation of protein structure prediction servers</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Automation Databases</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Internet *Protein Conformation Protein Folding Protein Structure</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Structural Homology</style></keyword><keyword><style  face="normal" font="default" size="100%">Secondary Proteins/chemistry Reproducibility of Results *Sequence Analysis</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2003</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=12824315</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">13</style></number><volume><style face="normal" font="default" size="100%">31</style></volume><pages><style face="normal" font="default" size="100%">3311-5</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">EVA (http://cubic.bioc.columbia.edu/eva/) is a web server for evaluation of the accuracy of automated protein structure prediction methods. The evaluation is updated automatically each week, to cope with the large number of existing prediction servers and the constant changes in the prediction methods. EVA currently assesses servers for secondary structure prediction, contact prediction, comparative protein structure modelling and threading/fold recognition. Every day, sequences of newly available protein structures in the Protein Data Bank (PDB) are sent to the servers and their predictions are collected. The predictions are then compared to the experimental structures once a week; the results are published on the EVA web pages. Over time, EVA has accumulated prediction results for a large number of proteins, ranging from hundreds to thousands, depending on the prediction method. This large sample assures that methods are compared reliably. As a result, EVA provides useful information to developers as well as users of prediction methods.</style></abstract><notes><style face="normal" font="default" size="100%">Koh, Ingrid Y Y Eyrich, Volker A Marti-Renom, Marc A Przybylski, Dariusz Madhusudhan, Mallur S Eswar, Narayanan Grana, Osvaldo Pazos, Florencio Valencia, Alfonso Sali, Andrej Rost, Burkhard 1-P50-GM62413-01/GM/NIGMS NIH HHS/United States 5-P20-LM7276/LM/NLM NIH HHS/United States P50 GM62529/GM/NIGMS NIH HHS/United States R01 GM54762/GM/NIGMS NIH HHS/United States R01-GM63029-01/GM/NIGMS NIH HHS/United States Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, Non-P.H.S. Research Support, U.S. Gov’t, P.H.S. England Nucleic acids research Nucleic Acids Res. 2003 Jul 1;31(13):3311-5.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Eswar, N.</style></author><author><style face="normal" font="default" size="100%">John, B.</style></author><author><style face="normal" font="default" size="100%">Mirkovic, N.</style></author><author><style face="normal" font="default" size="100%">Fiser, A.</style></author><author><style face="normal" font="default" size="100%">Ilyin, V. A.</style></author><author><style face="normal" font="default" size="100%">Pieper, U.</style></author><author><style face="normal" font="default" size="100%">Stuart, A. C.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Yerkovich, B.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Tools for comparative protein structure modeling and analysis</style></title><secondary-title><style face="normal" font="default" size="100%">Nucleic Acids Res</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Amino Acid *Software *Structural Homology</style></keyword><keyword><style  face="normal" font="default" size="100%">Internet Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular Protein Folding Proteins/chemistry Reproducibility of Results Sequence Alignment Sequence Homology</style></keyword><keyword><style  face="normal" font="default" size="100%">Protein Systems Integration</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2003</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=12824331</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">13</style></number><volume><style face="normal" font="default" size="100%">31</style></volume><pages><style face="normal" font="default" size="100%">3375-80</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">The following resources for comparative protein structure modeling and analysis are described (http://salilab.org): MODELLER, a program for comparative modeling by satisfaction of spatial restraints; MODWEB, a web server for automated comparative modeling that relies on PSI-BLAST, IMPALA and MODELLER; MODLOOP, a web server for automated loop modeling that relies on MODELLER; MOULDER, a CPU intensive protocol of MODWEB for building comparative models based on distant known structures; MODBASE, a comprehensive database of annotated comparative models for all sequences detectably related to a known structure; MODVIEW, a Netscape plugin for Linux that integrates viewing of multiple sequences and structures; and SNPWEB, a web server for structure-based prediction of the functional impact of a single amino acid substitution.</style></abstract><notes><style face="normal" font="default" size="100%">Eswar, Narayanan John, Bino Mirkovic, Nebojsa Fiser, Andras Ilyin, Valentin A Pieper, Ursula Stuart, Ashley C Marti-Renom, Marc A Madhusudhan, M S Yerkovich, Bozidar Sali, Andrej P50 GM62529/GM/NIGMS NIH HHS/United States R01 GM 54762/GM/NIGMS NIH HHS/United States R33 CA84699/CA/NCI NIH HHS/United States Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, P.H.S. England Nucleic acids research Nucleic Acids Res. 2003 Jul 1;31(13):3375-80.</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Fiser, A.</style></author><author><style face="normal" font="default" size="100%">Rost, B.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Reliability of assessment of protein structure prediction methods</style></title><secondary-title><style face="normal" font="default" size="100%">Structure</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">*Computer Simulation Humans *Models</style></keyword><keyword><style  face="normal" font="default" size="100%">Molecular *Protein Conformation Proteins/*chemistry Reproducibility of Results</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2002</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=12005441</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">3</style></number><volume><style face="normal" font="default" size="100%">10</style></volume><pages><style face="normal" font="default" size="100%">435-40</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;The reliability of ranking of protein structure modeling methods is assessed. The assessment is based on the parametric Student’s t test and the nonparametric Wilcox signed rank test of statistical significance of the difference between paired samples. The approach is applied to the ranking of the comparative modeling methods tested at the fourth meeting on Critical Assessment of Techniques for Protein Structure Prediction (CASP). It is shown that the 14 CASP4 test sequences may not be sufficient to reliably distinguish between the top eight methods, given the model quality differences and their standard deviations. We suggest that CASP needs to be supplemented by an assessment of protein structure prediction methods that is automated, continuous in time, based on several criteria applied to a large number of models, and with quantitative statistical reliability assigned to each characterization.&lt;/p&gt;</style></abstract><notes><style face="normal" font="default" size="100%">&lt;p&gt;Marti-Renom, Marc A Madhusudhan, M S Fiser, Andras Rost, Burkhard Sali, Andrej GM 54762/GM/NIGMS NIH HHS/United States GM62413/GM/NIGMS NIH HHS/United States Comparative Study Research Support, Non-U.S. Gov’t Research Support, U.S. Gov’t, P.H.S. United States Structure (London, England : 1993) Structure. 2002 Mar;10(3):435-40.&lt;/p&gt;</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Eyrich, V. A.</style></author><author><style face="normal" font="default" size="100%">M. A. Marti-Renom</style></author><author><style face="normal" font="default" size="100%">Przybylski, D.</style></author><author><style face="normal" font="default" size="100%">Madhusudhan, M. S.</style></author><author><style face="normal" font="default" size="100%">Fiser, A.</style></author><author><style face="normal" font="default" size="100%">Pazos, F.</style></author><author><style face="normal" font="default" size="100%">Valencia, A.</style></author><author><style face="normal" font="default" size="100%">Sali, A.</style></author><author><style face="normal" font="default" size="100%">Rost, B.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">EVA: continuous automatic evaluation of protein structure prediction servers</style></title><secondary-title><style face="normal" font="default" size="100%">Bioinformatics</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Automation Internet *Protein Conformation Proteins/*analysis *Software</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2001</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&amp;db=PubMed&amp;dopt=Citation&amp;list_uids=11751240</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">12</style></number><volume><style face="normal" font="default" size="100%">17</style></volume><pages><style face="normal" font="default" size="100%">1242-3</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Evaluation of protein structure prediction methods is difficult and time-consuming. Here, we describe EVA, a web server for assessing protein structure prediction methods, in an automated, continuous and large-scale fashion. Currently, EVA evaluates the performance of a variety of prediction methods available through the internet. Every week, the sequences of the latest experimentally determined protein structures are sent to prediction servers, results are collected, performance is evaluated, and a summary is published on the web. EVA has so far collected data for more than 3000 protein chains. These results may provide valuable insight to both developers and users of prediction methods. AVAILABILITY: http://cubic.bioc.columbia.edu/eva. CONTACT: eva@cubic.bioc.columbia.edu</style></abstract><notes><style face="normal" font="default" size="100%">Eyrich, V A Marti-Renom, M A Przybylski, D Madhusudhan, M S Fiser, A Pazos, F Valencia, A Sali, A Rost, B England Bioinformatics (Oxford, England) Bioinformatics. 2001 Dec;17(12):1242-3.</style></notes></record></records></xml>