%0 Journal Article %J Nucleic acids research %D 2015 %T Babelomics 5.0: functional interpretation for new generations of genomic data. %A Alonso, Roberto %A Salavert, Francisco %A Garcia-Garcia, Francisco %A Carbonell-Caballero, José %A Bleda, Marta %A García-Alonso, Luz %A Sanchis-Juan, Alba %A Perez-Gil, Daniel %A Marin-Garcia, Pablo %A Sánchez, Rubén %A Cubuk, Cankut %A Hidalgo, Marta R %A Amadoz, Alicia %A Hernansaiz-Ballesteros, Rosa D %A Alemán, Alejandro %A Tárraga, Joaquín %A Montaner, David %A Medina, Ignacio %A Dopazo, Joaquin %K babelomics %K data integration %K gene set analysis %K interactome %K network analysis %K NGS %K RNA-seq %K Systems biology %K transcriptomics %X Babelomics has been running for more than one decade offering a user-friendly interface for the functional analysis of gene expression and genomic data. Here we present its fifth release, which includes support for Next Generation Sequencing data including gene expression (RNA-seq), exome or genome resequencing. Babelomics has simplified its interface, being now more intuitive. Improved visualization options, such as a genome viewer as well as an interactive network viewer, have been implemented. New technical enhancements at both, client and server sides, makes the user experience faster and more dynamic. Babelomics offers user-friendly access to a full range of methods that cover: (i) primary data analysis, (ii) a variety of tests for different experimental designs and (iii) different enrichment and network analysis algorithms for the interpretation of the results of such tests in the proper functional context. In addition to the public server, local copies of Babelomics can be downloaded and installed. Babelomics is freely available at: http://www.babelomics.org. %B Nucleic acids research %V 43 %P W117-W121 %8 2015 Apr 20 %G eng %U http://nar.oxfordjournals.org/content/43/W1/W117 %R 10.1093/nar/gkv384 %0 Journal Article %J Scientific reports %D 2015 %T Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease. %A Luzón-Toro, Berta %A Gui, Hongsheng %A Ruiz-Ferrer, Macarena %A Sze-Man Tang, Clara %A Fernández, Raquel M %A Sham, Pak-Chung %A Torroglosa, Ana %A Kwong-Hang Tam, Paul %A Espino-Paisán, Laura %A Cherny, Stacey S %A Bleda, Marta %A Enguix-Riego, María Del Valle %A Joaquín Dopazo %A Antiňolo, Guillermo %A Garcia-Barceló, Maria-Mercè %A Borrego, Salud %K babelomics %K Hirschprung %K NGS %K prioritization %X Hirschsprung disease (HSCR; OMIM 142623) is a developmental disorder characterized by aganglionosis along variable lengths of the distal gastrointestinal tract, which results in intestinal obstruction. Interactions among known HSCR genes and/or unknown disease susceptibility loci lead to variable severity of phenotype. Neither linkage nor genome-wide association studies have efficiently contributed to completely dissect the genetic pathways underlying this complex genetic disorder. We have performed whole exome sequencing of 16 HSCR patients from 8 unrelated families with SOLID platform. Variants shared by affected relatives were validated by Sanger sequencing. We searched for genes recurrently mutated across families. Only variations in the FAT3 gene were significantly enriched in five families. Within-family analysis identified compound heterozygotes for AHNAK and several genes (N = 23) with heterozygous variants that co-segregated with the phenotype. Network and pathway analyses facilitated the discovery of polygenic inheritance involving FAT3, HSCR known genes and their gene partners. Altogether, our approach has facilitated the detection of more than one damaging variant in biologically plausible genes that could jointly contribute to the phenotype. Our data may contribute to the understanding of the complex interactions that occur during enteric nervous system development and the etiopathology of familial HSCR. %B Scientific reports %V 5 %P 16473 %8 2015 %G eng %U http://www.nature.com/articles/srep16473 %R 10.1038/srep16473 %0 Journal Article %J Nucleic Acids Research %D 2010 %T Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling. %A Medina, Ignacio %A Carbonell, José %A Pulido, Luis %A Madeira, Sara C %A Goetz, Stefan %A Ana Conesa %A Tárraga, Joaquín %A Pascual-Montano, Alberto %A Nogales-Cadenas, Ruben %A Santoyo, Javier %A García, Francisco %A Marbà, Martina %A Montaner, David %A Joaquín Dopazo %K babelomics %K gene expression %K genotyping %K gepas %K GSA %K GWAS %X

Babelomics is a response to the growing necessity of integrating and analyzing different types of genomic data in an environment that allows an easy functional interpretation of the results. Babelomics includes a complete suite of methods for the analysis of gene expression data that include normalization (covering most commercial platforms), pre-processing, differential gene expression (case-controls, multiclass, survival or continuous values), predictors, clustering; large-scale genotyping assays (case controls and TDTs, and allows population stratification analysis and correction). All these genomic data analysis facilities are integrated and connected to multiple options for the functional interpretation of the experiments. Different methods of functional enrichment or gene set enrichment can be used to understand the functional basis of the experiment analyzed. Many sources of biological information, which include functional (GO, KEGG, Biocarta, Reactome, etc.), regulatory (Transfac, Jaspar, ORegAnno, miRNAs, etc.), text-mining or protein-protein interaction modules can be used for this purpose. Finally a tool for the de novo functional annotation of sequences has been included in the system. This provides support for the functional analysis of non-model species. Mirrors of Babelomics or command line execution of their individual components are now possible. Babelomics is available at http://www.babelomics.org.

%B Nucleic Acids Research %V 38 %P W210-W213. Featured in NAR %8 2010 May 16 %G eng %U http://nar.oxfordjournals.org/content/38/suppl_2/W210.full %& Featured in NAR %0 Journal Article %J Artif Intell Med %D 2009 %T Formulating and testing hypotheses in functional genomics %A Dopazo, J. %K babelomics %K gene set analysis %X

OBJECTIVE: The ultimate goal of any genome-scale experiment is to provide a functional interpretation of the results, relating the available genomic information to the hypotheses that originated the experiment. METHODS AND RESULTS: Initially, this interpretation has been made on a pre-selection of relevant genes, based on the experimental values, followed by the study of the enrichment in some functional properties. Nevertheless, functional enrichment methods, demonstrated to have a flaw: the first step of gene selection was too stringent given that the cooperation among genes was ignored. The assumption that modules of genes related by relevant biological properties (functionality, co-regulation, chromosomal location, etc.) are the real actors of the cell biology lead to the development of new procedures, inspired in systems biology criteria, generically known as gene-set methods. These methods have been successfully used to analyze transcriptomic and large-scale genotyping experiments as well as to test other different genome-scale hypothesis in other fields such as phylogenomics.

%B Artif Intell Med %V 45 %P 97-107 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=18789659 %0 Journal Article %J Nucl. Acids Res. %D 2009 %T Gene set-based analysis of polymorphisms: finding pathways or biological processes associated to traits in genome-wide association studies %A Medina, Ignacio %A Montaner, David %A Bonifaci, Núria %A Pujana, Miguel Angel %A Carbonell, José %A Tárraga, Joaquín %A Fatima Al-Shahrour %A Dopazo, Joaquin %K babelomics %K gene set %K GESBAP %K pathway-based analysis %K SNP %X

Genome-wide association studies have become a popular strategy to find associations of genes to traits of interest. Despite the high-resolution available today to carry out genotyping studies, the success of its application in real studies has been limited by the testing strategy used. As an alternative to brute force solutions involving the use of very large cohorts, we propose the use of the Gene Set Analysis (GSA), a different analysis strategy based on testing the association of modules of functionally related genes. We show here how the Gene Set-based Analysis of Polymorphisms (GeSBAP), which is a simple implementation of the GSA strategy for the analysis of genome-wide association studies, provides a significant increase in the power testing for this type of studies. GeSBAP is freely available at http://bioinfo.cipf.es/gesbap/

%B Nucl. Acids Res. %V 37 %P W340-344 %G eng %U http://nar.oxfordjournals.org/cgi/content/abstract/37/suppl_2/W340 %R 10.1093/nar/gkp481 %0 Journal Article %J Nucleic Acids Res %D 2008 %T Babelomics: advanced functional profiling of transcriptomics, proteomics and genomics experiments %A Fatima Al-Shahrour %A Carbonell, J. %A Minguez, P. %A Goetz, S. %A A. Conesa %A Tarraga, J. %A Medina, Ignacio %A Alloza, E. %A Montaner, D. %A Dopazo, J. %K babelomics %K funtional profiling %X

We present a new version of Babelomics, a complete suite of web tools for the functional profiling of genome scale experiments, with new and improved methods as well as more types of functional definitions. Babelomics includes different flavours of conventional functional enrichment methods as well as more advanced gene set analysis methods that makes it a unique tool among the similar resources available. In addition to the well-known functional definitions (GO, KEGG), Babelomics includes new ones such as Biocarta pathways or text mining-derived functional terms. Regulatory modules implemented include transcriptional control (Transfac, CisRed) and other levels of regulation such as miRNA-mediated interference. Moreover, Babelomics allows for sub-selection of terms in order to test more focused hypothesis. Also gene annotation correspondence tables can be imported, which allows testing with user-defined functional modules. Finally, a tool for the ’de novo’ functional annotation of sequences has been included in the system. This allows using yet unannotated organisms in the program. Babelomics has been extensively re-engineered and now it includes the use of web services and Web 2.0 technology features, a new user interface with persistent sessions and a new extended database of gene identifiers. Babelomics is available at http://www.babelomics.org.

%B Nucleic Acids Res %V 36 %P W341-6 %G eng %U http://nar.oxfordjournals.org/content/36/suppl_2/W341.long %0 Journal Article %J Nucleic Acids Res %D 2007 %T FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments %A Fatima Al-Shahrour %A Minguez, P. %A Tarraga, J. %A Medina, Ignacio %A Alloza, E. %A Montaner, D. %A Dopazo, J. %K babelomics %K functional enrichment analysys %X

The ultimate goal of any genome-scale experiment is to provide a functional interpretation of the data, relating the available information with the hypotheses that originated the experiment. Thus, functional profiling methods have become essential in diverse scenarios such as microarray experiments, proteomics, etc. We present the FatiGO+, a web-based tool for the functional profiling of genome-scale experiments, specially oriented to the interpretation of microarray experiments. In addition to different functional annotations (gene ontology, KEGG pathways, Interpro motifs, Swissprot keywords and text-mining based bioentities related to diseases and chemical compounds) FatiGO+ includes, as a novelty, regulatory and structural information. The regulatory information used includes predictions of targets for distinct regulatory elements (obtained from the Transfac and CisRed databases). Additionally FatiGO+ uses predictions of target motifs of miRNA to infer which of these can be activated or deactivated in the sample of genes studied. Finally, properties of gene products related to their relative location and connections in the interactome have also been used. Also, enrichment of any of these functional terms can be directly analysed on chromosomal coordinates. FatiGO+ can be found at: http://www.fatigoplus.org and within the Babelomics environment http://www.babelomics.org.

%B Nucleic Acids Res %V 35 %P W91-6 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17478504 %0 Journal Article %J BMC Bioinformatics %D 2007 %T From genes to functional classes in the study of biological systems %A Fatima Al-Shahrour %A Arbiza, L. %A H. Dopazo %A Huerta-Cepas, J. %A Minguez, P. %A Montaner, D. %A Dopazo, J. %K Algorithms Chromosome Mapping/*methods Computer Simulation Gene Expression Profiling/methods *Models %K babelomics %K Biological Multigene Family/*physiology Signal Transduction/*physiology *Software Systems Biology/*methods *User-Computer Interface %X

BACKGROUND: With the popularization of high-throughput techniques, the need for procedures that help in the biological interpretation of results has increased enormously. Recently, new procedures inspired in systems biology criteria have started to be developed. RESULTS: Here we present FatiScan, a web-based program which implements a threshold-independent test for the functional interpretation of large-scale experiments that does not depend on the pre-selection of genes based on the multiple application of independent tests to each gene. The test implemented aims to directly test the behaviour of blocks of functionally related genes, instead of focusing on single genes. In addition, the test does not depend on the type of the data used for obtaining significance values, and consequently different types of biologically informative terms (gene ontology, pathways, functional motifs, transcription factor binding sites or regulatory sites from CisRed) can be applied to different classes of genome-scale studies. We exemplify its application in microarray gene expression, evolution and interactomics. CONCLUSION: Methods for gene set enrichment which, in addition, are independent from the original data and experimental design constitute a promising alternative for the functional profiling of genome-scale experiments. A web server that performs the test described and other similar ones can be found at: http://www.babelomics.org.

%B BMC Bioinformatics %V 8 %P 114 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17407596 %0 Journal Article %J Bioinformation %D 2007 %T Functional profiling and gene expression analysis of chromosomal copy number alterations %A L. Conde %A Montaner, D. %A Burguet-Castell, J. %A Tarraga, J. %A Fatima Al-Shahrour %A Dopazo, J. %K babelomics %X

Contrarily to the traditional view in which only one or a few key genes were supposed to be the causative factors of diseases, we discuss the importance of considering groups of functionally related genes in the study of pathologies characterised by chromosomal copy number alterations. Recent observations have reported the existence of regions in higher eukaryotic chromosomes (including humans) containing genes of related function that show a high degree of coregulation. Copy number alterations will consequently affect to clusters of functionally related genes, which will be the final causative agents of the diseased phenotype, in many cases. Therefore, we propose that the functional profiling of the regions affected by copy number alterations must be an important aspect to take into account in the understanding of this type of pathologies. To illustrate this, we present an integrated study of DNA copy number variations, gene expression along with the functional profiling of chromosomal regions in a case of multiple myeloma.

%B Bioinformation %V 1 %P 432-5 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17597935 %0 Journal Article %J Bioinformatics %D 2007 %T Functional profiling of microarray experiments using text-mining derived bioentities %A Minguez, P. %A Fatima Al-Shahrour %A Montaner, D. %A Dopazo, J. %K Artificial Intelligence *Databases %K babelomics %K Protein Gene Expression Profiling/*methods Information Storage and Retrieval/*methods *Natural Language Processing Proteins/*classification/*metabolism Research/*methods Systems Integration %X

MOTIVATION: The increasing use of microarray technologies brought about a parallel demand in methods for the functional interpretation of the results. Beyond the conventional functional annotations for genes, such as gene ontology, pathways, etc. other sources of information are still to be exploited. Text-mining methods allow extracting informative terms (bioentities) with different functional, chemical, clinical, etc. meanings, that can be associated to genes. We show how to use these associations within an appropriate statistical framework and how to apply them through easy-to-use, web-based environments to the functional interpretation of microarray experiments. Functional enrichment and gene set enrichment tests using bioentities are presented.

%B Bioinformatics %V 23 %P 3098-9 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17855415 %0 Book Section %B Microarray Technology Through Applications %D 2007 %T Microarray Technology in Agricultural Research %A A. Conesa %A J. Forment %A J. Gadea %A van Dijk, J. %K babelomics %B Microarray Technology Through Applications %I F. Falciani. Publisher: Taylor and Francis Group %P 173-209 %G eng %0 Journal Article %J Bioinformatics %D 2007 %T Prophet, a web-based tool for class prediction using microarray data %A Medina, Ignacio %A Montaner, D. %A Tarraga, J. %A Dopazo, J. %K babelomics %K gepas %K predictors %X

Sample classification and class prediction is the aim of many gene expression studies. We present a web-based application, Prophet, which builds prediction rules and allows using them for further sample classification. Prophet automatically chooses the best classifier, along with the optimal selection of genes, using a strategy that renders unbiased cross-validated errors. Prophet is linked to different microarray data analysis modules, and includes a unique feature: the possibility of performing the functional interpretation of the molecular signature found. Availability: Prophet can be found at the URL http://prophet.bioinfo.cipf.es/ or within the GEPAS package at http://www.gepas.org/ Supplementary information: http://gepas.bioinfo.cipf.es/tutorial/prophet.html.

%B Bioinformatics %V 23 %P 390-1 %G eng %U http://bioinformatics.oxfordjournals.org/cgi/content/full/23/3/390?view=long&pmid=17138587 %0 Journal Article %J Nucleic Acids Res %D 2006 %T BABELOMICS: a systems biology perspective in the functional annotation of genome-scale experiments %A Fatima Al-Shahrour %A Minguez, P. %A Tarraga, J. %A Montaner, D. %A Alloza, E. %A Vaquerizas, J. M. %A L. Conde %A Blaschke, C. %A Vera, J. %A Dopazo, J. %K babelomics %K functional profiling %X

We present a new version of Babelomics, a complete suite of web tools for functional analysis of genome-scale experiments, with new and improved tools. New functionally relevant terms have been included such as CisRed motifs or bioentities obtained by text-mining procedures. An improved indexing has considerably speeded up several of the modules. An improved version of the FatiScan method for studying the coordinate behaviour of groups of functionally related genes is presented, along with a similar tool, the Gene Set Enrichment Analysis. Babelomics is now more oriented to test systems biology inspired hypotheses. Babelomics can be found at http://www.babelomics.org.

%B Nucleic Acids Res %V 34 %P W472-6 %G eng %U http://nar.oxfordjournals.org/content/34/suppl_2/W472.long %0 Journal Article %J Stud Health Technol Inform %D 2006 %T Blast2GO goes grid: developing a grid-enabled prototype for functional genomics analysis %A Aparicio, G. %A Gotz, S. %A A. Conesa %A Segrelles, D. %A Blanquer, I. %A Garcia, J. M. %A Hernandez, V. %A Robles, M. %A Talon, M. %K babelomics %X

The vast amount in complexity of data generated in Genomic Research implies that new dedicated and powerful computational tools need to be developed to meet their analysis requirements. Blast2GO (B2G) is a bioinformatics tool for Gene Ontology-based DNA or protein sequence annotation and function-based data mining. The application has been developed with the aim of affering an easy-to-use tool for functional genomics research. Typical B2G users are middle size genomics labs carrying out sequencing, ETS and microarray projects, handling datasets up to several thousand sequences. In the current version of B2G. The power and analytical potential of both annotation and function data-mining is somehow restricted to the computational power behind each particular installation. In order to be able to offer the possibility of an enhanced computational capacity within this bioinformatics application, a Grid component is being developed. A prototype has been conceived for the particular problem of speeding up the Blast searches to obtain fast results for large datasets. Many efforts have been done in the literature concerning the speeding up of Blast searches, but few of them deal with the use of large heterogeneous production Grid Infrastructures. These are the infrastructures that could reach the largest number of resources and the best load balancing for data access. The Grid Service under development will analyse requests based on the number of sequences, splitting them accordingly to the available resources. Lower-level computation will be performed through MPIBLAST. The software architecture is based on the WSRF standard.

%B Stud Health Technol Inform %V 120 %P 194-204 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16823138 %0 Journal Article %J OMICS %D 2006 %T Functional interpretation of microarray experiments %A Dopazo, J. %K babelomics %K Diabetes Mellitus %K microarray data analysis %X

Over the past few years, due to the popularisation of high-throughput methodologies such as DNA microarrays, the possibility of obtaining experimental data has increased significantly. Nevertheless, the interpretation of the results, which involves translating these data into useful biological knowledge, still remains a challenge. The methods and strategies used for this interpretation are in continuous evolution and new proposals are constantly arising. Initially, a two-step approach was used in which genes of interest were initially selected, based on thresholds that consider only experimental values, and then in a second, independent step the enrichment of these genes in biologically relevant terms, was analysed. For different reasons, these methods are relatively poor in terms of performance and a new generation of procedures, which draw inspiration from systems biology criteria, are currently under development. Such procedures, aim to directly test the behaviour of blocks of functionally related genes, instead of focusing on single genes.

%B OMICS %V 10 %P 398-410 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=17069516 %0 Journal Article %J Genome Inform %D 2006 %T A function-centric approach to the biological interpretation of microarray time-series %A Minguez, P. %A Fatima Al-Shahrour %A Dopazo, J. %K babelomics %X

The interpretation of microarray experiments is commonly addressed by means a two-step approach in which the relevant genes are firstly selected uniquely on the basis of their experimental values (ignoring their coordinate behaviors) and in a second step their functional properties are studied to hypothesize about the biological roles they are fulfilling in the cell. Recently, different methods (e.g. GSEA or FatiScan) have been proposed to study the coordinate behavior of blocks of functionally-related genes. These methods study the distribution of functional information across lists of genes ranked according their different experimental values in a static situation, such as the comparison between two classes (e.g. healthy controls versus diseased cases). Nevertheless there is no an equivalent way of studying a dynamic situation from a functional point of view. We present a method for the functional analysis of microarrays series in which the experiments display autocorrelation between successive points (e.g. time series, dose-response experiments, etc.) The method allows to recover the dynamics of the molecular roles fulfilled by the genes along the series which provides a novel approach to functional interpretation of such experiments. The method finds blocks of functionally-related genes which are significantly and coordinately over-expressed at different points of the series. This method draws inspiration from systems biology given that the analysis does not focus on individual properties of genes but on collective behaving blocks of functionally-related genes. The FatiScan algorithm used in the method proposed is available at: http://fatiscan.bioinfo.cipf.es, or within the Babelomics suite: http://www.babelomics.org. Additional material is available at: http://bioinfo.cipf.es/data/plasmodium.

%B Genome Inform %V 17 %P 57-66 %G eng %0 Journal Article %J Methods Mol Biol %D 2006 %T Ontology-driven approaches to analyzing data in functional genomics %A F. Azuaje %A Fatima Al-Shahrour %A Dopazo, J. %K babelomics %K Cluster Analysis %K Cluster Analysis Computational Biology/*methods *Data Interpretation %K Computational Biology %K Statistical Gene Expression Profiling %K Statistical Gene Expression Profiling *Genomics Humans %X

Ontologies are fundamental knowledge representations that provide not only standards for annotating and indexing biological information, but also the basis for implementing functional classification and interpretation models. This chapter discusses the application of gene ontology (GO) for predictive tasks in functional genomics. It focuses on the problem of analyzing functional patterns associated with gene products. This chapter is divided into two main parts. The first part overviews GO and its applications for the development of functional classification models. The second part presents two methods for the characterization of genomic information using GO. It discusses methods for measuring functional similarity of gene products, and a tool for supporting gene expression clustering analysis and validation.

%B Methods Mol Biol %V 316 %P 67-86 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16671401 %0 Journal Article %J Nucleic Acids Res %D 2005 %T BABELOMICS: a suite of web tools for functional annotation and analysis of groups of genes in high-throughput experiments %A Fatima Al-Shahrour %A Minguez, P. %A Vaquerizas, J. M. %A L. Conde %A Dopazo, J. %K babelomics %K functional profiling %X

We present Babelomics, a complete suite of web tools for the functional analysis of groups of genes in high-throughput experiments, which includes the use of information on Gene Ontology terms, interpro motifs, KEGG pathways, Swiss-Prot keywords, analysis of predicted transcription factor binding sites, chromosomal positions and presence in tissues with determined histological characteristics, through five integrated modules: FatiGO (fast assignment and transference of information), FatiWise, transcription factor association test, GenomeGO and tissues mining tool, respectively. Additionally, another module, FatiScan, provides a new procedure that integrates biological information in combination with experimental results in order to find groups of genes with modest but coordinate significant differential behaviour. FatiScan is highly sensitive and is capable of finding significant asymmetries in the distribution of genes of common function across a list of ordered genes even if these asymmetries were not extreme. The strong multiple-testing nature of the contrasts made by the tools is taken into account. All the tools are integrated in the gene expression analysis package GEPAS. Babelomics is the natural evolution of our tool FatiGO (which analysed almost 22,000 experiments during the last year) to include more sources on information and new modes of using it. Babelomics can be found at http://www.babelomics.org.

%B Nucleic Acids Res %V 33 %P W460-4 %G eng %U http://nar.oxfordjournals.org/content/33/suppl_2/W460.long %0 Journal Article %J Bioinformatics %D 2005 %T Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research %A A. Conesa %A Gotz, S. %A Garcia-Gomez, J. M. %A Terol, J. %A Talon, M. %A Robles, M. %K babelomics %X

SUMMARY: We present here Blast2GO (B2G), a research tool designed with the main purpose of enabling Gene Ontology (GO) based data mining on sequence data for which no GO annotation is yet available. B2G joints in one application GO annotation based on similarity searches with statistical analysis and highlighted visualization on directed acyclic graphs. This tool offers a suitable platform for functional genomics research in non-model species. B2G is an intuitive and interactive desktop application that allows monitoring and comprehension of the whole annotation and analysis process. AVAILABILITY: Blast2GO is freely available via Java Web Start at http://www.blast2go.de. SUPPLEMENTARY MATERIAL: http://www.blast2go.de -> Evaluation.

%B Bioinformatics %V 21 %P 3674-6 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=16081474 %0 Book Section %D 2005 %T Data analysis and visualisation in genomics and proteomics %A F. Azuaje %A Dopazo, J. %K babelomics %I Wiley, F. Azuaje and J. Dopazo %G eng %0 Journal Article %J Bioinformatics %D 2005 %T Discovering molecular functions significantly related to phenotypes by combining gene expression data and biological information %A Fatima Al-Shahrour %A Diaz-Uriarte, R. %A Dopazo, J. %K babelomics %K Biological Neoplasm Proteins/genetics/*metabolism Phenotype Software Structure-Activity Relationship Systems Integration Tumor Markers %K Biological/genetics/*metabolism %K Breast Neoplasms/genetics/*metabolism Computer Simulation *Database Management Systems *Databases %K Protein Documentation/methods Gene Expression Profiling/*methods Humans *Models %X

MOTIVATION: The analysis of genome-scale data from different high throughput techniques can be used to obtain lists of genes ordered according to their different behaviours under distinct experimental conditions corresponding to different phenotypes (e.g. differential gene expression between diseased samples and controls, different response to a drug, etc.). The order in which the genes appear in the list is a consequence of the biological roles that the genes play within the cell, which account, at molecular scale, for the macroscopic differences observed between the phenotypes studied. Typically, two steps are followed for understanding the biological processes that differentiate phenotypes at molecular level: first, genes with significant differential expression are selected on the basis of their experimental values and subsequently, the functional properties of these genes are analysed. Instead, we present a simple procedure which combines experimental measurements with available biological information in a way that genes are simultaneously tested in groups related by common functional properties. The method proposed constitutes a very sensitive tool for selecting genes with significant differential behaviour in the experimental conditions tested. RESULTS: We propose the use of a method to scan ordered lists of genes. The method allows the understanding of the biological processes operating at molecular level behind the macroscopic experiment from which the list was generated. This procedure can be useful in situations where it is not possible to obtain statistically significant differences based on the experimental measurements (e.g. low prevalence diseases, etc.). Two examples demonstrate its application in two microarray experiments and the type of information that can be extracted.

%B Bioinformatics %V 21 %P 2988-93 %G eng %U http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=PubMed&dopt=Citation&list_uids=15840702 %0 Journal Article %J Bioinformatics %D 2004 %T FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes %A Fatima Al-Shahrour %A Diaz-Uriarte, R. %A Dopazo, J. %K *Algorithms Artificial Intelligence Databases %K babelomics %K DNA/*methods *Software %K Genetic Gene Expression Profiling/*methods *Hypermedia Information Storage and Retrieval/*methods *Internet *Phylogeny Sequence Alignment/methods Sequence Analysis %X

We present a simple but powerful procedure to extract Gene Ontology (GO) terms that are significantly over- or under-represented in sets of genes within the context of a genome-scale experiment (DNA microarray, proteomics, etc.). Said procedure has been implemented as a web application, FatiGO, allowing for easy and interactive querying. FatiGO, which takes the multiple-testing nature of statistical contrast into account, currently includes GO associations for diverse organisms (human, mouse, fly, worm and yeast) and the TrEMBL/Swissprot GOAnnotations@EBI correspondences from the European Bioinformatics Institute.

%B Bioinformatics %V 20 %P 578-80 %G eng %U http://bioinformatics.oxfordjournals.org/content/20/4/578.abstract %0 Book Section %B Neural Networks for Signal Processing XIII %D 2003 %T Using Gene Ontology on genome-scale studies to find significant associations of biologically relevant terms to group of genes %A Fatima Al-Shahrour %A Herrero, J. %A A. Mateos %A J. Santoyo %A Díaz-Uriarte, R %A Dopazo, J. %K babelomics %B Neural Networks for Signal Processing XIII %I IEEE Press %C New York, USA %P 43-52 %G eng