CSVS, a crowdsourcing database of the Spanish population genetic variability.

TitleCSVS, a crowdsourcing database of the Spanish population genetic variability.
Publication TypeJournal Article
Year of Publication2021
AuthorsPeña-Chilet, M, Roldán, G, Perez-Florido, J, Ortuno, FM, Carmona, R, Aquino, V, López-López, D, Loucera, C, Fernandez-Rueda, JL, Gallego, A, Garcia-Garcia, F, González-Neira, A, Pita, G, Núñez-Torres, R, Santoyo-López, J, Ayuso, C, Minguez, P, Avila-Fernandez, A, Corton, M, Moreno-Pelayo, MÁngel, Morin, M, Gallego-Martinez, A, Lopez-Escamez, JA, Borrego, S, Antiňolo, G, Amigo, J, Salgado-Garrido, J, Pasalodos-Sanchez, S, Morte, B, Carracedo, Á, Alonso, Á, Dopazo, J
Corporate AuthorsSpanish Exome Crowdsourcing Consortium
JournalNucleic Acids Res
Volume49
IssueD1
PaginationD1130-D1137
Date Published2021 01 08
ISSN1362-4962
KeywordsAlleles; Chromosome Mapping; Crowdsourcing; Databases, Genetic; Exome; Gene Frequency; Genetic Variation; Genetics, Population; Genome, Human; Genomics; Humans; Internet; Precision Medicine; Software; Spain
Abstract

The knowledge of the genetic variability of the local population is of utmost importance in personalized medicine and has been revealed as a critical factor for the discovery of new disease variants. Here, we present the Collaborative Spanish Variability Server (CSVS), which currently contains more than 2000 genomes and exomes of unrelated Spanish individuals. This database has been generated in a collaborative crowdsourcing effort collecting sequencing data produced by local genomic projects and for other purposes. Sequences have been grouped by ICD10 upper categories. A web interface allows querying the database removing one or more ICD10 categories. In this way, aggregated counts of allele frequencies of the pseudo-control Spanish population can be obtained for diseases belonging to the category removed. Interestingly, in addition to pseudo-control studies, some population studies can be made, as, for example, prevalence of pharmacogenomic variants, etc. In addition, this genomic data has been used to define the first Spanish Genome Reference Panel (SGRP1.0) for imputation. This is the first local repository of variability entirely produced by a crowdsourcing effort and constitutes an example for future initiatives to characterize local variability worldwide. CSVS is also part of the GA4GH Beacon network. CSVS can be accessed at: http://csvs.babelomics.org/.

DOI10.1093/nar/gkaa794
Alternate JournalNucleic Acids Res
PubMed ID32990755
PubMed Central IDPMC7778906