A crowdsourcing database for the copy-number variation of the Spanish population.

TitleA crowdsourcing database for the copy-number variation of the Spanish population.
Publication TypeJournal Article
Year of Publication2023
AuthorsLópez-López, D, Roldán, G, Fernandez-Rueda, JL, Bostelmann, G, Carmona, R, Aquino, V, Perez-Florido, J, Ortuno, F, Pita, G, Núñez-Torres, R, González-Neira, A, Peña-Chilet, M, Dopazo, J
Corporate AuthorsCSVS Crowdsourcing Group
JournalHum Genomics
Volume17
Issue1
Pagination20
Date Published2023 Mar 09
ISSN1479-7364
Abstract

BACKGROUND: Despite being a very common type of genetic variation, the distribution of copy-number variations (CNVs) in the population is still poorly understood. The knowledge of the genetic variability, especially at the level of the local population, is a critical factor for distinguishing pathogenic from non-pathogenic variation in the discovery of new disease variants.RESULTS: Here, we present the SPAnish Copy Number Alterations Collaborative Server (SPACNACS), which currently contains copy number variation profiles obtained from more than 400 genomes and exomes of unrelated Spanish individuals. By means of a collaborative crowdsourcing effort whole genome and whole exome sequencing data, produced by local genomic projects and for other purposes, is continuously collected. Once checked both, the Spanish ancestry and the lack of kinship with other individuals in the SPACNACS, the CNVs are inferred for these sequences and they are used to populate the database. A web interface allows querying the database with different filters that include ICD10 upper categories. This allows discarding samples from the disease under study and obtaining pseudo-control CNV profiles from the local population. We also show here additional studies on the local impact of CNVs in some phenotypes and on pharmacogenomic variants. SPACNACS can be accessed at: http://csvs.clinbioinfosspa.es/spacnacs/ .CONCLUSION: SPACNACS facilitates disease gene discovery by providing detailed information of the local variability of the population and exemplifies how to reuse genomic data produced for other purposes to build a local reference database.

DOI10.1186/s40246-023-00466-8
Alternate JournalHum Genomics
PubMed ID36894999
PubMed Central IDPMC9997023