Prunus sibirica CH320_5 Whole Genome v1.0 Assembly & Annotation
Overview
Publication Background The specific traits that have recently evolved in domesticated organisms under strong and recent human-driven selection provide unique opportunities to understand adaptive processes as it leaves footprints in the genome that are easier to detect than those left by natural selection. Here we studied the evolutionary history and selection footprints in apricots (Armeniaca spp) using a population genomics approach. We obtained the genomes of nearly 600 accessions and four high-quality assemblies of Armeniaca genomes, anchored on genetic maps. Population structure inferences and approximate Bayesian computation using these genomes indicated that Chinese and European apricots formed two differentiated gene pools with relatively high genetic diversity, resulting from independent domestication events from two different wild Central Asian populations, and with gene flow between wild and cultivated populations. We detected genomic footprints of selection (i.e., selective sweeps and recurrent changes in amino-acids) in the two groups of cultivated apricots. Consistent with large effective population sizes and outcrossing in fruit trees, we found a relatively low proportion of the genome affected by selection. Different genomic regions have been affected by selection in European and Chinese cultivated apricots, despite convergent phenotypic traits. Selection footprints appeared more abundant in European apricots, with a hotspot on chromosome 4, while admixture was much more pervasive in Chinese cultivated apricots. In both cultivated groups, the genes affected by selection had predicted functions involved in the perennial life cycle, fruit quality and disease resistance, and some of them colocalized with previously mapped genomic regions associated with these functions. In addition to improving our fundamental knowledge on the processes of adaptation, identified genes under positive selection provide clues to the biology of selected traits and targets for fruit tree research and breeding. Homology
Homology of the Prunus sibirica genome v1.0 proteins was determined by pairwise sequence comparison using the blastp algorithm against various protein databases. An expectation value cutoff less than 1e-9 was used for the NCBI nr (Release 2018-05) and 1e-6 for the Arabidoposis proteins (Araport11), UniProtKB/SwissProt (Release 2019-01), and UniProtKB/TrEMBL (Release 2019-01) databases. The best hit reports are available for download in Excel format.
Protein Homologs
Assembly
The Prunus sibirica Genome v1.0 assembly file is available in FASTA format. Downloads
Gene Predictions
The Prunus sibirica v1.0 genome gene prediction files are available in FASTA and GFF3 formats. Downloads
Functional Analysis
Functional annotation for the Prunus sibirica Genome v1.0 are available for download below. The Prunus sibirica Genome v1.0 proteins were analyzed using InterProScan in order to assign InterPro domains and Gene Ontology (GO) terms. Pathways analysis was performed using the KEGG Automatic Annotation Server (KAAS). Downloads
Transcript Alignments
Transcript alignments were performed by the GDR Team of Main Bioinformatics Lab at WSU. The alignment tool 'BLAT' was used to map transcripts to the Prunus sibirica genome assembly. Alignments with an alignment length of 97% and 97% identify were preserved. The available files are in GFF3 format.
|