GENOAK : Sequencing of the oak genome and identification of genes that matter for forest tree adaptation
Trees are an important component of the global ecosystem. They dominate many regions of the world as natural, extensively or intensively managed plantations. The Fagaceae family comprises about 1,000 woody species distributed throughout the northern hemisphere. About half belong to the Quercus (oak) genus. These oaks have traditionally high cultural and societal values. As other forest tree species, they are also providing important environmental (carbon sequestration, water cycle, reservoir of biodiversity, soil protection...) and economic (carpentry, furniture, cabinet making, veneer, cask industry, fuelwood, hunting and fungus gathering) services.
Finally these long-lived organisms are considered as relevant models to study both short and long term adaptation mechanisms to abiotic and biotic constraints associated with global change, as they grow under a wide range of environments (soil and climate). Due to their biological dominance, their commercial and ecological importance worldwide, an international network was launched recently to develop a coordinated strategy to sequence several Fagaceae genomes. These genomic resources will allow to discover which genes are involved in the adaptation of these organisms to their environments and to study their adaptive potential under current and future climate changes.
In Europe, genomic resources have been developed for sessile and pedunculate oaks in the frame of the EVOLTREE network of excellence. In US, American and Chinese chestnuts (Castanea) were the main targets of a NSF funded project. Their genome is now being sequenced within the Forest Health Initiative . The goal is to provide a high-quality reference sequence for the Castanea mollissima (Chinese Chestnut) genome, with which to identify genes for resistance to Cryphonectria parasitica. In China, the efforts to sequence the Lithocarpus and Castanopsis genomes are also on the right track.
The present project aims at supporting the participation of French Institutions involved in this international effort. It is a collaboration between four INRA research units and the GENOSCOPE: the French National Sequencing Center.
The objective of the GENOAK project are two-fold:
- Establish a high quality reference genome sequence for Quercus robur (740Mb/C), anchored to an ultra high density SNP-based linkage map on which a series of trait-QTLs have been localized. This resource will provide genetic links between traits and sequence variations at a high resolution,
- Identify genes involved in adaptation through the analysis of adaptive divergence (incipient ecological speciation) at the transcriptome and genome-wide levels by respectively i) identifying differentially expressed genes between co-occuring species (sessile and pedunculate oaks) adapted to different edapho-climatic environments, and ii) analyzing the size, distribution and nature of genomic islands of divergence maintained in the face of gene flow of the same sympatric species.
Tasks :
- Obtain a reference genome for a single Pedunculate oak genotype
- Assign genome scaffolds on the genome using a highly saturated SNP-based genetic linkage map
- Annotate structurally and functionally the genome
- Integrate the whole genome with automatic prediction into a Genome Browser
- Study the micro-synteny and colinearity between oak and chestnut, as well as with other eudicots, for which a genome sequence is available
- Reveal genetic variations that are responsible for important adaptive traits in oaks by resequencing the genome of several individuals of two sister species.
Website :
- https://w3.pierroton.inra.fr/QuercusPortal/index.php?p=OAK_GENOME_SEQUENCING
- http://www.oakgenome.fr/
Publications related to the project :
- Oak genome reveals facets of long lifespan.
- Decoding the oak genome: public release of sequence data, assembly, annotation and publication strategies.
- Analysis of BAC end sequences in oak, a keystone forest tree species, providing insight into the composition of its genome.
CNRGV's responsible
- William Marande