Chromosome-scale pseudomolecules refined by optical, physical and genetic maps in flax

Citation

You, F.M., Xiao, J., Li, P., Yao, Z., Jia, G., He, L., Zhu, T., Luo, M.C., Wang, X., Deyholos, M.K., Cloutier, S. (2018). Chromosome-scale pseudomolecules refined by optical, physical and genetic maps in flax. Plant Journal, [online] 95(2), 371-384. http://dx.doi.org/10.1111/tpj.13944

Plain language summary

Flax is a self-pollinated annual species of the Linaceae family that was domesticated about 7000 years ago for its seed oil and stem fibers. Genetic maps of economically important crops, such as flax, are a valuable resource for plant breeders who use molecular breeding strategies and to study evolution and diversification. Today, it is possible to go beyond mapping and to decipher the DNA sequence of an organism, Here, we used a so-called next generation sequencing method to produce a large amount of DNA sequence from the flax cultivar CDC Bethune. The main caveat to this approach is the short length of the sequences produced by this sequencing method which complicate the ascertainment of their positioning onto chromosomes. To solve this problem, we developed an optical map of the CDC Bethune genome that we combined with genetic maps previously obtained from several crosses. Optical maps are physical maps that provide long-range assembly information. Here, large DNA molecules are digested on open glass surfaces and visualized by fluorescence microscopy to generate ordered restriction maps. This single-molecule system provides signature barcodes, especially useful for the alignment and validation of the short-read sequences. Genetic maps reflect the recombination rates along chromosomes and, as such, provide complementary information to assist in properly assign, assemble and orient the sequence. The three typed of data were analyzed using a suite of bioinformatics analyses to generate the first chromosome-scale sequence of the flax genome. Flax was the first plant genome to be sequenced entirely by a Canadian team. It was the 10th plant genome sequenced and this report is the first of its refinement into 15 ordered and oriented chromosomes.

Abstract

Genomes of varying sizes have been sequenced with next-generation sequencing platforms. However, most reference sequences include draft unordered scaffolds containing chimeras caused by mis-scaffolding. A BioNano genome (BNG) optical map was constructed to improve the previously sequenced flax genome (Linum usitatissimum L., 2n = 30, about 373 Mb), which consisted of 3852 scaffolds larger than 1 kb and totalling 300.6 Mb. The high-resolution BNG map of cv. CDC Bethune totalled 317 Mb and consisted of 251 BNG contigs with an N50 of 2.15 Mb. A total of 622 scaffolds (286.6 Mb, 94.9%) aligned to 211 BNG contigs (298.6 Mb, 94.2%). Of those, 99 scaffolds, diagnosed to contain assembly errors, were refined into 225 new scaffolds. Using the newly refined scaffold sequences and the validated bacterial artificial chromosome-based physical map of CDC Bethune, the 211 BNG contigs were scaffolded into 94 super-BNG contigs (N50 of 6.64 Mb) that were further assigned to the 15 flax chromosomes using the genetic map. The pseudomolecules total about 316 Mb, with individual chromosomes of 15.6 to 29.4 Mb, and cover 97% of the annotated genes. Evidence from the chromosome-scale pseudomolecules suggests that flax has undergone palaeopolyploidization and mesopolyploidization events, followed by rearrangements and deletions or fusion of chromosome arms from an ancient progenitor with a haploid chromosome number of eight.

Publication date

2018-07-01