The draft flax genome pseudomolecules

Citation

You FM, Li P, Ragupathy R, Kumar S, Zhu T, Luo M-C, Duguid SD, Rashid K, Booker H, Deyholos M, Fu Y-B, Sharpe A, Cloutier S (2016) The draft flax genome pseudomolecules. 66th Flax Institute of the United States, Fargo, North Dakota, March 31-April 1, 2016.

Plain language summary

Flax (Linum usitatissimum L., 2n=30) is a self-pollinated annual species of the Linaceae family and was domesticated ~7,000 years ago for its seed oil and stem fibres. The genome of cv. CDC Bethune was sequenced in 2012 using a whole genome shotgun strategy (WGS) and the Illumina sequencing platform. The sequences were assembled into 3,852 scaffolds larger than 1 Kb covering a total of 300 Mb. By integrating the BAC-based physical map, the BAC end sequences, the high-density SNP-and SSR-based genetic maps and the BioNano optical genome map, we refined and sorted out the scaffolds onto 15 linkage groups and generated their draft pseudomolecules. A total of 286 Mb of scaffold sequences were anchored to ~360 Mb of fingerprint contigs and, 284 Mb of scaffold sequences were mapped to 316 Mb of the BioNano map, covering 97% of all predicted genes of the WGS assembly. The pseudomolecules contain ~316 Mb of sequences with linkage groups varying in sizes from 15.6 to 29.4 Mb. The use of the draft flax pseudomolecules will enhance genome-wide SNP discovery, haplotype map construction, QTL identification, association mapping and comparative genome analysis.

Abstract

Flax (Linum usitatissimum L., 2n=30) is a self-pollinated annual species of the Linaceae family and was domesticated ~7,000 years ago for its seed oil and stem fibres. The genome of cv. CDC Bethune was sequenced in 2012 using a whole genome shotgun strategy (WGS) and the Illumina sequencing platform. The sequences were assembled into 3,852 scaffolds larger than 1 Kb covering a total of 300 Mb. By integrating the BAC-based physical map, the BAC end sequences, the high-density SNP-and SSR-based genetic maps and the BioNano optical genome map, we refined and sorted out the scaffolds onto 15 linkage groups and generated their draft pseudomolecules. A total of 286 Mb of scaffold sequences were anchored to ~360 Mb of fingerprint contigs and, 284 Mb of scaffold sequences were mapped to 316 Mb of the BioNano map, covering 97% of all predicted genes of the WGS assembly. The pseudomolecules contain ~316 Mb of sequences with linkage groups varying in sizes from 15.6 to 29.4 Mb. The use of the draft flax pseudomolecules will enhance genome-wide SNP discovery, haplotype map construction, QTL identification, association mapping and comparative genome analysis.