Genetic Analysis of High Protein Content in ‘AC Proteus’ Related Soybean Populations Using SSR, SNP, DArT and DArTseq Markers

Citation

Samanfar, B., Cober, E.R., Charette, M., Tan, L.H., Bekele, W.A., Morrison, M.J., Kilian, A., Belzile, F., Molnar, S.J. (2019). Genetic Analysis of High Protein Content in ‘AC Proteus’ Related Soybean Populations Using SSR, SNP, DArT and DArTseq Markers. Scientific Reports, [online] 9(1), http://dx.doi.org/10.1038/s41598-019-55862-9

Plain language summary

Seed protein content is an important trait for soybean, as it crucial for its use as animal feed and for human consumption. Typically, soybean cultivars with high protein content show low yield. The high protein cultivar AC Proteus was developed for short season Canadian conditions and it has become the parent of numerous current varieties with high seed protein. Previous work has indicated that populations developed from AC Proteus may not exhibit the typical negative relationship between seed yield and seed protein. This study investigated the genetic control of high seed protein in AC Proteus. The authors identified several genomic regions which were maintained in high protein cultivars derived from AC Proteus over multiple breeding cycles. These high protein regions may prove useful for further development of high-yielding, high-protein soybean cultivars for short season Canadian conditions.

Abstract

Key message: Several AC Proteus derived genomic regions (QTLs, SNPs) have been identified which may prove useful for further development of high yielding high protein cultivars and allele-specific marker developments. High seed protein content is a trait which is typically difficult to introgress into soybean without an accompanying reduction in seed yield. In a previous study, ‘AC Proteus’ was used as a high protein source and was found to produce populations that did not exhibit the typical association between high protein and low yield. Five high x low protein RIL populations and a high x high protein RIL population were evaluated by either quantitative trait locus (QTL) analysis or bulk segregant analyses (BSA) following phenotyping in the field. QTL analysis in one population using SSR, DArT and DArTseq markers found two QTLs for seed protein content on chromosomes 15 and 20. The BSA analyses suggested multiple genomic regions are involved with high protein content across the five populations, including the two previously mentioned QTLs. In an alternative approach to identify high protein genes, pedigree analysis identified SNPs for which the allele associated with high protein was retained in seven high protein descendants of AC Proteus on chromosomes 2, 17 and 18. Aside from the two identified QTLs (five genomic regions in total considering the two with highly elevated test statistic, but below the statistical threshold and the one with epistatic interactions) which were some distance from Meta-QTL regions and which were also supported by our BSA analysis within five populations. These high protein regions may prove useful for further development of high yielding high protein cultivars.