TY - JOUR
T1 - Optical and physical mapping with local finishing enables megabase-scale resolution of agronomically important regions in the wheat genome
AU - IWGSC
AU - Keeble-Gagnère, Gabriel
AU - Rigault, Philippe
AU - Tibbits, Josquin
AU - Pasam, Raj
AU - Hayden, Matthew
AU - Forrest, Kerrie
AU - Frenkel, Zeev
AU - Korol, Abraham
AU - Huang, B. Emma
AU - Cavanagh, Colin
AU - Taylor, Jen
AU - Abrouk, Michael
AU - Sharpe, Andrew
AU - Konkin, David
AU - Sourdille, Pierre
AU - Darrier, Benoît
AU - Choulet, Frédéric
AU - Bernard, Aurélien
AU - Rochfort, Simone
AU - Dimech, Adam
AU - Watson-Haigh, Nathan
AU - Baumann, Ute
AU - Eckermann, Paul
AU - Fleury, Delphine
AU - Juhasz, Angela
AU - Boisvert, Sébastien
AU - Nolin, Marc Alexandre
AU - Doležel, Jaroslav
AU - Šimková, Hana
AU - Toegelová, Helena
AU - Šafář, Jan
AU - Luo, Ming Cheng
AU - Câmara, Francisco
AU - Pfeifer, Matthias
AU - Isdale, Don
AU - Nyström-Persson, Johan
AU - Koo, Dal Hoe
AU - Tinning, Matthew
AU - Cui, Dangqun
AU - Ru, Zhengang
AU - Appels, Rudi
N1 - Publisher Copyright:
© 2018 The Author(s).
PY - 2018/8/17
Y1 - 2018/8/17
N2 - Background: Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome. Results: Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region. Conclusions: Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.
AB - Background: Numerous scaffold-level sequences for wheat are now being released and, in this context, we report on a strategy for improving the overall assembly to a level comparable to that of the human genome. Results: Using chromosome 7A of wheat as a model, sequence-finished megabase-scale sections of this chromosome were established by combining a new independent assembly using a bacterial artificial chromosome (BAC)-based physical map, BAC pool paired-end sequencing, chromosome-arm-specific mate-pair sequencing and Bionano optical mapping with the International Wheat Genome Sequencing Consortium RefSeq v1.0 sequence and its underlying raw data. The combined assembly results in 18 super-scaffolds across the chromosome. The value of finished genome regions is demonstrated for two approximately 2.5 Mb regions associated with yield and the grain quality phenotype of fructan carbohydrate grain levels. In addition, the 50 Mb centromere region analysis incorporates cytological data highlighting the importance of non-sequence data in the assembly of this complex genome region. Conclusions: Sufficient genome sequence information is shown to now be available for the wheat community to produce sequence-finished releases of each chromosome of the reference genome. The high-level completion identified that an array of seven fructosyl transferase genes underpins grain quality and that yield attributes are affected by five F-box-only-protein-ubiquitin ligase domain and four root-specific lipid transfer domain genes. The completed sequence also includes the centromere.
KW - Megabase-scale integration
KW - Optical/physical maps Grain quality
KW - Wheat sequence finishing
KW - Yield
UR - http://www.scopus.com/inward/record.url?scp=85051741528&partnerID=8YFLogxK
U2 - 10.1186/s13059-018-1475-4
DO - 10.1186/s13059-018-1475-4
M3 - Article
C2 - 30115128
AN - SCOPUS:85051741528
SN - 1474-7596
VL - 19
JO - Genome Biology
JF - Genome Biology
IS - 1
M1 - 112
ER -