2020-2021 field seasons of Maize GxE project within the Genomes to Fields Initiative
BMC Research Notes volume 16, Article number: 219 (2023)
This release note describes the Maize GxE project datasets within the Genomes to Fields (G2F) Initiative. The Maize GxE project aims to understand genotype by environment (GxE) interactions and use the information collected to improve resource allocation efficiency and increase genotype predictability and stability, particularly in scenarios of variable environmental patterns. Hybrids and inbreds are evaluated across multiple environments and phenotypic, genotypic, environmental, and metadata information are made publicly available.
The datasets include phenotypic data of the hybrids and inbreds evaluated in 30 locations across the US and one location in Germany in 2020 and 2021, soil and climatic measurements and metadata information for all environments (combination of year and location), ReadMe, and description files for each data type. A set of common hybrids is present in each environment to connect with previous evaluations. Each environment had a collaborator responsible for collecting and submitting the data, the GxE coordination team combined all the collected information and removed obvious erroneous data. Collaborators received the combined data to use, verify and declare that the data generated in their own environments was accurate. Combined data is released to the public with minimal filtering to maintain fidelity to the original data.
The release of this data provides a unique resource to understand and dissect genotype-by-environment interactions in maize (Zea mays subsp. mays L.). Collaborators generate phenotypic, environmental, and metadata datasets to support a more comprehensive understanding of the opportunities and challenges associated with maize production in various environments. The Maize GxE project data is made available to the public in its original form, with minimum filtering to remove erroneous data or as specified by collaborators and in the description files. This approach ensures that the publicly available data contains the maximum amount of information collected by project collaborators and empowers users to define their quality controls based on their specific goals.
A set of 1184 publicly available hybrids were evaluated in the 2020 and 2021 seasons across 30 different locations. The main group of hybrids was produced by the cross of doubled-haploid (DH) inbred lines from the Wisconsin Stiff Stalk MAGIC population (WI-SS-MAGIC), crossed with three ex-PVP inbred testers, PHZ51, PHP02, and PHK76 . The WI-SS-MAGIC population involves the inbreds B73, B84, NKH8431, LH145, PHB47, and PHJ40 as parents in the initial crosses, and a detailed description of the population creation and DH production is in Michel et al. (2022) . The testers were selected to allow adaptation of materials to the wide array of maturities sampled across the project. Inbred tester PHZ51 was used in southern locations (DEH1, GAH1, GAH2, IAH1, IAH2, IAH3, IAH4, MOH1, NCH1, NEH1, NEH2, NEH3, NYH3, SCH1, TXH1, TXH2, TXH3, WIH2), PHK76 in the Midwest and intermediate locations (DEH1, IAH1, IAH2, IAH3, IAH4, ILH1, INH1, MOH1, NCH1, NYH3, WIH2), and PHP02 in the northern locations (DEH1, GEH1, IAH2, IAH3, IAH4, MIH1, MNH1, MOH1, NYH2, OHH1, WIH1, WIH2, WIH3). Six locations (mega-locations) had hybrids created using all three testers (DEH1, IAH2, IAH3, IAH4, MOH1, WIH2). Additional smaller-scale experiments were conducted alongside the main experiment for additional phenotyping and/or deployment of novel phenotyping methods and tools across approximately 83% of the locations. These experiments included the external Yellow Stripes (YS), same set of check hybrids know as ‘Yellow Stripe’ used to connect location and years, but evaluated in a different experiment; the High-Intensity Phenotyping Site (HIPS), which tested 22 hybrids (HIP_Hybrid) and 22 inbreds (HIP_Inbred). HIPS was introduced in 2020 as a more comprehensive phenotyping set for aerial high-throughput phenotyping platforms. The choice of hybrids and inbreds was based on their historical importance and relevance to other connected projects.
The 2020 and 2021 datasets are publicly available via CyVerse/iPlant. These datasets contain phenotypic, environmental, soil, and supplemental data, and have been structured according to the specifications outlined in Table 1.
Phenotypic data: Phenotypic measurements that follow a standard set of instructions, available at genomes2fields.org. Standard traits include days to anthesis, days to silking, ear height, plant height, stand count, stalk lodging, root lodging, grain moisture, test weight, plot weight, and grain yield. Both raw data and minimally quality-controlled (clean) data are reported separately. Out of range observations were set to missing values following the rules described in the readMe files.
Environmental dataset: WatchDog 2700 weather stations (Spectrum Technologies) were placed at each field site. Data were collected at 30-min intervals, or according with collaborator set up, from planting through harvest at each location. The geographic locations of the experiments are not identical across years due to crop rotation management practices; thus, the locations of the weather stations vary across years. Each station measured wind speed, direction, and gust; air temperature, dewpoint, relative humidity; soil temperature and moisture; rainfall and solar radiation. Additional measurements taken at selected sites included soil electrical conductivity, ultra-violet light, carbon dioxide, and photosynthetically active radiation. Instructions for weather station maintenance activities including pre-season tasks, field setup, maintenance throughout the growing season, and removal are available on the G2F webpage .
Soil dataset: Each field location collected soil samples that represent the experiment field according to the instructions available on the G2F webpage.
Supplemental dataset: Supplemental information consists of metadata (any field-level data collected at planting, in season, and/or at harvest), agronomic information (list of products, nutrients, and irrigation applied), and cooperator list (collaborators responsible for the field locations in 2020 and 2021).
These datasets contain missing data. Missing data includes data not reported by collaborators or erroneous data as specified on the readMe and description files. Genotypic data is not included in this release. Locations that did not collect data due to the complete loss of the experiment are listed on the cooperator list.
Genomes to Fields
Genotype by environment
Michel KJ, Lima DC, Hundley H, Singan V, Yoshinaga Y, Daum C et al. Genetic mapping and prediction of flowering time and plant height in a maize Stiff Stalk MAGIC population. Genetics. 2022;221.
Genomes to Fields resources. 2023. https://www.genomes2fields.org/resources/.
G2F Consortium. Genomes to Fields 2020 dataset. CyVerse Data Commons. 2020. https://doi.org/10.25739/hzzs-a865.
G2F Consortium. Genomes to Fields 2021 dataset. CyVerse Data Commons. 2021. https://doi.org/10.25739/5ae3-sw62.
We gratefully acknowledge contributions from many field managers, crop research coordinators, staff, graduate students, student interns and data collectors including: Dustin Eilert, Marina Borsecnik, Renata Barcelos and Ben Fischer (de Leon/Kaeppler labs, University of Wisconsin - Madison). Naomi Rodman, Hadden Duling, Nathan Moore, Evonne Pinto, Marielle Quinn, & Kai Sanders (Wallace lab). Jason Brewer (USDA-ARS, Raleigh, NC). Trevor Perla, Paige Coffee, Will Deems, and Amy Deariso (ARS-Tifton, GA). Brian Hearn, Gunnar Isaacs, Buddy Willey, Dheeraj Danthuluri, and Raphael Kim (University of Delaware). Tom Siler (Michigan State University). Katherine Guill, Brady Blanton, Mia Ruppel, Grace Sidberry, Daniel Kick (Washburn lab, USDA-ARS). Amanda Gilbert, Peter Hermanson, and Thomas Hoverstad (University of Minnesota).
We gratefully acknowledge support from: National Corn Growers Association, Iowa Corn Promotion Board, Georgia Corn Commission, Nebraska Corn Board, Ohio Corn Marketing Program, Corn Marketing Program of Michigan, Texas Corn Producers Board, University of Göttingen startup funds, USDA-ARS, and USDA Germplasm Enhancement of Maize program.
Ethics approval and consent to participate
Consent for publication
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Lima, D.C., Aviles, A.C., Alpers, R.T. et al. 2020-2021 field seasons of Maize GxE project within the Genomes to Fields Initiative. BMC Res Notes 16, 219 (2023). https://doi.org/10.1186/s13104-023-06430-y
- Genotype by Environment
- Grain Yield