Introduction

Soybean [Glycine max (L.) Merr.] is among the most widely grown crops in the world and is valuable because of its high oil and protein concentrations. The United States is the leading soybean-producing country, followed by Brazil, Argentina, and China (USDA-FAS 2016). Over the last 60 years, there has been an increasing trend in soybean yield in the US due to improvement in genetic and cultural practices (Irwin and Good 2015). Fox et al. (2013) reported that soybean breeding produced more than 500 cultivars over the last 60 years in North America that contributed to a 25% increase in yield.

Drought occurrence represents a severe abiotic stress and causes a reduction in soybean productivity in rain-fed areas. Drought adversely affects soybean yield to some degree at most developmental stages, particularly, during reproductive development (Oya et al. 2004). Drought is a major cause in the variation of soybean yield from year to year (Zipper et al. 2016) and is projected to be more intense with global climate change. Genetic improvement of soybean for drought tolerance is a cost-effective approach to stabilize yield for rain-fed areas.

Slow canopy wilting is a promising trait for improving drought tolerance. Screening of exotic germplasm for drought tolerance in North Carolina identified several slow-wilting genotypes, including PI 416937 and PI 471938 (Carter et al. 1999, 2006). Several mechanisms are likely to be responsible for slower canopy wilting. Sinclair et al. (2008) reported that slow-wilting in PI416937 was mainly associated with lower leaf hydraulic conductance for transpiration rate under high vapor pressure deficit (VPD). Devi and Sinclair (2013) reported that slow-wilting in PI 471938 was associated with a lower osmotic potential that helped to maintain a greater leaf turgor pressure. Slow-wilting may also be associated with maintaining greater leaf turgor, transpiration, and CO2 exchange rates during drought conditions (Carter et al. 2006; Fletcher et al. 2007; Sadok and Sinclair 2009; Sloane et al. 1990). King et al. (2009) and Ries et al. (2012) determined that slow-wilting was due to the conservation of soil moisture when soil moisture was plentiful. The conserved soil moisture could then be used when soil moisture in fast-wilting genotypes had been depleted.

Drought tolerance is complex and is controlled by a number of genes throughout the genome, each with minor effects and interacting with the environment (Blum 2011; Tuberosa et al. 2007). Under water-limited environments, crop performance can be improved by selecting and pyramiding favorable alleles associated with drought-tolerant related traits into elite cultivars (Blum, 2005). Various genomic approaches have been used to dissect genetic control of drought stress tolerance (Tuberosa et al. 2007). Quantitative trait loci (QTL) identification using molecular markers is one way to dissect the traits associated with drought tolerance (Dixit et al. 2014). Several different mapping populations have been used for QTL mapping of canopy wilting, which identified several genomic regions associated with canopy wilting variation. Charlson et al. (2009) identified four QTLs for canopy wilting on chromosomes Gm08, Gm13, Gm14, and Gm17 using a mapping population of 93 RILs (KS4895 and Jackson). Du et al. (2009) used a mapping population of 184 RILs from the cross of Kefeng1 and Nannong1138-2 to identify two QTLs for canopy wilting that were present on Gm8 and Gm20. Abdel-Haleem et al. (2012) identified seven QTLs for canopy wilting on Gm02, Gm04, Gm05, Gm12, Gm14, Gm17, and Gm19 using a mapping population of 150 RILs (Benning and PI 416937). Hwang et al. (2015) used the results of QTLs for wilting from five mapping populations to identify clusters of eight QTLs that were present in at least two populations, and a meta-analysis of these eight clusters identified nine meta-QTLs in eight chromosomal regions (Hwang et al. 2016).

Genome-wide association studies (GWAS) have emerged as a powerful tool to map and unravel complex trait variation down to the sequence level and to identify the genes associated with those traits (Nordborg et al. 2002; Zhu et al. 2008). The USDA soybean germplasm collection has been genotyped with the SoySNP50K iSelect Beadchip, which has allowed characterization of soybean genetic diversity, linkage disequilibrium, and the construction of high resolution linkage maps (Song et al. 2013). Recently, several GWAS in soybean identified significant SNP markers associated with seed protein and oil concentrations (Hwang et al. 2014), carotenoids (Dhanapal et al. 2015a), δ13C ratio (Dhanapal et al. 2015b), agronomic traits (Wen et al. 2014), and ureide concentrations (Ray et al. 2015).

In the present research, 42,509 SNP markers (www.soybase.org) were utilized for GWAS of canopy wilting on a panel of 373 diverse MG IV accessions. The objectives of this study were to explore the genetic variation of canopy wilting present within this panel, to identify significant SNPs associated with canopy wilting, and to confirm those SNPs with previously reported chromosomal regions associated with canopy wilting variations.

Materials and methods

Field experiments

Field experiments were conducted in four environments including the Pine Tree Research Station, AR (35°7′N, 90°55′W) in 2016 (PT-16) on a Calloway silt loam, Rohwer Research Station, AR (33°48′N, 91°17′W) in 2016 (RO-16) on a Sharkey silty clay, Salina, KS (38°480′N, 97°36′W) in 2015 (SA-15) and 2016 (SA-16) on a Hord silt loam. The experiments were sown on 2 June 2016 (PT-16), 23 May 2016 (RO-16), 9 June 2015 (SA-15), and 15 June 2016 (SA-16). The seeding density for all experiments was 32 m−2. For PT-16 and RO-16, plots were sown with a drill (19-cm row spacing) and were 1.52-m wide and 4.57-m long. At Salina for both years, plots consisted of two rows, 0.76 m apart that were 3.65 m in length.

The experimental design at each environment was a randomized complete block with two replications. Along with 373 maturity group (MG) 4 accessions, two check genotypes, slow-wilting (PI 416937) and fast-wilting (A5959), were evaluated in each environment. PI 416937 and A5959 were confirmed as slow and fast-wilting in the previous research (Hwang et al. 2015, 2016; King et al. 2009). These 373 accessions were obtained from the Soybean Germplasm Collection, USDA-ARS based on GRIN (Germplasm Resources Information Network, www.ars-grin.gov) data as reported by Dhanapal et al. (2015b). These accessions originated from 11 different nations, which increased the genetic diversity.

Phenotypic evaluations and descriptive statistics

Phenotypic evaluation of canopy wilting was scored using a visual rating based on a scale from 0 (no wilting) to 100 (plant death) (King et al. 2009). Canopy wilting was rated two times at PT-16, and three times each at RO-16, SA-15, and SA-16 environments within 2 h of solar noon under a clear sky. For any given rating date, all MG 4 accessions were at approximately the same growth stage, while the two slow-wilting check entries were at a slightly earlier stage. The previous research found that genotypic ranking for wilting was similar between years over a wide range of growth stages and over stress conditions that were mild to severe (King et al. 2009) and that there was little or no discernible differences in wilting due to growth stages prior to R6. Because of a dynamic plant/environment interaction, canopy wilting scores must be taken when conditions are favorable to differentiate genotypes for wilting. For the four environments in this study, conditions favorable for canopy wilting occurred between late vegetative development to R4 across all rating dates. For each environment, the average of all the canopy wilting scores per plot was determined for further analysis. Genotype was treated as a fixed effect and replication within the environment was considered as a random effect. Descriptive statistics and Pearson correlation analysis, for the average canopy wilting scores for each environment were computed using the PROC UNIVARIATE and PROC CORR procedures (α = 0.05) of SAS version 9.4 (SAS, Institute 2013), respectively. For analysis of variance (ANOVA), the PROC MIXED procedure (α = 0.05) of SAS 9.4 was used with a model as suggested by Bondari (2003). The model for the phenotypic trait was \( y_{ijk} = \mu + G_{i} + E_{j} + \left( {\text{GE}} \right)_{ij} + B_{{k\left( {ij} \right)}} + \varepsilon_{ijk} , \) where \( \mu \) is the total mean, \( G_{i} \) is the genotypic effect of the ith genotype, \( E_{j} \) is the effect of the jth environment, \( \left( {\text{GE}} \right)_{ij} \) is the interaction effect between the ith genotype and the jth environment, \( B_{{k\left( {ij} \right)}} \) is the effect of replication within the jth environment, and \( \varepsilon_{ijk} \) is a random error following \( N\left( {0, \sigma_{e}^{2} } \right) \).

Broad sense heritability on an entry-mean basis was calculated as \( H^{2} = \sigma_{G}^{2} / \left( {\sigma_{G}^{2} + \left( {\frac{{\sigma_{\text{GE}}^{2} }}{k}} \right) + \left( {\frac{{\sigma_{\varepsilon }^{2} }}{rk}} \right)} \right), \) where \( \sigma_{G}^{2} \) is the genotypic variance, \( \sigma_{\text{GE}}^{2} \) is the genotype by environment variance, \( \sigma_{\varepsilon }^{2} \) is the residual variance, k is the number of environments, and r is the number of replications. The PROC VARCOMP of SAS 9.4 with the REML method (restricted maximum likelihood estimation) was used to estimate the above variance components. To reduce the environmental variation, the Best Linear Unbiased Prediction (BLUP) values for each independent environment and across all environments were estimated using the PROC MIXED procedure, and these values were then used in GWAS analysis.

Genotyping and quality control

Marker data available from Soybase (www.soybase.org) provided data on 42,509 SNP markers for all 373 genotypes. There were 31,260 polymorphic markers after performing quality control checks by eliminating monomorphic markers, markers with minor allele frequency (MAF) <5%, and markers with a missing rate higher than 10%. Imputation of remaining marker data was applied using an LD-kNNi method, which is based on a k-nearest-neighbor genotype (Money et al. 2015). After filtration and imputation, the remaining 31,260 SNPs were used for association testing to identify significant SNPs.

Linkage disequilibrium estimation

Pairwise linkage disequilibrium (LD) between markers was measured by squared correlation coefficients (r 2) of alleles in the TASSEL 5.0 software (Bradbury et al. 2007). To understand the difference in recombination rate, LD was calculated separately for euchromatic and heterochromatic regions. Information of physical distance of euchromatic and heterochromatic regions for each chromosome was obtained from Soybase (www.soybase.org). Nonlinear regression curves, as described by Hill and Weir (1988), were used to estimate the LD decay with distance. The decay rate of LD was determined as the physical distance between markers where the average r 2 dropped to half its maximum value.

Genome-wide association analysis

Population stratification can induce false-positives in GWAS. There are a number of statistical models that effectively control these false-positives by incorporating population structure and kinship among genotypes. The most commonly used model for association analysis is the mixed linear model (MLM) that accounts for the family relatedness and population structure (Yu et al. 2006; Zhang et al. 2010). Incorporation of population structure and family relatedness in the MLM models adjusts association tests to control false-positives; however, these adjustments also compromise true positives (Liu et al. 2016). Hence, these models can induce false-negatives due to over-fitting of the model to a degree where potentially important associations can be missed. Fixed and random model Circulating Probability Unification (FarmCPU) effectively corrects false-positives without compromising true positives (Liu et al. 2016). Both MLM and FarmCPU models were compared using average canopy wilting across all environments and results were evaluated based on quantile–quantile (Q–Q) plots. Based on results as described subsequently, the FarmCPU model was chosen for the association analysis reported herein.

In FarmCPU, Multiple Loci Linear Mixed Model (MLMM) is divided into two parts: a Fixed Effect Model (FEM) and a Random Effect Model (REM) which are used iteratively (Liu et al. 2016). To avoid model over-fitting, REM estimates the multiple associated markers that are used to obtain kinship. The FEM tests markers, one at a time, and kinship from REM as covariates to control false-positives and false-negatives. At each iteration, P values of testing markers and multiple associated markers are unified.

A threshold value (−Log10 P ≥ 3.5), which is equivalent to a P value ≤ 0.0003, was used to declare a significant association of SNPs with canopy wilting. This threshold value is more stringent than that reported in other soybean GWAS studies (Dhanapal et al. 2015a, b; Hao et al. 2012; Hwang et al. 2014; Zhang et al. 2015). To identify the common significant SNPs present in more than one environment, a threshold value of P ≤ 0.05 was used, but only if the representative SNP had an association of P ≤ 0.0003 in a second environment.

Genetic merit for each accession was determined using genomic best linear unbiased prediction (gBLUP), which utilizes genomic-relationship matrix and phenotype data (Clark and van der Werf 2013; Zhang et al. 2007) and breeding values for genotypes. The gBLUP values were calculated in a GAPIT program (Lipka et al. 2012). The breeding value of each accession was calculated from the allelic effects of all significant SNPs. The allelic effect was calculated by taking a difference in mean canopy wilting between genotypes with the major allele and those with the minor allele. Alleles from either the major class or minor class were considered as favorable if they were associated with a reduction in the canopy wilting. To estimate the breeding value for each accession, the absolute value of the allelic effect of each significant SNP was considered as a negative value if an accession had a favorable allele of a significant SNP at that location, (i.e., if the allelic effect decreased canopy wilting). Otherwise, if the allelic effect was unfavorable (i.e., increased canopy wilting), the allelic value for an SNP was considered as a positive value. All positive and negative allelic values were summed to estimate the breeding value of each accession.

Candidate gene identification

Significant SNPs at level of −Log10 P ≥ 3.5 were used to identify the candidate genes in each environment and across all environments. Candidate genes, their associated functional annotation, and biological function were identified using Glyma1.1, Glyma1.0, and NCBI RefSeq gene models in Soybase (www.soybase.org) with consideration for those candidate genes that may have a direct biological connection with canopy wilting, transpiration, rooting, or water transport.

Results

Phenotype descriptions

A broad range of canopy wilting within each environment indicated wide phenotypic variation. Canopy wilting scores had a range of 25 (PT-16), 25 (RO-16), 30 (SA-15), and 38 (SA-16) (Table 1). The Shapiro–Wilk test of normality was performed, which indicated that canopy wilting data were normally distributed within each environment and skewness and kurtosis also indicated a normal distribution (Table 1). Analysis of variance indicated that genotype, environment, and their interaction had significant effects (P ≤ 0.05) on canopy wilting. A significant positive correlation for canopy wilting between environments ranged from r = 0.40 between PT-16 and SA-15 to r = 0.66 between RO-16 and SA-16. Broad sense heritability of canopy wilting on an entry-mean basis was moderate to high for PT-16 (59%), RO-16 (74%), SA-15 (69%), and SA-16 (84%). When considering all environments, heritability was 80%.

Table 1 Descriptive statistics of canopy wilting score from Pine Tree in 2016 (PT-09), Rohwer in 2016 (RO-16), Salina in 2015 (SA-15), 2016 (SA-16), and the average across environments (AAE)

Genomic best linear unbiased prediction (gBLUP) values were calculated utilizing the genomic-relationship matrix and phenotypic data of 373 accessions to estimate the genomic breeding values. The 373 accessions were ranked from lowest to highest based on the average gBLUP values of canopy wilting across all environments. Based on the average gBLUP-value ranking, the 15 accessions with lowest gBLUP for canopy wilting and 15 accessions with highest gBLUP for canopy wilting were selected (Table 2). Ranking of these 30 accessions was consistent with ranking of average phenotypic data of canopy wilting across all environments (Table 2). PI 592940 had the lowest canopy wilting for both gBLUP and phenotypic data, and PI507407 had the highest canopy wilting scores for both gBLUP and phenotypic data. These genotypes likely represent the most consistent extremes for canopy wilting.

Table 2 15 accessions with the lowest and highest ranking (gRank) for canopy wilting score based on gBLUP values averaged across all four environments, Pine Tree in 2016 (PT-16), Rohwer in 2016 (RO-16), Salina in 2015 (SA-15), 2016 (SA-16), and average across all four environments (AAE)

Slow-wilting extreme PI 592940 had an average wilting score across environments of 11; in comparison, the slow-wilting check (PI 416937) had an average wilting score across environments of 20. Fast-wilting extreme PI 507407 had an average wilting score across environments of 39; in comparison, the fast-wilting check (A5959) had an average wilting score across environments of 33. The 15 accessions with lowest gBLUP values and canopy wilting scores averaged across all environments were from China (8 accessions), South Korea (6 accessions), and Taiwan (1 accession) (Table 2). The 15 accessions with the highest gBLUP values and canopy wilting scores averaged across all environments were from Japan (9 accessions), South Korea (5 accessions), and Georgia (1 accession). The breeding value and number of favorable alleles of these 30 accessions were calculated using allelic effects of significant SNPs. Slow-wilting accessions had large negative breeding values associated with reduced canopy wilting (−73 to −4). In contrast, fast-wilting accessions had large positive breeding values (38–106) associated with increased canopy wilting. Slow-wilting accessions had more favorable alleles (29–41) as compared to fast-canopy wilting accessions (10–24).

Markers distribution and linkage disequilibrium

Of the 31,260 markers used for association analysis, 22% of the markers had an MAF between 0.05 and 0.1, and 15% of the markers had a MAF between 0.10 and 0.15 (Table 3). Markers were more densely distributed in euchromatic (75.4%) than heterochromatic regions (24.6%) across the chromosomes. Out of a total of 950.1 Mb in the soybean genome, SNP density in euchromatic region ranged from 42 SNPs/Mb for Gm19 to 65 SNPs/Mb for Gm09. For the heterochromatic regions, SNP density ranged from 4 SNPs/Mb for Gm20 to 36 SNPs/Mb for Gm18. The average \( r^{2} \) between markers in the euchromatic region declined to 0.25 within 150 kb, whereas the r 2 in the heterochromatic region declined to 0.25 within 5000 kb (data not shown). These LD estimated results were consistent with results of Dhanapal et al. (2015b) and Hwang et al. (2014), indicating considerably greater LD for the euchromatic region than the heterochromatic region.

Table 3 Single-nucleotide polymorphism (SNPs) distribution with minor allele frequency (MAF) in this population panel

Genome-wide association analysis

The FarmCPU and MLM models were compared using the average canopy wilting data across all environments. The Q–Q plot of the FarmCPU model resulted in a sharp deviation from the expected P value distribution in the tail area, indicating that false-positives and negatives were adequately controlled, whereas Q–Q plots from MLM models did not show a sharp deviation (Fig. 1). These results indicated that FarmCPU was a better choice than MLM model for association testing in this study.

Fig. 1
figure 1

Quantile–quantile (QQ) plot of the mixed linear model (MLM) and fixed and random model circulating probability unification (FarmCPU) model using average canopy wilting data across all environments

Association analysis identified 61 significant SNPs in four environments associated with canopy wilting at the level of −Log10 (P) ≥ 3.5; P ≤ 0.0003 (Fig. 2). Out of 61 SNPs, 21 SNPs were present in at least two environments. One significant SNP on Gm20 was present in all four environments (Table 4). Significant SNPs, which were present within the same LD block, were considered as one locus, and out of the 61 significant SNPs identified across environments, there were 51 putative loci. Two putative loci on Gm06 and Gm16 were identified by three closely spaced SNPs, and five putative loci on Gm09, Gm13, Gm18, and Gm20 (2) were identified by two closely spaced SNPs, while the remaining loci were identified by one SNP. The allelic effect (difference in mean canopy wilting between genotypes with major allele and minor allele) for these significantly SNPs ranged from −7.40 to 5.18. A positive value indicates that the minor allele was the favorable allele associated with reduced canopy wilting and a negative value indicates that the major allele was the favorable allele associated with reduced canopy wilting. Information on the 61 significant SNPs, their corresponding MAF, major or minor allele, allelic effect, and common environments is listed in Table 4. These 61 significant SNPs from four environments were used to identify 61 genes within ± 10 kb of the respective SNPs. This distance was chosen, because it approximates the average distance between SNPs (18 kb). A list of these genes and their corresponding functional annotations (www.soybase.com) are provided (Table S1).

Fig. 2
figure 2

Manhattan plots of −Log10 (P) vs. chromosomal position of significant SNP associations and respective Quantile–quantile (QQ) plots of canopy wilting for four environments; a Salina 2015, b Salina 2016, c Pine Tree 2016, d Rohwer 2016, and e average canopy wilting across all environments (AAE) using the FarmCPU model. Red line represents the association threshold (−Log10 (P) ≥ 3.5; P ≤ 0.0003)

Table 4 List of significant SNPs associated with canopy wilting score for four environments, Pine Tree in 2016 (PT-16), Rohwer in 2016 (RO-16), Salina in 2015 (SA-15), and 2016 (SA-16) using FarmCPU model with threshold P value (−Log10 (P) ≥ 3.5; P ≤ 0.0003)

Association analysis of canopy wilting averaged across all environments identified 34 significant SNP associations at −Log10 (P) ≥ 3.5; P ≤ 0.0003 (Fig. 2). Of these 34 SNPs, seven were common to the 61 significant SNPs identified from four environments and among these seven SNPs, five SNPs were present in more than one environment (Table 5). Based on the closely spaced significant SNPs within the LD blocks, the 34 SNPs comprised 23 putative loci. The putative locus 1 on Gm01 was identified by 10 closely spaced SNPs, and locus 2 on Gm01 and locus 21 on Gm20 were identified by two closely spaced SNPs, while the remaining loci were all identified by one SNP. The allelic effect (difference in mean canopy wilting between genotypes with major allele and minor allele) for these significant SNPs ranged from −4.82 to 3.13. Information for these 34 significant SNPs, their corresponding MAF, major or minor allele, allelic effect, and common environments is listed in Table 5. These 34 significant SNPs from four environments were used to identify 34 genes within ±10 kb of the respective SNPs. A list of these genes and their corresponding functional annotations are provided (Table S2).

Table 5 List of significant SNPs associated with average canopy wilting across all environments using FarmCPU model with threshold P value (−Log10 (P) ≥ 3.5; P ≤ 0.0003)

Discussion

This research evaluated canopy wilting in a panel of 373 MG IV soybean accessions in four environments along with slow-wilting (PI 416937) and fast-wilting (A5959) check genotypes. Canopy wilting had a wide range of phenotypic variation within each environment, which is important for dissecting complex traits through association mapping (McCarthy et al. 2008). In the panel of 373 accessions, genotypes were found with canopy wilting scores more extreme than any previous report. Two genotypes, PI 416937 and A5959, which were used in this study as checks, were also confirmed as slow- and fast-wilting in the previous research (Hwang et al. 2015, 2016; King et al. 2009). In the present research, PI 416937 and A5959 had average wilting scores across environments of 20 and 33, respectively (Fig. 3a). In comparison to the two checks, the average wilting score of slow-wilting PI 592940 was 11, and the average wilting score of the fast-wilting extreme PI 507407 was 39 (Table 2). Overall, 185 genotypes had lower average wilting scores across environments than did the slow-wilting check (PI 416937), but only two genotypes (PI507424 and PI507407) had wilting scores higher than the fast-wilting check. The genotypes with lower average wilting scores represent new genetic sources for the slow-wilting trait with potential alternative alleles or different mechanisms to achieve slow-wilting.

Fig. 3
figure 3

Distribution of average canopy wilting score across all environments (a), breeding values of accessions (b), and number of favorable alleles (c)

Carter et al. (2006) and King et al. (2009) reported that the accessions PI 416937 and PI 471938 were slow-wilting. The breeding values of these previously reported slow-wilting genotypes were 24 (PI 416937) and 20 (PI 471938) (Fig. 3b). In addition, the number of favorable alleles for these genotypes was 29 (PI 416937) and 24 (PI 471938) (Fig. 3c). In comparison, the most extreme slow-wilting accessions in this study had breeding values as low as -67 and as many as 41 favorable alleles (Table 2; Fig. 3b, c). The 15 accessions that were considered as slowest wilting in ranking also had considerably lower breeding values and more favorable alleles than PI 416937 and PI 471938. In contrast, the 15 accessions that were considered as fastest wilting in ranking had breeding values as high as 104 with only 10 favorable alleles. Hence, there is considerably greater variation in canopy wilting among genotypes than has been previously reported.

Significant positive correlations for canopy wilting between environments and a moderate-to-high heritability indicated that canopy wilting is a relatively stable trait across environments. Similar results of heritability were reported in several different mapping studies (Abdel-Haleem et al. 2012; Charlson et al. 2009; Hwang et al. 2015).

An advantage of GWAS over the traditional QTL mapping is that it is possible to map complex trait variation down to the nucleotide level. Out of 61 significant SNPs associated with canopy wilting, the minor alleles of 35 of these SNPs were favorable and associated with a decrease in the canopy wilting. One SNP with the minor allele associated with the largest reduction in the canopy wilting (5.18) was present on Gm18. This SNP was present within the coding region of Glyma18g14740, which encodes a protein functioning as a serine-glyoxylate amino-transaminase, and having a biological function involved with water transportation (www.soybase.com) (Table S1). Sinclair et al. (2008) concluded that low leaf hydraulic conductance in PI 416937 contributed to slow-wilting by conserving water under conditions of high evaporative demand. Interestingly, the SNP associated with the minor allele having the second largest reduction in canopy wilting was on Gm06 and was present in the coding region of Glyma06g45120. This region encodes the auxin-responsive GH3 family protein having a biological function associated with response of abscisic acid stimulus (Table S1). Biological functions of water transport and abscisic acid stimulus may directly result in phenotypic variation in canopy wilting. Presuming that slow-wilting is primarily associated with conserving soil moisture when soil moisture is plentiful (King et al. 2009; Ries et al. 2012) or by limiting transpiration when vapor pressure deficit is above a threshold value (Fletcher et al. 2007; Sadok and Sinclair 2009), slow-wilting genotypes expectantly would have lower water transport and greater expression of abscisic acid synthesis compared to fast-wilting genotypes. Up-regulation of Glyma18g14740 and Glyma06g45120 may, therefore, confer slow-wilting in plants under water deficit conditions. Similarly, other genes (Table S1) functioning in water transportation, abscisic acid stimulus, and root developmental, etc. may have roles in decreasing wilting by accessing more water (e.g., deeper rooting) or conserving water.

For 26 out of 61 SNPs associated with canopy wilting, the minor alleles of these SNPs were unfavorable and associated with an increase in canopy wilting (negative value of allelic effect indicates that minor allele was associated with an increase in canopy wilting in Table 4). One SNP on Gm08 had an allelic effect of −7.4, indicating that the major allele was favorable and associated with decreased canopy wilting. This SNP was present in the coding region of acyl-CoA synthetase gene, which functions in long-chain fatty acid metabolism. Table S1 provides information on 61 genes associated with identified SNPs that may have some association with canopy wilting, which can be directly or indirectly related to transpiration or water conservation for improving drought tolerance.

Several QTL mapping studies of canopy wilting have been conducted using different mapping populations to identify the chromosomal regions associated with canopy wilting variation. The studies found four QTLs (on chromosomes Gm08, Gm13, Gm14, and Gm17; Charlson et al. 2009), two QTLs (on chromosomes Gm08 and Gm20; Du et al. 2009), seven QTLs (on chromosomes Gm02, Gm04, Gm05, Gm12, Gm14, Gm17, and Gm19; Abdel-Haleem et al. 2012), and 22 QTLs (on chromosomes Gm02, Gm05, Gm11, Gm17, and Gm19; Hwang et al. 2015). Recently, meta-QTL analysis of canopy wilting by Hwang et al. (2016) reported nine meta-QTLs in eight chromosomal regions. Location of these reported chromosomal regions was compared with significant SNPs associated with canopy wilting in the present research from four environments and from the average across all environments (Fig. 4). Six different putative loci on five chromosomes Gm02 (2), Gm11, Gm17 (2), and Gm19 were located within six chromosomal regions that were identified by meta-QTL analysis of Hwang et al. (2016). These six putative loci consisted of eight significant SNPs, and six SNPs out of these eight had the minor allele associated with a decrease in the canopy wilting (Fig. 4).

Fig. 4
figure 4

Location of SNPs significantly associated with canopy wilting in four environments and across environments with previously identified QTLs for canopy wilting as shown in Soybase (www.soybase.org)

Some of the putative loci, located within previously reported chromosomal regions of meta-QTLs of canopy wilting (Hwang et al. 2016), may be candidates for decreasing wilting based on their biological functions. For example, Glyma02g01910, on Gm02, encodes a glycosyl hydrolase that has a biological function involved with root hair elongation. Up-regulation of Glyma02g01910 may increase root hair elongation in plants, providing more access to soil moisture and slowing wilting. A second gene, Glyma11g20020 on Gm11, encodes an acyl-CoA synthetase with a biological function involved in jasmonic acid biosynthesis. Jasmonic acid is an essential component of the signaling pathway in response to drought stress. Jasmonic acid is needed to induce ABA accumulation in roots of plants (de Ollas et al. 2015). Up-regulation of Glyma11g20020 may increase jasmonic acid biosynthesis, increasing ABA accumulation in roots, closing stomata, and conserving soil moisture.

Conclusions

In this study, high-density marker data of 31,260 SNPs with MAF ≥5% were used in GWAS to map the genomic regions controlling canopy wilting variation. Association analysis identified 61 significant SNPs associated with canopy wilting variation from four environments and 34 significant SNPs associated with average canopy wilting across all environments at a significance level of −Log10 (P) ≥ 3.5. Twenty-one significant associations of SNPs with canopy wilting out of 61 were present in at least two environments. The 61 SNP-canopy wilting associations and 34 SNPs identified from the average of canopy wilting across environments likely tagged 51 and 23 different loci, respectively. Six different putative loci were located within chromosomal regions that were previously reported as meta-QTLs for canopy wilting. Ultimately, significant SNPs that were present in more than one environment and those located within chromosome regions that were reported previously, are potential alleles for improving soybean drought tolerance. The genotypes identified with a large number of favorable slow-wilting alleles represent new genetic sources for crop improvement.

Author contribution statement

ASK and LCP designed the experiments, and ASK performed the statistical analysis. WTS, CAK, ASK, and LCP performed the field experiments and collected phenotypic data. ASK and JDR assembled the genotypic data. ASK co-wrote the manuscript with LCP. WTS, CAK, and JDR critically revised the manuscript. LCP coordinated and supervised the project. All authors read and approved the final manuscript.