Effects of Subsetting by Carbon Content, Soil Order, and Spectral Classification on Prediction of Soil Total Carbon with Diffuse Reflectance Spectroscopy
Subsetting of samples is a promising avenue of research for the continued improvement of prediction models for soil properties with diffuse reflectance spectroscopy. This study examined the effects of subsetting by soil total carbon ( ) content, soil order, and spectral classification with k-means cluster analysis on visible/near-infrared and mid-infrared partial least squares models for prediction. Our sample set was composed of various Hawaiian soils from primarily agricultural lands with contents from <1% to 56%. Slight improvements in the coefficient of determination ( ) and other standard model quality parameters were observed in the models for the subset of the high activity clay soil orders compared to the models of the full sample set. The other subset models explored did not exhibit improvement across all parameters. Models created from subsets consisting of only low samples (e.g., < 10%) showed improvement in the root mean squared error (RMSE) and percent error of prediction for low soil samples. These results provide a basis for future study of practical subsetting strategies for soil prediction. 1. Introduction Diffuse reflectance spectroscopy (DRS) and chemometric analysis have become popular subjects of research for their potential to predict soil carbon and other soil properties. This methodology could be beneficial for monitoring soil quality and temporal variation, as well as helping to facilitate digital soil mapping efforts. Both visible/near-infrared (VNIR) and mid-infrared (MIR) spectra show promise for the prediction of soil total carbon ( ) and organic carbon, as well as organic matter, total N, total P, sand, silt, and clay fractions, cation exchange capacity, and pH (e.g., [1–8]). Particular attention has been given to soil carbon, which is an important indicator of soil fertility and biological activity and is crucial to carbon sequestration endeavors [9–12]. Partial least squares regression (PLSR) appears to be the most widely used chemometric method for developing prediction models from soil diffuse reflectance spectra. A sample set is commonly divided into two groups with the larger used for calibration and the smaller for validation to approximate true independent model validation, but no clear or consistent guidelines have been adopted for this process. Model results are known to vary with different groupings of samples for the calibration and validation sets. To address this issue, some studies have created multiple models, each with different random divisions of the sample set into calibration and validation sets, to
References
[1]
J. B. Reeves III, G. W. McCarty, and V. B. Reeves, “Mid-infrared diffuse reflectance spectroscopy for the quantitative analysis of agricultural soils,” Journal of Agricultural and Food Chemistry, vol. 49, no. 2, pp. 766–772, 2001.
[2]
G. W. McCarty, J. B. Reeves, V. B. Reeves, R. F. Follett, and J. M. Kimble, “Mid-infrared and near-infrared diffuse reflectance spectroscopy for soil carbon measurement,” Soil Science Society of America Journal, vol. 66, no. 2, pp. 640–646, 2002.
[3]
K. D. Shepherd and M. G. Walsh, “Development of reflectance spectral libraries for characterization of soil properties,” Soil Science Society of America Journal, vol. 66, no. 3, pp. 988–998, 2002.
[4]
R. A. V. Rossel, D. J. J. Walvoort, A. B. McBratney, L. J. Janik, and J. O. Skjemstad, “Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties,” Geoderma, vol. 131, no. 1-2, pp. 59–75, 2006.
[5]
G. M. Vasques, S. Grunwald, and J. O. Sickman, “Comparison of multivariate methods for inferential modeling of soil carbon using visible/near-infrared spectra,” Geoderma, vol. 146, no. 1-2, pp. 14–25, 2008.
[6]
G. M. Vasques, S. Grunwald, and J. O. Sickman, “Modeling of soil organic carbon fractions using visible—near-lnfrared spectroscopy,” Soil Science Society of America Journal, vol. 73, no. 1, pp. 176–184, 2009.
[7]
R. A. V. Rossel and T. Behrens, “Using data mining to model and interpret soil diffuse reflectance spectra,” Geoderma, vol. 158, no. 1-2, pp. 46–54, 2010.
[8]
M. L. McDowell, G. L. Bruland, J. L. Deenik, S. Grunwald, and N. M. Knox, “Soil total carbon analysis in Hawaiian soils with visible, near-infrared and mid-infrared diffuse reflectance spectroscopy,” Geoderma, vol. 189-190, pp. 312–320, 2012.
[9]
K. Paustian, O. Andrén, H. H. Janzen et al., “Agricultural soils as a sink to mitigate CO2 emissions,” Soil Use and Management, vol. 13, no. 4, pp. 230–244, 1997.
[10]
H. Tiessen, E. Cuevas, and P. Chacon, “The role of soil organic matter in sustaining soil fertility,” Nature, vol. 371, no. 6500, pp. 783–785, 1994.
[11]
E. T. Craswell and R. D. B. Lefroy, “The role and function of organic matter in tropical soils,” Nutrient Cycling in Agroecosystems, vol. 61, no. 1-2, pp. 7–18, 2001.
[12]
R. Lal, “Soil carbon sequestration impacts on global climate change and food security,” Science, vol. 304, no. 5677, pp. 1623–1627, 2004.
[13]
D. J. Brown, R. S. Bricklemyer, and P. R. Miller, “Validation requirements for diffuse reflectance soil characterization models with a case study of VNIR soil C prediction in Montana,” Geoderma, vol. 129, no. 3-4, pp. 251–267, 2005.
[14]
A. M. Mouazen, B. Kuang, J. De Baerdemaeker, and H. Ramon, “Comparison among principal component, partial least squares and back propagation neural network analyses for accuracy of measurement of selected soil properties with visible and near infrared spectroscopy,” Geoderma, vol. 158, no. 1-2, pp. 23–31, 2010.
[15]
D. V. Sarkhot, S. Grunwald, Y. Ge, and C. L. S. Morgan, “Comparison and detection of total and available soil carbon fractions using visible/near infrared diffuse reflectance spectroscopy,” Geoderma, vol. 164, no. 1-2, pp. 22–32, 2011.
[16]
B. E. Madari, J. B. Reeves, M. R. Coelho et al., “Mid- and near-infrared spectroscopic determination of carbon in a diverse set of soils from the Brazilian national soil collection,” Spectroscopy Letters, vol. 38, no. 6, pp. 721–740, 2005.
[17]
J. Cierniewski, C. Ka?mierowski, K. Ku?nierek et al., “Unsupervised clustering of soil spectral curves to obtain their stronger correlation with soil properties,” in Proceedings of the 2nd Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing, (WHISPERS '10), Reykjavik, Iceland, June 2010.
[18]
G. M. Vasques, S. Grunwald, and W. G. Harris, “Spectroscopic models of soil organic carbon in Florida, USA,” Journal of Environmental Quality, vol. 39, no. 3, pp. 923–934, 2010.
[19]
AOAC International, Official Methods of Analysis of AOAC International, AOAC International, Arlington, Va, USA, 16th edition, 1997.
[20]
P. C. Williams, “Variables affecting near-infrared reflectance spectroscopic analysis,” in Near-Infrared Technology in the Agricultural and Food Industries, P. Williams and K. Norris, Eds., pp. 143–167, American Association of Cereal Chemists, St. Paul, Minn, USA, 1987.
[21]
V. Bellon-Maurel, E. Fernandez-Ahumada, B. Palagos, J. M. Roger, and A. McBratney, “Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy,” Trends in Analytical Chemistry, vol. 29, no. 9, pp. 1073–1081, 2010.
[22]
W. R. B. IUSS Working Group, “World reference base for soil resources,” World Soil Resources report no. 103, FAO, Rome, Italy, 2006.