An official website of the European Union How do you know?      
European Commission logo
Predicting Soil Properties Using Spectral Subsets of LUCAS Visible Near-Infrared Spectroscopy Data
cover
Soil health is critical for sustaining ecosystem functions and addressing environmental challenges. Effective soil health management requires reliable methods for assessing soil properties. Soil spectroscopy may allow resource-effective assessment of soil properties, but more knowledge is needed to transfer knowledge from laboratory-grade spectrometers to in-field data acquisition. This study explores the predictive potential of selected spectral subsets from the full visible and near-infrared (VIS–NIR) range, using various machine learning algorithms (MLAs), as a theoretical exercise to support the design of practical soil sensing tools. Specifically, we evaluated whether narrower spectral ranges can provide predictions comparable to those achieved with the full VIS–NIR spectrum. The ranges are chosen to emulate the spectral coverage and resolution of commercially available sensors which are candidates for widespread and resource-effective data collection. We used the VIS–NIR spectral data (400–2500 nm) alongside laboratory analyses of several soil properties to be predicted from the pan-European LUCAS dataset. We employed four different MLAs for estimating soil properties: support vector regression (SVR), cubist, random forest (RF), and multi-layer perceptron (MLP), which were benchmarked against ordinary least squares regression. Our results showed that spectral subset ranges of 1000–2500 nm and 1350–2500 nm (emulating Trinamix and NeoSpectra sensors, respectively) yielded prediction accuracies similar to the full spectrum. Spectral subsets limited to the visible and early NIR range (350–1000 nm) were less effective. The most informative spectral features were found in wavelengths above approximately 1750 nm. Among MLAs, MLP consistently delivered the best performance, particularly when estimating organic carbon, nitrogen, pH and clay, which were predicted with greater accuracy compared to potassium (K), phosphorus (P) and coarse fragments (CF) which cannot yet be robustly predicted from spectral data alone. This study provides preliminary insight into the spectral regions most relevant for soil property prediction. These findings may inform future development and optimisation of real-world soil sensors. Validation with actual sensor data, both on dried and in situ samples, remains an important next step.
2026-04-22
WILEY
JRC141815
1365-2389 (online),   
https://bsssjournals.onlinelibrary.wiley.com/doi/abs/10.1111/ejss.70242,    https://publications.jrc.ec.europa.eu/repository/handle/JRC141815,   
10.1111/ejss.70242 (online),   
NameCountryCityType
Datasets
IDTitlePublic URL
Dataset collections
IDAcronymTitlePublic URL
Scripts / source codes
DescriptionPublic URL
Additional supporting files
File nameDescriptionFile type 
Show metadata record  Copy citation url to clipboard  Download BibTeX