{"id":13730,"date":"2025-08-26T17:15:00","date_gmt":"2025-08-26T22:15:00","guid":{"rendered":"https:\/\/www.agron.iastate.edu\/glsi\/?p=13730"},"modified":"2025-09-26T22:49:45","modified_gmt":"2025-09-27T03:49:45","slug":"influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy","status":"publish","type":"post","link":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/","title":{"rendered":"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy"},"content":{"rendered":"<div class=\"paragraph-widget paragraph-widget--text-html\"><div class=\"text-content\">\n<p>The objective of this study is to evaluate and compare the performance of different machine learning (ML) algorithms, viz., multi-layer perceptron (MLP), random forest (RF), extra trees regressor (ETR), CatBoost, and gradient boost (GB), considering the impact of variation in sample sizes for the prediction of soil nutrients. In this context, this study evaluates the impact of sample size on the prediction performance of five ML algorithms for mapping 14 soil properties, including key soil physico-chemical properties (soil organic carbon, pH, and electrical conductivity), and multiple macro (available nitrogen, phosphorus, potassium, calcium, magnesium, and sulfur) and micronutrients (available iron, manganese, copper, zinc, and boron) at a geographical extent of 8303 km2 located in eastern India. A total of 1024 surface soil samples were collected, of which 800 were used for model training, while the remaining 224 were reserved for independent validation (IV) of the resultant maps. The original training data set (800 samples) was reduced by random selection into six different sample sizes, i.e., 800, 400, 200, 100, 50, and 25. An exhaustive set of 574 environmental variables, derived from digital terrain derivatives and Sentinel-2 satellite imagery, was used as predictors. Popular statistical indicators, such as Lin\u2019s concordance correlation coefficient (CCC) and root mean squared error (RMSE), were employed to evaluate the predictive capability of the algorithms under different sample size scenarios. The results showed that prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.<\/p>\n<\/div><\/div>\n\n<div class=\"paragraph-widget paragraph-widget--text-html\"><div class=\"text-content\">\n<p>Dash, P.K., C. Ferhatoglu, B.A. Miller, N. Panigrahi, A. Mishra. 2025. Influence of sample size and machine learning algorithms on digital soil nutrient mapping accuracy. <a href=\"https:\/\/link.springer.com\/article\/10.1007\/s10661-025-14322-w\">Environmental Monitoring and Assessment 197:996. doi: 10.1007\/s10661-025-14322-w<\/a>.<\/p>\n<\/div><\/div>\n\n<div class=\"paragraph-widget paragraph-widget--text-html\"><div class=\"text-content\">\n<p><\/p>\n<\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>The objective of this study is to evaluate and compare the performance of different machine learning (ML) algorithms, viz., multi-layer perceptron (MLP), random forest (RF), extra trees regressor (ETR), CatBoost, and gradient boost (GB), considering the impact of variation in sample sizes for the prediction of soil nutrients. In this context, this study evaluates the [&hellip;]<\/p>\n","protected":false},"author":3216,"featured_media":13735,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"ngg_post_thumbnail":0,"footnotes":""},"categories":[330,5,7],"tags":[34,331,76,118,201],"class_list":["post-13730","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ferhatoglu","category-manuscripts","category-miller","tag-digital-soil-mapping","tag-india","tag-modelling","tag-soil-fertility","tag-soil-properties"],"acf":[],"featured_image_urls_v2":{"full":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png",472,433,false],"thumbnail":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy-150x150.png",150,150,true],"medium":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy-300x275.png",300,275,true],"medium_large":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png",472,433,false],"large":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png",472,433,false],"1536x1536":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png",472,433,false],"2048x2048":["https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png",472,433,false]},"post_excerpt_stackable_v2":"<p>The objective of this study is to evaluate and compare the performance of different machine learning (ML) algorithms, viz., multi-layer perceptron (MLP), random forest (RF), extra trees regressor (ETR), CatBoost, and gradient boost (GB), considering the impact of variation in sample sizes for the prediction of soil nutrients. In this context, this study evaluates the impact of sample size on the prediction performance of five ML algorithms for mapping 14 soil properties, including key soil physico-chemical properties (soil organic carbon, pH, and electrical conductivity), and multiple macro (available nitrogen, phosphorus, potassium, calcium, magnesium, and sulfur) and micronutrients (available iron, manganese,&hellip;<\/p>\n","category_list_v2":"<a href=\"https:\/\/www.agron.iastate.edu\/glsi\/category\/ferhatoglu\/\" rel=\"category tag\">Ferhatoglu<\/a>, <a href=\"https:\/\/www.agron.iastate.edu\/glsi\/category\/manuscripts\/\" rel=\"category tag\">Manuscripts<\/a>, <a href=\"https:\/\/www.agron.iastate.edu\/glsi\/category\/miller\/\" rel=\"category tag\">Miller<\/a>","author_info_v2":{"name":"Bradley Miller","url":"https:\/\/www.agron.iastate.edu\/glsi\/author\/millerba\/"},"comments_num_v2":"0 comments","yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.6 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy - Geospatial Laboratory for Soil Informatics<\/title>\n<meta name=\"description\" content=\"Prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy - Geospatial Laboratory for Soil Informatics\" \/>\n<meta property=\"og:description\" content=\"Prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/\" \/>\n<meta property=\"og:site_name\" content=\"Geospatial Laboratory for Soil Informatics\" \/>\n<meta property=\"article:published_time\" content=\"2025-08-26T22:15:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-09-27T03:49:45+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png\" \/>\n\t<meta property=\"og:image:width\" content=\"472\" \/>\n\t<meta property=\"og:image:height\" content=\"433\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Bradley Miller\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Bradley Miller\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/\"},\"author\":{\"name\":\"Bradley Miller\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#\\\/schema\\\/person\\\/a96fa0c818314fce5f3928c232490277\"},\"headline\":\"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy\",\"datePublished\":\"2025-08-26T22:15:00+00:00\",\"dateModified\":\"2025-09-27T03:49:45+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/\"},\"wordCount\":405,\"publisher\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/files\\\/2025\\\/08\\\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png\",\"keywords\":[\"Digital Soil Mapping\",\"India\",\"modelling\",\"soil fertility\",\"soil properties\"],\"articleSection\":[\"Ferhatoglu\",\"Manuscripts\",\"Miller\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/\",\"url\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/\",\"name\":\"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy - Geospatial Laboratory for Soil Informatics\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/files\\\/2025\\\/08\\\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png\",\"datePublished\":\"2025-08-26T22:15:00+00:00\",\"dateModified\":\"2025-09-27T03:49:45+00:00\",\"description\":\"Prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/files\\\/2025\\\/08\\\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png\",\"contentUrl\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/files\\\/2025\\\/08\\\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png\",\"width\":472,\"height\":433},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/manuscripts\\\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#website\",\"url\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/\",\"name\":\"Geospatial Laboratory for Soil Informatics\",\"description\":\"Soil Mapping Research at Iowa State University\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#organization\",\"name\":\"Geospatial Laboratory of Soil Informatics\",\"url\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/files\\\/2019\\\/06\\\/favicon.ico\",\"contentUrl\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/files\\\/2019\\\/06\\\/favicon.ico\",\"width\":16,\"height\":16,\"caption\":\"Geospatial Laboratory of Soil Informatics\"},\"image\":{\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#\\\/schema\\\/logo\\\/image\\\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/#\\\/schema\\\/person\\\/a96fa0c818314fce5f3928c232490277\",\"name\":\"Bradley Miller\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/743e78561251313585ddf27fc4aa4b4d9e076967d63702712d0794a7f92f2418?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/743e78561251313585ddf27fc4aa4b4d9e076967d63702712d0794a7f92f2418?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/743e78561251313585ddf27fc4aa4b4d9e076967d63702712d0794a7f92f2418?s=96&d=mm&r=g\",\"caption\":\"Bradley Miller\"},\"url\":\"https:\\\/\\\/www.agron.iastate.edu\\\/glsi\\\/author\\\/millerba\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy - Geospatial Laboratory for Soil Informatics","description":"Prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/","og_locale":"en_US","og_type":"article","og_title":"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy - Geospatial Laboratory for Soil Informatics","og_description":"Prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.","og_url":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/","og_site_name":"Geospatial Laboratory for Soil Informatics","article_published_time":"2025-08-26T22:15:00+00:00","article_modified_time":"2025-09-27T03:49:45+00:00","og_image":[{"width":472,"height":433,"url":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png","type":"image\/png"}],"author":"Bradley Miller","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Bradley Miller","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#article","isPartOf":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/"},"author":{"name":"Bradley Miller","@id":"https:\/\/www.agron.iastate.edu\/glsi\/#\/schema\/person\/a96fa0c818314fce5f3928c232490277"},"headline":"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy","datePublished":"2025-08-26T22:15:00+00:00","dateModified":"2025-09-27T03:49:45+00:00","mainEntityOfPage":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/"},"wordCount":405,"publisher":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/#organization"},"image":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#primaryimage"},"thumbnailUrl":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png","keywords":["Digital Soil Mapping","India","modelling","soil fertility","soil properties"],"articleSection":["Ferhatoglu","Manuscripts","Miller"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/","url":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/","name":"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy - Geospatial Laboratory for Soil Informatics","isPartOf":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#primaryimage"},"image":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#primaryimage"},"thumbnailUrl":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png","datePublished":"2025-08-26T22:15:00+00:00","dateModified":"2025-09-27T03:49:45+00:00","description":"Prediction accuracy and reliability of prediction performance across multiple target variables improved with sample size. However, beyond a certain point, improvement in predictive performance became substantially negligible compared to the efforts in additional sampling. All the ML algorithms performed well (mean IV-CCC varied between 0.26 and 0.64 for different soil properties) to increase in sample size, except MLP, which exhibited a poorer prediction performance (mean IV-CCC varied between 0.14 and 0.29 for different soil properties). Micronutrients in general responded well to the increase in sample sizes. The uncertainty analysis revealed that increasing sample size generally reduced prediction uncertainty, though the extent varied by soil property and ML algorithm. Concisely, the results presented herein showed an effective manner of selecting an appropriate sample size and a suitable ML algorithm to predict multiple soil nutrients accurately, which would be recommended to achieve optimal accuracy for a project.","breadcrumb":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#primaryimage","url":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png","contentUrl":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2025\/08\/Influence-of-Sample-Size-and-Machine-Learning-Algorithms-on-Digital-Soil-Nutrient-Mapping-Accuracy.png","width":472,"height":433},{"@type":"BreadcrumbList","@id":"https:\/\/www.agron.iastate.edu\/glsi\/manuscripts\/influence-of-sample-size-and-machine-learning-algorithms-on-digital-soil-nutrient-mapping-accuracy\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.agron.iastate.edu\/glsi\/"},{"@type":"ListItem","position":2,"name":"Influence of Sample Size and Machine Learning Algorithms on Digital Soil Nutrient Mapping Accuracy"}]},{"@type":"WebSite","@id":"https:\/\/www.agron.iastate.edu\/glsi\/#website","url":"https:\/\/www.agron.iastate.edu\/glsi\/","name":"Geospatial Laboratory for Soil Informatics","description":"Soil Mapping Research at Iowa State University","publisher":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.agron.iastate.edu\/glsi\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.agron.iastate.edu\/glsi\/#organization","name":"Geospatial Laboratory of Soil Informatics","url":"https:\/\/www.agron.iastate.edu\/glsi\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.agron.iastate.edu\/glsi\/#\/schema\/logo\/image\/","url":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2019\/06\/favicon.ico","contentUrl":"https:\/\/www.agron.iastate.edu\/glsi\/files\/2019\/06\/favicon.ico","width":16,"height":16,"caption":"Geospatial Laboratory of Soil Informatics"},"image":{"@id":"https:\/\/www.agron.iastate.edu\/glsi\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.agron.iastate.edu\/glsi\/#\/schema\/person\/a96fa0c818314fce5f3928c232490277","name":"Bradley Miller","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/743e78561251313585ddf27fc4aa4b4d9e076967d63702712d0794a7f92f2418?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/743e78561251313585ddf27fc4aa4b4d9e076967d63702712d0794a7f92f2418?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/743e78561251313585ddf27fc4aa4b4d9e076967d63702712d0794a7f92f2418?s=96&d=mm&r=g","caption":"Bradley Miller"},"url":"https:\/\/www.agron.iastate.edu\/glsi\/author\/millerba\/"}]}},"_links":{"self":[{"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/posts\/13730","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/users\/3216"}],"replies":[{"embeddable":true,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/comments?post=13730"}],"version-history":[{"count":1,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/posts\/13730\/revisions"}],"predecessor-version":[{"id":13733,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/posts\/13730\/revisions\/13733"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/media\/13735"}],"wp:attachment":[{"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/media?parent=13730"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/categories?post=13730"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.agron.iastate.edu\/glsi\/wp-json\/wp\/v2\/tags?post=13730"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}