نوع مقاله : مقالات پژوهشی
نویسندگان
1 دانشگاه تربیت مدرس
2 سازمان تحقیقات، آموزش و ترویج کشاورزی، تهران، ایران
چکیده
شور شدن خاکها در جهان به گونهای روزافزون روبه گسترش است و درنتیجه تولید محصولات کشاورزی در مواجهه با این تنش کاهش مییابد. سیاستگذاران و تصمیمسازان در راستای برنامهریزی برای تطبیق با تغییرات اقلیمی و افزایش نیاز به غذا نیازمند پایش کمی مستمر شوری خاک می-باشند. شاخصهای طیفی حاصل از سنجندههای ماهوارهای و یا سنجندههای نزدیک به سطح زمین بهطور روزافزونی برای پایش شوری خاک مورداستفاده قرار میگیرند بهنحویکه تا کنون تعداد زیادی شاخص برای پایش شوری خاک معرفی شدهاند. برای مدلسازی و سنجش اعتبار مدل حاصله روشهای رگرسیونی مختلفی مورداستفاده قرار گرفته که مهمترین آنها رگرسیون خطی چندگانه (شامل رگرسیون گامبهگام، انتخاب رو به جلو و حذف رو به عقب) و رگرسیون حداقل مربعات جزئی است. در این پژوهش بهمنظور ارزیابی این دو روش در مدلسازی تغییرات شوری خاک از اندازه-گیریهای آزمایشگاهی و الکترومغناطیسی شوری خاک مربوط به 97 نقطه در سال 1392 و 225 نقطه در سال 1393 در بخشی از دشت سبزوار- داورزن به مساحت حدود 50 هزار هکتار استفاده شد. تعداد 23 شاخص طیفی از تصاویر ماهواره لندست 8 مربوط به تاریخهای نمونهبرداری استخراج و به همراه مدل رقومی ارتفاع بهعنوان متغیر مستقل مورداستفاده قرار گرفت. روشهای مختلف رگرسیون خطی چندمتغیره با استفاده از دادههای سال اول بهعنوان آموزش و سال دوم بهعنوان آزمون و بالعکس هرچند ضریب تبیین بین حدود 22 تا 88 درصد ایجاد کرد، ولی این همبستگی در دسته اعتبار سنجی از 29 درصد تجاوز نکرد. به علت وجود همراستایی خطی چندگانه در بین متغیرهای مستقل روش رگرسیون خطی چندگانه برای تمام متغیرها قابل کاربرد نبود. حذف متغیرهای دارای همراستایی خطی، تبدیل لگاریتمی و تصادفی کردن کل دادهها در دو دسته آموزش و آزمون، ضریب رگرسیون مدل و اعتبار آن را بهطور قابل قبولی افزایش داد. استفاده از رگرسیون حداقل مربعات جزئی با استفاده از دادههای اصلی و تبدیل لگاریتمی شده سال اول و دوم بهعنوان آموزش و آزمون و بالعکس نیز در دسته آموزش ضریب تبیین بین 39 تا 85 درصد ایجاد کرد، ولی از برآورد در دسته آزمون ناتوان بود. تصادفی کردن دادهها و تقسیم مجدد آنها به دو دسته آموزش و آزمون موجب ارتقای چشمگیر ضریب تعیین در دسته اعتبارسنجی شد. تکرار عملیات تصادفی کردن نشان داد که روش از ثبات لازم برای برآورد ضرایب متغیرها برخوردار است.
کلیدواژهها
عنوان مقاله [English]
Statistical Modeling of Soil Salinity on Large Scale
نویسندگان [English]
- Yousef Hasheminejhad 1
- Mehdi Homaee 1
- Ali Akbar Noroozi 2
1 Tarbiat Modares University
2 Agricultural Research, Education and Extension Organization (AREEO), Tehran
چکیده [English]
Introduction: Soil salinization is increasing across developing world countries and agricultural production is decreasing as a result of this stress. Climate change could adversely affect soil salinization trend through the decrease in rainfall and increased evapotranspiration in arid regions. Policy and decision makers require continuous and quantitative monitoring of soil salinity to adapt with the adverse effects of climate change and increasing need for food. Indices derived from near surface or satellite based sensors are increasingly applied for monitoring of soil salinity so a considerable number of these indices are introduced already for soil salinity monitoring. Different regression methods have been already used for modeling and verification of developed models amongst them multiple linear regression (including stepwise, forward selection and backward elimination) and partial least square regression are the most important methods.
Materials and Methods: To evaluate different approaches for modeling soil salinity against remotely sensed data, an area of about 50000 ha was selected in Sabzevar- Davarzan plain during 2013 and 2014 years. The locations of sampling points were determined using Latin Hypercube Sampling (LHS) strategy. Sampling density was 97 points for 2013 and 25 points for 2014. All points were sampled down to 90 cm depth in 30 cm increments. Totally 366 soil samples were analyzed in the laboratory for electrical conductivity of saturated extract. Electromagnetic induction device (EM38) was also used to measure bulk soil electrical conductivity for the sampling points at the first year and sampling points and 8 points around it at the second year. Totally 97 and 225 EM measurements were also recorded for first and second years respectively. Mean measured soil EC data were calibrated against the EM measurements. Finding the fair correlations, the EM and EC data could be converted to each other. 23 spectral indices derived from Landsat 8 images in the sampling dates along with DEM were used as independent variables. Multiple Linear Regression (MLR) and Partial Least Square Regression (PLSR) methods were evaluated for their fitness in predicting soil salinity from independent variables in different calibration and verification datasets.
Results and Discussion: Different multiple linear regression approaches using the first year data for training and second year data for testing the models and vice versa were evaluated which produced determination coefficients of about 22 to 88 percent in the training dataset but this regression did not reach to 29 percent in the test dataset. Due to the multiple co-linearity amongst the independent variables the multiple linear regression methods were not applicable to all variables. Excluding the co-linear variables, log- transforming and randomizing them into train and test datasets improved the determination coefficient of model and its validation at an acceptable level. Application of partial least square regression using the original and log- transformed data of first and second years as train and test datasets and vice versa introduced determination coefficients of about 39 to 85 percent in the training dataset but were not able to predict in the test dataset. Random dividing of all data into train and test datasets considerably increased the determination coefficient in the verification dataset. Repeating the randomization showed that the approach has the required consistency for predicting the coefficients of variables.
Conclusions: Wide range of independent variable could be used for predicting soil salinity from remotely sensed data and indices. On the other hand the independent variables generally show multi-colinearity amongst themselves. Correlation matrix, variance inflation factor and tolerance indices could be used to identify multi-colinearity. Removing or scaling the variable with high colinearity could improve the regression. Different data transformation methods including log- transformation could also significantly improve the strength of regression. In this research EM data showed more significant correlations with spectral indices in comparison with laboratorial measured EC data. As the EM38 device measures the reflectance in special range of spectrum this higher correlation could be expected. Such models should be calibrated and verified against ground truth data. Generally a part of data set is used for calibrating (making the model) and the remained for verifying (testing the model). Random dividing of the total data of 2 years into calibration (2/3 of data) and verification (1/3 of data) could significantly improve the regression in the verification data set. This procedure increases the range of variability for data used for calibration and verification and prevents outlier predictions.
کلیدواژهها [English]
- Multiple linear regression
- Partial least square regression
- Remote sensing
- spectral indices
- Verification
ارسال نظر در مورد این مقاله