جستجوی مقالات مرتبط با کلیدواژه « ماشین بردار پشتیبان » در نشریات گروه « آب و خاک »

تکرار جستجوی کلیدواژه « ماشین بردار پشتیبان » در نشریات گروه « کشاورزی »

انتخاب همه

ارزیابی دقت روش فرامکعب لاتین به منظور انتخاب موقعیت نقاط مطالعاتی برای تهیه نقشه ی رقومی ویژگی های خاک

زهره مصلح قهفرخی*، ابوالفضل آزادی

نشریه آب و خاک، سال سی و هشتم شماره 3 (پیاپی 95، امرداد و شهریور 1403)، صص 367 -382

با توجه به اینکه دقت و صحت تمام اطلاعات خاک شناسی وابسته به بهترین گمانه زنی در مورد مکان تغییرات خاک ها در قالب تعیین الگوی نمونه برداری می باشد، انتخاب روشی کارآمد که بتواند به بهترین شکل این تغییرات را رصد نماید بسیار حائز اهمیت است. تاکنون مطالعات اندکی در رابطه با بررسی تاثیر تصادفی بودن انتخاب نمونه ها در روش فرامکعب لاتین بر صحت نقشه ها انجام شده است. این مطالعه با هدف ارزیابی دقت روش فرامکعب لاتین در انتخاب موقعیت نمونه برداری به منظور انجام مطالعات نقشه برداری رقومی خاک در منطقه ای از شهرستان بروجن در استان چهارمحال و بختیاری انجام شد. با توجه به اینکه، چندین مرتبه نمونه برداری میدانی برای ارزیابی روش نمونه برداری خاک امری غیرمنطقی است در این پژوهش تلاش گردید تا از روش های شبیه سازی بر اساس نقشه هایی با صحت بسیار بالا برای این منظور استفاده شود. فاصله باهاتاچاریا برای کمی سازی فاصله بین توزیع احتمال جامعه اصلی و اجراهای مختلف روش فرامکعب لاتین استفاده گردید. نقشه ویژگی های خاک (درصد کربنات کلسیم معادل، رس و کربن آلی) عمق سطحی (صفر تا 30 سانتی متر) با استفاده از روش ماشین بردار پشتیبان تهیه گردید و اعتبارسنجی شد. علاوه بر آن، انتخاب موقعیت نقاط نمونه برداری با استقاده از روش فرامکعب لاتین با تراکم 200 نقطه با 500 مرتبه اجرا انجام گردید. در هر مرحله، اعتبارسنجی برای پیش بینی ویژگی های خاک با استفاده از R2، RMSE و %RMSE انجام شد. نتایج نشان داد که برای تمامی ویژگی های مورد بررسی، مدل ماشین بردار پشتیبان از صحت قابل قبولی (%RMSE کمتر از 40) برخوردار می باشد. از سوی دیگر، نتایج گویای آن است که خروجی های مختلف روش فرامکعب لاتین در اجراهای مختلف آن بر صحت مدلسازی تاثیرگذار است و مقادیر RMSE مدل در حالت های مختلف برای درصد کربنات کلسیم معادل، رس و کربن آلی به ترتیب از 1/1، 1/1 و 02/0 تا 2/3، 2 و 12/0 متغیر است. اگرچه این موضوع متاثر از ویژگی مورد بررسی و میزان تغییرات آن در منطقه مورد مطالعه نیز می باشد.

کلید واژگان: فاصله باهاتاچاریا, ماشین بردار پشتیبان, موقعیت نمونه, نقشه برداری رقومی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Evaluating the Precision of the Conditioned Latin Hypercube Sampling Method for Selecting Soil Samples to Generate Digital Maps of Soil Properties

Z. Mosleh Ghahfarokhi *, A. Azadi

Journal of water and soil, Volume:38 Issue: 3, 2024, PP 367 -382

Introduction

Soil properties play a crucial role as they determine the soil's suitability for different types of plant growth, ecosystems, and biota functioning. They have a significant impact on nutrient cycling, carbon sequestration, and soil management. Digital Soil Mapping (DSM) is a process aimed at delineating soil properties. Soil sampling for DSM serves as a fundamental step in improving prediction accuracy and is crucial for incorporating variability in terms of environmental covariates. Conditioned Latin Hypercube (CLH) sampling is a technique utilized to generate a sample of points from a multivariate distribution conditioned on one or more covariates. Numerous researchers (Ramirez-Lopez et al., 2014; Adhikari et al., 2017; Zhang et al., 2022) have endorsed this approach in their studies, following its inception by Minasny and McBratney in 2006. However, there has been limited research to date on the impact of the Latin hypercube method's random sample selection process on the accuracy of resulting maps. Hence, the central question remains: Is the Latin hypercube sampling method, which is currently widely adopted, always a dependable approach in this field?

Materials and Methods

The study area covers longitudes 50°35'47'' to 51°29'' east and latitudes 31°36''31'' to 32°15'48'' north in Borujen city, Chaharmahal, and Bakhtiari Province. The region, with an average elevation of 2338 meters above sea level, receives an annual rainfall of 250 millimeters and maintains an average temperature of 11.5 degrees centigrade. In this investigation, inherited data from soil studies were utilized, consisting of 250 samples distributed across the study area. In this research, the studied characteristics included percentage of equivalent calcium carbonate, clay, and soil organic carbon at a depth of 0 to 30 cm. Land component variables were extracted using the Alus Palsar digital elevation model with a spatial resolution of 12.5 meters. In the initial stage, digital maps of equivalent calcium carbonate, clay, and soil organic carbon were generated using the support vector machine method. The modeling process proceeded until a highly accurate model was achieved, with the root mean square error percentage (RMSE%) being less than 40. The Latin hypercube approach was utilized for sample design, with 500 repetitions in this study. After selecting sampling points for each run using the Latin hypercube method, these points were mapped onto a detailed map, and the corresponding feature values were retrieved. The final map was created based on the extracted points. Subsequently, the latin hypercube approach was employed to generate soil property maps for each selected dataset. Validation was conducted using criteria such as the coefficient of explanation, root mean square error, and root mean square error in multiple iterations to ensure the accuracy of the generated maps.

Results and Discussion

The results distinctly illustrates the varied selection of sampling positions with each implementation of the Latin hypercube method. It is important to note that there may be some overlaps in different implementations. Consequently, the primary question arises: Is a one-time execution of the Latin hypercube sufficient for selecting study points? The findings indicate that the support vector machine model achieves satisfactory accuracy for all the examined characteristics. In the studied area, the environmental factors such as slope and elevation were identified as a significant predictors for estimating percentage of equivalent calcium carbonate.

Conclusion

In the present study, the accuracy of the latin hypercube method was assessed for selecting sampling location for digital soil mapping endeavors in Chaharmahal and Bakhtiari Province. Given the impracticality of collecting numerous field samples to evaluate the soil sampling method, this research aimed to employ simulation methods based on highly accurate maps for this purpose. The results indicate that the different outputs of the Latin hypercube method influence the accuracy of modeling, although this effect is also influenced by the specific feature under investigation and the extent of its variability within the study area. Considering that the Latin hypercube method is based on the principle that samples are randomly selected in each class of environmental parameters, it is suggested that future studies using this method should account for this principle. Adequate consideration should be given, and the selection of sampling locations should rely on multiple implementations of the Bhattacharya distance method to ensure robustness and reliability.

Keywords: Bhattacharyya Distance, Digital Soil Mapping, Sampling Position, Support Vector Machine}

Abstract View Paper Research/Original Article Original: Persian
تخمین غلظت هوا در سرریز شوت با استفاده از روش های فرامدل

کیومرث روشنگر*، رضا سعادت جو، حمیدرضا عباس زاده، آیدین پناهی

مجله تحقیقات آب و خاک ایران، سال پنجاه و پنجم شماره 4 (پیاپی 100، تیر 1403)، صص 601 -613

یکی از راه های جلوگیری از ایجاد فشار منفی و کاویتاسیون در سرریزها، هوادهی به جریان عبوری از سرریزها می باشد. شناخت نحوه توزیع تغییرات غلظت هوا در طول سرریز جهت تخمین میزان هوادهی از اهمیت زیادی برخوردار است. در پژوهش حاضر کاربرد روش های فرامدل رگرسیونی فرآیند گاوسی (GPR) و ماشین بردار پشتیبان (SVM) در پیش بینی غلظت هوا مورد بررسی قرار گرفت. بدین منظور مجموعه داده های آزمایشگاهی (2268) به دست آمده از مدل های هیدرولیکی سرریز شوت در فرآیند مدل سازی به کار گرفته شد. مدل های ورودی متنوعی بر اساس ترکیب مختلفی از پارامترهای اندازه گیری شده تعریف گردید. نتایج به دست آمده نشان دهنده توانایی بالای هر دو روش در برآورد غلظت هوای مورد نیاز بر روی سرریز است. در برآورد میزان غلظت هوا در سرریز شوت برای حالتی که هوادهی مصنوعی توسط هواده انجام می گیرد پارامترهای دبی جریان (QW)، نسبت فاصله طولی از انتهای دفلکتور به عرض کانال (L/W) و نسبت عمق (عمود بر سرریز) بر عرض کانال (Y/W) تاثیر زیادی داشتند. نتایج شاخص های آماری ضریب همبستگی (R)، ضریب تبیین (DC) و خطای جذر میانگین مربعات برای این حالت در روش GPR به ترتیب 9214/0، 8451/0 و 1008/0 و مقادیر 9333/0، 8662/0 و 0937/0 در روش SVM است. برای حالتی که هوادهی مصنوعی توسط هواده انجام نمی گیرد، مدل با پارامترهای ورودی Qw، L/W، Y/W و ΔP (اختلاف فشار ما بین فشار اتمسفر و فشار زیر جت) با دارا بودن مقادیر 9222/0=R، 8644/0=DC و 0914/0=RMSE در روش GPR و به ترتیب با مقادیر 87/0، 7543/0 و 123/0 به عنوان برترین مدل انتخاب گردیدند.

کلید واژگان: رگرسیون فرآیند گاوسی, سرریز شوت, ماشین بردار پشتیبان, هوادهی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Estimation of air concentration in chute spillway using metamodel methods

Kiyoumars Roushangar *, Reza Saadatjoo, Hamidreza Abbaszadeh, Aydin Panahi

Iranian Journal of Soil and Water Research, Volume:55 Issue: 4, 2024, PP 601 -613

One of the ways to prevent creating negative pressure and cavitation in spillways is to introduce air into the flow over the spillways. Understanding the distribution of air concentration variations along the spillway is of significant importance for estimating the aeration level. This study explores the application of GPR and SVM molels in predicting air concentration. To achieve this, a dataset of 2268 laboratory experiments obtained from hydraulic models of chute spillways was utilized in the modeling process. Various input models were defined based on different combinations of measured parameters. The results demonstrate the high capability of both methods in estimating the required air concentration over the spillway. In predicting air concentration in the chute spillway under artificial aeration conditions, flow discharge (QW), longitudinal distance ratio from the end of the deflector to the channel width (L/W), and depth ratio (perpendicular to the spillway) to channel width (Y/W) significantly influenced the outcomes. Statistical indices, including R, DC, and RMSE for this case were 0.9214, 0.8451, and 1.008, respectively, in the GPR, and 0.9333, 0.8662, and 0.937 in the SVM. For scenarios without artificial aeration, the model with input parameters QW, L/W, Y/W, and ΔP (pressure difference between atmospheric pressure and the pressure under the jet) achieved the best performance in the GPR method with values of R=0.9222, DC=0.8644, and RMSE=0.914. In the SVM, the same model with values of 0.87, 0.7543, and 0.123 for R, DC, and RMSE, respectively, was selected as the superior model.

Keywords: Aeration, Chute Spillway, Gaussian Process Regression, Support Vector Machine}

Abstract View Paper Research/Original Article Original: Persian
ارزیابی کارایی مدل هایSVM ، LS-SVM و SVM-GOA در شبیه سازی دبی اوج سیل ایستگاه پل دختر

فاطمه توکلی، حامد نوذری*، صفر معروفی

مجله تحقیقات آب و خاک ایران، سال پنجاه و پنجم شماره 4 (پیاپی 100، تیر 1403)، صص 537 -552

مدل سازی یا شبیه سازی سیل یکی از راهکارهای اساسی برای مدیریت و کاهش اثرات مخرب این پدیده بوده و شناسایی مدل هایی کارآمد بدین منظور، یکی از مهم ترین ارکان در مدیریت حوضه های آبریز است. در این پژوهش دقت مدل های ماشین بردار کلاسیک(SVM) ، ماشین بردار پشتیبان تلفیق شده با الگوریتم ملخ (GOA-SVM)و حداقل مربعات ماشین بردار پشتیبان (LS-SVM) در شبیه سازی دبی اوج سیل ایستگاه پل دختر در حوضه کرخه، مورد ارزیابی قرار گرفته است. بدین منظور از آمار 74 واقعه سیل در محدوه سال های 1388 تا 1395 در ایستگاه پل دختر و بارش روزانه 13 ایستگاه باران سنجی در حوضه آبریز بالادست این ایستگاه استفاده شده است. از این تعداد، 52 واقعه برای آموزش و 22 واقعه نیز برای صحت سنجی مدل ها انتخاب شد. مقایسه نتایج به کمک چهار شاخص آماری ضریب تبیین(R^2)، جذر میانگین مربعات خطا (RMSE)، خطای استاندارد (SE)، ضریب نش (NS) و همچنین تحلیل عدم قطعیت به کمک دو شاخص متوسط طول بازه نسبی (ARIL)و درصد پوشش (POC) صورت گرفت. نتایج حاکی از برتری نسبی مدل LS-SVM با 407/0SE=، 16/110RMSE=، 91/0NS= و 92/0R2= نسبت به مدل SVM با 5/0 SE=، 70/137RMSE=، 87/0NS= و 88/0R2= و مدل SVM-GOA با 519/0 SE=، 53/144RMSE=، 83/0NS= و 9/0R2= است. متوسط مدت زمان اجرای مدلLS-SVM در حد چند ثانیه و این زمان در مدل SVM-GOA در حد چند ساعت است. از سوی دیگر تنظیم پارامترهای مدل SVM کلاسیک بصورت دستی نیز مستلزم صرف زمان زیادی است. لذا مدلLS-SVM به دلیل دارا بودن پارامترهای قابل تنظیم کمتر نسبت به مدل های SVM وSVM-GOA ، از لحاظ اجرایی ازسهولت بیشتری برخوردار است. لذا می توان با قطعیت و اختلافی چشمگیر مدلLS-SVM را نسبت به دو مدل دیگر در ارجحیت قرار داد.

کلید واژگان: الگوریتم ملخ, حوضه کرخه, پل دختر, مدل سازی سیل, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Evaluating the efficiency of SVM, LS-SVM and SVM-GOA models in simulating the Flood peak discharge at the Poldokhtar station

Fatemeh Tavakoli, Hamed Nozari *, Safar Marofi

Iranian Journal of Soil and Water Research, Volume:55 Issue: 4, 2024, PP 537 -552

In order to control and minimize the damaging impacts of floods, flood modeling or simulation is a fundamental solution. Identifying effective models for this purpose is crucial in watershed management. This study evaluates the accuracy of support vector machine models combined with the support vector machine (SVM), Grasshopper algorithm (SVM-GOA) and least square support vector machine (LS-SVM) in simulating the flood peak discharge of Poldokhtar station in the Karkheh basin. For this study, 74 flood events from 2009 to 2016 at the Poldokhtar station and data from 13 daily rainfall stations in the upstream area for the same period were utilized. Subsequently, 52 events were allocated for training, and 22 for validation. The comparison of results was conducted using three statistical indicators: Correlation coefficient (R2), Root mean square error (RMSE), Nash efficiency (Ns), and Standard error (SE). Additionally, uncertainty analysis was performed using two indexes: ARIL and POC. The results indicate the relative superiority of the LS-SVM model with SE=0.407, RMSE=110.16, NS= 0.91 and R2=0.92 compared to the SVM model with SE=0.5, RMSE=137.70, NS= 0.87 and R2=0.88 and SVM-GOA model with SE=0.519, RMSE=144.53, NS= 0.83 and R2=0.9. The study's overall conclusion is that the LS-SVM model is more accurate, faster, and easier to implement compared to the SVM and SVM-GOA models. As a result, it can be confidently preferred over the SVM and SVM-GOA models due to its significant advantages. The research emphasizes the critical importance of precise flood modeling and simulation in watershed management for mitigating the destructive impact of floods.

Keywords: Flood Modeling, Support Vector Machine, Grasshopper Algorithm, Karkheh Basin, Poldokhtar Station}

Abstract View Paper Research/Original Article Original: Persian
برآورد نیاز آبی و تبخیرتعرق واقعی با استفاده از تصاویر ماهواره ای به منظور بهبود تحویل حجمی آب در شبکه های آبیاری و زهکشی (مطالعه موردی: شبکه آبیاری و زهکشی مهاباد، استان آذربایجان غربی)

امیر نورجو*، فرید فیض الله پور

نشریه تحقیقات مهندسی سازه های آبیاری و زهکشی، سال بیست و چهارم شماره 93 (زمستان 1402)، صص 23 -42

با توجه به محدودیت کمی و کیفی آب، مدیریت و تحویل حجمی آب در شبکه های آبیاری و زهکشی امری مهم محسوب می شود. برای دستیابی به این هدف، الگوی کشت شبکه آبیاری و زهکشی مهاباد با استفاده از تصاویر ماهواره ای سنتینل 2 و روش طبقه بندی ماشین بردار پشتیبان برای سال زراعی 98-97 استخراج گردید. همچنین، با استفاده از داده های هواشناسی ایستگاه مهاباد و معادله پنمن مانتیث، حجم خالص آب مورد نیاز گیاهان غالب در محل نقاط تحویل حجمی محاسبه گردید. برای تعیین میزان تبخیر-تعرق واقعی، از تصاویر ماهواره ای لندست 8 و الگوریتم سبال استفاده شد و در نهایت نقشه های مکانی تبخیر-تعرق واقعی و نیاز خالص آبیاری برای شبکه استخراج گردید. بر اساس نتایج حاصل، 64 درصد از اراضی کشت شده (6786 هکتار) شبکه مهاباد به صورت باغی و 36 درصد از اراضی (3808 هکتار) به صورت زراعی به دست آمد. بدین ترتیب، نیاز خالص آبیاری (تبخیر و تعرق محاسباتی با کسر بارش موثر)برابر با 71 میلیون مترمکعب و نیاز ناخالص آبیاری با لحاظ راندمان آبیاری 44 درصد، برابر با 36/161 میلیون مترمکعب محاسبه گردید. همچنین، کل میزان تبخیر-تعرق حاصل از الگوریتم سبال برابر با 78/79 میلیون متر محاسبه گردید. بر اساس نقشه های کاربری اراضی، نیاز خالص آبیاری و تبخیر-تعرق واقعی، نحوه برداشت آب در شبکه مورد بررسی قرار گرفته و مشاهده شد که در اراضی بالادست شبکه و مجاور رودخانه مهاباد، نیاز آبی گیاهان برطرف شده ولی مناطق پایین دست شبکه، به علت عدم دسترسی به آب کافی، دچار تنش آبیاری شده اند.

کلید واژگان: الگوی کشت, چرخه فنولوژی گیاهی, سبال, کم آبیاری, ماشین بردار پشتیبان}

چکیده مشاهده متن مطالعه موردی زبان: فارسی

Estimation of crop water requirement and actual evapotranspiration using satellite images to improve volumetric water delivery in irrigation and drainage networks (case study: Mahabad irrigation and drainage network, West Azerbaijan province

Amir Nourjou *, Farid Feizolahpour

Irrigation and Drainage Structures Engineering Research, Volume:24 Issue: 93, 2024, PP 23 -42

Due to the location of Iran in arid and semi-arid regions and according to the quantitative and qualitative limitations of water resources, optimal management and volumetric delivery of water is important in irrigation and drainage networks. In this regard, it is necessary to estimate the water requirement of crops accurately and provide adequate water to farmers. Remote sensing technology provides facilities that can be used to obtain different layers of information at the lowest cost in the fastest time. Accordingly, many researchers have used remote sensing data to monitor vegetation cover, provide land use maps, estimate crop evapotranspiration and have declared this technology as appropriate tool for such studies. Based on the previous studies, it is observed that low researches has been conducted to investigate the crop evapotranspiration considering the crop water requirement. Therefore, the most important objectives of this study are: provide the cropping pattern and land use maps using Sentinel 2 satellite images, determination of the water requirement for the delivery points of irrigation network, determination of the actual evapotranspiration of the crop cover using SEBAL algorithm and Landsat 8’s images and finally evaluation of the water supply and management in the Mahabad irrigation and drainage network. In order to determine the cropping pattern of the Mahabad irrigation and drainage network, Sentinel 2 images have been used related to the 2018-2019 crop year. The images were examined in terms of the region of syudy and the percentage of cloudiness and after selecting the appropriate images, pre-processing operations including radiometric and atmospheric corrections were applied on them. Then, the NDVI index was calculated based on selected images. On the other hand, after determination of the classification classes, the phenological cycle of crops were examined for each class and spectral pattern of crops was determined during the growing season. Training samples were selected for supervised classification using the existing maps, Google Earth images, creating images with false color composites and considering the growth pattern and some of them were also considered for validation of the classified map. Then, the cropping pattern map was obtained by using the SVM classification algorithm. After generating the crop classification map, the water requirement of the different classes was determined based on the Penman-Montith evapotranspiration method, applying plant coefficients and irrigation application efficiency at the volumetric water delivery points. Finally, the actual evapotranspiration rate of the study area calculated based on the SEBAL algorithm and compared with the net water requirement map. Based on the results, kappa coefficient and overall accuracy of the classified map were determined to be 0.953 and 91%, respectively. The area of the planted agricultural farms was equal to 10594 hectares and 1576 hectares of farms were without planting. The area of orchard farms was equal to 6786 hectares and the area of sugar beet, wheat, alfalfa and corn lands were obtained to 998, 1839, 693 and 278 hectares, respectively. Thus, the net irrigation water requirement was equal to 71 million cubic meters and the gross irrigation water requirement was calculated equal to 161.36 million cubic meters, considering the irrigation efficiency of 44%. On the other hand, the evaluation of the SEBAL evapotranspiration maps during the growing season indicated that the total amount of evapotranspiration was equal to 79.78 million cubic meters, and this amount was 14% higher than the net irrigation water requirement. Finally, according to the crop classification map and based on the comparison of the net irrigation water requirement and evapotranspiration maps, the water consumption in the Mahabad irrigation and drainage network was evaluated. It turned out that in the upstream farms of the network or close to the Mahabad River, the Water consumption was more than net water requirement and downstream areas were faced to deficit irrigation due to lack of sufficient water.Finally, based on the results of this study, it was observed that by using the capabilities of satellite images and remote sensing, it is possible to monitor and evaluate the condition of agricultural farms on a large scale with acceptable accuracy. Also it is possible to improve the management of water supply and water use efficiency in irrigation and drainage networks by creating up-to-date land use maps, determining net and gross irrigation water requirment and comparing with actual evapotranspiration maps.

Keywords: Crop Pattern, Plant Phenology Cycle, SEBAL, Deficit Irrigation, Support Vector Machine}

Abstract View Paper Case Study Original: Persian
تحلیل عدم قطعیت مدل های شبکه عصبی مصنوعی (ANN) و ماشین بردار پشتیبان (SVM) در پیش بینی جریان ماهانه رودخانه (مطالعه موردی: رودخانه قزل اوزن)

مجید محمدی، پویا اللهویردی پور*

نشریه مدل سازی و مدیریت آب و خاک، سال چهارم شماره 2 (تابستان 1403)، صص 311 -326

در یک دهه اخیر، روش های هوش مصنوعی بیش ترین کاربرد را در شبیه سازی فرآیندهای مختلف از جمله فرآیندهای هیدرولوژیکی داشته اند، اما نتایج این روش ها همواره با عدم قطعیت همراه بوده اند. یکی از راه حل هایی که می تواند تا حدودی این مشکل را حل نماید، تحلیل عدم قطعیت پیش بینی های صورت گرفته است. در مطالعه حاضر عدم قطعیت نتایج مدل های شبکه عصبی مصنوعی (ANN) و ماشین بردار پشتیبان (SVM) در پیش بینی جریان ماهانه رودخانه با استفاده از شبیه سازی مونت-کارلو و مقادیر 95PPU و d-factor مورد ارزیابی قرار گرفته است. در این پژوهش از داده ها و سری زمانی جریان ماهانه رودخانه قزل اوزن در یک دوره 39 ساله از سال 1355 تا 1393 برای ایستگاه آب سنجی بیانلو-یساول استفاده شده است که 75 درصد داده ها برای آموزش و 25 درصد برای آزمون مدل ها به کار رفته است. در این مدل ها به منظور تخمین جریان رودخانه، شش ترکیب مختلف ورودی شامل جریان یک، دو و سه ماه قبل و شماره ماه های جریان مورد استفاده قرار گرفت. برای ارزیابی مدل ها از معیارهای آماری ضریب همبستگی (R) و ریشه میانگین مربعات خطا (RMSE) استفاده شد. نتایج نشان داد که اگر چه مدل ANN با مقادیر R مساوی با 757/0 و RMSE مساوی با 45/9 دارای عملکرد خوبی نسبت به مدل SVM با مقادیر R مساوی با 729/0 و RMSE مساوی با 946/10 در پیش بینی جریان رودخانه است. اما نتایج این مدل با عدم قطعیت زیادی همراه است. مقایسه تحلیل عدم قطعیت نتایج مدل ها نشان داد که مدل SVM با مقادیر d-factor و 95PPU به ترتیب برابر با 155/0 و 241/17 نسبت به مدل ANN با مقادیر d-factor و 95PPU به ترتیب برابر با 993/0 و 470/85 از عدم قطعیت کم تری برخوردار است و از این لحاظ بر مدل ANN برتری دارد. مطابق نتایج این پژوهش باید با در نظر گرفتن این نکته که مدل های پیشرفته هوش مصنوعی نیز دارای عدم قطعیت هستند، نسبت به کاربرد این روش ها در زمینه های مدیریت ریسک و برنامه ریزی های آینده اقدام کرد تا بهترین عملکرد را به دست آورد.

کلید واژگان: پیش بینی جریان, رودخانه قزل اوزن, شبکه عصبی مصنوعی, عدم قطعیت, ماشین بردار پشتیبان}

چکیده مشاهده متن مطالعه موردی زبان: فارسی

Uncertainty analysis of artificial neural network (ANN) and support vector machine (SVM) models in predicting monthly river flow (Case study: Ghezelozan River)

Majid Mohammadi, Pouya Allahverdipour *

Journal of Water and Soil Management and Modeling, Volume:4 Issue: 2, 2024, PP 311 -326

Introduction

River flow forecasting has been one of the important challenges of water resources management in recent decades, so many researchers have proposed different methods to improve the performance of forecasting models. In the last decade, artificial intelligence methods have been most widely used in the simulation of various processes, including hydrological processes, due to their flexibility and high accuracy in modeling. However, the results of these methods have always been associated with uncertainty due to several factors such as structure, algorithm, input data, and the type of method chosen for data calibration. One of the methods that can somewhat solve this problem is the uncertainty analysis of the predictions made by these models.

Materials and Methods

In this study, the uncertainty of the results of artificial neural network (ANN) and support vector machine (SVM) models in predicting the monthly flow of the river has been evaluated. In this research, the time series of the monthly flow of the Ghezelozan River using the data of the Bianlu-Yasaul Stream gauging station in 39 years from 1976 to 2014 was used, where 75% and 25% of the data was used for training and testing the models, respectively. In these models, to estimate the monthly flow of the Ghezelozan River, six different input combinations including the flow of one, two, and three months before and the number of months of the flow were used. Then, the accuracy and performance of the models were compared using the coefficient of determination (R) and root mean square of errors (RMSE). Finally, the uncertainty of the results of ANN and SVM models in predicting the monthly flow of the river was evaluated by the Monte-Carlo method using d-factor and 95PPU values.

Results and Discussion

The evaluation of the performance of the ANN model shows that the best performance is related to the combination where the flow of the previous two months and the number of the month of the flow are the inputs of the model so that R and RMSE indexes were obtained as 0.757 and 9.45, respectively. In the SVM model for the monthly river flow series, the best performance is related to the combination where the flow of one, two, and three months ago and the number of months of the flow were the inputs of the model, and the R and RMSE indexes for this input pattern were 0.729 and 10.946, respectively. After studying the uncertainty of the models, the results showed that the ANN model has more uncertainty in the output values compared to the SVM model, and this is while the d-factor of the ANN model, both in the training and test phase, it was more than the SVM model. The comparison of the uncertainty analysis of the results of the ANN and SVM models showed that the SVM model with d-factor and 95PPU values equal to 0.155 and 17.241, respectively, compared to the ANN model with d-factor and 95PPU values equal to 0.993 and 85.470, respectively, has less uncertainty in the output values. So the number of observation data placed in the 95% confidence range (95PPU) of the ANN model compared to the SVM model has increased significantly in both the training and testing phases. Also, the results showed that both models have more uncertainty in the months with a large volume of water, which can be due to the complexity of the process and the involvement of uncertain factors in these months, as well as the effect of factors that are not considered in the structure of predictive models.

Conclusion

The results of ANN and SVM models in predicting the monthly flow of the Ghezelozan River showed that although the ANN model with R-value equal to 0.757 and RMSE value equal to 9.45 has a good performance compared to the SVM model with R-value equal to 0.729 and RMSE value equal to 10.946 in predicting the river flow, the results of this model are associated with high uncertainty. The comparison of the uncertainty analysis of the results of ANN and SVM models by Monte-Carlo method showed that the SVM model with d-factor and 95PPU values equal to 0.155 and 17.241, respectively, compared to the ANN model with d-factor and 95PPU values equal to 0.993 and 85.470, respectively, has less uncertainty in predicting the monthly flow of the Ghezelozan River and it is better than ANN model. According to the results of this research, taking into account the fact that advanced artificial intelligence models also have uncertainty, it is necessary to apply these methods in the fields of risk management and future planning to obtain the best performance.

Keywords: Artificial Neural Network, Flow Prediction, Ghezelozan River, Support Vector Machine, Uncertainty}

Abstract View Paper Case Study Original: Persian
پهنه بندی حساسیت وقوع زمین لغزش با استفاده از الگوریتم های یادگیری ماشین (منطقه مورد مطالعه: بخشی از حوزه آبخیز هراز)

علیرضا سپه وند*، نسرین بیرانوند

نشریه مدل سازی و مدیریت آب و خاک، سال چهارم شماره 2 (تابستان 1403)، صص 261 -278

زمین لغزش یکی از انواع پدیده های زمین شناسی در سراسر جهان است که هر ساله تلفات جانی و خسارات اقتصادی زیادی را به همراه دارد. بنابراین، این پژوهش به منظور ارزیابی پهنه بندی حساسیت وقوع زمین لغزش با استفاده از الگوریتم های مختلف یادگیری ماشین از نوع ماشین پشتیبان بردار (SVM) و رگرسیون فرآیند گاوسی (SVM) با دو کرنل (PUK و RBF) و جنگل تصادفی (RF) در بخشی از حوزه آبخیز هراز، ایران انجام شده است. در پژوهش حاضر از نه عامل شیب، جهت، ارتفاع، زمین شناسی، کاربری اراضی، فاصله از گسل، فاصله از جاده، فاصله از رودخانه و بارش به عنوان پارامترهای ورودی و نقاط لغزشی و غیرلغزشی به عنوان پارامتر خروجی برای مدل سازی و پهنه بندی حساسیت وقوع زمین لغزش استفاده شد. از مجموع 148 نقاط لغزشی و غیرلغزشی، 70 درصد برای مرحله آموزش و 30 درصد برای مرحله آزمایش مدل سازی استفاده شد. برای ارزیابی کارایی مدل ها و انتخاب مدل بهینه از معیارهای سنجش خطای مدل Accuracy، F1-score و AUC و برای تحلیل حساسیت از روش حذفی استفاده شد. نتایج به دست آمده نشان داد که مدل RF (با 9/0Accuracy =، 957/0F1-score= و 999/0AUC=) در بخش آزمایش در مقایسه با دیگر مدل ها به عنوان بهترین مدل برای پهنه بندی حساسیت وقوع زمین لغزش انتخاب شد. بر اساس نتایج نقشه پهنه بندی مشخص شد که به ترتیب 86/31، 16/32، 38/13، 73/9 و 84/12 درصد در طبقات با حساسیت خیلی کم، کم، متوسط، زیاد و خلیی زیاد قرار دارد. علاوه براین نتایج تحلیل حساسیت مدل نشان داد که جهت شیب، حساس ترین پارامتر در پهنه بندی خطر وقوع زمین لغزش است. مقایسه نتایج مدل ها نشان داد که ارتباط معناداری بین مقادیر پیش بینی شده و مقادیر مشاهداتی با استفاده از مدل های استفاده شده وجود ندارد. بر اساس نتایج به دست آمده از نقشه پهنه بندی حساسیت وقوع زمین لغزش می توان به اولویت بندی و مدیریت مناطق پایدار و با حساسیت کم به وقوع حرکت های توده ای برای اجرای عملیات عمرانی پرداخت.

کلید واژگان: حوزه آبخیز هراز, رگرسیون فرآیند گاوسی, زمین لغزش, شاخص حساسیت زمین لغزش, ماشین بردار پشتیبان, مدل جنگل تصادفی}

چکیده مشاهده متن مطالعه موردی زبان: فارسی

Landslide susceptibility mapping using various soft computing techniques (Case study: A part of Haraz Watershed)

Alireza Sepahvand *, Nasrin Beiranvand

Journal of Water and Soil Management and Modeling, Volume:4 Issue: 2, 2024, PP 261 -278

Introduction

A landslide is one of the mass movements on the top surface of the earth. Landslides have resulted in notable injury and damage to human life and destroyed infrastructure and property. Landslides represented approximately Nine percent of the natural disasters worldwide during the 1990s. According to studies, this trend is expected to continue due to increased human development. Many studies have been done to determine the factors affecting mass movement. In large part of Iran including the mountain areas, tectonic activity and seismic high with diverse geological and weather conditions led to many countries prone to landslide. Landslides cause wide damage to natural resources, human settlements, infrastructure, mud floods, and filling reservoirs. Landslides cause extensive property damage and occasionally result in loss of life. Besides, should not be ignored the social and environmental impacts resulting from the occurrence of this phenomenon, such as immigration and unemployment. One of the strategies for reducing losses due to a range of movements is the identification and management of unstable slope areas. To identify unstable regions pay to landslide hazard mapping. The main purpose of this research is to assess the effective parameter on landslide occurrence and to compare different machine learning models including SVM, GP regression, and RF for landslide susceptibility zoning.

Materials and Methods

The study area is a part of the Haraz Watershed, Mazandaran Province, Iran, occurrence many landslides are damaged after each heavy rain. So, it was selected as a suitable Watershed to evaluate the landslide susceptibility mapping (LSM). The vegetation covers and land mainly consists of rangeland. The geology of the study area consists mainly of Quaternary and Shemshak formations. The first step for the assessment of landslide susceptibility is gathering the necessary data and preparing information. These data were determined based on several factors. Considering the literature review, the local conditions, and previous studies. In this study, nine parameters such as slope angle, slope aspect, elevation, geology, land use, the distance of fault, the distance of the road, the distance of the river, and precipitation were identified as key factors for the prediction of landslide susceptibility. To assess the effectiveness of GP-PUK, GP-RBF, SVM-PUK, SVP-RBF, AND RF to estimate the landslide susceptibility map (LSM), data used in the present study were taken from field data. In this study, the dataset contains 148 observations of landslide occurrence and landslide non-occurrence points. The landslide data have been randomly separated into training (70% of landslides; 103) and testing (30% of the landslides; 45). To judge the performance of the soft computing techniques, statistical evaluation parameters were used. In this research, three statistical evaluation parameters were used. These parameters are the correlation coefficient (C.C.), root mean square error (RMSE), and Nash–Sutcliffe model efficiency (NSE).

Results and Discussion

According to the results of the comparison of methods, RF was the best model and the accuracy of the RF model was more suitable for the estimation of the landslide occurrence. So, in this study, RF was used for the landslide susceptibility map. Single-factor ANOVA test suggests that there is an insignificant difference between observed and predicted values of landslide occurrence and landslide non-occurrence using GP_PUK, GP_RBF, SVM_PUK, SVM_RBF and Random Forest approaches. According to the results of the comparison of methods, RF was the best model and the accuracy of the RF model was more suitable for the estimation of the landslide occurrence. The map of landslide susceptibility map was divided into five classes from none susceptible to very high susceptibility. According to the final Landslide susceptibility map, the area belonging to the “non-susceptible” class covers 35.86 km2, “low susceptibility” class 36.19 km2, “moderate susceptibility” class 15.06 km2, “high susceptibility” class 10.95 km2 and “very high susceptibility” class 14.46 km2 of Haraz Watershed. Sensitivity analysis was performed to find the most significant input parameter in the prediction of landslide occurrence and landslide non-occurrence. The result shows that aspect has a major role in predicting landslide occurrence and landslide non-occurrence in comparison to other input parameters, respectively.

Conclusion

Due to all results, some zones are potentially dangerous for any future habitation and development. Thus, there is an immediate need to implement mitigation measures in the very high-hazard and high-hazard zones, or such zones need to be avoided for habitation or any future developmental activities. The results of this research can be used by the local authority to manage properly, and systematically and plan development within their areas.

Keywords: Haraz Watershed, Landslide, Landslide Susceptibility Index (LSI), Support Vector Machine, Gaussian Process, Random Forest Method}

Abstract View Paper Case Study Original: Persian
ارزیابی روش های محاسبات نرم در برآورد رسوب معلق رودخانه (ایستگاه حسن آباد رودخانه تیره)

امیر مرادی نژاد*، سعید خسروبیگی، محمود اکبری، سیداحمد حسینی

نشریه مدل سازی و مدیریت آب و خاک، سال چهارم شماره 2 (تابستان 1403)، صص 241 -260

برآورد بار رسوب رودخانه ها از مسائل مهم و کاربردی در مطالعات و طراحی پروژه های مهندسی آب، مانند طراحی و توسعه شبکه های آبیاری و زهکشی، آبگیری از رودخانه و غیره است. مدل های آماری و رگرسیونی از معمول ترین روش های تحلیل هستند که اغلب با توجه به حل خطی این پدیده ها، نتایجی همراه با خطا ارائه داده اند. مدل های هیدرولیکی به دلیل نیاز به داده های زیاد و گاهی در دسترس نبودن داده های مورد نیاز و دقیق نبودن داده ها به علت خطای انسانی برای شبیه سازی رسوبات، همیشه نمی توان به آن ها اعتماد کرد. امروزه سیستم هادی هوشمند فازی و عصبی با توجه به توانایی در حل پدیده های غیرخطی و پیچیده، کاربردهای فراوانی در مسائل مختلف مهندسی آب از جمله رسوب پیدا کرده اند. هدف از پژوهش حاضر نیز ارزیابی و مقایسه چهار روش مدل های فازی-عصبی تطبیقی (ANFIS)، ماشین بردار پشتیبان (SVM)، برنامه ریزی بیان ژن (GEP) و روش گروهی کنترل داده ها GMDH در برآورد بار رسوب ایستگاه حسن آباد رودخانه تیره استان مرکزی است. بدین منظور به عملکرد چهار نوع مدل در شبیه سازی بار رسوبی رودخانه ها پرداخته، سپس نتایج چهار روش با یک دیگر و با نتایج منحنی سنجه مورد مقایسه قرار گرفت. نتایج بیان گر عملکرد قابل قبول مدل ها نسبت به منحنی سنجه است. هم چنین، نتایج برتری مدل (GMDH) با بیش ترین ضریب تبیین (R2) با مقدار 99/0 و کم ترین ریشه میانگین مربعات خطا (RMSE) بر حسب تن در روز با مقدار 0038/0 نشان داد. در این خصوص کارآیی مدل (GEP) تا حدی بهتر از مدل های SVM و ANFIS بود. در مرحله بعد، از بهترین الگوی انتخابی مدل های ANFIS، SVM و GEP به عنوان ورودی مدل GMDH استفاده شد. نتایج بیان گر عملکرد قابل قبول مدل GMDH با بیش ترین ضریب تبیین (R2) برابر 99/0 و 98/0 و کم ترین ریشه میانگین مربعات خطا به ترتیب برابر 0038/0 و 0045/0 تن در روز شد. نتایج به دست آمده نشان داد هر چهار روش داده کاوی بررسی شده به مراتب نتایج بهتری نسبت به منحنی سنجه رسوب ارائه می کنند.

کلید واژگان: بار معلق, برنامه ریزی بیان ژن, رسوب, شبکه فازی-عصبی, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Assessing soft calculation methods in river suspended sediment estimation (Hassan Abad station of Tirah river)

Amir Moradinejad *, Saeid Khosrobeigi, Mamood Akbari, Seyed Ahmad Hosseini

Journal of Water and Soil Management and Modeling, Volume:4 Issue: 2, 2024, PP 241 -260

Introduction

Rivers are always faced with erosion and sediment transport. Sediment transport in rivers is one of the most complex topics in river engineering and is always the focus of experts and water engineers. This phenomenon is one of the important hydrodynamic processes that affect many hydraulic systems and water facilities and is considered one of the basic problems in the exploiting surface water resources globally. Estimating the sediment load of rivers is one of the important and practical issues in the studies and design of water engineering projects, such as the design and development of irrigation and drainage networks, water extraction from rivers, etc. Sediment concentration can be calculated by direct or indirect methods, which are usually expensive and time-consuming direct methods. Various factors affect this phenomenon, which makes their analysis difficult. Therefore, they cannot model the sedimentation phenomenon with acceptable accuracy. Hydraulic models cannot always be trusted due to the need for a lot of data, unavailability of the required data, and the inaccuracy of the data due to human error for simulating sediments. Nowadays, fuzzy and neural intelligent conductor systems, due to their ability to solve complex and nonlinear phenomena, have found many applications in various water engineering problems, including sedimentation. The purpose of this research is to evaluate and compare adaptive neural fuzzy models (ANFIS), support vector machine (SVM), gene expression programming (GEP), and group model of data handling (GMDH) in estimating the sediment load of Tirah River, Markazi Province.

Materials and Methods

In this research, first, the long-term daily statistics of temperature, rainfall, average flow rate, and sediment concentration of Hasan Abad hydrometric and sediment measuring station located on the main branch of the Tirah River were collected. Then, the data sufficiency test for analysis, checking the correlation between parameters of river discharge, precipitation, temperature with sediment discharge, and determining the long-term average of suspended sediment in the studied stations were performed. In the next step, a suitable combination of input variables was selected. The design of the input parameter pattern can be based on the relationship between flow and sediment flow parameters, rainfall, temperature, flow, and sediment flow. Of course, considering that the mentioned parameters have a historical course, therefore, the design of the input patterns of soft computing models should be done based on time delays (like what is discussed in the analysis and forecasting of time series). Determining the most appropriate time delay of the input parameters in the modeling of discharge, sediment, temperature, and rainfall, then the appropriate design of the structure of the used soft calculation models was done. In the next step, the estimation of sediment discharge using an SVM, GEP, and ANFIS group method of GMDH data control and comparison of three data mining methods, and also with the sediment rating curve and observational data. About 70 % of the research data was used as training and between 20 to 30 % for validation and testing.

Results and Discussion

Based on the statistical indicators of optimal model selection, the best performance of the SVR model has been obtained for model number one. In this model, the R2 and RMSE obtained from the model are 0.96 and 0.0047, respectively. Besides, the R2 and the RMSE error of the models in predicting suspended sediment values in the test stage are 0.95 and 0.014, respectively for the ANFIS model, and 0.50 and 4.97, respectively for the GEP model. The best performance of the ANFIS model has been obtained for model number one. In this model, the R2 and the RMSE obtained from the model are 0.95 and 0.014. The R2 and RMSE of the models in predicting suspended sediment values in the test stage are 0.96, 0.0047 for the SVR model, and 0.50, 4.97 for the GEP model, respectively. The best performance of the GEP model has been obtained for pattern number nine. In this model, the R2 and RMSE obtained from the model are 0.99 and 0.010, respectively. The R2 and the RMSE of the models in predicting the amount of suspended sediment in the test stage are respectively equal to 0.70, 0.015 for the ANFIS model and 0.78, 0.0185 tons respectively for the SVR model.

Conclusion

It can be seen that the performance of the GEP model was better compared to other models. SVR and ANFIS models are ranked second and third. In the next step, the best-selected pattern of ANFIS, SVM, and GEP models was used as the input of the GMDH model. First, input pattern one, which was selected as the best pattern for ANFIS and SVM models, was introduced as the input of the GMDH model. In the training and test, the values of R2 statistical indices are 0.94 and 0.99, respectively, the RMSE error value is 0.0079 and 0.0038, respectively, the MSE value is 0.000062 and 0.000015, respectively, and the MAPE values are respectively 0.007 and 0.003. In the next step, input pattern nine, which was selected as the best pattern for the GEP model, is introduced as GMDH input. In the training and test steps, the value of R2 is equal to 0.95 and 0.98 respectively, the RMSE error value is equal to 0.0077 and 0.0045 respectively, and the MSE value is equal to 0.0006 and 0.00002 respectively, and MAPE value is equal to 363 and 502. The results showed the acceptable performance of the GMDH model with the highest R2 equal to 0.99 and 0.98 and the lowest RMSE equal to 0.0038 and 0.0045, respectively.

Keywords: Fuzzy Neural Network, Gene Expression Programming, Suspended Load, Sedimentation, Support Vector Machine}

Abstract View Paper Research/Original Article Original: Persian
تحلیل عدم قطعیت در شبیه سازی دبی موثر نشت از سدهای خاکی با الگوریتم مونت کارلو و یادگیری ماشین

فرهود کلاته*، میلاد خیری

نشریه مدل سازی و مدیریت آب و خاک، سال چهارم شماره 1 (بهار 1403)، صص 151 -170

عدم قطعیت های ناشی از ماهیت پیچیده خاک موجب گسترش استفاده از تحلیل های احتمالاتی در طراحی سازه های خاکی شده است و در برخی از کشورها آیین نامه های طراحی چنین سازه هایی را تغییر داده است. هدف پژوهش حاضر تحلیل تراوش با فرض عدم قطعیت در هدایت هیدرولیکی خاک است که در شرایط مختلف هندسی سد مورد بررسی قرار گرفته است. در این پژوهش ترکیب روش اجزای محدود به عنوان روش عددی محاسباتی در کنار یادگیری ماشینی (ML) برای بررسی مساله تراوش از سد خاکی استفاده شده است که تحلیل عدم قطعیت در زبان برنامه نویسی فرترن با الگوریتم شبیه سازی مونت کارلو (MCS) پیاده سازی شده و با تعداد نمونه 2000 برای هر زیرمدل اجرا شده و تابع توزیع فراوانی برای هر مدل استخراج شد. سپس، نتایج احتمالاتی با رگرسیون بردار پشتیبان (SVR) و برنامه نویسی بیان ژن (GEP) تحلیل شدند که مدل درختی برای تراوش نیز ارائه شد. برای بررسی جریان نشت به صورت بی بعد از مولفه دبی موثر نشت (ESD) استفاده شد که بیان گر جریان دبی خروجی با در نظر گرفتن هندسه سد و ضریب هدایت هیدرولیکی آن است. مدل سازی داده های حاصل از کد فرترن به دو روش برنامه نویسی بیان ژن و رگرسیون بردار پشتیبان انجام شد. ضریب همبستگی مدل SVR و GEP به ترتیب 96/0 (در سه حالت داده های آزمون، آموزش و کل) و 91/0 و ریشه میانگین مربعات خطا (RMSE) در هر دو مدل نزدیک 01/0 به دست آمد که بیان گر این است که دو مدل مذکور با دقت مناسبی قادر به پیش بینی دبی موثر هستند و نتایج مدل SVR نسبت به مدل GEP به نتایج تحلیل ناشی از اجزای محدود، تطابق بیش تری دارد.

کلید واژگان: تابع چگالی احتمال, تحلیل احتمالاتی, زبان برنامه نویسی فرترن, ماشین بردار پشتیبان, محیط متخلخل, هدایت هیدرولیکی خاک}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Uncertainty analysis in the simulation of effective seepage flow through earth dams with the Monte Carlo algorithm and machine learning

Farhoud Kalateh *, Milad Kheiry

Journal of Water and Soil Management and Modeling, Volume:4 Issue: 1, 2024, PP 151 -170

Introduction

The cost of building dams is very high and their failure can be hazardous. On the other hand, they are vital for every country as freshwater storage. Deterministic and traditional algorithms can not answer the multidimensional and complex problems of dam construction, and it is necessary to use hybrid methods based on probabilities. The problems of fluid movement in their nature have a complexity that modeling and finding requires using an advanced algorithm that can finally interpret its non-deterministic nature. Earth dams have a porous, multiphase, and complex medium, and the hydraulic and mechanical variables in different parts are associated with uncertainty. For this reason, in recent years, the regulations for the design of dams have been reforming in the direction of applying non-deterministic and probabilistic variables in the calculations. A probabilistic engineering view leads to a more realistic understanding of design than deterministic approaches. In the research, artificial intelligence (AI) methods have been used to analyze the data, which provides a predictive model of behavior for seepage discharge flow through the earth dam. In general, the present research has two purposes a) to estimate the effect of uncertainty of the hydraulic conductivity dam on seepage discharge and b) to provide a model to estimate seepage discharge in a dimensionless way with the gene expression programming (GEP) and support vector machine (SVM) methods.

Materials and Methods

Monte Carlo simulation (MCS) with 2000 iterations was executed for stochastic analysis. The first step of the Monte Carlo simulation is the choice of the deterministic performance function. In the second step, the input variables were defined to the performance function and the probability distribution for variable/variables. By repeating the process n times, n random answers were extracted for the resulting problem, and finally, probability density function (PDF) and cumulative density function (CDF) graphs were drawn for the results. In the Fortran code of this research and to check the convergence, the hydraulic heads were compared to achieve the difference obtained in iteration n with the obtained value in iteration n-1, and if the difference is less than the tolerance error, then the program stops. In the next section of the algorithm, the obtained data (from the repeated execution of the MCS) are converted into a model for a description relationship between the effective seepage discharge (ESD) and the input variables by using the metaheuristic methods that include; gene expression programming (GEP) and support vector machine (SVM). After GEP and support vector regression (SVR) modeling the predicted and observed results were compared by statistical indexes such as MSE, RMSE, MAE, and Correlation coefficients.

Results and Discussion

The different models of earth dams were implemented in the Fortran program, and the average and standard deviation of the seepage discharge flow in the uncertainty state were obtained. To determine the relationship between the ESD value, indicators had been defined that these parameters used for the Gene Expression Programming model include; Kx/Ky, W/B, Bd/B, Bu/B, Hdam/B, Hu/Hdam, and Hd/Hu. These were the factors influencing the seepage discharge of the earth dam, and the discharge component is also defined as the effective seepage discharge (ESD) in a dimensionless manner. Kx and Ky are soil permeability in the direction of the horizontal and vertical axes respectively (m/s), W is the width of the crest, B is the width of the base of the earth dam, Bd is the horizontal distance of the dam tip from the downstream side from the crest, Bu is the horizontal distance of the dam tip from the upstream side crest, Hdam height of the dam, Hu height of the reservoir level, Hd water height downstream of the dam, all the variables are in meters. By increasing the Kx/Ky ratio of horizontal to vertical hydraulic conductivity by 49%, the Effective Seepage Discharge increases by 14%. If the horizontal variable of permeability is increased by 25%, the ESD rate increases by 4.56%, similarly, if the vertical variable is increased by 25%, the ESD decreases by 4.72%.

Conclusion

After finite element analysis, and modeling with two methods of gene expression programming (GEP) and support vector regression (SVR), the statistical analysis of the methods showed that the two calculation models had a good prediction of the ESD with a correlation coefficient above 0.9. Vertical hydraulic conductivity (Ky) has a greater effect on the ESD rate than horizontal hydraulic conductivity (Kx). The results of the geometric investigation of the dam also show that the increase in the ratio Hdam/B has a direct impact on the ESD and also the lower the slope downstream of the dam leads to the lower the ESD. The statistical analysis was used to compare the results of the data obtained from Fortran output for SVR and GEP models. In general, the SVR model is closer to the model resulting from the Fortran code rather than the GEP model, and it has a low root mean square error (RMSE) and a high correlation coefficient.

Keywords: Fortran programming language, Porous medium, Probabilistic analysis, Probability density function (PDF), Soil hydraulic conductivity, Support Vector Machine (SVR)}

Abstract View Paper Research/Original Article Original: Persian
ارزیابی پارامترهای موثرجهت پیش بینی عیار پتاسیم شورابه با استفاده از الگوریتم های ماشین بردار پشتیبان و جنگل تصادفی (مطالعه موردی: پلایای شهرستان خور و بیابانک، استان اصفهان)

مریم ایرجی*، سید علیرضا موحدی نائینی، چوقی بایرام کمکی، سهیلا ابراهیمی، بامشاد یغمایی

مجله تحقیقات آب و خاک ایران، سال پنجاه و پنجم شماره 1 (پیاپی 97، فروردین 1403)، صص 145 -161

اهمیت پتاسیم در بالا بردن کمیت و کیفیت محصولات کشاورزی، تقاضا را برای کودهای پتاسیمی افزایش داده است. تضمین استخراج پتاسیم از شورابه های زیرزمینی مقدار عیار پتاسیم در آن هاست. هدف این پژوهش استفاده از الگوریتم های جنگل تصادفی (RF) و ماشین بردار پشتیبان (SVM) به منظور اولویت بندی پارامترهای موثر بر عیار پتاسیم شورابه زیرزمینی در پلایای خور و بیابانک استان اصفهان است. به همین منظور تعداد 55 پارامتر در 12 گمانه حفاری اندازه گیری شد. پارامترهای اندازه گیری شده به عنوان متغیرهای مستقل شامل درصد رطوبت اشباع مغزه در 15عمق مختلف، جرم مخصوص ظاهری مغزه در 15عمق مختلف، تخلخل مغزه در 15عمق مختلف، مساحت پلی گون، عمق آب زیرزمینی، عمق لایه نمک، پتاسیم لایه سطحی، دانسیته شورابه و میزان عناصر کلسیم، منیزیم، سدیم، کلر و عیار پتاسیم به عنوان متغیر وابسته وارد مدل شدند. در مدلRF برای اولویت بندی، پارامترها از روش های اهمیت ویژگی جایگشت (PFI) و حذف ویژگی جایگشتی (RFE) استفاده شد. در کرنل های مختلف الگوریتم SVM به منظور جلوگیری از هم خطی پارامترهای مستقل، تمام ترکیب های حاصل از متغیرهای مستقل با در نظر گرفتن ضریب تورم واریانس کمتر از 8 و بالاترین ضریب تعیین و کمترین خطای MSE بررسی و به عنوان بهترین ترکیب انتخاب شدند. پارامترهای موثر در پیش بینی عیار پتاسیم شورابه در الگوریتم RF و تابع خطی الگوریتم SVM به ترتیب sp، ap، duw، slp، SAR و n، sp، duw و SAR بودند که منجر به بهترین نتیجه (ضریب تعیین زیاد و خطای کم) شدند. ضریب تعیین برای هر دو مدل به ترتیب 99/0 و 97/0 که نشان دهنده دقت خوب هر دو الگوریتم است.

کلید واژگان: پیش بینی عیار, جنگل تصادفی, شورابه, ماشین بردار پشتیبان}

چکیده مشاهده متن مطالعه موردی زبان: فارسی

Evaluation of effective parameters for predicting the potassium grade of saline water by using support vector machine and random forest algorithms (case study: playa of Khoor and Biabank area city, Isfahan province)

Maryam Iraji *, Seyed Alireza Movahedi Naeini, Chooghi Bayram Komaki, Soheila Ebrahimi, Bamshad Yaghmaei

Iranian Journal of Soil and Water Research, Volume:55 Issue: 1, 2024, PP 145 -161

The importance of potassium in agricultural products has increased the demand for potassium fertilizers. Potassium grade in aquifers ensures its extraction. The purpose of this research is to use RF and SVM algorithms in order to prioritize the effective parameters on the potassium grade of saline water groundwater in playa Khoor and Biabank in Isfahan province. For this purpose, 55 parameters were measured in 12 drilling holes.The parameters measured as independent variables include the percentage of saturated moisture, the apparent specific gravity and the porosity of the core at 15 different depths, the area polygon, the depth of the underground water, the depth of the salt layer, the potassium of the surface layer, the density of the brine and the amount of Elements of calcium, magnesium, sodium, chlorine and grade potassium were included in the model as dependent variables. In the RF model, the (PFI) and (RFE) were used for prioritization. In the different kernels of the SVM algorithm, in order to prevent the collinearity of the independent parameters, all the combinations of the independent variables, considering the variance inflation factor less than 8 and the highest coefficient of determination and the lowest MSE error, were examined and selected as the best combination. The effective parameters in predicting the grade potassium of the brine in the RF algorithm and the linear function of the SVM algorithm are sp, ap, duw, slp, SAR and n, sp, duw, and SAR respectively, which led to the best results. The coefficient of determination for both models is 0.99 and 0.97, respectively, which indicates the good accuracy of both algorithms.

Keywords: grade prediction, Random forest, Saline water, Support vector machine}

Abstract View Paper Case Study Original: Persian
مدل سازی بارش- رواناب ایستگاه های هیدرومتری خرمازرد و بناب با استفاده از الگوریتم ماشین بردار پشتیبان و جنگل تصادفی

زینب بیگدلی، ابوالفضل مجنونی هریس*، رضا دلیرحسن نیا، سپیده کریمی

نشریه آب و خاک، سال سی و هفتم شماره 6 (پیاپی 92، بهمن و اسفند 1402)، صص 971 -989

شبیه سازی فرآیند بارش-رواناب می تواند نقش بسزایی در مدیریت منابع آب و مسائل هیدرولوژی داشته باشد. در این تحقیق با استفاده از مدل های داده کاوی ماشین بردار پشتیبان (SVM) و جنگل تصادفی (RF) اقدام به مدل سازی بارش- رواناب دو ایستگاه بناب و خرمازرد به ترتیب واقع بر روی رودخانه های صوفی چای و ماهپری چای (دشت مراغه) شده است. در مطالعه حاضر داده های ایستگاه های هواشناسی و هیدرومتری منطقه از سال 1355 تا 1397 از شرکت آب منطقه ای و سازمان هواشناسی استان آذربایجان شرقی دریافت گردید. تغییر روند رواناب جاری در سال 1374، باعث گردید مدت مطالعه به دو دوره قبل و بعد آن تقسیم شود. مقدار بارش و رواناب با تاخیر زمانی یک ماه بعنوان ورودی به این مدل وارد و سپس مقادیر رواناب ماهانه مشاهداتی با رواناب ماهانه تخمین زده شده با استفاده از معیارهای ارزیابی خطا مورد بررسی گرفت. نتایج نشان داد که در هر دو دوره برای ایستگاه بناب مدل SVM کارآیی بالاتری نسبت به مدل RF داشت و در ایستگاه خرمازرد نیز برای این دو دوره، مدل RF عملکرد بهتری از مدل SVM ارائه کرد. نتایج مدل سازی در مجموعه تست در دو ایستگاه نشان داد که مقدار همبستگی متقابل برای دو دوره مطالعاتی اول و دوم ایستگاه بناب به ترتیب برابر با 85/0 و 84/0 و برای ایستگاه خرمازرد برابر با 79/0 و 75/0 بدست آمد. با توجه به نتایج مقادیر آماره من کندال و سری های زمانی برای هر دو ایستگاه، روند مشخصی برای بارش در طول دوره مشاهده نشد، ولی دبی رودخانه صوفی چای در ایستگاه بناب، بخصوص بعد از سال 1374 روند صعودی و دبی رودخانه ماهپری چای روند کاملا نزولی داشته است.

کلید واژگان: بارش- رواناب, جنگل تصادفی, دشت مراغه, صوفی چای, ماشین بردار پشتیبان, مدل سازی}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Rainfall-Runoff Modeling of Khormazard and Bonab Hydrometric Stations Using Support Vector Machine and Random Forest Algorithms

Z. Bigdeli, A. Majnooni-Heris *, R. Delearhasannia, S. Karimi

Journal of water and soil, Volume:37 Issue: 6, 2024, PP 971 -989

Introduction

Water plays a crucial role in ensuring the sustainable development of any region. Given that our country consists primarily of arid and semi-arid regions, where the majority of rivers are also found, along with the critical state of groundwater extraction and the growing importance of surface water, It is crucial to have a deep understanding of the future condition of water resources within the country's watersheds (Fathollahi et al., 2015). By utilizing intelligent models, it becomes feasible to represent the inherent relationships between data that cannot be solved by conventional mathematical methods. Support vector machine (SVM) and Random Forest algorithms are two types of machine learning methods that utilize essential algorithms for making repeated and accurate predictions (Kisi & Parmarm, 2016). The most recent study conducted by Zarei et al. (2022) evaluated the risk of flooding using data mining models of SVM and RF (case study: Frizi watershed). By analyzing the results, it was found that both the SVM algorithm and the new random forest algorithm showed higher accuracy in predicting flooding risks, both in terms of the educational data and algorithmic performance. The purpose of this study is to simulate the precipitation-runoff process in the hydrometric stations at the end of the Maragheh plain (Khormazard station on the Mahpari chai river and Bonab station on the Sufichai river) in East Azerbaijan province using support vector machine and random forest modeling algorithms. This study has been conducted over a period of 43 years, making it one of the few research cases in this area.

Materials and Methods

The Maragheh Sufi chai basin is situated in the eastern region of Lake Urmia, within the East Azarbaijan province. It covers an area of 611.89 square kilometers and is located between longitudes 45° and 40´ to 46° and 25´and latitudes from 37° and 15´ to 37° and 55´ north. The average height of the basin is 1767 meters above sea level (Sharmod et al., 2015). Based on the substantial changes observed in the runoff trend in the data since 1994 (without any noticeable change in the precipitation trend), the available data was divided into two distinct periods. The first period spans from 1976 to 1994, and the second period covers the years 1995 to 2019. To simulate rainfall-runoff, first the average rainfall of Maragheh plain was calculated by polygonal method. Subsequently, this data was combined with the discharge output from Bonab and Khormazard stations, with a one-day time lag. These inputs were then utilized in two models, SVM (kernel function) and RF. For this purpose, 70% of the data was used for the training stage and 30% of the data was used for the validation stage. Then, the rainfall and runoff training sets from one day before were chosen as the predictor variables, while the runoff training set was designated as the target variable. Several combinations of runoff and rainfall inputs were evaluated for the purpose of modeling. The inputs consist of the monthly Q and P values that were recorded previously (Pt, Qt-1), while the output represents the current runoff data (Qt), with the subscript t indicating the time step. As a result, two input combinations were constructed from Q and P data (as seen in Table 3) and SVM and RF models were used for rainfall-runoff modeling to determine the optimal input combination.
Calculating average rainfall through the Thiessen Polygons method Thiessen polygons, which are Voronoi cells, are used to define rainfall polygons that correspond to the surface area (Ai). These polygons are used to weight the rainfall measured by each rain gauge (ri). Consequently, the area-weighted rainfall is equivalent to:
(1)
Random Forest Algorithm
Random forest is a modern type of tree-based methods that includes a multitude of classification and regression trees. This algorithm is one of the most widely used machine learning algorithms due to its simplicity and usability for both classification and regression tasks.
Support Vector Machine (SVM) algorithm
Support vector machines works like other artificial intelligence methods based on data mining algorithm. The most important functions of the support vector machine model are classification and linearization or data regression.
Evaluation Criteria
To evaluate the models and compare their effectiveness, this research employs metrics such as the root mean square error (RMSE), correlation coefficient (r), explanation coefficient (R2) and Nash-Sutcliffe efficiency coefficient (NS) are used. Below are the relationships among these criteria:
(2)
(3)
(4)
(5)

Results and Discussion

Figure 6 displays the time series data for rainfall and runoff during the two study periods, before and after 1994.The analysis of the figures showed that for Bonab station, during the two study periods, the value of Kendall's statistic for precipitation variable was 0.044 and 0.028, respectively. For Khormazard station, this statistic value for the first and second period was 0.030, and 0.028, respectively. However, these values are not significant at the 95% level. This indicates that the annual rainfall for the two studied stations during these years is not statistically significant. Therefore, it is concluded that the annual rainfall in these stations between the years 1976 to 2019 did not show any significant trend. The variations observed during this period were deemed normal, suggesting that the time series of rainfall displayed fluctuating patterns. However, it should be noted that there were instances of both increasing and decreasing trends in certain years Examining the time series reveals varying trends Initially, the outflow from Bonab station (both a and b) displayed fluctuating patterns, followed by periods of both decreasing and increasing trends. However, in recent years, there has an increase in outflow from this station. The Mann-Kendall test statistic for the two study periods for this station is 0.325 and 0.512, respectively. These values are significantly different at the 95% level, indicating that the increasing trend of discharge for both time periods was statistically significant. The reason for this trend at the Bonab station, compared to other entrance stations to Lake Urmia, is the lower demand for water in the Sofichai basin for agricultural and industrial purposes, in contrast to other rivers. To explore the root cause of this issue, studies should be conducted to examine both underground and surface water sources, as well as the utilization of water in the agricultural and industrial sectors of this region. On the contrary, the trend observed at Khormazard station (c and d) is different. Unlike Bonab station, the discharge from Khormazard station exhibited a complete downward trend. The Mann-Kendall test statistic for the discharge variable during our two research periods were -0.269 and -0.412, respectively. At the 95% level, the decreasing trend of discharge in this station was found to be significant. On the other hand, it is apparent that the volume of discharge in this hydrometric station has decreased drastically since 1976 (d). Apart from 2007, when there was a sudden increase in discharge volume, the water inflow into lake Urmia has remained at its lowest level throughout the years. To analyze the Bonab and Khormazard stations during two distinct periods, rainfall and runoff statistics (average, minimum, maximum) for the first period (1976-1994) and the second period (1995-2019) are presented in Tables 4 and 5. Based on the data presented in both tables, the Bonab station displays the highest average rainfall and runoff values in the total data column, while the Khormazard station has the lowest average rainfall and runoff values.
As mentioned, in order to model rainfall-runoff data using SVM and RF models, a portion of the data was used for training purposes, while another portion was used for validation. Tables 5 and 6 present the values of the calculated statistical indicators associated with the results obtained from the training and validation sections for both SVM and RF models. According to the results of Tables 6 and 7, it is clear that in both study periods, the SVM model outperformed the RF model at the Bonab station. The SVM model demonstrated superior accuracy in simulating both flow rate and monthly rainfall. Conversely, at the Kharmazard station during these periods, the RF model displayed better performance compared to the SVM model. The modeling results in the test set for both stations revealed that the mutual correlation values for the first and second study periods at the Bonab station were 0.85 and 0.84, respectively. For the Kharmazard station, these values were 0.79 and 0.75, respectively.

Conclusion

The results indicate that for both periods at the Bonab station, the SVM model exhibited higher efficiency compared to the RF model. Conversely, at the Khormazard station, the RF model outperformed the SVM model for both periods. Mutual correlation values for the test sets were 0.85 and 0.84 for the first and second study periods at the Bonab station, respectively, for the SVM model test set. For the Khormazard station, these values were 0.79 and 0.75, respectively, for the RF model test set. Other notable findings of this research include the analysis of the time series data for rainfall and runoff over 43 years. Graphs obtained for both stations, along with the Mann-Kendall statistic for precipitation and flow parameters, revealed no discernible trend in precipitation during the two study periods. Instead, precipitation in these areas displayed fluctuating patterns However, the analysis of the time series and statistical values for the discharge of Sofichai and Mahpari chai rivers at the Bonab and Khormazard stations showed different results. In the Bonab station, the discharge exhibited fluctuations, with an increase observed in the second period. Conversely, at the Khormazard station, the discharge trend was downward in both study periods. The volume of Mahpari chai River outflow notably decreased in recent years, as evidenced by the Mann-Kendall statistic showing a decreasing trend.

Keywords: Maragheh Plain, Modeling, Rainfall-Runoff, random forest, Sufi Chai, Support vector machine}

Abstract View Paper Research/Original Article Original: Persian
ارزیابی مدل های یادگیری ماشین در GIS جهت پیش بینی آب زیرزمینی مناطق نیمه خشک شرق ایران

مبین افتخاری*، علی حاجی الیاسی، سید احمد اسلامی نژاد

مجله آبخوان و قنات، سال چهارم شماره 2 (پیاپی 7، پاییز و زمستان 1402)، صص 49 -66

پیش بینی پتانسیل آب های زیرزمینی جهت توسعه و برنامه ریزی سیستماتیک منابع آب بسیار حیاتی است. هدف اصلی این تحقیق، توسعه مدل های یادگیری ماشینی از جمله جنگل تصادفی (RF)، درخت تصمیم (DT) و ماشین بردار پشتیبان (SVM) برای پیش بینی مناطق پتانسیلی آب زیرزمینی در دشت بیرجند است. بنابراین، برای اجرای این مطالعه، داده های ژئوهیدرولوژیکی مربوط به 37 چاه آب زیرزمینی (شامل تعداد و موقعیت چاه ها و سطح آب زیرزمینی) و 17 معیار هیدرولوژی، توپوگرافی، زمین شناسی و محیطی مورد استفاده قرار گرفت. روش انتخاب ویژگی از طریق کمترین مربعات ماشین بردار پشتیبان جهت تعیین معیارهای موثر برای بهبود عملکرد الگوریتم های یادگیری ماشین به کار گرفته شد. در نهایت، نقشه های پیش بینی پتانسیل آب زیرزمینی با استفاده از مدل های DT، RF و SVM تهیه شدند و عملکرد این مدل ها با استفاده از سطح زیر منحنی (AUC) و سایر شاخص های آماری مورد ارزیابی قرار گرفت. نتایج نشان داد که مدل DT (AUC=0.89) توانایی پیش بینی بسیار بالایی برای پتانسیل آب زیرزمینی در منطقه مورد مطالعه دارد و معیار ارتفاع به عنوان مهم ترین عامل در پیش بینی پتانسیل آب زیرزمینی در این منطقه شناخته شد. نتایج این مطالعه می تواند به عنوان راهنمایی برای تصمیم گیری و برنامه ریزی مناسب در استفاده بهینه از منابع آب زیرزمینی مورد استفاده قرار گیرد.

کلید واژگان: دشت بیرجند, نقشه های پیش بینی, جنگل تصادفی, درخت تصمیم, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Assessment of machine learning models in GIS for predicting groundwater in semi-arid regions of eastern Iran

Mobin Eftekhari *, Ali Haji Elyasi, Seyed Ahmad Eslaminezhad

Journal of Auifer and Qanat, Volume:4 Issue: 2, 2024, PP 49 -66

Predicting groundwater potential is crucial for systematic development and planning of water resources. The main objective of this study is to develop machine learning models including Random Forest (RF), Decision Tree (DT), and Support Vector Machine (SVM) for predicting potential groundwater areas in the Birjand plain. Therefore, for the implementation of this study, geohydrological data related to 37 groundwater wells (including the number and location of wells and groundwater levels) and 17 hydrological, topographical, geological, and environmental criteria were used. Feature selection was performed using Support Vector Machine's least squares method to determine effective criteria for improving the performance of machine learning algorithms. Ultimately, predictive maps of groundwater potential were prepared using DT, RF, and SVM models, and the performance of these models was evaluated using the Area under the Curve (AUC) and other statistical indicators. The results showed that the DT model (AUC=0.89) has very high predictive capability for groundwater potential in the study area, and elevation was identified as the most important factor in predicting groundwater potential in this area. The findings of this study can serve as a guide for decision-making and appropriate planning in the optimal use of groundwater resources.

Keywords: Birjand Plain, Predictive Maps, Random Forest, Decision Tree, Support Vector Machine}

Abstract View Paper Research/Original Article Original: Persian
مقایسه عملکرد مدل های داده کاوی در پیش بینی بارش باران با استفاده از رویکرد دسته بندی (مطالعه موردی: ایستگاه هواشناسی سینوپتیک فرودگاه همدان)

مرتضی صالحی سربیژن*، حمیدرضا دزفولیان

فصلنامه حفاظت منابع آب و خاک، سال سیزدهم شماره 4 (پیاپی 52، تابستان 1403)، صص 113 -126

زمینه و هدف

بارندگی یکی از پدیده های پیچیده طبیعی و از مهم ترین اجزای چرخه آب بوده و در سنجش خصوصیات اقلیمی هر منطقه نقش بسیار مهمی ایفا می کند. شناخت میزان و روند تغییرات بارش به عنوان یکی از عناصر مهم هواشناسی، از یک سو جهت داشتن مدیریت اثربخش و برنامه ریزی دقیق تر برای بخش های کشاورزی، اقتصادی و اجتماعی و از سوی دیگر برای مطالعاتی مانند رواناب ها، خشک سالی ها، وضعیت آب های زیرزمینی و سیلاب ها ضروری است. همچنین پیش بینی بارش در مناطق شهری تاثیر بسیار زیادی بر کنترل ترافیک، جریان فاضلاب ها و فعالیت های ساخت وساز دارد.

روش پژوهش:

هدف این مطالعه مقایسه دقت مدل های کلاس بندی درخت تصمیم (چاید (CHAID)، درخت تصمیم C5، نیو بیزین (NB)، کوئست (Quest) و جنگل تصادفی)، k نزدیک ترین همسایگی (KNN)، ماشین بردار پشتیبان (SVM) و شبکه عصبی مصنوعی (ANN) جهت پیش بینی وقوع بارش باران با استفاده از داده های یک دوره 50 ساله در ایستگاه سینوپتیک فرودگاه همدان است. در این مطالعه از 80 درصد داده ها جهت آموزش و از 20 درصد داده ها جهت صحت سنجی مدل ها استفاده شده و نتایج حاصل از اجرای مدل ها با استفاده از معیارهای ماتریس درهم ریختگی (اغتشاش)، منحنی ROC و شاخص AUC مقایسه شدند. برای ساخت متغیر کلاس بندی داده های بارش و عدم بارش، با توجه به داده های بارش، روزهای سال در دو کلاس روزهای وقوع بارش (y) و روزهای عدم وقوع بارش (n) دسته بندی شدند. در این تحقیق پیش پردازش داده ها با استفاده از پیش پردازش خودکار داده ها (ADP) انجام شده و آنگاه کاهش ابعاد متغیرها از روش PCA استفاده شد.

یافته ها

در این مطالعه با توجه به روش PCA ابعاد متغیرها به 5 بعد کاهش یافت. همچنین از داده های موجود تقریبا 80 درصد، روزها بدون بارش و 20 درصد روزها با بارش هستند. نتایج تحقیق نشان داد که مدل KNN با معیار صحت 9/91 برای داده های آموزشی و مدل SVM، 13/89 درصد برای داده های آزمون بهترین عملکرد را بین مدل های داده کاوی داشتند. شاخص AUC مدل KNN برابر 97/0 در داده های آموزشی و در داده های آزمون مقدار 94/0 برای الگوریتم SVM به دست آمد. همچنین با توجه به منحنی عملکرد سیستم (ROC) برای داده های بارش همدان مدل KNN نسبت به سایر مدل ها عملکرد بهتری را دارا می باشد. توجه به شاخص حساسیت در ماتریس اغتشاش، مدل های KNN و SVM در پیش بینی عدم وقوع بارش برای داده های آموزش بهتر عمل کردند. با توجه به شاخص خاصیت در پیش بینی وقوع بارش مدل های RT و KNN نتایج بهتری داشتند.

نتایج

نتایج تحقیق نشان داد که در داده های آموزش مقدار معیار صحت برای مدل های RT، C5، ANN، SVM، BN،KNN ، CHAID و QUEST به ترتیب 82/86، 78/89، 55/89، 96/89، 06/88، 9/91، 29/88 و 46/87 بدست آمده اند. همچنین این معیار در داده های آزمون برای این مدل ها به ترتیب 2/83، 9/87، 12/88، 13/89، 12/87، 19/88، 93/86 و 76/86 به دست آمد. مقدار شاخص AUC در داده های آموزش برای مدل های RT، C5، ANN، SVM، BN،KNN ، CHAID و QUEST به ترتیب 94/0، 92/0، 94/0، 94/0، 93/0، 97/0، 93/0 و 89/0 به دست آمد. همچنین این معیار در داده های آزمون برای این مدل ها به ترتیب 89/0، 89/0، 93/0، 94/0، 92/0، 90/0، 92/0 و 88/0 برآورد شد. همان طور که مشاهده شد، با توجه به معیارهای صحت و شاخص AUC در داده های آموزش مدل KNN و با توجه به داده های آزمون مدل SVM کارا تر در پیش بینی بارش باران بودند.

کلید واژگان: شبکه عصبی مصنوعی, مدل K نزدیک ترین همسایگی, ماشین بردار پشتیبان, پیش بینی بارش باران, مدل های درخت تصمیم}

چکیده مشاهده متن مطالعه موردی زبان: فارسی

Comparison of Data Mining Models Performance in Rainfall Prediction Using Classification Approach (Case Study: Hamedan Airport Synoptic Weather Station) / ,

Morteza Salehi Sarbijan *, Hamidreza Dezfoulian

Journal of Water and Soil Resources Conservation, Volume:13 Issue: 4, 2024, PP 113 -126

Background and Aim

Rainfall is one of the complex natural phenomena and one of the most crucial component of the water cycle, playing a significant role in assessing the climatic characteristics of each region. Understanding the amount and trends of rainfall changes is essential for effective management and more precise planning in agricultural, economic, and social sectors, as well as for studies related to runoff, droughts, groundwater status, and floods. Additionally, rainfall prediction in urban areas has a significant impact on traffic control, sewage flow, and construction activities.

Method

The objective of this study is to compare the accuracy of classification models, including Chi-squared Automatic Interaction Detector (CHAID), C5 decision tree, Naive Bayes (NB), Quest tree, and Random Forest, k-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Artificial Neural Network (ANN) in predicting rainfall occurrence using 50 years of data from the synoptic station at Hamedan Airport. In this study, 80% of the data is used for training the models, and 20% for model validation and the results obtained from the model executions are compared using metrics such as confusion matrix, Receiver Operating Characteristic (ROC) curve, and the Area Under the Curve (AUC) index. To create the classification variable for rainfall and non-rainfall data, based on rainfall data, the days of the year are categorized into two classes: days with rainfall (y) and days without rainfall (n). Data preprocessing is performed using Automatic Data Preprocessing (ADP). Then, Principal Component Analysis (PCA) is employed to reduce the dimensions of the variables.

Results

In this study, the PCA method reduces the dimensions of the variables to 5. Also, approximately 80% of the available data corresponds to rainless days, while 20% corresponds to rainy days. The research results indicated that the KNN model with an accuracy of 91.9% for training data and the SVM model with 89.13% for test data exhibit the best performance among the data mining models. The AUC index for the KNN model is 0.967 for training data and 0.935 for test data, while for the SVM algorithm, it is 0.967 for training data and 0.935 for test data. According to the ROC curve for Hamedan rainfall data, the KNN model outperforms other models. Considering the sensitivity index in the confusion matrix, the KNN and SVM models perform better in predicting non-rainfall occurrence for training data. In terms of the precipitation occurrence prediction, the RT and KNN models show better results according to the specificity index.

Conclusion

The results demonstrated that for the RT, C5, ANN, SVM, BN, KNN, CHAID, QUEST, accuracy metrics was obtained 86.82%, 89.78%, 89.55%, 89.96%, 88.06%, 91.9%, 88.29%, 87.46%, 91.9%, respectively for training data. Moreover, for test data, the accuracy metrics for this model was obtained 83.82%, 87.9%, 88.12%, 89.13%, 87.12%, 89.13%, 87.12%, 88.19%, 86.93%, 86.76%, respectively. The AUC index in the training data for RT, C5, ANN, SVM, BN, KNN, CHAID QUEST models was 0.94%, 0.99%, 0.94%, 0.94%, 0.93%, 0.97%, 0.93%, 0.89%, respectively. In addition, for the test data, this metric was evaluated 0.89%, 0.89%, 0.93%, 0.94%, 0.92%, 0.90%, 0.92%, 0.88% respectively. As observed, considering accuracy metric and AUC index for training data KNN model and for test data SVM model were more sufficient in rainfall prediction.

Keywords: Decision tree models, K-nearest neighbors (KNN) model, Rainfall prediction, Artificial Neural Network, support vector machine (SVM)}

Abstract View Paper Case Study Original: Persian
ارزیابی روش های پیش بینی شاخص ترکیبی خشکسالی کشاورزی (CDI) براساس تصاویر ماهواره ای با روشهای یادگیری عمیق و یادگیری ماشین

نازیلا شاملو، محمدتقی ستاری*، خلیل ولیزاده کامران، حالیت آپ آیدین

نشریه آب و خاک، سال سی و هفتم شماره 5 (پیاپی 91، آذر و دی 1402)، صص 787 -807

باتوجه به بحران خشکیدگی دریاچه ارومیه، مطالعه وضعیت پوشش گیاهی و خشکسالی کشاورزی محدوده حوضه آبریز دریاچه ارومیه که یکی از شش حوضه اصلی ایران محسوب می شود، از اهمیت قابل توجهی برخوردار است. در این مطالعه ابتدا یک شاخص ترکیبی خشکسالی CDI (Combined Drought Index) مبتنی بر شاخص های وضعیت پوشش گیاهی (VCI)، وضعیت دمایی گیاهی (TCI) و شاخص تنش آبی محصول (CWSI) با استفاده از داده های سنجنده MODIS قرار گرفته در ماهواره TERRA معرفی و محاسبه گردید. سپس با روش های درخت تصمیم-طبقه بندی و درخت رگرسیون (DT-CART)، ماشین بردار پشتیان (SVM) و حافظه کوتاه مدت، بلند مدت (LSTM) و حافظه کوتاه مدت دو جهته (BiLSTM)، شاخص ترکیبی خشکسالی (CDI) معرفی و تخمین زده شد. در فرآیند مدل سازی شاخص ترکیبی خشکسالی، محصولات شاخص های پوشش گیاهی، تبخیر- تعرق، تبخیر-تعرق پتانسیل، دمای سطح زمین در روز و دمای سطح زمین در شب برگرفته از سنجنده MODIS به عنوان ورودی مدل ها استفاده شد. درنهایت بررسی عملکرد مدل ها براساس ترکیب های متفاوتی از ورودی مدل ها بااستفاده از معیارهای ارزیابی شامل ضریب همبستگی، جذر میانگین مربعات خطا و ضریب ناش ساتکلیف و همچنین به کمک نمودارهای کلوروگرام، تیلور و ویلونی بصورت بصری انجام شد. نتایج نشان داد که متغیر های دمای سطح زمین در روز، دمای سطح زمین در شب و تبخیر-تعرق موثرترین متغیرها برای مدل سازی شاخص ترکیبی خشکسالی (CDI) و مطالعه خشکسالی کشاورزی می باشند. همچنین مدل CART با ضریب همبستگی 96/0، میانگین جذر مربعات خطا برابر با 029/0 و ضریب ناش ساتکلیف 92/0 به عنوان بهترین مدل انتخاب گردید. نتایج بدست آمده نشان داد که روش های یادگیری ماشین و یادگیری عمیق ابزاری توانمند در مدل سازی و پیش بینی شاخص ترکیبی خشکسالی (CDI) بوده و در بررسی و ارزیابی خشکسالی کشاورزی به خصوص در حوضه های فاقد آمار با اطمینان کافی می تواند مورد استفاده قرار گیرد.

کلید واژگان: حافظه کوتاه مدت بلند مدت, درخت تصمیم, سنجش از دور, شاخص خشکسالی, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Evaluation of Combined Agricultural Drought Index (CDI), Prediction Methods Based on Satellite Images via Deep Learning and Machine Learning Approaches

Nazila Shamloo, Mohammad Taghi Sattari *, Khalil Valizadeh Kamran, Halit Apaydin

Journal of water and soil, Volume:37 Issue: 5, 2023, PP 787 -807

Introduction

Drought is one of the greatest challenges of our time due to the dangers it poses to the world. In arid and semi-arid regions, it is necessary to continuously monitor agricultural systems that face water shortages and frequent droughts. Therefore, it is necessary to have large-scale information about agricultural systems and land use for managing and making decisions for the sustainability of food security. Continuous monitoring of drought requires a large amount of information to be processed with great speed and accuracy. Due to the complexity and impact of various factors on drought, in recent years, the methods of combining several factors to create a comprehensive drought index have received much attention. Machine learning and deep learning methods can provide a more accurate and efficient tool to predict droughts and be used in drought risk management. The review of sources shows that until now no studies have been conducted in the field of drought monitoring using deep learning approach and satellite images in the catchment area of Lake Urmia in Iran. A large part of its economic activities is dedicated to agriculture. The increase in temperature, the increase in evaporation-transpiration and the excessive use of water resources for agriculture have caused an upward trend in the frequency of droughts in this basin during consecutive years, one of the harmful effects of which is a significant decrease in the lake level. Therefore, for drought management in this basin, it is very important to identify drought behavior so It is very important to determine appropriate and reliable indicators to measure and predict the effects of droughts. According to the investigations, it was observed that most of the studies in the field of drought in this basin have been carried out from the meteorological point of view, or by individual plant indicators, so in this study, using the approach of principal component analysis, we tried to provide a composite drought index for drought modeling and forecasting.

Materials and Methods

In this research, satellite images and deep learning and machine learning methods have been used to predict the Combined Drought Index. For this purpose, satellite images were first obtained for the study area and pre-processing was done on the data. Then, all the data were converted to a scale with a spatial resolution of 500 meters, and the VCI index was calculated using NDVI data, the TCI index using the land surface temperature product, and the CWSI index using the Modis evapotranspiration product, and finally, CDI drought index was calculated using principal component analysis method. Then the correlation between CDI data and other meteorological variables including evapotranspiration, potential evapotranspiration, land surface temperature during the day, and land surface temperature at night was calculated. Finally, the CDI index is modeled using deep learning and machine learning methods.

Results and Discussion

This study modeled the Combined Drought Index based on a different combination of input variables and deep learning and machine learning methods. Examining the results showed that the variables of the normalized difference vegetation index, the land surface temperature during the day and at night, evapotranspiration, and potential evapotranspiration were the most influential parameters for modeling the CDI index, and all four methods with acceptable accuracy and error have been able to model the combined drought index. The CART model with a correlation coefficient of 0.96, RMSE equal to 0.029, and Nash Sutcliffe coefficient of 0.92 was chosen as the best model among the methods.

Conclusion

In this research, different combinations of input variables extracted from satellite image products were evaluated in the form of 6 independent scenarios to predict the Combined Drought Index. By examining the evaluation parameters including correlation coefficient, Nash Sutcliffe coefficient, and root mean square error, it was found that all four methods can estimate the combined drought index with acceptable accuracy and error. Among all the methods, the CART method performed better (R=0.96 and RMSE=0.029) than the other methods for predicting the time series of the Combined Drought Index. On the other hand, the SVM method has been able to model the combined drought index with acceptable accuracy (R=0.94 and RMSE=0.034). However, contrary to expectations, two deep learning methods were able to model the combined drought index with less accuracy than machine learning methods. In general, by examining the results, it was found that with the method presented in this research, it is possible to accurately predict the CDI combined drought index time series and predict drought in different periods of plant growth, and use its results for regional drought management and policies, especially in Basins without statistics.

Keywords: Agricultural drought, Combined Drought Index (CDI), Deep learning, machine learning, Satellite Images}

Abstract View Paper Research/Original Article Original: Persian
مقایسه کارایی هیدرولیکی سرریزهای غیر خطی قوسی در پلان با استفاده از شبکه های عصبی GEP و SVM

مهدی ماجدی اصل*، توحید امیدپور علویان، مهدی کوهدرق، وحید شمسی

نشریه علوم آب و خاک (علوم و فنون کشاورزی و منابع طبیعی)، سال بیست و هفتم شماره 3 (پیاپی 105، پاییز 1402)، صص 179 -199

سرریزهای غیرخطی ضمن دارا بودن مزیت های اقتصادی، قابلیت عبوردهی بیشتری را نسبت به سرریزهای خطی دارند. این سرریزها با افزایش طول تاج در یک عرض مشخص، در مقایسه با سرریزهای خطی راندمان دبی بیشتر با ارتفاع آزاد کمتر را در بالادست دارند. الگوریتم های هوشمند به دلیل توانایی زیاد در کشف رابطه های دقیق پیچیده مخفی بین پارامترهای مستقل موثر و پارامتر وابسته و همچنین صرفه جویی مالی و زمانی، جایگاه بسیار ارزشمندی بین پژوهشگران پیدا کرده اند. در این پژوهش عملکرد الگوریتم های ماشین بردار پشتیبان (SVM) و برنامه ریزی بیان ژن (GEP) در پیش بینی ضریب دبی سرریزهای غیرخطی قوسی به کمک 243 سری داده آزمایشگاهی برای سناریو اول و 247 سری داده آزمایشگاهی برای سناریو دوم بررسی شده است. پارامترهای هندسی و هیدرولیکی استفاده شده شامل بار آبی (HT/p)، ارتفاع سرریز (P)، نسبت بار آبی کل ، زاویه سیکل قوسی (Ɵ)، زاویه دیواره سیکل(α) و ضریب دبی (Cd) است. نتایج هوش مصنوعی نشان داد که ترکیب پارامترهای (H_T/p ،α ،Ɵ و Cd) به ترتیب در الگوریتم های GEP و SVM در مرحله آموزش مربوط به سناریو اول (سرریز کنگره ای با زاویه دیواره سیکل 6 درجه) به ترتیب برابر است با (0/9811=R2)، (RMSE=0/02120)، (DC=0/9807)، (R2=0/9896)، (RMSE=0/0189)، (DC=0/9871). (در سناریو دوم (سرریز کنگره ای با زاویه دیواره سیکل 12 درجه) به ترتیب برابراست با (0/9770=R2)،(RMSE=0/0193)، (DC=0/9768) و (9908/0=R2)، (RMSE=0/0128)، (DC=0/9905) که در مقایسه با دیگر ترکیب ها منجر به بهینه ترین خروجی شده است که نشان دهنده دقت بسیار مطلوب هر دو الگوریتم در پیش بینی ضریب دبی سرریز غیرخطی قوسی است. نتایج آنالیز حساسیت نشان داد که پارامتر موثر در تعیین ضریب دبی سرریز غیرخطی قوسی در GEP و هم در SVM پارامتر نسبت بار آبی کل (HT/p) است. مقایسه نتایج این پژوهش با سایر پژوهشگران نشان می دهد که شاخصه های ارزیابی برای الگوریتم های GEP و SVM پژوهش حاضر نسبت به سایر پژوهشگران برآورد بهتری دارند.

کلید واژگان: شبکه های عصبی, سرریز غیرخطی, ضریب دبی, ماشین بردار پشتیبان, برنامه ریزی بیان ژن}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Comparison of Hydraulic Efficiency of Arched Non-linear Weirs in Plan Using GEP and SVM Neural Networks

M. Majedi Asl*, T. Omidpour Alavian, M. Kouhdaragh, V. Shamsi

Journal of Hydrology and Soil Science, Volume:27 Issue: 3, 2023, PP 179 -199

Non-linear weirs meanwhile economic advantages, have more passing flow capacity than linear weirs. These weirs have higher discharge efficiency with less free height upstream compared to linear weirs by increasing the length of the crown at a certain width. Intelligent algorithms have found a valuable place among researchers due to their great ability to discover complex and hidden relationships between effective independent parameters and dependent parameters, as well as saving money and time. In this research, the performance of support vector machine (SVM) and gene expression programming algorithm (GEP) in predicting the discharge coefficient of arched non-linear weirs was investigated using 243 laboratory data series for the first scenario and 247 laboratory data series for the second scenario. The geometric and hydraulic parameters were used in this research including the water load (HT), weir height (P), total water load ratio (HT/p), arc cycle angle (Ɵ), cycle wall angle (α), and discharge coefficient (Cd). The results of artificial intelligence showed that the combination of parameters (Cd, H_T/p, α, Ɵ) respectively in GEP and SVM algorithms in the training phase related to the first scenario (Labyrinth weir with cycle wall angle 6 degrees) were respectively equal to (R2=0.9811), (RMSE=0.02120), (DC=0.9807), and (R2=0.9896), (RMSE=0.0189), (DC=0.9871) in the second scenario (Labyrinth weir with a cycle wall angle of 12 degrees) it was equal to (R2=0.9770), (RMSE=0.0193), (RMSE=0.9768), and (R2 = 0.9908), (RMSE = 0.0128), (DC = 0.9905), which compared to other combinations has led to the most optimal output that shows the very favorable accuracy of both algorithms in predicting the coefficient the Weir discharge is arched non-linear. The results of the sensitivity analysis indicated that the effective parameter in determining the discharge coefficient of the arched non-linear Weir in GEP and in SVM is the total water load ratio parameter (HT/p). Comparing the results of this research with other researchers revealed that the evaluation indices for GEP and SVM algorithms of this research had better estimates than other researchers.

Keywords: Neural networks, Non-linear weirs, Discharge coefficient, Support vector machine, Genetic expression tool}

Abstract View Paper Research/Original Article Original: Persian
برآورد دبی جریان در فلوم های با تنگ شدگی مثلثی شکل با استفاده از روش های یادگیری ماشین

محمدرضا زایری *

نشریه تحقیقات مهندسی سازه های آبیاری و زهکشی، سال بیست و چهارم شماره 90 (بهار 1402)، صص 55 -70

فلوم های گلو بریده که نوعی پارشال فلوم بدون بخش طولی گلوگاه می باشند، به عنوان ابزارهایی ساده و کارامد نقش بسزایی حهت اندازه گیری دبی جریان در کانال های روباز محسوب می شوند. نصب ساده، هزینه راه اندازی پایین و دقت بسیار مناسب در اندازه گیری میزان دبی جریان از ویژگی های مهم این نوع از سازه هاست. در این پژوهش از نتایج آزمایشگاهی به دست آمده از سازه فلوم گلو بریده که با قرار دادن دو صفحه مثلثی در دو طرف دیواره های کناری یک کانال مستطیلی و تشکیل مقطع مستطیلی و ذوزنقه ای به کار گرفته شد، جهت توسعه مدل های یادگیری ماشین مورد بررسی قرار گرفت. به منظور برآورد دبی جریان در این نوع از کانال ها از مدل های شامل دسته بندی گروهی داده ها (GMDH)، ماشین بردار پشتیبان (SVM) و جنگل تصادفی (RF) استفاده گردید. بدین منظور از پارامترهای هندسی و هیدرولیکی شامل عرض تنگ شدگی در محل سازه، شیب های افقی و عمودی دیواره های مثلثی شکل، عمق نسبی جریان به عنوان متغیر ورودی استفاده و دبی به عنوان متغیر خروجی (پاسخ) در نظر گرفته شد. نتایج نشان داد که مقدار آماره ریشه میانگین مربعات خطا (RMSE) برای مدل های مبتنی بر GMDH، SVM و RF به ترتیب، 033/0، 016/0 و 020/0 و مقدار آماره ضریب تعیین (R2) به ترتیب، 805/0، 951/0 و 900/0 به دست آمد. مقایسه بین تحقیقات گذشته و نتایج حاضر حاکی از برتری عملکرد مدل مبتنی بر SVM نسبت به سایر مدل های توسعه یافته بود. عمق آب به عرض مقطع تنگ شده به عنوان مهم ترین داده ورودی مدل ها توسعه یافته شناسایی شد.

کلید واژگان: جنگل تصادفی, فلوم های گلو بریده, شبکه های آبیاری, دسته بندی گروهی داده ها, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Discharge Prediction in Flumes with Trapezoidal Contraction by Machine Learning Techniques

mohamadreza zayeri

Irrigation and Drainage Structures Engineering Research, Volume:24 Issue: 90, 2023, PP 55 -70

Introduction

The effective use of water for irrigation requires that flow discharge and flow volume be measured carefully. Venturi flumes are widely-used structures for monitoring flow discharges in canal networks (Venturi tubes used in pipelines). Venturi was the first that observed the effect of a local contraction in a conduit on flow velocity distribution. The Venturi flumes have a local constriction. These devices may be built in different shapes and generally are very accurate when operated under free outflow conditions. Longitudinal section of the Venturi flumes has a constant bottom slope, or the bottom has a local sill or hump. The Venturi flume with an arc-shaped inlet is referred to as Khafagi flume. Montana flume is also a flume without diverging downstream wall. The polygonal-shaped flumes are simpler in construction than curve-shaped flumes and are less expensive due to plan shaped elements. The Parshall flume is characterized by a specific shape with variousdegrees of convergence and divergence. Since development of the Parshall flume in 1926, many studies have been made to simplify, reduce construction costs, increase performance and accuracy of the flumes. A review of the literature of the subject shows that the study of hydraulics in flow measurement flumes is mainly based on laboratory research, and although numerical modeling is of interest, due to the complexity of the geometry of the section and the Contraction of the triangular shape, their accuracy is not reported to be acceptable. On the other hand, researchers have tried to develop and use soft computing models to estimate flow characteristics in such sections due to the hydraulic complexity of these types of flumes, which according to published reports, their accuracy has been suitable in all types of sections. Therefore, in this research, the development of data group classification model, support vector machine (SVM) model and random forest algorithm were developed to estimate the Discharge in flumes with triangular contraction.

Methodology

By investigating the hydraulics of flow in flumes, researchers have found that the Discharge depends on the width of the contraction at the location of the structure, the horizontal and vertical slopes of the triangular walls, and the relative depth of the flow. Therefore, in the present study, for the development of GMDH, SVM and RF models, five dimensionless input parameters were considered. GMDH algorithm has been widely used in solving various hydraulic engineering problems. One of the most important applications of this method is the estimation of erosion around the bridge base, downstream of the cup-shaped launcher, and the discharge coefficient of flow measurement structures such as overflows. The SVM model is divided into two main groups a) Support Vector Classification model and b) Support Vector Regression model or SVR for short. The support vector machine classification model is used to solve data classification problems that are placed in different classes, and the support vector machine regression model is used to solve forecasting problems.

Results and Discussion

First, the collected data are divided into two categories, training and testing. It should be noted that the number of collected data is 592, and in this research, 80% of the data was assigned to training and the remaining 20% to testing. The training data is used for calibration and the test data is used for validation. Due to the fact that the collected data do not have a time series nature, they were randomly assigned to each of the training and testing groups. First, the results of the random forest model are presented. The reason for the priority of presenting the results of the random forest model compared to other modes used in this research is the identification of the most important effective parameters in the development process of the random forest model in the modeling and estimation of Discharge.

Conclusions

First, the collected data are divided into two categories, training and testing. It should be noted that the number of collected data is 592, and in this research, 80% of the data was assigned to training and the remaining 20% to testing. The training data is used for calibration and the test data is used for validation. Due to the fact that the collected data do not have a time series nature, they were randomly assigned to each of the training and testing groups. First, the results of the random forest model are presented. The reason for the priority of presenting the results of the random forest model compared to other modes used in this research is the identification of the most important effective parameters in the development process of the random forest model in the modeling and estimation of Discharge.

Keywords: Cut-throated flume, Irrigation networks, Support vector machine, GMDH, Random Forest}

Abstract View Paper Research/Original Article Original: Persian
ارزیابی اثرات تغییرات کاربری اراضی و نوع کشت بر تغییرات حجم آب ورودی به دریاچه ارومیه

حسن گل محمدی*، کیومرث روشنگر، محمدتقی اعلمی

نشریه مدیریت آب در کشاورزی، سال دهم شماره 1 (پیاپی 19، بهار و تابستان 1402)، صص 49 -64

شرایط کنونی دریاچه ارومیه پیامد توسعه نامتوازن و ناپایدار در حوضه آبریز آن و برداشت بی رویه از منابع آب تجدیدپذیر حوضه بویژه در دو دهه اخیر می باشد. هدف پژوهش حاضر سنجش اثرگذاری تغییرکاربری اراضی کشاورزی بر روند کاهش تراز آب دریاچه ارومیه و تسریع روند خشکی آن است.‬‬‬‬‬ داده های مورد استفاده در این پژوهش شامل تصاویر ماهواره ای لندست در بازه زمانی سال 2000 الی 2020 و آمار و اطلاعات منابع آب ورودی به دریاچه ارومیه می باشند که توسط الگوریتم‫های SVM, Kappa Coefficient در نرم ‫افزارENVI5.3 طبقه‫ بندی و صحت سنجی شده و سپس با استفاده از نرم افزارArc-GIS میزان تغییرات کاربری ها مشخص شده است. در نهایت بعد از مشخص نموندن میزان تغییرات هر کاربری میزان آب مورد نیاز هر نوع کشت بر اساس تیپ اقلیمی و ویژگی خاک آن شهرستان توسط مدلNETWAT محاسبه گردید. نتایج حاصل از بررسی تصاویر ماهواره‫ای نشان می‫دهد در بازه زمانی هدف پژوهش روند تغییر الگوی کشت از کشاورزی آبی به باغداری بسیار سریع بوده به طوری که از مساحت 395 کیلومترمربع در سال 2000 به 688 کیلومترمربع در سال 2020 رسیده است. همچنین خروجی‫های مدل NETWAT نیز نشان می‫دهد که با توجه به تغییر مساحت کاربری اراضی کشاورزی و تغییر الگوی کشت میزان مصرف آب مورد نیاز در بخش کشاورزی در بازه زمانی رشدی تقریبا دو برابری داشته و از 1600 میلیون متر‫مکعب در سال 2000 به 2900 میلیون مترمکعب در سال 2020 رسیده است و این افزایش نیاز مصرفی باعث حذف جریان سطحی رودخانه ها و پایین رفتن سطح آب‫های زیرزمینی حوضه شده است که خود مبین دلیل اصلی کاهش حجم آب ورودی به دریاچه ارومیه می باشد.‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬‬

کلید واژگان: تغییر الگوی کشت, تغییر کاربری ارضی, حوضه آبریزدریاچه ارومیه, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Assessment of the effects of land-use changes and cultivation type on changes in the volume of water entering Lake of Urmia

Hassan Golmohammadi *, Kiomars Roshangar, MohammadTaghi Aalami

Water Management in Agriculture, Volume:10 Issue: 1, 2023, PP 49 -64

The purpose of this study is to measure the impact of agricultural land-use change on the process of reducing the water level of Urmia Lake and accelerate its land process. The data used in this research include Landsat satellite images in the period 2000 to 2020 and statistics and information of water sources entering the lake of Urmia Lake Rehabilitation Headquarters, which are classified by SVM, Kappa Coefficient algorithms in ENVI5.3 software and Validated and then using Arc-GIS software to determine the extent of user changes. Finally, after determining the number of changes in each land use, the amount of water required for each type of cultivation was calculated based on the climatic type and soil characteristics of the city by the NETWAT model. The satellite images show that in the period, the research aimed to change the cultivation pattern from irrigated agriculture to cropland, from an area of 395 km3 in 2000 to 688 km3 in 2020. The outputs of the NETWAT model also show that due to the change in the land-use area of agricultural lands and the change in the cultivation pattern, the amount of water consumption required in the agricultural sector has almost doubled in the growth period, from 1600 million cubic meters in 2000. It reached 2900 million cubic meters in 2020. This increase in consumption needs has eliminated the surface flow of rivers and lowered the groundwater level of the basin, which is the main reason for reducing the volume of water entering Urmia Lake.

Keywords: Cultivation type change, Land Use Change, Urmia Lake catchment area, Super vector machine}

Abstract View Paper Research/Original Article Original: Persian
بررسی تغییرات کاربری اراضی با استفاده از تصاویر ماهواره ای در شهرستان زرند-کرمان

سعید دلگرم، معین گنجعلی خانی، بهرام بختیاری*

نشریه هواشناسی کشاورزی، سال یازدهم شماره 1 (بهار و تابستان 1402)، صص 64 -74

تغییرات کاربری اراضی به ویژه اراضی کشاورزی، تاثیر بسزایی در خرداقلیم و مدیریت منابع طبیعی و تبادل شار مابین سطح زمین و جو دارد. سنجش ازدور، یکی از روش های قابل اعتماد و دقیق در تهیه نقشه های کاربری اراضی بویژه در گسترهای وسیع می باشد.هدف از این مطالعه تهیه نقشه کاربری اراضی و آشکارسازی تغییر سطح پوشش مشتمل بر صنعتی، کشاورزی ، مسکونی و بایر در شهرستان زرند واقع در استان کرمان در سال های 1366 و 1399 ، با استفاده از تصاویر ماهواره ای می باشد. به منظور طبقه بندی کاربری اراضی در هریک ازین گروه های 4 گانه، از سه روش حداکثر درستنمایی، شبکه عصبی مصنوعی و ماشین بردار پشتیبان استفاده شد که ماشین بردار پشتیبان به عنوان روش برگزیده انتخاب گردید. به طورکلی نتایج نقشه های کاربری اراضی در سال های مورد مطالعه حاکی از افزایش 64 هکتاری بخش کشاورزی، 17 هکتاری بخش شهری و 2 هکتاری بخش صنعتی می باشد. افزایش مناطق صنعتی شهرستان و افزایش مناطق با کاربری کشاورزی دو مورد بسیار با اهمیت در تغییر محتمل الگوهای کشت و اقلیم کشاورزی منطقه بوده و شایسته توجه بیشتر در برنامه ریزی های بلند مدت زیست محیطی منطقه است.

کلید واژگان: اقلیم کشاورزی, شبکه عصبی مصنوعی, کاربری اراضی, ماشین بردار پشتیبان, Etm+}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Investigation of land-use changes using satellite images in Zarand region, Kerman

S .Delgarm, M. Ganjalikhani, B .Bakhtiari *

Journal of Agricultural Meteorology, Volume:11 Issue: 1, 2023, PP 64 -74

Land-use changes especially agricultural lands have a significant impact on the microclimate of a region,natural resrource managrment and land-atmoshptre interactions .Remote sensing is a reliable and precise techniques in generating land-use maps. The aim of this study, is producing to a land use map by and detection of land cover pattern changes including urban,industrial agricultural and fallow land in Zarand region, Kerman province,south of Iran, during 1987 to 2020. In order to classify land use in four above mentioned types, three methods of maximum likelihood, artificial neural network, and support vector machine were used. The support vector machine was found to be the best performing method. In general, the generated land-use maps in the studied years showed an increase of 64,17 and 2 hectares in the agricultural, urban and industrial land uses, respectively. The observed increase in industrial and agricultural lands are quite important in possible changes of cropping pattern and agroclimatic condtion of the region and needs further investigatin in long term environmental management planing.

Keywords: Agro-climate, Artificial Neural Network, Etm+, Land-use, Support Vector Machine}

Abstract View Paper Research/Original Article Original: Persian
بررسی عملکرد مدل های داده کاوی در پیش بینی بارش و تحلیل وضعیت خشک سالی ایستگاه سینوپتیک بندرعباس

عماد محجوبی*، حمید عبدل آبادی، جواد محجوبی، احسان غفوری

مجله مدیریت آب و آبیاری، سال سیزدهم شماره 2 (تابستان 1402)، صص 429 -499

استفاده از روش های مختلف داده کاوی در پیش بینی خشک سالی متداول است. با این حال، به طور عمده انتخاب مدل برتر بر مبنای دقت شبیه سازی صورت می گیرد. درحالی که در اغلب مطالعات به ویژگی های ساختاری مدل ها کم تر توجه شده است. در این مقاله کارایی مجموعه ای از متداول ترین مدل های داده کاوی شامل شبکه عصبی مصنوعی چندلایه پرسپترون (ANN-MLP)، شبکه عصبی با تابع پایه شعاعی (ANN-RBF)، درخت تصمیم رگرسیونی (CART)، مدل درختی (M5P) و ماشین بردار پشتیبان (SVM) جهت پیش بینی بارش یک سال بعد ایستگاه سینوپتیک بندر عباس ارزیابی شده و ویژگی های هر یک از آن ها تشریح می شود. واسنجی و صحت سنجی مدل ها با استفاده از داده های خام و میانگین متحرک سه ساله پارامترهای اقلیمی در بازه آماری 1347 تا 1396 انجام شد. عملکرد مدل ها با استفاده از پارامترهای آماری مختلف و نمودارهای مقایسه ای ارزیابی شد. نتایج نشان داد مدل های SVM و M5P به ترتیب با مقادیر RMSE برابر 93/7 و 31/8 میلی متر، MAE برابر 66/3 و 69/4 میلی متر و ضریب همبستگی 83/0 و 82/0 کارایی مطلوبی در پیش بینی بارش دارند. هم چنین، به استثنای مدل CART، تغییر در ابزار داده کاوی تفاوت هشت تا 11 درصدی در دقت تخمین ها ایجاد می کند؛ بنابراین انتخاب مدل مناسب تر باید بر مبنای سایر ویژگی های روش ها در کنار میزان دقت آن ها صورت پذیرد. به علاوه، بهره گیری از میانگین متحرک سه ساله به طور متوسط ضریب همبستگی را حدود 78 درصد افزایش و RMSE را حدود 63 درصد کاهش داده است. تحلیل وضعیت درازمدت خشک سالی نشان داد با افزایش طول دوره شاخص بارش استاندارد، میزان تفکیک سال های مرطوب و خشک مشخص تر می شود.

کلید واژگان: درخت تصمیم, شاخص بارش استاندارد, شبکه عصبی مصنوعی, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Investigating the Performance of Data Mining Models in Rainfall Forecasting and Drought Analysis of Bandar Abbas Synoptic Station

Emad Mahjoobi *, Hamid Abdolabadi, Javad Mahjoobi, Ehsan Ghafoori

Journal of Water and Irrigation Management, Volume:13 Issue: 2, 2023, PP 429 -499

It is common to use different data mining methods in drought prediction. However, the selection of the best model is mainly based on the accuracy of the simulation, while most of the studies do not mention the features of the models. In this paper, the performance of the most common data mining models, including Multilayer Perceptron Artificial Neural Network (ANN-MLP), Radial Base Function Neural Network (ANN-RBF), Regression Decision Tree (CART), Model Tree (M5P), and Support Vector Machine (SVM) is evaluated in order to predict monthly one year ahead rainfall at Bandar Abbas synoptic station and then the characteristics of each of them are described. Calibration and validation of the models were done using raw data and a three-year moving average of climatic parameters from 1347 to 1396. The performance of the models has been evaluated using different statistical indices and comparative diagrams. The results showed that the SVM and M5P models have good prediction performance with RMSE of 7.93 and 8.31 mm, the MAE of 3.66 and 4.69 mm, and the CC of 0.83 and 0.82, respectively. Also, with the exception of the CART, the change in the data mining tool makes an eight to 11 percent difference in the accuracy of the estimates. Therefore, the most appropriate model should be selected based on other characteristics of the methods besides their accuracy. In addition, using the three-year moving average of the input parameters has increased the correlation coefficient by about 78 percent and reduced the RMSE by about 63 percent. The analysis of the long-term drought situation showed that with the increase in the period of the standard precipitation index, the separation of wet and dry years becomes more specific.

Keywords: Artificial Neural Network, Decision Tree, Standard Precipitation Index, Support vector machine}

Abstract View Paper Research/Original Article Original: Persian
شبیه سازی عملکرد و بهره وری آب گیاه خیار (Cucumis sativus L.) با استفاده از شبکه عصبی مصنوعی

ساناز شکری، عبدالرحیم هوشمند، منا گلابی، ناصر عالم زاده انصاری، دن استرو

مجله مهندسی آبیاری و آب ایران، پیاپی 52 (تابستان 1402)، صص 165 -182

به منظور انجام شبیه سازی میزان عملکرد و بهره وری آب گیاه خیار (Cucumis sativus L.) آزمایشی در قالب طرح بلوک کاملا تصادفی با سه سطح آبیاری 100، 85 و 75 درصد نیازآبی در دو فصل کشت طی سال های 1397 و 1398 اجرا و از شبکه های عصبی پرسپترون (MLP) و روش ماشین بردار پشتیبان (SVM) استفاده گردید و در نهایت جهت انتخاب مدل مناسب و بهینه از شاخص های ضریب تبیین، میانگین مربعات خطا و میانگین مربعات خطای نرمال شده استفاده شد. میزان آب آبیاری،، تعداد برگ روی بوته، دما، میزان تبخیر و میزان رطوبت نسبی به عنوان داده های ورودی انتخاب شدند و به ترتیب 60، 20 و 20 درصد کل داده ها، به ترتیب برای آموزش، اعتبارسنجی و آزمون مدل اختصاص یافت. نتایج نشان داد که شبکه عصبی MLP با ورودی های میزان آب آبیاری و تعداد برگ به ترتیب با داشتن ضریب تبیین 92/0 و 86/0 دقت بیشتری در شبیه سازی میزان عملکرد میوه و بهره وری آب مصرفی در گیاه خیار داشت. نتایج آنالیز حساسیت حاکی از آن بود که پارامتر ورودی آب آبیاری به ترتیب با ضریب حساسیت 9/0 و 86/0 مهمترین پارامتر موثر بر مدل بهره وری آب مصرفی و عملکرد میوه خیار می باشد.

کلید واژگان: پرسپترون, ماشین بردار پشتیبان, شبکه عصبی, آنالیز حساسیت, کم آبیاری}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Simulation of Yield and Water Productivity of Cucumber Plant Using Artificial Neural Network

Sanaz Shokri, Abdolrahim Hooshmand, Mona Golabi, Naser Alemzadeansari, Dan Struve

Irrigation & Water Engineering, Volume:13 Issue: 52, 2023, PP 165 -182

In order to simulate the yield and water productivity of cucumber plant (Cucumis sativus L.), an experiment was conducted in the form of a completely randomized block design with three irrigation levels of 100, 85 and 75% of the water requirement in two growing seasons during 2017 and 2018 and using perceptron neural networks (MLP) and support vector machine (SVM) methods were used and finally, to select the appropriate and optimal model, the indices of explanatory coefficient, mean squared error and normalized mean squared error were used. The amount of irrigation water, number of leaves on the plant, temperature, evaporation rate and relative humidity were selected as input data and 60%, 20% and 20% of the total data were allocated for training, validation and testing of the model, respectively. The results showed that the MLP neural network with the inputs of irrigation water and number of leaves was more accurate in simulating fruit yield and water productivity in cucumber plants with an explanation coefficient of 0.92 and 0.86, respectively. The results of the sensitivity analysis indicated that the irrigation water input parameters are the most important effective parameters on the water consumption efficiency model and cucumber fruit yield with sensitivity coefficients of 0.9 and 0.86, respectively.

Keywords: Perceptron, Support Vector Machine, Neural Network, Sensitivity analysis, dehydration}

Abstract View Paper Research/Original Article Original: Persian
ارزیابی پهنه های تالاب انزلی بر اساس سری های زمانی داده های Landsat و شاخص MNDWI

مریم حقیقی خمامی*، امیراسلام بنیاد، محمد پناهنده

مجله تحقیقات آب و خاک ایران، سال پنجاه و چهارم شماره 1 (پیاپی 85، فروردین 1402)، صص 173 -192

زیستگاه های تالابی جزء مهمترین اکوسیستم های طبیعی جهان بوده و بررسی روند تغییرات جهت مدیریت این زیست بوم های با ارزش نیازمند اطلاعات دقیق و به روزی است که فناوری سنجش ازدور این ممکن را میسر می سازد. در این مطالعه تغییرات تالاب بین المللی انزلی در استان گیلان طی سال های 1986 تا 2020 با استفاده از تصاویر ماهواره ای لندست و شاخص پهنه آبی در گوگل ارث انجین بررسی گردید. این بررسی با استفاده از تصاویر سنجنده TM لندست 5، سنجنده OLI لندست 8 و شاخص آبی MNDWI در گوگل ارث انجین، محدوده تالاب به دو کلاس آب و غیر آب طبقه بندی شد و داده های اقلیمی شامل داده های باران ماهواره TRMM و شاخص PDSI حاصل از داده های TerraClimate و داده های سطح تراز آب دریای خزر جهت بررسی تغییرات سطح آب تالاب استفاده شد. نقشه های تولید شده از طبقه بندی به روش ماشین بردار پشتیبان، دارای صحت کلی بالاتر از 87 درصد و ضریب کاپا بالاتر از 88 درصد بوده و بر اساس شاخص آبی مساحت پهنه های آبی در این دوره زمانی 20 درصد کاهش یافته و از 5926 هکتار به 954 هکتار رسیده است. این تغییرات در ابتدا (تا سال 2000) روندی صعودی و سپس نزولی در مساحت پهنه های آبی را مشخص نمود. همچنین بررسی فاکتورهای اقلیمی در تغییرات سطح پهنه های آبی، نشانگر تاثیر بیشتر سطح تراز آب دریا بر روند تراز آب تالاب است. نتایج نشان می دهد شاخص های آبی و گوگل ارث انجین ابزاری کارآمد برای شناسایی روند افزایشی و کاهشی سطح آب تالاب ها بوده که می تواند برنامه ریزان و سیاستگذاران را در حفاظت و مدیریت منابع طبیعی در مناطق مطالعه شده یاری رسانند.

کلید واژگان: سطح تراز دریا, شاخص MNDWI, گوگل ارث انجین, ماشین بردار پشتیبان}

چکیده مشاهده متن مقاله پژوهشی/اصیل زبان: فارسی

Anzali Wetland Surface Area Evaluation Based on Landsat Time Series Data and NDWI Indices

Maryam Haghighi Khomami *, Amir Eslam Bonyad, Mohammad Panahandeh

Iranian Journal of Soil and Water Research, Volume:54 Issue: 1, 2023, PP 173 -192

Wetland habitats are one of the most important natural ecosystems in the world. Evaluating and managing these valuable ecosystems require accurate and up-to-date data that remote sensing makes it possible. In this study, the changes of Anzali International Wetland in Gilan province, Iran, were investigated using Landsat satellite images and the Modified Nomalized Diffrence water index (MNDWI) in Google Earth Engine (GEE) platform during the years 1986 to 2020. To monitor waterbodies changes, two classes of water and non-water area were classifeied by Support Vector Machine (SVM) algorythm and MNDWI index was used to distinct the water surface areas. On the other hand, climate data including TRMM satellite data and PDSI index from TerraClimate data and Caspian Sea water level data were used to determine their effects on water level fluctuation of the wetland. The maps of SVM classification had overall accuracy more than 87% and Kappa coefficient was more than 88%. The wetland water body loss has decreased by 20% in its area according to MNDWI index maps, it has reached from 5926 hectares to 954 hectares, so that initially (until 2000) there was an upward trend and then a downward trend in the wetland water level. Also, the water level of Anzali wetland have been affected more by the sea level than the climatic factors. The results show that water indices and Google Earth Engine are efficient tools to identify the trends of water level changes of wetlands, and could provide more detailed scientific guidance to protect and manage natural resources in the studied areas.

Keywords: Sea Level, MNDWI Index, Google Earth Engine, SVM}

Abstract View Paper Research/Original Article Original: Persian

نکته

نتایج بر اساس تاریخ انتشار مرتب شده‌اند.
کلیدواژه مورد نظر شما تنها در فیلد کلیدواژگان مقالات جستجو شده‌است. به منظور حذف نتایج غیر مرتبط، جستجو تنها در مقالات مجلاتی انجام شده که با مجله ماخذ هم موضوع هستند.
در صورتی که می‌خواهید جستجو را در همه موضوعات و با شرایط دیگر تکرار کنید به صفحه جستجوی پیشرفته مجلات مراجعه کنید.

به جمع مشترکان مگیران بپیوندید!

جستجوی مقالات مرتبط با کلیدواژه « ماشین بردار پشتیبان » در نشریات گروه « آب و خاک »