mahboobeh farzandi; Seyed Hossein Sanaeinejad; Bijan Ghahraman; Majid Sarmad
Abstract
Introduction: Temperature and precipitation are two of the main variables in meteorology and climatology. These are basic inputs in water resource management. The length of the statistical period plays a pivotal role in the accurate analysis of these variables. Observation data at Iran's first ...
Read More
Introduction: Temperature and precipitation are two of the main variables in meteorology and climatology. These are basic inputs in water resource management. The length of the statistical period plays a pivotal role in the accurate analysis of these variables. Observation data at Iran's first synoptic station from 1330 (1951) is available at the Iranian Meteorological Organization website The historical monthly precipitation and temperature of five stations in Iran is available since 1880 with missing data. These data measured by the Embassy of the United States and Britain from the Qajar period and recorded in World Weather records books. These synoptic stations include Mashhad, Isfahan, Tehran, Bushehr, and Jask. The monthly missing data were predominantly recorded during World War II (1941-1949). Unfortunately, these data have missing. Therefore, the accuracy of simulating these variables is very important. The current research aimed to predict the missing values of monthly temperature and precipitation in Mashhad station. The stations in the neighboring countries were selected due to the distance to Mashhad, relationship, and completeness of data since 1880, as the predictive variables. Monthly precipitation of Ashgabat from Tajikistan and Sarakhs, Kooshkah, Bayram Ali, Kerki and Repetek from Turkmenistan were selected as an independent variable in the making of Missing Rainfall in Mashhad. Also, the temperature of Ashgabat, Bayram Ali, Gudan, Sarakhs, and Tajan were selected to restore the monthly temperature of the Mashhad station. This research has fitted ten multiple regression models to monthly rainfall of Mashhad station and has fitted 6 multiple regression to the monthly temperature of Mashhad. then the parameters of these patterns are optimized by genetic and Ant Colony algorithm. Also, the Artificial Neural Network (MLP) model and Support vector regression have been selected and implemented in order to simulate monthly precipitation and temperature data of Mashhad.
Materials and Methods: In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships among variables. It includes many techniques for modeling and analyzing several variables when the focus is on the relationship between a dependent variable and one or more independent variables (or 'predictors'). Genetic algorithm (GA) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). Genetic algorithms are commonly used to generate high-quality solutions to optimization and search problems by relying on bio-inspired operators such as mutation, crossover, and selection. Ant colony optimization algorithm (ACO) is a probabilistic technique for solving computational problems which can be reduced to finding good paths through graphs. This algorithm is a member of the ant colony algorithms family, in swarm intelligence methods, and it constitutes some metaheuristic optimizations. Artificial neural networks are one of the main tools used in machine learning. As the “neural” part of their name suggests, they are brain-inspired systems which are intended to replicate the way that we humans learn. Neural networks consist of input and output layers, as well as (in most cases) a hidden layer consisting of units that transform the input into something that the output layer can use. They are excellent tools for finding patterns which are far too complex or numerous for a human programmer to extract and teach the machine to recognize. In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. Given a set of training examples, each marked as belonging to one or the other of two categories, an SVM training algorithm builds a model that assigns new examples to one category or the other, making it a non-probabilistic binary linear classifier (although methods such as Platt scaling exist to use SVM in a probabilistic classification setting).
Results and Discussion: At the first stage, several multiple regressions were fitted to monthly precipitation (with coefficients ranging from 0.63 to 0.81) and six patterns for monthly temperature (0.986-0.993). Afterward, GA and ACO were applied to improve the accuracy of the selected regression models by optimizing their parameters. At the next stage, ANN and SVR were used to estimate the monthly missing values separately. Finally, the results of the previous stages were compared using the root mean square error (RMSE), and the optimal models were applied to determine the missing values of monthly temperature and precipitation of Mashhad. The results showed that the Genetic Algorithm and Ant Colony increase the accuracy of the estimation of missing rainfall data significantly more than the previous methods. The lowest error criterion (RMSE) between regression patterns is 9.8 millimeters. By genetic algorithm, this criterion is reduced to 2.56 mm, and by ant colony algorithm to 2.559.
Conclusion: Comparison of the above methods in restoration temperature and precipitation shows that evolutionary methods (GA and ACO) are the best for estimating the missing monthly precipitation and machine learning methods (ANN and SVR) are the best to imputation missing data of monthly temperature.
H. Shamkoeian; B. Ghahraman; K. Davary; M. Sarmad
Abstract
Abstract
Natural disasters threatening and endangering human communities has resulted in the study and research of such disasters through the related sciences and present methods of forecasting their behavior with time and place and also from a qualification and quantity viewpoint. To this end, numerous ...
Read More
Abstract
Natural disasters threatening and endangering human communities has resulted in the study and research of such disasters through the related sciences and present methods of forecasting their behavior with time and place and also from a qualification and quantity viewpoint. To this end, numerous methods for the determination of the maximum flood in various return period has been made available which can be refered to as flood frequency analysis methods. One of these methods is the regional flood frequency analysis in which instead of using the data from a single station, it considers the data and characteristics of a group of similar stations. In the case under the research this method uses L-Moments and Index Flood in North, Razavi and South Khorasan water basins and MATLAB software. Maximum annual flood statistics were used from 68 Hydrometric stations with minimum and maximum statistical periods of 6 and 39 years. Using Cluster analysis the region under study was divided to 7 partitions. Discordance test has conducted and only one station in region C was found as discordance station. Because of knowing the homogeneity of the regions, the parameter of Kappa distribution were estimated and with using the simulation method of Monte Carlo with 500 times, the homogeneity measure was tested in 7 regions. Using homogeneity test all regions was found homogen. Using goodness-of-fit measure z and Kolmogrove-Smirnov the Log normal 3 parameters distribution were selected for two regions of A and B, GEV for C, Generalized Pareto for D and E, Generalized logistic for F and Pearson III for G. Besides, GEV distribution was found appropriate for all of the regions, only their parameters are different in any regions. For estimating of index flood a logarithmic model has found for each region with 4 variables of area, height, average slop and form factor. Using of these models, the index flood can be estimated in each region and it can be used for standardize the statistics of maximum flood values.
Keywords: Regional flood frequency analysis; L-Moments; Index Flood; Cluster analysis; Khorasan
M. Mousavi baygi; M. Erfanian; M. Sarmad
Abstract
Abstract
One of the losses decrease's ways in fields, is the proper irrigation management, which its base is the accurate estimation of crop water requirement. Equations which are used to calculate the reference evapotranspiration (ETo), do not use the same climatic parameters and due to their empirical ...
Read More
Abstract
One of the losses decrease's ways in fields, is the proper irrigation management, which its base is the accurate estimation of crop water requirement. Equations which are used to calculate the reference evapotranspiration (ETo), do not use the same climatic parameters and due to their empirical base, are not match to all climatic situations. So it is needed to clarify proper methods for each region. In this investigation, lysimeteric data, which are taken from the climatologic station of Mashhad University in a 6-month period tests, ware compared with the F.A.O-Penman-Montieth (F.P.M), evaporation pan, monthly and yearly adjusting coefficient. Also the F.P.M's calculated ETo for all the synoptic stations of Razavi Khorasan province were regression with the air temperature, radiation and psychrometeric coefficient to suggest a simple equation. The best results were obtained from the F.P.M equation with the monthly (R2=0.99) and yearly (R2=0.92) adjusting coefficient, respectively. So it is advised to assess the reference situation of the station and use adjusting coefficients for the non-reference ones.
Key words: Reference evapotranspiration, Lysimeter, FAO-Penman-Montieth, Adjusting coefficients