Document Type : Research Article
Authors
1 College of Agricultural Sciences and Natural Resources, Sari University
2 Saeb University
3 Islamic Azad University, Zanjan
Abstract
Introduction: River discharge as one of the most important hydrology factors has a vital role in physical, ecological, social and economic processes. So, accurate and reliable prediction and estimation of river discharge have been widely considered by many researchers in different fields such as surface water management, design of hydraulic structures, flood control and ecological studies in spetialand temporal scale. Therefore, in last decades different techniques for short-term and long-term estimation of hourly, daily, monthly and annual discharge have been developed for many years. However, short-term estimation models are less sophisticated and more accurate.Various global and local algorithms have been widely used to estimate hydrologic variables. The current study effort to use Lazy Learning approach to evaluate the adequacy of input data in order to follow the variation of discharge and also simulate next-day discharge in Talar River in KasilianBasinwhere is located in north of Iran with an area of 66.75 km2. Lazy learning is a local linear modelling approach in which generalization beyond the training data is delayed until a query is made to the system, as opposed to in eager learning, where the system tries to generalize the training data before receiving queries
Materials and Methods: The current study was conducted in Kasilian Basin, where is located in north of Iran with an area of 66.75 km2. The main river of this basin joins to Talar River near Valicbon village and then exit from the watershed. Hydrometric station located near Valicbon village is equipped with Parshall flume and Limnogragh which can record river discharge of about 20 cubic meters per second.In this study, daily data of discharge recorded in Valicbon station related to 2002 to 2012 was used to estimate the discharge of 19 September 2012. The mean annual discharge of considered river was also calculated by using available data about 0.441 cubic meters per second. To estimate the discharge of considered day, three methods of constant, linear and quadratic functionscontrollers based on the local linearization provided by the lazy learning algorithm were considered. Lazy learning is a memory-based linear technique for local modeling approach which is reported as a high-efficient algorithm for simulating variables with low input data.The series of input data was categorized into previous 6, 8, 10, 15 and 20 days, 1 and 2 months, 1, 2 and 3 seasons and also 1 and 2 years to evaluate which series is appropriately enough to predict next-day discharge inthe river. Then, mean absolute error and root-mean square error were calculated for all series and modelsin order to find the best estimator model and the most appropriate series of input data.
Results: Results showed that constant and linear model had the minimum root-mean square error of 0.001 and 0.057 respectivelywith previous 60 days’ data series. Whilethe quadratic model had its best estimation with previous 2 season data series with the minimum root-mean square error of 0.059. The result indicated that the more input data increase, the best quadratic model estimate until 60 days. But after 60 days, estimation error gradually increased. Consequently, not more data but adequate areneeded for accurate estimation. Also, RMSE in linear model had less fluctuation and therefore less sensitivity compared with other models. And quadratic model had less fluctuation and sensitivity to neighborhoods. Also, according to results, the more variation in each period increase, the better estimation is accrued by lazy learning algorithm. Hence, it was expected that next-day discharge prediction in low-water period needs longer data series than high-water period.
Conclusion: Regarding to thousands of prepared training models, constant model with previous 60 days’ data and minimum error of 0.0001 was selected as the most accurate estimatefor next-day river discharge in Talar River. Results showed that despite of some limitation and demerits, the local Lazy Learning algorithm has significant efficiency in time series simulating. Although the accuracy of simulation increase with more input data, but this algorithm can runby at least 5 training data. However we find lazy learning to be the best performing approach on average goodness indicators (such as mean absolute error and Root-mean square error). On the other hand, the lazy learning predictor can be quickly developed and easily kept up-to-date by adding new data to its database. Also, it does not face with overfitting problems which are common in global modeling approaches.According to some noteworthy features of lazy learning noticed in this regards, this approach will have good performance for time-series studies.
Keywords
Send comment about this article