یادگیری تقویتی چهارچوبی به منظور پهنه بندی سیل در مناطق کوهستانی (با استفاده از مدل Deep Q-Learning)

یوسفی., صالح.; مردانیان, سارا; کریمی پور, احمد رضا

doi:10.22092/wmrj.2026.371991.1656

یادگیری تقویتی چهارچوبی به منظور پهنه بندی سیل در مناطق کوهستانی (با استفاده از مدل Deep Q-Learning)

مقالات آماده انتشار

نوع مقاله : پژوهشی

نویسندگان

¹ عضو هیأت علمی مرکز تحقیقات و آموزش کشاورزی و منابع طبیعی استان چهارمحال و بختیاری

² مرکز تحقیقات و آموزش کشاورزی ومنابع طبیعی استان چهارمحال وبختیاری، سازمان تحقیقات، آموزش وترویج کشاورزی،شهرکرد،ایران

10.22092/wmrj.2026.371991.1656

چکیده

چکیدۀ مبسوط
مقدمه و هدف
درک و کاهش ریسک‌های سیلاب از اولویت‌های اصلی پژوهشگران و سیاست‌گذاران است. پهنه‌بندی حساسیت به سیلاب ابزار کلیدی مدیریت ریسک در مناطق کوهستانی به شمار می‌رود، زیرا مناطق مستعد را شناسایی کرده و مبنای برنامه‌ریزی مکانی، آمادگی و اقدامات پیشگیرانه را فراهم می‌کند. رویکردهای سنتی نقشه‌برداری سیلاب مانند مدل‌سازی هیدرولوژیکی و تحلیل داده‌های تاریخی اغلب با محدودیت‌هایی چون کمبود داده، پیچیدگی محاسباتی و پویایی سیستم‌های محیطی مواجه‌اند. ازاین‌رو، روش‌های نوین هوش مصنوعی به‌ویژه یادگیری ماشین و یادگیری تقویتی برای پردازش داده‌های متنوع و شناسایی الگوهای پیچیده به کار گرفته شده‌اند. در میان رویکردهای هوش مصنوعی، یادگیری تقویتی به دلیل قابلیت تعامل با محیط و بهبود تدریجی تصمیم‌گیری، نویدبخش است. مدل Deep Q-Learning (DQL) به‌عنوان یکی از روش‌های پیشرفته یادگیری تقویتی، امکان کار با فضاهای حالت پیوسته و ابعاد بالا را بدون نیاز به گسسته‌سازی فراهم می‌کند. این پژوهش باهدف بهره‌گیری از مدل DQL برای تهیه نقشه حساسیت به سیلاب در استان چهارمحال و بختیاری، منطقه‌ای کوهستانی و نیمه‌خشک مستعد به سیلاب‌های ناگهانی ناشی از ذوب برف و بارش‌های شدید، انجام‌شده است. هدف اصلی ارائه چارچوبی دقیق برای ارزیابی خطر سیلاب و کمک به سیاست‌گذاران و مدیران بحران در کاهش ریسک و تقویت تاب‌آوری به‌ویژه در شهرستان‌های آسیب‌پذیر همچون اردل و لردگان است.

مواد و روش‌ها
استان چهارمحال و بختیاری با مساحت تقریبی ۱۶۵۵۳۰۰ هکتار در جنوب غربی ایران و در رشته‌کوه‌های زاگرس قرار دارد. ویژگی‌های توپوگرافی شامل دامنه‌های تند، دره‌های عمیق و شبکه زهکشی گسترده است که نقش مهمی در پویایی هیدرولوژیکی منطقه ایفا می‌کند. رخدادهای بارشی شدید در اواخر زمستان و اوایل بهار اغلب منجر به سیلاب‌های ناگهانی و رودخانه‌ای می‌شود که خسارات قابل‌توجهی به زیرساخت‌ها و اراضی کشاورزی وارد می‌کند. فهرست رخدادهای سیل شامل ۵۴۵ نقطه مکانی (۳۴۶ نقطه سیلابی و ۱۹۹ نقطه غیرسیلابی) از بازه زمانی ۱۹۸۳ تا ۲۰۲۳ گردآوری شد. داده‌ها به‌طور تصادفی و لایه‌ای به مجموعه آموزش (۳۸۹ نقطه، ۷۱٪) و آزمون (۱۵۶ نقطه، ۲۹٪) تقسیم شدند. ابتدا ۲۱ متغیر محیطی از منابع مختلف شامل مدل رقومی ارتفاع، سری زمانی اقلیمی، تصاویر Landsat 9 و پایگاه SoilGrid استخراج شد. پس از انجام تحلیل هم‌خطی، چهار متغیر حذف و ۱۷ متغیر نهایی (ارتفاع، تجمع جریان، جهت جریان، پیوستگی جریان، جهت جغرافیایی، طول شیب، انحنای افقی /و پروفیل، شاخص رطوبت توپوگرافی، عمق تا بستر سنگی، بیشینه بارش ۲۴ ساعته، دماهای میانگین و بیشینه، عمق برف، NDVI، درصد ماسه سطحی و فاصله از مناطق مسکونی) انتخاب گردید. مدل Deep Q-Learning برای پردازش متغیرهای پیوسته و تولید نقشه حساسیت به پنج کلاس (خیلی کم تا خیلی زیاد) طبقه‌بندی شد. دقت مدل با معیارهایی همچون AUC، Kappa، Recall، Precision و Specificity و LogLoss ارزیابی شد.
نتایج و بحث
پس از مدل سازی بااستفاده از مدل Deep Q-Learning (DQL) نقشه رستری تهیه شده بر اساس روش و الگوریتم Natural break سطح منطقه موردمطالعه (۱٬۶۵۵٬۳۰۰ هکتار) را به پنج کلاس حساسیت به سیلاب ازجمله: خیلی کم (۶٪، معادل ۹۷٬۲۲۱ هکتار)، کم (۲۰٪، ۳۳۶٬۱۹۰ هکتار)، متوسط (۲۶٪، ۴۲۹٬۴۵۳ هکتار)، زیاد (۲۶٪، ۴۳۵٬۲۹۳ هکتار) و خیلی زیاد (۲۲٪، ۳۵۷٬۱۴۰ هکتار) تقسیم کرد. شاخص‌های ارزیابی دقت، نشان‌دهنده عملکرد قوی مدل بود. از نتایج شاخص‌ها می‌توان دید که:AUC برابر93/0 شد که قدرت تفکیک عالی را تأیید می‌کند، Kappa با 72/0 که توافق قابل‌قبول بین پیش‌بینی و واقعیت را نشان می‌دهد،Recall با مقدار 87/0 بیانگر شناسایی بالای نقاط سیلابی است، Precision, برابر با 90/0 ، Specificity برابر با 85/0 و LogLoss معادل 65/0 محاسبه شد. تحلیل اهمیت متغیرها نشان داد که عمق برف بیشترین تأثیر را دارد که نقش کلیدی ذوب برف در سیلاب‌های بهاره را برجسته می‌سازد؛ پس‌ازآن تجمع جریان (ویژگی‌های توپوگرافی)، فاصله از مناطق مسکونی (اثرات انسانی و توسعه شهری)، شاخص NDVI (پوشش گیاهی و نفوذپذیری)، دمای میانگین (شرایط اقلیمی) و طول شیب قرار دارند. مناطق با حساسیت خیلی زیاد عمدتاً در نواحی پست، دره‌ها و امتداد رودخانه‌های اصلی مانند کارون و زاینده‌رود به‌ویژه در شهرستان‌های اردل و لردگان متمرکز هستند. مقایسه با مطالعات مشابه مبتنی بر مدل‌های یادگیری ماشین نظارت‌شده نشان می‌دهد که DQL پیش‌بینی متعادل‌تری در محیط‌های ناهمگن کوهستانی ارائه می‌دهد و پتانسیل بالایی در مدیریت ریسک سیلاب دارد.
نتیجه‌گیری و پیشنهادها
این پژوهش نشان داد که مدل Deep Q-Learning (DQL) به‌عنوان یک روش یادگیری تقویتی عمیق، ظرفیت بالایی در پهنه‌بندی حساسیت به سیلاب در مناطق پیچیده کوهستانی مانند استان چهارمحال و بختیاری دارد. این مدل با پردازش متغیرهای محیطی پیوسته و شناسایی دقیق روابط غیرخطی، نقشه‌های معتبر و کاربردی تولید کرد که نواحی پرخطر را به‌طور مؤثر مشخص می‌سازد. نتایج تأکید می‌کنند که عوامل اصلی سیلاب در منطقه شامل ذوب برف، ویژگی‌های توپوگرافی، توسعه انسانی و کاهش پوشش گیاهی است که این یافته‌ها با واقعیت‌های میدانی منطقه همخوانی دارد. نقشه‌های حاصل می‌تواند مبنایی عملی برای برنامه‌ریزی کاربری زمین، تقویت زیرساخت‌های دفع آب باران، حفاظت از مناطق مسکونی در برابر سیلاب‌های ناگهانی و بازسازی اکوسیستم‌های گیاهی به‌منظور افزایش نفوذپذیری خاک فراهم آورد. برای بهبود دقت مدل در مطالعات آینده، پیشنهاد می‌شود فهرست رخدادهای سیلاب با داده‌های فصلی و زمانی دقیق‌تر گسترش یابد، متغیرهای پویا مانند نوسانات روزانه دما و چرخه‌های ذوب برف اضافه شود و اعتبارسنجی میدانی گسترده‌تری به‌ویژه در ارتفاعات بالا و مناطق شهری انجام گیرد. همچنین، ترکیب DQL با سایر روش‌های یادگیری تقویتی یا مدل‌های هیبریدی می‌تواند کارایی را افزایش دهد. به‌طورکلی، این رویکرد نوین چارچوبی قدرتمند برای مدیریت پایدار ریسک سیلاب در مناطق مشابه ایران و جهان ارائه می‌دهد و می‌تواند به سیاست‌گذاران در تقویت تاب‌آوری جوامع محلی کمک شایان توجهی کند.
واژ‌گان کلیدی
ارتفاعات ایران، ارزیابی ریسک سیل، یادگیری تقویتی، یادگیری ماشین.

کلیدواژه‌ها

موضوعات

ژئومرفولوژی و مخاطرات طبیعی

عنوان مقاله [English]

A Reinforcement Learning Framework for Flood Zoning in Mountainous Areas (Using the Deep Q-Learning Model)

نویسندگان [English]

SARA MARDANIAN ²
AHMADREZA KARIMIPOR ²

² Chaharmahal and Bakhtiari Province Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education and Promotion Organization, Shahrekord, Iran,

چکیده [English]

Extended Abstract
Introduction and Goal
Understanding and reducing flood risk are among the primary priorities of researchers and policymakers. Flood susceptibility mapping is considered a key risk management tool in mountainous regions, as it identifies prone areas and provides a foundation for spatial planning, preparedness, and preventive measures. Traditional flood mapping approaches, such as hydrological modeling and historical data analysis, often face limitations including data scarcity, computational complexity, and the dynamic nature of environmental systems. Therefore, modern artificial intelligence (AI) methods—particularly machine learning (ML) and reinforcement learning (RL)—have been employed to process diverse datasets and identify complex patterns.
Among AI approaches, reinforcement learning is promising due to its capacity to interact with the environment and progressively improve decision-making. The Deep Q-Learning (DQL) model, as an advanced reinforcement learning technique, enables operation in continuous and high-dimensional state spaces without requiring discretization. This study aims to apply the DQL model to produce a flood susceptibility map for Chaharmahal and Bakhtiari Province, a mountainous and semi-arid region prone to flash floods caused by snowmelt and intense rainfall. The main objective is to provide a precise framework for flood hazard assessment and to support policymakers and crisis managers in reducing risk and enhancing resilience, particularly in vulnerable counties such as Ardal and Lordegan.

Materials and Methods
Chaharmahal and Bakhtiari Province, covering approximately 1,655,300 hectares in southwestern Iran within the Zagros mountain range, is characterized by steep slopes, deep valleys, and an extensive drainage network that plays a significant role in the region’s hydrological dynamics. Intense precipitation events in late winter and early spring frequently result in flash and riverine floods, causing considerable damage to infrastructure and agricultural lands.
A flood inventory consisting of 545 spatial points (346 flood and 199 non-flood locations) from 1983 to 2023 was compiled. The dataset was randomly and stratifiedly divided into training (389 points, 71%) and testing (156 points, 29%) subsets. Initially, 21 environmental variables were extracted from multiple sources, including a Digital Elevation Model (DEM), climatic time series, Landsat 9 imagery, and the SoilGrid database. After multicollinearity analysis, four variables were removed, and 17 final predictors were selected: elevation, flow accumulation, flow direction, stream connectivity, aspect, slope length, plan and profile curvature, Topographic Wetness Index (TWI), depth to bedrock, maximum 24-hour precipitation, mean and maximum temperature, snow depth, NDVI, surface sand percentage, and distance from residential areas.
The Deep Q-Learning model was applied to process continuous variables and generate a flood susceptibility map classified into five categories (very low to very high). Model performance was evaluated using AUC, Kappa, Recall, Precision, Specificity, and LogLoss metrics.

Results and Discussion Using the Deep Q-Learning (DQL) model, the resulting raster map—classified using the Natural Breaks algorithm—divided the study area (1,655,300 hectares) into five flood susceptibility classes: very low (6%, 97,221 ha), low (20%, 336,190 ha), moderate (26%, 429,453 ha), high (26%, 435,293 ha), and very high (22%, 357,140 ha).
Accuracy assessment indicated strong model performance. The AUC reached 0.93, confirming excellent discriminative ability. The Kappa coefficient was 0.72, indicating substantial agreement between predictions and observations. Recall was 0.87, demonstrating a high detection rate of flood-prone locations. Precision and Specificity were 0.90 and 0.85, respectively, while LogLoss was 0.65.
Variable importance analysis revealed that snow depth had the greatest influence, highlighting the critical role of snowmelt in spring floods. This was followed by flow accumulation (topographic characteristics), distance from residential areas (human and urban development impacts), NDVI (vegetation cover and soil permeability), mean temperature (climatic conditions), and slope length. Areas classified as very high susceptibility were mainly concentrated in low-lying areas, valleys, and along major rivers such as the Karun and Zayandeh-Rud rivers, particularly within Ardal and Lordegan counties. Comparison with similar studies based on supervised machine learning models suggests that DQL provides more balanced predictions in heterogeneous mountainous environments and demonstrates strong potential for flood risk management.

Conclusion and Suggestions
This study demonstrated that the Deep Q-Learning (DQL) model, as a deep reinforcement learning approach, has high capability for flood susceptibility mapping in complex mountainous regions such as Chaharmahal and Bakhtiari Province. By processing continuous environmental variables and accurately capturing nonlinear relationships, the model produced reliable and practical maps that effectively delineate high-risk areas.
The findings emphasize that the main flood drivers in the region include snowmelt, topographic characteristics, human development, and vegetation degradation—results that are consistent with field observations. The generated maps can serve as a practical basis for land-use planning, strengthening stormwater infrastructure, protecting residential areas against flash floods, and restoring vegetation ecosystems to enhance soil permeability.
For future studies, it is recommended to expand the flood inventory with more precise seasonal and temporal data, incorporate dynamic variables such as daily temperature fluctuations and snowmelt cycles, and conduct broader field validation, particularly in high-altitude and urban areas. Additionally, integrating DQL with other reinforcement learning approaches or hybrid models may further improve predictive performance.
Overall, this innovative framework provides a robust approach for sustainable flood risk management in similar regions of Iran and worldwide and can significantly support policymakers in enhancing community resilience.
Keywords: Iranian Highlands, Flood Risk Assessment, Reinforcement Learning, Machine Learning.