Extracting Natech Reports from Large Databases: Development of a Semi-Intelligent Natech Identification Framework

Xiaolong Luo; Ana Maria Cruz; Dimitrios Tzioutzios

doi:10.1007/s13753-020-00314-6

Volume 11 Issue 6

Dec. 2021

Turn off MathJax

Article Contents

Article Navigation > International Journal of Disaster Risk Science > 2020 > 11(6): 735-750

Xiaolong Luo, Ana Maria Cruz, Dimitrios Tzioutzios. Extracting Natech Reports from Large Databases: Development of a Semi-Intelligent Natech Identification Framework[J]. International Journal of Disaster Risk Science, 2020, 11(6): 735-750. doi: 10.1007/s13753-020-00314-6

Citation:

Xiaolong Luo, Ana Maria Cruz, Dimitrios Tzioutzios. Extracting Natech Reports from Large Databases: Development of a Semi-Intelligent Natech Identification Framework[J]. International Journal of Disaster Risk Science, 2020, 11(6): 735-750. doi: 10.1007/s13753-020-00314-6

Citation:

Xiaolong Luo, Ana Maria Cruz, Dimitrios Tzioutzios. Extracting Natech Reports from Large Databases: Development of a Semi-Intelligent Natech Identification Framework[J]. International Journal of Disaster Risk Science, 2020, 11(6): 735-750. doi: 10.1007/s13753-020-00314-6

PDF

Extracting Natech Reports from Large Databases: Development of a Semi-Intelligent Natech Identification Framework

doi: 10.1007/s13753-020-00314-6

1 Department of Urban Management, Graduate School of Engineering, Kyoto University, Kyoto 615-8540, Japan;
2 Disaster Prevention Research Institute, Kyoto University, Kyoto 611-0011, Japan

Funds:

This research was supported in part by the Japan Society for the Promotion of Science (Kaken Grant 17K01336, April 2017–March 2020). We would also like to acknowledge that Xiaolong Luo holds a Ph.D. fellowship from the China Scholarship Council (CSC No. 201606620007) and the Ministry of Education, Culture, Sports, Science, and Technology of Japan (Monbukagakusho: MEXT Scholarship No. 171572, 2017–2021), and Dimitrios Tzioutzios holds a Ph.D. fellowship from the Ministry of Education, Culture, Sports, Science, and Technology of Japan (Monbukagakusho: MEXT scholarship, 2019–2022).

Available Online: 2021-12-25
Publish Date: 2021-12-25

Abstract

Abstract

Natural hazard-triggered technological accidents (Natechs) refer to accidents involving releases of hazardous materials (hazmat) triggered by natural hazards. Huge economic losses, as well as human health and environmental problems are caused by Natechs. In this regard, learning from previous Natechs is critical for risk management. However, due to data scarcity and high uncertainty concerning such hazards, it becomes a serious challenge for risk managers to detect Natechs from large databases, such as the National Response Center (NRC) database. As the largest database of hazmat release incidents, the NRC database receives hazmat release reports from citizens in the United States. However, callers often have incomplete details about the incidents they are reporting. This results in many records having incomplete information. Consequently, it is quite difficult to identify and extract Natechs accurately and efficiently. In this study, we introduce machine learning theory into the Natech retrieving research, and a Semi-Intelligent Natech Identification Framework (SINIF) is proposed in order to solve the problem. We tested the suitability of two supervised machine learning algorithms, namely the Long ShortTerm Memory (LSTM) and the Convolutional Neural Network (CNN), and selected the former for the development of the SINIF. According to the results, the SINIF is efficient (a total number of 826,078 records were analyzed) and accurate (the accuracy is over 0.90), while 32,841 Natech reports between 1990 and 2017 were extracted from the NRC database. Furthermore, the majority of those Natech reports (97.85%) were related to meteorological phenomena, with hurricanes (24.41%), heavy rains (19.27%), and storms (18.29%) as the main causes of these reported Natechs. Overall, this study suggests that risk managers can benefit immensely from SINIF in analyzing Natech data from large databases efficiently.
- Data extraction method,
- Machine learning,
- Natechs,
- Natural hazards,
- NRC database

FullText(HTML)

References(59)

References

Antonioni, G., S. Bonvicini, G. Spadoni, and V. Cozzani. 2009. Development of a framework for the risk assessment of Na-Tech accidental events. Reliability Engineering and System Safety 94(9): 1442–1450.

Antonioni, G., G. Landucci, A. Necci, D. Gheorghiu, and V. Cozzani. 2015. Quantitative assessment of risk due to NaTech scenarios caused by floods. Reliability Engineering and System Safety 142: 334–345.

Antonioni, G., G. Spadoni, and V. Cozzani. 2007. A methodology for the quantitative risk assessment of major accidents triggered by seismic events. Journal of Hazardous Materials 147(1–2): 48–59.

Bashar, S.S., and A.A. Mahmud. 2019. A machine learning approach for heart rate estimation from PPG signal using random forest regression algorithm. In Proceedings of 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), 24–25 July 2019, Swat, Pakistan, 1–5.

Bishop, C.M. 2006. Pattern recognition and machine learning. Berlin: Springer.

Blanzieri, E., and A. Bryl. 2008. A survey of learning-based techniques of email spam filtering. Artificial Intelligence Review 29: 63–92.

Bureau for Analysis of Industrial Risk and Pollution. 2019. Analysis, Research and Information on Accidents (ARIA). French Ministry of Ecology and Sustainable Development, Bureau for Analysis of Industrial Risk and Pollution, France. https://www.aria.developpement-durable.gouv.fr. Accessed 10 Jan 2019 (in French).

Campadelli, P., R. Lanzarotti, and G. Lipori. 2004. Face detection in color images of generic scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence 24: 97–103.

Campedel, M., V. Cozzani, A. Garcia-Agreda, and E. Salzano. 2008. Extending the quantitative assessment of industrial risks to earthquake effects. Risk Analysis 28: 1231–1246.

Chowdhury, G.G. 2003. Natural language processing. Annual Review of Information Science and Technology 37: 51–89.

Cireşan, D., U. Meier, and J. Schmidhuber. 2012. Multi-column deep neural networks for image classification. arXiv preprint arXiv:1202.2745.

Clow, J.C. 1980. The Coast Guard’s National Response Center. Environment International 3(2): 151–153.

Cozzani, V., M. Campedel, E. Renni, and E. Krausmann. 2010. Industrial accidents triggered by flood events: Analysis of past accidents. Journal of Hazardous Materials 175: 501–509.

Cruz, A.M., and E. Krausmann. 2009. Hazardous-materials releases from offshore oil and gas facilities and emergency response following Hurricanes Katrina and Rita. Journal of Loss Prevention in the Process Industries 22(1): 59–65.

Cruz, A.M., and N. Okada. 2008. Consideration of natural hazards in the design and risk management of industrial facilities. Natural Hazards 44(2): 213–227.

Cruz, A.M., L.J. Steinberg, and A.L. Vetere-Arellano. 2006. Emerging issues for natech disaster risk management in Europe. Journal of Risk Research 9(5): 483–501.

Diale, M., T. Celik, and C. Van Der Walt. 2019. Unsupervised feature learning for spam email filtering. Computers and Electrical Engineering 74: 89–104.

Emanuel, K. 2007. Environmental factors affecting tropical cyclone power dissipation. Journal of Climate 20(22): 5497–5509.

European Commission. 2019a. Natech accident database. Joint Research Centre, Institute for the Protection and Security of the Citizen, Italy. http://enatech.jrc.ec.europa.eu. Accessed 11 Jan 2019.

European Commission. 2019b. eMars (Major Accident Reporting System) database. European Commission, Joint Research Centre, Institute for the Protection and Security of the Citizen, Italy. http://emars.jrc.ec.europa.eu. Accessed 11 Jan 2019.

Fernández, S., A. Graves, and J. Schmidhuber. 2007. An application of recurrent neural networks to discriminative keyword spotting. In Proceedings of International Conference on Artificial Neural Networks, 9–13 September 2007, Porto, Portugal, 220–229.

Fukushima, K. 1980. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological Cybernetics 36(4): 193–202.

Girgin, S., and E. Krausmann. 2014. Analysis of pipeline accidents induced by natural hazards: Final report. JRC88410. Joint Research Centre, European Commission, Italy.

Girgin, S., and E. Krausmann. 2016. Historical analysis of U.S. onshore hazardous liquid pipeline accidents triggered by natural hazards. Journal of Loss Prevention in the Process Industries 40: 578–590.

Graves, A., and J. Schmidhuber. 2009. Offline handwriting recognition with multidimensional recurrent neural networks. In Proceedings of the 21st International Conference on Neural Information Processing Systems, ed. D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, 545–552. Red Hook, NY: Curran Associates Inc.

Hochreiter, S., and J. Schmidhuber. 1997. Long short-term memory. Neural Computation 9(8): 1735–1780.

Huang, G.-B., H. Zhou, X. Ding, and R. Zhang. 2011. Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 42(2): 513–529.

Ji, S., W. Xu, M. Yang, and K. Yu. 2013. 3D convolutional neural networks for human action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(1): 221–231.

Khakzad, N., and P. Van Gelder. 2017. Fragility assessment of chemical storage tanks subject to floods. Process Safety and Environmental Protection 111: 75–84.

Khakzad, N., and P. Van Gelder. 2018. Vulnerability of industrial plants to flood-induced natechs: A Bayesian network approach. Reliability Engineering and System Safety 169: 403–411.

Khakzad, N., M. Dadashzadeh, and G. Reniers. 2018. Quantitative assessment of wildfire risk in oil facilities. Journal of Environmental Management 223: 433–443.

Knutson, T.R., R.E. Tuleya, S.T. Garner, M.A. Bender, G.A. Vecchi, I.M. Held, and J.J. Sirutis. 2010. Modeled impact of anthropogenic warming on the frequency of intense Atlantic hurricanes. Science 327(5964): 454–458.

Krausmann, E., and A.M. Cruz. 2013. Impact of the 11 March 2011, Great East Japan earthquake and tsunami on the chemical industry. Natural Hazards 67(2): 811–828.

Krausmann, E., and F. Mushtaq. 2008. A qualitative Natech damage scale for the impact of floods on selected industrial facilities. Natural Hazards 46(2): 179–197.

Krausmann, E., and E. Salzano. 2017. Lessons learned from Natech events. In Natech risk assessment and management: Reducing the risk of natural-hazard impact on hazardous installations, ed. E. Krausmann, A.M. Cruz, and E. Salzano, 33–52. Amsterdam: Elsevier.

Krausmann, E., E. Renni, M. Campedel, and V. Cozzani. 2011. Industrial accidents triggered by earthquakes, floods and lightning: Lessons learned from a database analysis. Natural Hazards 59(1): 285–300.

Kumasaki, M., T. Hara, N. Nakajima, Y. Wada, and R. Makino. 2017. The classification of physical effects from natural hazards for Natech risk assessment based on a Japanese database. Journal of Loss Prevention in the Process Industries 50(B): 308–316.

Landsea, C.W., and J.L. Franklin. 2013. Atlantic hurricane database uncertainty and presentation of a new database format. Monthly Weather Review 141(10): 3576–3592.

Le, Q.V., and T. Mikolov. 2014. Distributed representations of sentences and documents. In Proceedings of the 31st International Conference on Machine Learning, 21–26 June 2014, Beijing, China. JMLR:W&CP 32: 1–5.

LeCun, Y., and Y. Bengio. 1997. Convolutional networks for images, speech, and time series. In The handbook of brain theory and neural networks, ed. M.A. Arbib, 276–279. Cambridge, MA: The MIT Press.

LeCun, Y., Y. Bengio, and G. Hinton. 2015. Deep learning. Nature 521(7553): 436–444.

Lu, D., and Q. Weng. 2007. A survey of image classification methods and techniques for improving classification performance. International Journal of Remote Sensing 28(5): 823–870.

Manevitz, L.M., and M. Yousef. 2001. One-class SVMs for document classification. Journal of Machine Learning Research 2: 139–154.

Murphy, K.P. 2012. Machine learning: A probabilistic perspective. Cambridge, MA: The MIT press.

Özgür, A., L. Özgür, and T. Güngör. 2005. Text categorization with class-based and corpus-based keyword selection. In Proceedings of 20th International Symposium on Computer and Information Sciences, 26–28 October 2005, Istanbul, Turkey, 606–615.

Ranjan, R., V.M. Patel, and R. Chellappa. 2019. HyperFace: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 41: 121–135.

Rubin, T.N., A. Chambers, P. Smyth, and M. Steyvers. 2012. Statistical topic models for multi-label document classification. Machine Learning 88(1–2): 157–208.

Sboev, A., T. Litvinova, D. Gudovskikh, R. Rybka, and I. Moloshnikov. 2016. Machine learning models of text categorization by author gender using topic-independent features. Procedia Computer Science 101: 135–142.

Sengul, H., N. Santella, L.J. Steinberg, and A.M. Cruz. 2012. Analysis of hazardous material releases due to natural hazards in the United States. Disasters 36(4): 723–743.

Shah, A.A., J. Ye, M. Abid, J. Khan, and S.M. Amir. 2018. Flood hazards: Household vulnerability and resilience in disaster-prone districts of Khyber Pakhtunkhwa Province, Pakistan. Natural Hazards 93(1): 147–165.

Shin, J., Y. Kim, S. Yoon, and K. Jung. 2018a. Contextual-CNN: A novel architecture capturing unified meaning for sentence classification. In Proceedings of 2018 IEEE International Conference on Big Data and Smart Computing, 15–18 January 2018, Shanghai, China, 491–494.

Shih, C.H., B.C. Yan, S.H. Liu, and B. Chen. 2018b. Investigating Siamese LSTM networks for text categorization. In Proceedings of the 9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 12–15 November 2018, Honolulu, Hawaii, USA, 641–646.

Showalter, P.S., and M.F. Myers. 1992. Natural disasters as the cause of technological emergencies: A review of the decade, 1980–1989. Boulder, CO: Univercity of Colorada.

Sokolova, M., and G. Lapalme. 2009. A systematic analysis of performance measures for classification tasks. Information Processing and Management 45(4): 427–437.

Sotthisopha, N., and P. Vateekul. 2018. Improving short text classification using fast semantic expansion on multichannel convolutional neural network. In Proceedings of the 19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 27–29 June 2018, Busan, South Korea, 182–187.

Sudharshan, P.J., C. Petitjean, F. Spanhol, L.E. Oliveira, L. Heutte, and P. Honeine. 2019. Multiple instance learning for histopathological breast cancer image classification. Expert Systems with Applications 117: 103–111.

TNO Industrial and External Safety. 2019. Failure and ACcidents Technical information System (FACTS). the Unified Industrial & Harbour Fire Department in Rotterdam-Rozenburg, the Netherlands. http://www.factsonline.nl/. Accessed 14 Jan 2019.

Torn, R.D., and C. Snyder. 2012. Uncertainty of tropical cyclone best-track information. Weather and Forecasting 27(3): 715–729.

United States Coast Guard. 2017. United States National Response Center (NRC) database. Washington, DC: United States Coast Guard. http://www.nrc.uscg.mil/. Accessed 17 Oct 2017.

Relative Articles

Supplements(0)

Cited By

Proportional views