An Application of Natural Language Processing to Classify What Terrorists Say They Want
:1. Introduction
- Apply natural language processing (NLP) techniques to extract features from the stated motive narrative of terrorist attacks to identify perpetrator aim categories (PACs).
- Validate the effectiveness of the PAC classification by evaluating the predictive performance of 11 different machine learning (ML) models applied to a different narrative field, the event summary. The significance of evaluating multiple types of ML models is that no single model type can best represent all datasets, which is an accepted tenant in the practice of ML (Aggarwal 2015).
2. Literature Review
2.1. Root Causes
2.2. Planned Impacts
2.3. Terrorist Aims
2.4. ML Applications
3. Methodology
3.1. Data
3.2. Topic Classification
3.3. Machine Learning
4. Results
4.1. Topic Classification
4.2. Machine Learning
5. Discussion
6. Conclusions
Data Availability Statement
Conflicts of Interest
- Abrahms, Max. 2008. What Terrorists Really Want: Terrorist Motives and Counterterrorism Strategy. International Security 32: 78–105. [Google Scholar] [CrossRef]
- Abrahms, Max, Nicholas Beauchamp, and Joseph Mroszczyk. 2017. What Terrorist Leaders Want: A Content Analysis of Terrorist Propaganda Videos. Studies in Conflict & Terrorism 40: 899–916. [Google Scholar]
- Aggarwal, Charu C. 2015. Data Mining. New York: Springer International Publishing, p. 734. [Google Scholar]
- Araújo, Rafaella de C. R., Magdalena Bobowik, Roosevelt Vilar, James H. Liu, Homero Gil de Zuñiga, Larissa Kus-Harbord, Nadezhda Lebedeva, and Valdiney V. Gouveia. 2020. Human values and ideological beliefs as predictors of attitudes toward immigrants across 20 countries: The country-level moderating role of threat. European Journal of Social Psychology 50: 534–46. [Google Scholar] [CrossRef]
- Arce, Daniel G. 2019. On the human consequences of terrorism. Public Choice 178: 371–96. [Google Scholar] [CrossRef] [Green Version]
- Balestrini, Pierre Philippe. 2021. Counterterrorism Evaluation and Citizens: More Than about Policing? The Social Sciences 10: 298. [Google Scholar] [CrossRef]
- Bassetti, Thomas, Raul Caruso, and Friederich Schneider. 2018. The tree of political violence: A GMERT analysis. Empirical Economics 54: 839–50. [Google Scholar] [CrossRef]
- Bilal, Muhammad, Abdullah Gani, Muhammad Ikram Ullah Lali, Mohsen Marjani, and Nadia Malik. 2019. Social Profiling: A Review, Taxonomy, and Challenges. Cyberpsychology, Behavior, and Social Networking 22: 433–50. [Google Scholar] [CrossRef]
- Björgo, Tore. 1993. Terrorist violence against immigrants and refugees in Scandinavia: Patterns and motives. In Racist Violence in Europe. Edited by Björgo Tore and Rob Witte. London: Palgrave Macmillan, pp. 29–45. [Google Scholar]
- Breiman, Leo. 2001. Random forests. Machine Learning 45: 5–32. [Google Scholar] [CrossRef] [Green Version]
- Burke, Paul. 2021. Al-Qaeda. In Global Jihadist Terrorism. Edited by P. Burke, D. Elnakhala and S. Miller. Cheltenham: Edward Elgar Publishing, p. 352. [Google Scholar]
- Burnap, Peter, and Matthew Leighton Williams. 2014. Hate speech, machine classification and statistical modelling of information flows on Twitter: Interpretation and communication for policy decision making. Paper presented at the Internet, Policy & Politics, Oxford, UK, September 26. [Google Scholar]
- Canetti, Daphna, Joshua Gubler, and Thomas Zeitzoff. 2021. Motives Don’t Matter? Motive Attribution and Counterterrorism Policy. Political Psychology 42: 483–99. [Google Scholar] [CrossRef]
- Canhoto, Ana Isabel. 2021. Leveraging machine learning in the global fight against money laundering and terrorism financing: An affordances perspective. Journal of Business Research 131: 441–52. [Google Scholar] [CrossRef]
- Cassese, Antonio. 2006. The Multifaceted Criminal Notion of Terrorism in International Law. Journal of International Criminal Justice 4: 933–58. [Google Scholar] [CrossRef]
- Chen, Tianqi, and Carlos Guestrin. 2016. XGBoost: A Scalable Tree Boosting System. Paper presented at the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13–17. [Google Scholar]
- Coaffee, Jon. 2009. Protecting the Urban The Dangers of Planning for Terrorism. Theory, Culture & Society 26: 343–55. [Google Scholar]
- Coccia, Mario. 2018. A Theory of General Causes of Terrorism: High Population Growth, Income Inequality and Relative Deprivation. Social Science Research Network 2: 26. [Google Scholar] [CrossRef] [Green Version]
- Cottee, Simon. 2021. Incel (E)motives: Resentment, Shame and Revenge. Studies in Conflict & Terrorism 44: 93–114. [Google Scholar]
- Cottee, Simon, and Keith J. Hayward. 2011. Terrorist (E)motives: The Existential Attractions of Terrorism. Studies in Conflict & Terrorism 34: 963–86. [Google Scholar]
- Das, Priyanka, and Asit Kumar Das. 2019. Graph-based clustering of extracted paraphrases for labelling crime reports. Knowledge Based Systems 179: 55–76. [Google Scholar] [CrossRef]
- Enders, Walter, and Xuejuan Su. 2007. Rational Terrorists and Optimal Network Structure. Journal of Conflict Resolution 51: 33–57. [Google Scholar] [CrossRef]
- Fawcett, Tom. 2006. An introduction to ROC analysis. Pattern Recognition Letters 27: 861–74. [Google Scholar] [CrossRef]
- Feng, Yi, Dujuan Wang, Yunqiang Yin, Zhiwu Li, and Zhineng Hu. 2020. An XGBoost-Based Casualty Prediction Method for Terrorist Attacks. Complex & Intelligent Systems 6: 721–40. [Google Scholar]
- Friedman, Jerome H. 2001. Greedy function approximation: A gradient boosting machine. Annals of Statistics 29: 1189–232. [Google Scholar] [CrossRef]
- Géron, Aurélien. 2017. Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, 2nd ed. Sebastopol: O’Reilly Media, p. 856. [Google Scholar]
- Guo, Diansheng, Ke Liao, and Michael Morgan. 2007. Visualizing patterns in a global terrorism incident database. Environment and Planning B: Planning and Design 34: 767–84. [Google Scholar] [CrossRef]
- Hao, Mengmeng, Dong Jiang, Fangyu Ding, Jingying Fu, and Shuai Chen. 2019. Simulating Spatio-Temporal Patterns of Terrorism Incidents on the Indochina Peninsula with GIS and the Random Forest Method. ISPRS International Journal of Geo-Information 8: 133. [Google Scholar] [CrossRef] [Green Version]
- Hastie, Trevor, Robert Tibshirani, and Jerome Friedman. 2016. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed. New York: Springer, p. 767. [Google Scholar]
- Höflinger, Tim. 2021. Modern terrorism: Motives of individual terrorists or the strategies of terrorist groups? Global Change, Peace & Security 33: 77–83. [Google Scholar]
- Huamaní, Enrique Lee, Alva Mantari, and Avid Roman-Gonzalez. 2020. Machine Learning Techniques to Visualize and Predict Terrorist Attacks Worldwide using the Global Terrorism Database. International Journal of Advanced Computer Science and Applications 11: 562–570. [Google Scholar] [CrossRef]
- Huff, Connor, and Joshua D. Kertzer. 2018. How the Public Defines Terrorism. American Journal of Political Science 62: 55–71. [Google Scholar] [CrossRef]
- James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An Introduction to Statistical Learning with Applications in R. New York: Springer, vol. 112. [Google Scholar]
- Jaspersen, Johannes G., and Gilberto Montibeller. 2020. On the Learning Patterns and Adaptive Behavior of Terrorist Organizations. European Journal of Operational Research 282: 221–34. [Google Scholar] [CrossRef]
- Jones, Karen Ssparck, and Peter Willett, eds. 1997. Readings in Information Retrieval. Burlington: Morgan Kaufmann, p. 589. [Google Scholar]
- Khalifa, Nour Eldeen Mahmoud, Mohamed Hamed N. Taha, Sarah Hamed N. Taha, and Aboul Ella Hassanien. 2019. Statistical Insights and Association Mining for Terrorist Attacks in Egypt. In International Conference on Advanced Machine Learning Technologies and Applications. Cham: Springer, pp. 291–300. [Google Scholar]
- Krieger, Tim, and Daniel Meierrieks. 2011. What causes terrorism. Public Choice 147: 3–27. [Google Scholar] [CrossRef]
- Kurtulus, Ersun N. 2017. Terrorism and fear: Do terrorists really want to scare? Critical Studies on Terrorism 10: 501–22. [Google Scholar] [CrossRef]
- Kydd, Andrew H., and Barbara F. Walter. 2006. The Strategies of Terrorism. International Security 31: 49–79. [Google Scholar] [CrossRef]
- Lane, Hobson, Cole Howard, and Hannes Max Hapke. 2019. Natural Language Processing in Action: Understanding, Analyzing, and Generating Text with Python. Shelter Island: Manning Publications Co. [Google Scholar]
- Luo, Lanjun, and Chao Qi. 2021. An analysis of the crucial indicators impacting the risk of terrorist attacks: A predictive perspective. Safety Science 144: 105442. [Google Scholar] [CrossRef]
- Mashechkin, Igor V., Mikhail Petrovskiy, Dmitry V. Tsarev, and Maxim N. Chikunov. 2019. Machine Learning Methods for Detecting and Monitoring Extremist Information on the Internet. Programming and Computer Software 45: 99–115. [Google Scholar] [CrossRef]
- Masuku, Mfundo Mandla, Victor H. Mlambo, and Bhekani J. Ngwenya. 2021. The Critical Analyses of Propaganda of the Terrorism Deed. Technium Social Sciences Journal 25: 619–29. [Google Scholar]
- Maszka, John. 2018. The Perils of Deduction: Limitations of Theoretical Models in Terrorism Studies. Systems Research and Behavioral Science 35: 884–907. [Google Scholar] [CrossRef]
- Mishra, Namrata, Shrabanee Swagatika, and Debabrata Singh. 2020. An Intelligent Framework for Analysing Terrorism Actions Using Cloud. In New Paradigm in Decision Science and Management. Advances in Intelligent Systems and Computing. Edited by Srikanta Patnaik, Andrew W. H. Ip, Madjid Tavana and Vipul Jain. Singapore: Springer, vol. 1005, pp. 225–35. [Google Scholar]
- Monahan, Tom, and Robin Maria Valeri. 2018. Terrorism and Fear. In Terrorism in America, 1st ed. Edited by Robin Maria Valeri and Kevin Borgeson. New York: Routledge, p. 256. [Google Scholar]
- Morley, Barry, and Gavin D. Leslie. 2007. Terrorist bombings: Motives, methods and patterns of injuries. Australasian Emergency Nursing Journal 10: 5–12. [Google Scholar] [CrossRef]
- Natekin, Alexey, and Alois Knoll. 2013. Gradient Boosting Machines, a Tutorial. Frontiers in Neurorobotics 7: 21. [Google Scholar] [CrossRef] [Green Version]
- Padmaja, V. R., S. Lakshmi Narayana, and C. H. Divakar. 2018. Probabilistic Topic Modeling and its Variants: A Survey. International Journal of Advanced Research in Computer Science 9: 173–77. [Google Scholar]
- Reich, Walter. 1990. Understanding terrorist behavior: The limits and opportunities of psychological inquiry. In Origins of Terrorism: Psychologies, Ideologies, Theologies, States of Mind. Edited by Walter Reich. Cambridge: Cambridge University Press, Washington, DC: Woodrow Wilson International Center for Scholars, pp. 261–79. [Google Scholar]
- Rigterink, Anouk S. 2021. The Wane of Command: Evidence on Drone Strikes and Control within Terrorist Organizations. American Political Science Review 115: 31–50. [Google Scholar] [CrossRef]
- Rothenberger, Liane Phil, and Kathrin M.A. Müller. 2015. Categorizing terrorist entities listed by the European Union according to terrorist groups’ underlying motives. Conflict & Communication 14: 1–14. [Google Scholar]
- START. 2020. University of Maryland. Available online: (accessed on 13 April 2020).
- Tamborini, Ron, Lindsay Hahn, Melinda Aley, Sujay Prabhu, Joshua Baldwin, Neha Sethi, Eric Novotny, Brian Klebig, and Matthias Hofer. 2020. The Impact of Terrorist Attack News on Moral Intuitions. Communication Studies 71: 511–27. [Google Scholar] [CrossRef]
- Treistman, Jeffrey. 2021. Social Exclusion and Political Violence: Multilevel Analysis of the Justification of Terrorism. Studies in Conflict & Terrorism, 1–24. [Google Scholar] [CrossRef]
- Uddin, M. Irfan, Nazir Zada, Furqan Aziz, Yousaf Saeed, Asim Zeb, Syed Atif Ali Shah, Mahmoud Ahmad Al-Khasawneh, and Marwan Mahmoud. 2020. Prediction of Future Terrorist Activities Using Deep Neural Networks. Complexity 2020: 1373087. [Google Scholar] [CrossRef] [Green Version]
- Van Um, Eric. 2011. Discussing Concepts of Terrorist Rationality: Implications for Counterterrorism Policy. Defence and Peace Economics 22: 161–79. [Google Scholar] [CrossRef] [Green Version]
- Veen, Fjodor Van, and Stefan Leijnen. 2016. The Neural Network Zoo. The Asimov Institute for Artificial Creativity & Constraint. September 14. Available online: (accessed on 3 August 2020).
- Wong, Kevin, Geoffrey Walton, and Gavin Bailey. 2021. Using information science to enhance educational preventing violent extremism programs. Journal of the Association for Information Science and Technology 72: 362–76. [Google Scholar] [CrossRef]
Model | Description | Reference |
Logistic Regression (LR) | Fits the data to a logistic function of the linear combination of attributes to estimate the probability of a binary class. | Aggarwal (2015), Géron (2017) |
Support Vector Machine (SVM) | Finds a hyperplane in multidimensional feature space that maximally separates the classes. | Aggarwal (2015), Géron (2017) |
Stochastic Gradient Descent (SGD) | Fits a linear multivariate function to the data by randomly selecting data instances to calculate parameter updates that minimize a selected loss function. | Géron (2017) |
Decision Tree (DT) | Grows a logic tree by recursively splitting nodes to maximize the purity of child or leaf nodes. | Géron (2017) Hastie et al. (2016) |
Random Forest (RF) | Grows many shallow and partial decision trees by randomly selecting a subset of attributes and data subset to split nodes, and then uses majority vote to predict the class. | Aggarwal (2015), Breiman (2001) |
AdaBoost (ADB) | Sequentially build shallow decision trees (stumps) that improve on the prediction errors of previous trees, and then uses majority vote to predict the class. | Hastie et al. (2016) |
Multi-layer Perceptron (MLP) | A feed-forward and fully connected artificial neural network that learns a function with one or more inner layers of neurons. | Géron (2017) Veen and Leijnen (2016) |
Naïve Bayes (NB) | Uses Bayes probability theory to predict a class given the observed set of features, and assuming that they are independent. | Aggarwal (2015), James et al. (2013) |
k-Nearest Neighbors (kNN) | Predicts a class based on the majority vote of its k-nearest neighbors in feature space. | Aggarwal (2015), James et al. (2013) |
Gradient Boosting (GB) | Sequentially build improved models to predict the errors or residuals of previous models. | Natekin and Knoll (2013), Aggarwal (2015), |
Extreme Gradient Boosting (XGB) | A highly configurable version of gradient boosting that incorporates regularization. | Chen and Guestrin (2016), Feng et al. (2020) |
PAC | Definition | Corpus | Motive Narrative Keywords and Their Document Frequency |
Protest | Expression of ire without necessarily demanding a change. | F: 2620 D: 1544 | protest (0.242), elect (0.233), resist (0.069), construct (0.064), celebration (0.049), oppose (0.046), drug (0.037), democrat (0.036), demonstrate (0.035), policy (0.029), republican (0.027), trial (0.026), show (0.024), conflict (0.024), express (0.019), trade (0.014), pipeline (0.013), leftist (0.010), amnesty (0.010), instillation (0.005), miner (0.004), unhappy (0.004), animal (0.003), attention (0.001) |
Retaliate | Reaction to an action or situation. | F: 3498 D: 2956 | retaliate (0.335), spy (0.090), NPA (0.078), revenge (0.071), punish (0.048), extort (0.045), avenge (0.035), murder (0.033), crime (0.028), cooperate (0.022), corrupt (0.020), collaborate (0.019), anarchist (0.017), retribution (0.017), traitor (0.014), respond (0.011), mistreat (0.010), reprise (0.010), critic (0.009), sympathetic (0.007), clash (0.006), anniversary (0.004), blame (0.004), react (0.004), defy (0.003) |
Intimidate | Repel people or actions by instilling fear. | F: 866 D: 781 | intimidate (0.800), threaten (0.060), fear (0.036), threat (0.027), prove (0.020), deter (0.019), raid (0.015), away (0.013), discourage (0.010) |
Weaken | Weaken the target or create instability. | F: 617 D: 703 | destabilize (0.498), weaken (0.296), disrupt (0.156), interest (0.048), destroy (0.038), offense (0.031), undermine (0.014), dampen (0.009), chaos (0.007), distract (0.007), insurgency (0.004), |
Force | Force or demand a change or divert attention. | F: 2806 D: 1638 | demand (0.173), liberate (0.090), free (0.066), peace (0.040), prevent (0.038), pay (0.027), land (0.023), sharia (0.022), control (0.020), join (0.017), withdraw (0.016), halt (0.013), remove (0.013), cause (0.012), create (0.011), freedom(0.010), obtain (0.010), taken (0.010), close (0.007), kidnap (0.007), vivisection (0.007), supply (0.006), divert (0.006), preserve (0.006), finance (0.005), incite (0.005), indoctrinate (0.005), autonomy (0.005), revenue (0.004), occupy (0.004), overthrow (0.004), release (0.004), captor (0.002), stop (0.002), surrender (0.002), disavow (0.001) |
Despise | Expression of contempt for a race, gender identity, religion, or ideology. | F: 1341 D: 778 | Islam (0.288), religious (0.109), anti (0.087), Muslim (0.068), Ramadan (0.064), separatist (0.063), white (0.060), Christian (0.044), pro (0.041), Shi’i (0.041), sectarian (0.033), foreign (0.030), Jew (0.028), women (0.019), nationalist (0.017), ideology (0.015), racial (0.014), music (0.010), refuge (0.009), black (0.008), incompetence (0.008), politician (0.006) |
World Region | Despise | Force | Intimidate | Protest | Retaliate | Weaken |
Australasia and Oceania | 0.26% | 0.06% | 0.26% | 0.19% | 0.24% | 0.14% |
Central America and Caribbean | 0.00% | 0.00% | 0.00% | 0.13% | 0.03% | 0.00% |
Central Asia | 0.51% | 0.37% | 0.26% | 0.58% | 0.07% | 0.85% |
East Asia | 0.39% | 0.12% | 0.13% | 0.84% | 0.37% | 0.00% |
Eastern Europe | 3.86% | 1.71% | 11.40% | 2.01% | 1.73% | 0.71% |
Middle East and North Africa | 28.02% | 16.67% | 40.08% | 20.01% | 30.51% | 66.15% |
North America | 9.51% | 4.52% | 1.92% | 8.23% | 0.98% | 0.71% |
South America | 2.31% | 5.80% | 2.82% | 5.12% | 2.81% | 1.85% |
South Asia | 25.58% | 43.65% | 27.66% | 34.26% | 37.31% | 16.22% |
Southeast Asia | 5.40% | 8.73% | 3.20% | 10.75% | 12.86% | 4.55% |
Sub-Saharan Africa | 13.62% | 14.29% | 9.35% | 7.25% | 8.22% | 6.83% |
Western Europe | 10.54% | 4.09% | 2.94% | 10.62% | 4.87% | 1.99% |
Model | AUC | CA | F1 | PR | RC | Hyperparameter Settings |
XGB | 0.900 | 0.905 | 0.895 | 0.900 | 0.905 | γ:[0:1,0], Max Depth: [2:8,6], Min Child Weight: [0:2,1], R:[0:10,1], w:[0:2,1], L:[0:1,0.2] |
RF | 0.893 | 0.890 | 0.866 | 0.897 | 0.890 | Trees (N): [20:100,60], Attributes/Split: [2:6,5], Min Subset: [2:10,5] |
LR | 0.887 | 0.898 | 0.893 | 0.892 | 0.898 | R: [L1: L2, L2], C:[0.1:10, 5] |
MLP | 0.867 | 0.889 | 0.885 | 0.883 | 0.889 | Hidden Nodes: [2:200,100], Activation: ReLu, OA: Adam (α:10−4) |
NB | 0.853 | 0.696 | 0.744 | 0.868 | 0.696 | No parameters to tune. |
GB | 0.850 | 0.893 | 0.875 | 0.891 | 0.893 | LF: LR, Trees (N): [20: 200,100], L: 0.2, Min Samples Leaf: [1:5,1] |
SGD | 0.841 | 0.891 | 0.874 | 0.882 | 0.891 | LF: (LR, ε:1), R: E.Net (α:10−5, 0.15), L: IVS (η0:10−2, t:0.25) |
kNN | 0.769 | 0.856 | 0.819 | 0.857 | 0.856 | N: [3:100,30], Distance (Euclidean, Weights: Uniform) |
ADB | 0.768 | 0.890 | 0.885 | 0.883 | 0.890 | Trees (N): [20:200,50], LF: Linear, OA: SAMME.R, LR: 1.0 |
DT | 0.735 | 0.866 | 0.838 | 0.842 | 0.866 | Max Depth: [5:20,10], Min Samples Leaf (N): [5:200,90], Min Subset: [5:20,5] |
SVM | 0.624 | 0.262 | 0.236 | 0.785 | 0.262 | Kernel: Sigmoid, R (C:0.2, ε:1.0) |
Null | 0.499 | 0.833 | 0.761 | 0.703 | 0.833 | No parameters to tune. |
Category | AUC | CA | F1 | PR | RC |
Protest | 0.906 | 0.904 | 0.896 | 0.901 | 0.904 |
Retaliate | 0.921 | 0.865 | 0.861 | 0.868 | 0.865 |
Intimidate | 0.914 | 0.940 | 0.933 | 0.935 | 0.940 |
Weaken | 0.927 | 0.943 | 0.940 | 0.938 | 0.943 |
Force | 0.870 | 0.860 | 0.839 | 0.853 | 0.860 |
Despise | 0.864 | 0.920 | 0.899 | 0.906 | 0.920 |
Mean | 0.900 | 0.905 | 0.895 | 0.900 | 0.905 |
STD | 0.027 | 0.036 | 0.039 | 0.035 | 0.036 |
CV | 0.030 | 0.040 | 0.044 | 0.038 | 0.040 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (
Share and Cite
Bridgelall, R. An Application of Natural Language Processing to Classify What Terrorists Say They Want. Soc. Sci. 2022, 11, 23.
Bridgelall R. An Application of Natural Language Processing to Classify What Terrorists Say They Want. Social Sciences. 2022; 11(1):23.
Chicago/Turabian StyleBridgelall, Raj. 2022. "An Application of Natural Language Processing to Classify What Terrorists Say They Want" Social Sciences 11, no. 1: 23.
APA StyleBridgelall, R. (2022). An Application of Natural Language Processing to Classify What Terrorists Say They Want. Social Sciences, 11(1), 23.