Identification of Antimicrobial Peptides and Their Functional Activities Using Ensemble Learning and Amino Acid Encoding Methods

Authors

  • Zhirui Liu

DOI:

https://doi.org/10.54097/qmgn4m95

Keywords:

Antimicrobial Peptides, Functional Activities Identification, Ensemble Learning, Machine Learning, Amino Acid Encoding.

Abstract

Antimicrobial resistance (AMR) has become a significant clinical crisis worldwide, resulting in substantial economic losses and posing a threat to human health. Because antimicrobial peptides (AMPs) can interact with membrane bilayers and inhibit microbial growth, they represent a promising therapeutic candidate to overcome AMR. Despite their potential, AMP development is strongly constrained due to the high costs and inefficiencies in identifying novel and effective AMPs, particularly those with specific functional activities against diverse microbial species. To address these limitations, I developed a comprehensive ensemble learning framework for both AMP identification and functional activity prediction. This approach integrates four amino acid encoding methods (one-hot encoding, BLOSUM62 substitution matrix, AAIndex physicochemical properties, and pseudo amino acid composition) with ensemble learning algorithms to predict 14 different antimicrobial activities, including antibacterial, antifungal, antiviral, anticancer, and other specialized functions. I also evaluated various advanced machine learning algorithms across different encoding approaches and built comprehensive model combinations for each task. The experimental results showed that my ensemble learning-based model achieved the highest performance, with an accuracy of 97.66% for AMR prediction. Additionally, the ensemble approach demonstrated superior performance compared to other individual models. It is able to achieve balanced accuracy across various functional activity tasks. In conclusion, my proposed method offers a practical approach for AMP discovery, helping to address the AMR crisis.

Downloads

Download data is not yet available.

References

[1] C.J.L. Murray, K.S. Ikuta, F. Sharara, L. Swetschinski, G. Robles Aguilar, A. Gray, C. Han, C. Bisignano, P. Rao, E. Wool, S.C. Johnson, A.J. Browne, M.G. Chipeta, F. Fell, S. Hackett, G. Haines-Woodhouse, B.H. Kashef Hamadani, E.A.P. Kumaran, B. McManigal, S. Achalapong, R. Agarwal, S. Akech, S. Albertson, J. Amuasi, J. Andrews, A. Aravkin, E. Ashley, F.-X. Babin, F. Bailey, S. Baker, B. Basnyat, A. Bekker, R. Bender, J.A. Berkley, A. Bethou, J. Bielicki, S. Boonkasidecha, J. Bukosia, C. Carvalheiro, C. Castañeda-Orjuela, V. Chansamouth, S. Chaurasia, S. Chiurchiù, F. Chowdhury, R. Clotaire Donatien, A.J. Cook, B. Cooper, T.R. Cressey, E. Criollo-Mora, M. Cunningham, S. Darboe, N.P.J. Day, M. De Luca, K. Dokova, A. Dramowski, S.J. Dunachie, T. Duong Bich, T. Eckmanns, D. Eibach, A. Emami, N. Feasey, N. Fisher-Pearson, K. Forrest, C. Garcia, D. Garrett, P. Gastmeier, A.Z. Giref, R.C. Greer, V. Gupta, S. Haller, A. Haselbeck, S.I. Hay, M. Holm, S. Hopkins, Y. Hsia, K.C. Iregbu, J. Jacobs, D. Jarovsky, F. Javanmardi, A.W.J. Jenney, M. Khorana, S. Khusuwan, N. Kissoon, E. Kobeissi, T. Kostyanev, F. Krapp, R. Krumkamp, A. Kumar, H.H. Kyu, C. Lim, K. Lim, D. Limmathurotsakul, M.J. Loftus, M. Lunn, J. Ma, A. Manoharan, F. Marks, J. May, M. Mayxay, N. Mturi, T. Munera-Huertas, P. Musicha, L.A. Musila, M.M. Mussi-Pinhata, R.N. Naidu, T. Nakamura, R. Nanavati, S. Nangia, P. Newton, C. Ngoun, A. Novotney, D. Nwakanma, C.W. Obiero, T.J. Ochoa, A. Olivas-Martinez, P. Olliaro, E. Ooko, E. Ortiz-Brizuela, P. Ounchanum, G.D. Pak, J.L. Paredes, A.Y. Peleg, C. Perrone, T. Phe, K. Phommasone, N. Plakkal, A. Ponce-de-Leon, M. Raad, T. Ramdin, S. Rattanavong, A. Riddell, T. Roberts, J.V. Robotham, A. Roca, V.D. Rosenthal, K.E. Rudd, N. Russell, H.S. Sader, W. Saengchan, J. Schnall, J.A.G. Scott, S. Seekaew, M. Sharland, M. Shivamallappa, J. Sifuentes-Osornio, A.J. Simpson, N. Steenkeste, A.J. Stewardson, T. Stoeva, N. Tasak, A. Thaiprakong, G. Thwaites, C. Tigoi, C. Turner, P. Turner, H.R. van Doorn, S. Velaphi, A. Vongpradith, M. Vongsouvath, H. Vu, T. Walsh, J.L. Walson, S. Waner, T. Wangrangsimakul, P. Wannapinij, T. Wozniak, T.E.M.W. Young Sharma, K.C. Yu, P. Zheng, B. Sartorius, A.D. Lopez, A. Stergachis, C. Moore, C. Dolecek, M. Naghavi, Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis, The Lancet 399 (2022) 629–655. https://doi.org/10.1016/S0140-6736(21)02724-0.

[2] C.S. Ho, C.T.H. Wong, T.T. Aung, R. Lakshminarayanan, J.S. Mehta, S. Rauz, A. McNally, B. Kintses, S.J. Peacock, C. de la Fuente-Nunez, R.E.W. Hancock, D.S.J. Ting, Antimicrobial resistance: a concise update, Lancet Microbe 6 (2025) 100947. https://doi.org/10.1016/j.lanmic.2024.07.010.

[3] I.N. Okeke, M.E.A. de Kraker, T.P. Van Boeckel, C.K. Kumar, H. Schmitt, A.C. Gales, S. Bertagnolio, M. Sharland, R. Laxminarayan, The scope of the antimicrobial resistance challenge, The Lancet 403 (2024) 2426–2438. https://doi.org/10.1016/S0140-6736(24)00876-6.

[4] Md.A. Salam, Md.Y. Al-Amin, M.T. Salam, J.S. Pawar, N. Akhter, A.A. Rabaan, M.A.A. Alqumber, Antimicrobial Resistance: A Growing Serious Threat for Global Public Health, Healthcare 11 (2023) 1946. https://doi.org/10.3390/healthcare11131946.

[5] S. Panwar, M. Thapliyal, V. Kuriyal, V. Tripathi, A. Thapliyal, GEU-AMP50: Enhanced antimicrobial peptide prediction using a machine learning approach, Mater Today Proc 73 (2023) 81–87. https://doi.org/10.1016/j.matpr.2022.09.326.

[6] Y. Lin, Y. Cai, J. Liu, C. Lin, X. Liu, an advanced approach to identify antimicrobial peptides and their function types for penaeus through machine learning strategies, BMC Bioinformatics 20 (2019) 291. https://doi.org/10.1186/s12859-019-2766-9.

[7] C.D. Santos-Júnior, M.D.T. Torres, Y. Duan, Á. Rodríguez del Río, T.S.B. Schmidt, H. Chong, A. Fullam, M. Kuhn, C. Zhu, A. Houseman, J. Somborski, A. Vines, X.-M. Zhao, P. Bork, J. Huerta-Cepas, C. de la Fuente-Nunez, L.P. Coelho, Discovery of antimicrobial peptides in the global microbiome with machine learning, Cell 187 (2024) 3761-3778.e16. https://doi.org/10.1016/j.cell.2024.05.013.

[8] S. Zhang, Y. Sun, K. Yin, J. Zhang, L. Du, S. Wang, D. Zheng, R. Li, ML-AMPs designed through machine learning show antifungal activity against C. albicans and therapeutic potential on mice model with candidiasis, Life Sci 366–367 (2025) 123485. https://doi.org/10.1016/j.lfs.2025.123485.

[9] F. Wan, F. Wong, J.J. Collins, C. de la Fuente-Nunez, Machine learning for antimicrobial peptide identification and design, Nature Reviews Bioengineering 2 (2024) 392–407. https://doi.org/10.1038/s44222-024-00152-x.

[10] C. Bournez, M. Riool, L. de Boer, R.A. Cordfunke, L. de Best, R. van Leeuwen, J.W. Drijfhout, S.A.J. Zaat, G.J.P. van Westen, CalcAMP: A New Machine Learning Model for the Accurate Prediction of Antimicrobial Activity of Peptides, Antibiotics 12 (2023) 725. https://doi.org/10.3390/antibiotics12040725.

[11] K. Musin, E. Asyanova, How Machine Learning Helps in Combating Antimicrobial Resistance: A Review of AMP Analysis and Generation Methods, Int J Pept Res Ther 31 (2025) 59. https://doi.org/10.1007/s10989-025-10716-z.

[12] A. Cesaro, M. Bagheri, M. Torres, F. Wan, C. de la Fuente-Nunez, Deep learning tools to accelerate antibiotic discovery, Expert Opin Drug Discov 18 (2023) 1245–1257. https://doi.org/10.1080/17460441.2023.2250721.

[13] J.M. Stokes, K. Yang, K. Swanson, W. Jin, A. Cubillos-Ruiz, N.M. Donghia, C.R. MacNair, S. French, L.A. Carfrae, Z. Bloom-Ackermann, V.M. Tran, A. Chiappino-Pepe, A.H. Badran, I.W. Andrews, E.J. Chory, G.M. Church, E.D. Brown, T.S. Jaakkola, R. Barzilay, J.J. Collins, A Deep Learning Approach to Antibiotic Discovery, Cell 180 (2020) 688-702.e13. https://doi.org/10.1016/j.cell.2020.01.021.

[14] J. Yan, J. Cai, B. Zhang, Y. Wang, D.F. Wong, S.W.I. Siu, Recent Progress in the Discovery and Design of Antimicrobial Peptides Using Traditional Machine Learning and Deep Learning, Antibiotics 11 (2022) 1451. https://doi.org/10.3390/antibiotics11101451.

[15] T.-T. Lin, L.-Y. Yang, I.-H. Lu, W.-C. Cheng, Z.-R. Hsu, S.-H. Chen, C.-Y. Lin, AI4AMP: an Antimicrobial Peptide Predictor Using Physicochemical Property-Based Encoding Method and Deep Learning, MSystems 6 (2021). https://doi.org/10.1128/mSystems.00299-21.

[16] G. Wang, I.I. Vaisman, M.L. van Hoek, Machine Learning Prediction of Antimicrobial Peptides, in: 2022: pp. 1–37. https://doi.org/10.1007/978-1-0716-1855-4_1.

[17] J. Xu, F. Li, A. Leier, D. Xiang, H.-H. Shen, T.T. Marquez Lago, J. Li, D.-J. Yu, J. Song, Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides, Brief Bioinform 22 (2021). https://doi.org/10.1093/bib/bbab083.

[18] B. Vishnepolsky, M. Grigolava, G. Managadze, A. Gabrielian, A. Rosenthal, D.E. Hurt, M. Tartakovsky, M. Pirtskhalava, Comparative analysis of machine learning algorithms on the microbial strain-specific AMP prediction, Brief Bioinform 23 (2022). https://doi.org/10.1093/bib/bbac233.

[19] B. Olcay, G.D. Ozdemir, M.A. Ozdemir, U.K. Ercan, O. Guren, O. Karaman, Prediction of the synergistic effect of antimicrobial peptides and antimicrobial agents via supervised machine learning, BMC Biomed Eng 6 (2024) 1. https://doi.org/10.1186/s42490-024-00075-z.

[20] Y. Wang, L. Wang, C. Li, Y. Pei, X. Liu, Y. Tian, AMP-EBiLSTM: employing novel deep learning strategies for the accurate prediction of antimicrobial peptides, Front Genet 14 (2023). https://doi.org/10.3389/fgene.2023.1232117.

[21] C.M. Van Oort, J.B. Ferrell, J.M. Remington, S. Wshah, J. Li, AMPGAN v2: Machine Learning-Guided Design of Antimicrobial Peptides, J Chem Inf Model 61 (2021) 2198–2207. https://doi.org/10.1021/acs.jcim.0c01441.

[22] S. Lertampaiporn, T. Vorapreeda, A. Hongsthong, C. Thammarongtham, Ensemble-AMPPred: Robust AMP Prediction and Recognition Using the Ensemble Learning Method with a New Hybrid Feature for Differentiating AMPs, Genes (Basel) 12 (2021) 137. https://doi.org/10.3390/genes12020137.

[23] J. Yan, P. Bhadra, A. Li, P. Sethiya, L. Qin, H.K. Tai, K.H. Wong, S.W.I. Siu, Deep-AmPEP30: Improve Short Antimicrobial Peptides Prediction with Deep Learning, Mol Ther Nucleic Acids 20 (2020) 882–894. https://doi.org/10.1016/j.omtn.2020.05.006.

[24] A. Jan, M. Hayat, M. Wedyan, R. Alturki, F. Gazzawe, H. Ali, F.K. Alarfaj, Target-AMP: Computational prediction of antimicrobial peptides by coupling sequential information with evolutionary profile, Comput Biol Med 151 (2022) 106311. https://doi.org/10.1016/j.compbiomed.2022.106311.

[25] A. Khorshidi, A.A. Peterson, Amp: A modular approach to machine learning in atomistic simulations, Comput Phys Commun 207 (2016) 310–324. https://doi.org/10.1016/j.cpc.2016.05.010.

[26] J. Xu, F. Li, C. Li, X. Guo, C. Landersdorfer, H.-H. Shen, A.Y. Peleg, J. Li, S. Imoto, J. Yao, T. Akutsu, J. Song, iAMPCN: a deep-learning approach for identifying antimicrobial peptides and their functional activities, Brief Bioinform 24 (2023). https://doi.org/10.1093/bib/bbad240.

[27] M.P. Styczynski, K.L. Jensen, I. Rigoutsos, G. Stephanopoulos, BLOSUM62 miscalculations improve search performance, Nat Biotechnol 26 (2008) 274–275. https://doi.org/10.1038/nbt0308-274.

[28] S. Kawashima, AAindex: Amino Acid index database, Nucleic Acids Res 28 (2000) 374–374. https://doi.org/10.1093/nar/28.1.374.

[29] K.-C. Chou, Pseudo Amino Acid Composition and its Applications in Bioinformatics, Proteomics and System Biology, Curr Proteomics 6 (2009) 262–274. https://doi.org/10.2174/157016409789973707.

[30] David, Marian. Analyticity, Carnap, Quine, and Truth. 2022.

[31] Dewey, John. Reconstruction in Philosophy. Enl. ed., 15 pr, Beacon Press, 1972.

[32] Ebbs, G. “Carnap and Quine on Truth by Convention.” Mind, vol. 120, no. 478, Apr. 2011, pp. 193–237. DOI.org (Crossref), https://doi.org/10.1093/mind/fzr020.

[33] Friedman, Michael. Dynamics of Reason: The 1999 Kant Lectures at Stanford University. CSLI Publications, 2001.

[34] Habermas, Jürgen, et al. Reason and the Rationalization of Society. Nachdr., Beacon, 2007.

[35] Husserl, Edmund. The Phenomenology of Internal Time-Consciousness. Edited by Martin Heidegger, Translated by James S. Churchill, Indiana University Press, 2019.

[36] Kant, Immanuel. Critique of Pure Reason. Edited by Marcus Weigelt, Translated by Friedrich Max Müller, Penguin Books, 2007.

[37] Quine, W. V. “Main Trends in Recent Philosophy: Two Dogmas of Empiricism.” The Philosophical Review, vol. 60, no. 1, Jan. 1951, p. 20. DOI.org (Crossref), https://doi.org/10.2307/2181906.

[38] Quine, W. V. Pursuit of Truth. Rev. ed, Harvard University Press, 1992.

[39] Rosser, Barkley. “W. V. Quine. Truth by Convention. Philosophical Essays for Alfred North Whitehead, Longmans, Green and Co., New York1936, Pp. 90–124.” Journal of Symbolic Logic, vol. 1, no. 1, Mar. 1936, pp. 42–42. DOI.org (Crossref), https://doi.org/10.2307/2269329.

[40] Soames, Scott. Analytic Philosophy in America: And Other Historical and Contemporary Essays. Princeton University Press, 2014. DOI.org (Crossref), https://doi.org/10.1515/9781400850464.

[41] Uebel, Thomas E., and Alan W. Richardson, editors. The Cambridge Companion to Logical Empiricism. Cambridge University Press, 2007.

Downloads

Published

17-04-2026

How to Cite

Identification of Antimicrobial Peptides and Their Functional Activities Using Ensemble Learning and Amino Acid Encoding Methods. (2026). Highlights in Science, Engineering and Technology, 162, 28-39. https://doi.org/10.54097/qmgn4m95