Methods of computer modelling of the transcription factor of stress resistance WRKY2 in Triticum Aestivum: A comparative analysis

Iryna Demianenko, Anton Petrovskyi, Igor Levtun
Abstract

The aim of this work was to compare the efficiency of modern computer modelling methods for constructing the three-dimensional structure of the transcription factor TaWRKY2 from Triticum aestivum, which plays a key role in plant stress resistance. The study used traditional homology approaches and machine learning-based methods, including AlphaFold, RoseTTaFold, ESMFold, and OmegaFold. The model obtained with AlphaFold2 (MMseqs2) was the best in terms of Clashscore (99th percentile), MolProbity Score, and ERRAT. The complex with zinc ions and DNA generated by AlphaFold3 showed better results in terms of Z-Score. The AlphaFold method provided biologically relevant structures with the correct location of functional sites. ESMFold provided fast modelling, but showed deviations in the structure, including additional α-helices and lower quality in most metrics. The RoseTTaFold model had an elongated shape with a higher content of α-helices and required additional verification of functional activity. OmegaFold, although it provided the best QMEAN score (0.4), was inferior in other metrics. Additional tools, such as Amber and Chimera, allowed for structure relaxation and analysis of key features, including the spatial arrangement of the zinc fingers (18.7 Å). Evaluation by Verify 3D, ERRAT, Ramachandran Plot, and other metrics revealed the advantages and disadvantages of each approach. The findings confirmed the advantages of machine learning methods for modelling proteins with high functional plasticity. In particular, AlphaFold was recognised as the most effective approach for building models with high accuracy. At the same time, the use of several methods allows considering alternative conformations and interactions, which is important for a deeper understanding of the functional mechanisms of the protein

Keywords

bioinformatics; three-dimensional protein structure; machine learning methods; model evaluation metrics; zinc fingers

Suggested citation
Demianenko, I., Petrovskyi, A., & Levtun, I. (2024). Methods of computer modelling of the transcription factor of stress resistance WRKY2 in Triticum Aestivum: A comparative analysis. Biological Systems: Theory and Innovation, 15(4), 34-50. https://doi.org/10.31548/biologiya/4.2024.34
References

1. Abramson, J., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature, 630, 493-500. doi: 10.1038/s41586-024-07487-w.

2. Amoah, J.N., Ko, C.S., Yoon, J.S., & Weon, S.Y. (2019). Effect of drought acclimation on oxidative stress and transcript expression in wheat (Triticum aestivum L.). Journal of Plant Interactions, 14(1), 492-505. doi: 10.1080/17429145.2019.1662098.

3. Baek, M., DiMaio, F., Anishchenko, I., Dauparas, J., Ovchinnikov, S., Lee, G.R., & Baker, D. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science, 373(6557), 871-876. doi: 10.1126/science.abj8754.

4. Banerjee, S., Mukherjee, A., Khatun, M., Roy, P., Nandi, S., & Roy, S. (2023). Exploring the critical function and molecular mechanism of WRKY transcription factor family in regulating plant response under abiotic stress conditions. In T. Aftab (Ed.), New frontiers in plant-environment interactions: Innovative technologies and developments (pp. 121-142). Cham: Springer. doi: 10.1007/978-3-031-43729-8_6.

5. Blanc-Mathieu, R., Dumas, R., Turchi, L., Lucas, J., & Parcy, F. (2024). Plant-TFClass: A structural classification for plant transcription factors. Trends in Plant Science, 29(1), 40-51. doi: 10.1016/j.tplants.2023.06.023.

6. Chen, X., Li, C., Wang, H., & Guo, Z. (2019). WRKY transcription factors: Evolution, binding, and action. Phytopathology Research, 1, article number 13. doi: 10.1186/s42483-019-0022-x.

7. DeepMind. (2024). AlphaFold has revealed millions of intricate 3D protein structures, and is helping scientists understand how life’s molecules interact. Retrieved from https://deepmind.google/technologies/alphafold/.

8. Dinusha, C.M.M. (2019). The evolution of reproductive development in angiosperms (Doctoral thesis, University of California, Riverside, USA)

9. Du, Z., Su, H., Wang, W., Ye, L., Wei, H., Peng, Z., & Yang, J. (2021). The trRosetta server for fast and accurate protein structure prediction. Nature Protocols, 16, 5634-5651. doi: 10.1038/s41596-021-00628-9.

10. Hassan, S., Lethin, J., Blomberg, R., Mousavi, H., & Aronsson, H. (2019). In silico based screening of WRKY genes for identifying functional genes regulated by WRKY under salt stress. Computational Biology and Chemistry, 83, article number 107131. doi: 10.1016/j.compbiolchem.2019.107131.

11. He, X., Li, J.-J., Chen, Y., Yang, J.-Q., & Chen, X.-Y. (2019). Genome-wide analysis of the WRKY gene family and its response to abiotic stress in buckwheat (Fagopyrum tataricum). Open Life Sciences, 14(1), 80-96. doi: 10.1515/biol-2019-0010.

12. Hekkelman, M.L., de Vries, I., Joosten, R.P., & Perrakis, A. (2023). AlphaFill: Enriching AlphaFold models with ligands and cofactors. Nature Methods, 20, 205-213. doi: 10.1038/s41592-022-01685-y.

13. Li, N., Liu, T., Guo, F., Yang, J., Shi, Y., Wang, S., & Sun, D. (2022). Identification of long non-coding RNA-microRNA-mRNA regulatory modules and their potential roles in drought stress response in wheat (Triticum aestivum L.). Frontiers in Plant Science, 13, article number 1011064. doi: 10.3389/fpls.2022.1011064.

14. Li, S., Khoso, M.A., Xu, H., Zhang, C., Liu, Z., Wagan, S., Dinislam, K., & Liu, L. (2024). WRKY transcription factors (TFs) as key regulators of plant resilience to environmental stresses: Current perspective. Agronomy, 14(10), article number 2421. doi: 10.3390/agronomy14102421.

15. Lin, Z., Akin, H., Rao, R., Hie, B., Zhu, Z., Lu, W., & Rives, A. (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, 379(6637), 1123-1130. doi: 10.1126/science.ade2574.

16. Long, L., Gu, L., Wang, S., Cai, H., Wu, J., Wang, J., & Yang, M. (2023). Progress in the understanding of WRKY transcription factors in woody plants. International Journal of Biological Macromolecules, 242(Part 1), article number 124379. doi: 10.1016/j.ijbiomac.2023.124379.

17. Lu, C.H., Chen, C.C., Yu, C.S., Liu, Y.Y., Liu, J.J., Wei, S.T., & Lin, Y.F. (2022). MIB2: Metal ion-binding site prediction and modeling server. Bioinformatics, 38(18), 4428-4429. doi: 10.1093/bioinformatics/btac534.

18. National Center for Biotechnology Information (NCBI). (2024). Protein database. Retrieved from https://www.ncbi.nlm.nih.gov/protein/.

19. Pandey, B., Grover, A., & Sharma, P. (2018). Molecular dynamics simulations revealed structural differences among WRKY domain-DNA interaction in barley (Hordeum vulgare). BMC Genomics, 19, article number 132. doi: 10.1186/s12864-018-4506-3.

20. Ranjan, P., Yadav, A., Behera, A.K., Singh, D.K., Singh, P., & Singh, G.P. (2024). Structural, functional and evolutionary analysis of wheat WRKY45 protein: A combined bioinformatics and MD simulation approach. Cereal Research Communications, 52, 489-499. doi: 10.1007/s42976-023-00399-x.

21. Rodríguez-Lumbreras, L.A., Jiménez-García, B., Giménez-Santamarina, S., & Fernández-Recio, J. (2022). pyDockDNA: A new web server for energy-based protein-DNA docking and scoring. Frontiers in Molecular Biosciences, 9, article number 988996. doi: 10.3389/fmolb.2022.988996.

22. Shukla, R., & Tripathi, T. (2020). Molecular dynamics simulation of protein and protein-ligand complexes. In D.B. Singh (Ed.), Computer-aided drug design (pp. 133-161). Singapore: Springer. doi: 10.1007/978-981-15-6815-2_7.

23. Singh, A., Sharma, A.K., Singh, N.K., Sonah, H., Deshmukh, R., & Sharma, T.R. (2019). Understanding the effect of structural diversity in WRKY transcription factors on DNA binding efficiency through molecular dynamics simulation. Biology, 8(4), article number 83. doi: 10.3390/biology8040083.

24. Steinegger, M., & Söding, J. (2017). MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology, 35(11), 1026-1028. doi: 10.1038/nbt.3988.

25. Theisen, F.F., Prestel, A., Elkjær, S., Leurs, Y.H., Morffy, N., Strader, L.C., & Skriver, K. (2024). Molecular switching in transcription through splicing and proline-isomerization regulates stress responses in plants. Nature Communications, 15, article number 592. doi: 10.1038/s41467-024-44859-2.

26. Tian, J., Zhang, J., & Francis, F. (2023). Large-scale identification and characterization analysis of VQ family genes in plants, especially gymnosperms. International Journal of Molecular Sciences, 24(19), article number 14968. doi: 10.3390/ijms241914968.

27. Tripathi, A., Mondal, R., Lahiri, T., Chaurasiya, D., & Pal, M.K. (2023). TemPred: A novel protein template search engine to improve protein structure prediction. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 20(3), 2112-2121 doi: 10.1109/TCBB.2022.3233846.

28. Upadhyay, D., Budhlakoti, N., Kumari, J., Chaudhary, N., Padaria, J.C., Sareen, S., & Kumar, S. (2024). In-silico characterization of drought stress related WRKY2 transcription factor in wheat crop (Triticum aestivum L.): Study of its physico-chemical properties and structural dynamics. Genetic Resources and Crop Evolution, 71, 1481-1492. doi: 10.1007/s10722-023-01706-5.

29. Wani, S.H., Tripathi, P., Zaid, A., Challa, G.S., Kumar, A., Kumar, V., & Bhatt, M. (2018). Transcriptional regulation of osmotic stress tolerance in wheat (Triticum aestivum L.). Plant Molecular Biology, 97, 469-487. doi: 10.1007/s11103-018-0761-6.

30. Wu, R., Ding, F., Wang, R., Shen, R., Zhang, X., Luo, S., Su, C., Wu, Z., Xie, Q., Berger, B., Ma, J., & Peng, J. (2022). High-resolution de novo structure prediction from primary sequence. doi: 10.1101/2022.07.21.500999.

31. Yang, Z., Zeng, X., Zhao, Y., & Chen, R. (2023). AlphaFold2 and its applications in the fields of biology and medicine. Signal Transduction and Targeted Therapy, 8, article number 115. doi: 10.1038/s41392-023-01381-z.

32. Zaheer, M., Rehman, S.U., Khan, S.H., Shahid, S., Rasheed, A., Naz, R., & Sajjad, M. (2021). Genome-wide identification and transcriptional profiling of new cobra-like genes in bread wheat (Triticum Aestivum). doi: 10.21203/rs.3.rs-796690/v1.