Deep Learning Framework for RNA Inverse Folding with Geometric Structure Potentials

Authors

  • Annabelle Yao

DOI:

https://doi.org/10.54097/xrsjjh84

Keywords:

Geometric Vector Perceptron; Transformer; Deep Learning

Abstract

RNA’s functional diversity arises from its rich three-dimensional structural landscape, yet designing RNA sequences that adopt a target 3D conformation (inverse folding) remains a fundamental challenge. I present a geometric deep learning framework that enables end-to-end RNA inverse folding by integrating Geometric Vector Perceptron (GVP) layers into a Transformer architecture, thereby explicitly encoding the spatial and directional features of RNA structures. A curated dataset of experimentally solved RNA 3D structures was constructed from the BGSU RNA database through rigorous filtering and deduplication. Model performance was evaluated using sequence recovery rate and TM-score to assess sequence accuracy and structural fidelity, respectively. Across standard benchmarks and RNA-Puzzles, the proposed method achieves state-of-the-art performance, improving the recovery rate by 85% and the TM-Score by 40% compared to existing approaches. Masked family-level validation using Rfam annotations demonstrates strong generalization to unseen RNA families, with a 132% improvement in generalization performance. Furthermore, inverse-folded sequences refolded using AlphaFold3 closely recapitulate native structures, underscoring the importance of explicit geometric representations in RNA design. This work establishes a “3D-aware” paradigm for RNA inverse folding, enabling the rapid generation of structurally valid RNA sequences in hours rather than months. By combining geometric learning with sequence modeling, the framework offers a scalable foundation for accelerating mRNA vaccine development, RNA-based therapeutics, and synthetic biology, and positions geometric deep learning as a key technology for next-generation biomolecular design.

Downloads

Download data is not yet available.

References

[1] Dauparas, J. et al. Robust deep learning–based protein sequence design using proteinmpnn. Science 378, 49–56 (2022).

[2] Shen, T. et al. Accurate rna 3d structure prediction using a language model-based deep learning approach. Nature Methods 1–12 (2024).

[3] Wang, W. et al. trrosettarna: automated prediction of rna 3d structure with transformer network. Nature Communications 14, 7266 (2023).

[4] Watkins, A. M., Rangan, R. & Das, R. Farfar2: improved de novo rosetta prediction of complex global rna folds. Structure 28, 963–976 (2020).

[5] Boniecki, M. J. et al. Simrna: a coarse-grained method for rna folding simulations and 3d structure prediction. Nucleic acids research 44, e63–e63 (2016).

[6] Zadeh, J. N. et al. Nupack: Analysis and design of nucleic acid systems. Journal of computational chemistry 32, 170–173 (2011).

[7] Lorenz, R. et al. Viennarna package 2.0. Algorithms for molecular biology 6, 1–14 (2011).

[8] Yang, X., Yoshizoe, K., Taneda, A. & Tsuda, K. Rna inverse folding using monte carlo tree search. BMC bioinformatics 18, 1–12 (2017).

[9] Huang, H., Lin, Z., He, D., Hong, L. & Li, Y. Ribodiffusion: tertiary structure-based rna inverse folding with generative diffusion models. Bioinformatics 40, i347–i356 (2024).

[10] Yao, A., Su, O. & Kumar, S. Enhancing real-time vision systems: Integrating dynamic vision sensors with vision transformers to increase computational efficiency. Applied and Computational Engineering107, 73–82 (2024).

[11] Jing, B., Eismann, S., Suriana, P., Townshend, R. J. & Dror, R. Learning from protein structure with geometric vector perceptrons. arXiv preprint arXiv:2009.01411 (2020).

[12] Perry, Z. R., Pyle, A. M. & Zhang, C. Arena: rapid and accurate reconstruction of full atomic rna structures from coarse-grained models. Journal of Molecular Biology 435, 168210 (2023).

[13] Yao, A. A novel interpretative deep neural network with grad-cam’s heatmap for the early diagnosis of alzheimer’s disease. Theoretical and Natural Science 49, 106–119 (2024).

[14] Zhang, Y. & Skolnick, J. Scoring function for automated assessment of protein structure template quality. Proteins: Structure, Function, and Bioinformatics 57, 702–710 (2004).

[15] Wu, M. J., Andreasson, J. O., Kladwang, W., Greenleaf, W. & Das, R. Automated design of highly diverse riboswitches. bioRxiv 603001 (2019).

[16] Runge, F. & Hutter, F. Machine learning for rna design: Learna. In RNA Design: Methods and Protocols, 63–93 (Springer, 2024).

Downloads

Published

17-04-2026

How to Cite

Deep Learning Framework for RNA Inverse Folding with Geometric Structure Potentials. (2026). Highlights in Science, Engineering and Technology, 162, 333-344. https://doi.org/10.54097/xrsjjh84