siRNADesign: A Graph Neural Network for siRNA Efficacy Prediction via Deep RNA Sequence Analysis
With the growing attention on siRNA silencing efficacy prediction, many methods have been proposed recently ranging from traditional data analysis methods to advanced machine learning models. However, previous works fail to explore complex but vital information, e.g., the RNA sequence interactions and related proteins. To alleviate this issue, we propose siRNADesign, a GNN model that innovatively analyzes both non-empirical and empirical-rules-based features of siRNA and mRNA sequences. This comprehensive approach allows siRNADesign to capture the nuanced dynamics of gene silencing effectively, achieving unprecedented state-of-the-art results across various datasets. Furthermore, we introduce a novel dataset-splitting methodology to mitigate the issues of data leakage and the shortcomings of traditional validation techniques in prior works. By considering siRNA and mRNA sequences as separate entities for dataset segmentation, this method guarantees a more accurate and unbiased evaluati