TY - JOUR
T1 - DTL-NeddSite
T2 - A Deep-Transfer Learning Architecture for Prediction of Lysine Neddylation Sites
AU - Xu, Deli
AU - Zhu, Yafei
AU - Xu, Qiang
AU - Liu, Yuhai
AU - Chen, Yu
AU - Zou, Yang
AU - Li, Lei
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023
Y1 - 2023
N2 - Neddylation, as a reversible post-translational modification (PTM), plays a role in various cellular processes. Defects in neddylation are related to human diseases. Detecting neddylation sites is necessary for revealing the mechanisms of protein neddylation. As identifying such sites through experimental methods is expensive and time-consuming, it is essential to develop in silico methods to predict neddylation sites. In this study, we constructed a few classifiers integrating various algorithms and encoding features. However, they performed poorly (AUC ≈ 0.767 ), mainly due to the limited number ( ∼ 1000) of identified neddylation sites. The large number (> 100,000) of other lysine PTM sites inspired us to employ a deep transfer learning (DTL) strategy for performance improvement. We constructed a predictor, dubbed DTL-NeddSite, which adopted the DTL-based convolution neural network using the one-hot encoding approach. Specifically, the massive number of lysine PTM sites were used to build the source model, followed by the fine-tuning of the target model using neddylation sites. DTL-NeddSite compared favourably with the corresponding model without the DTL strategy in cross-validation and independent tests. For instance, the AUC value increased to 0.818. Contrary to a general DTL model that combines frozen and unfrozen layers, all the layers in DTL-NeddSite were unfrozen to re-train. We expect the DTL strategy to be widely used in newly discovered modification types with limited known sites. Furthermore, DTL-NeddSite is freely accessible at https://github.com/XuDeli123/DTL-NeddSite.
AB - Neddylation, as a reversible post-translational modification (PTM), plays a role in various cellular processes. Defects in neddylation are related to human diseases. Detecting neddylation sites is necessary for revealing the mechanisms of protein neddylation. As identifying such sites through experimental methods is expensive and time-consuming, it is essential to develop in silico methods to predict neddylation sites. In this study, we constructed a few classifiers integrating various algorithms and encoding features. However, they performed poorly (AUC ≈ 0.767 ), mainly due to the limited number ( ∼ 1000) of identified neddylation sites. The large number (> 100,000) of other lysine PTM sites inspired us to employ a deep transfer learning (DTL) strategy for performance improvement. We constructed a predictor, dubbed DTL-NeddSite, which adopted the DTL-based convolution neural network using the one-hot encoding approach. Specifically, the massive number of lysine PTM sites were used to build the source model, followed by the fine-tuning of the target model using neddylation sites. DTL-NeddSite compared favourably with the corresponding model without the DTL strategy in cross-validation and independent tests. For instance, the AUC value increased to 0.818. Contrary to a general DTL model that combines frozen and unfrozen layers, all the layers in DTL-NeddSite were unfrozen to re-train. We expect the DTL strategy to be widely used in newly discovered modification types with limited known sites. Furthermore, DTL-NeddSite is freely accessible at https://github.com/XuDeli123/DTL-NeddSite.
KW - Neddylation
KW - deep learning
KW - deep transfer learning
KW - modification site prediction
KW - post-translational modification
UR - https://www.scopus.com/pages/publications/85161006685
U2 - 10.1109/ACCESS.2023.3279498
DO - 10.1109/ACCESS.2023.3279498
M3 - 文章
AN - SCOPUS:85161006685
SN - 2169-3536
VL - 11
SP - 51798
EP - 51809
JO - IEEE Access
JF - IEEE Access
ER -