Improving the Prediction of Protein-Protein Interaction Sites Using a Novel Over-Sampling Approach and Predicted Shape Strings

Nguyen, Lan Anh T. and Hirose, Osamu and Dang, Xuan Tho and Le, TuKien T. and Saethang, Thammakorn and Tran, Vu Anh and Kubo, Mamoru and Yamada, Yoichi and Satou, Kenji (2013) Improving the Prediction of Protein-Protein Interaction Sites Using a Novel Over-Sampling Approach and Predicted Shape Strings. Annual Research & Review in Biology, 3 (2). pp. 92-106.

[thumbnail of 24636-Article Text-46184-1-10-20190101.pdf] Text
24636-Article Text-46184-1-10-20190101.pdf - Published Version

Download (2MB)

Abstract

Identification of protein-protein interaction (PPI) sites is one of the most challenging tasks in bioinformatics and many computational methods based on support vector machines have been developed. However, current methods often fail to predict PPI sites mainly because of the severe imbalance between the numbers of interface and non-interface residues. In this study, we propose a novel over-sampling method that relaxes the class-imbalance problem based on local density distributions. We applied the proposed method to a PPI dataset that includes 2,829 interface and 24,616 non-interface residues. The experimental result showed a significant improvement in predictive performance comparing with the other state-of-the-art methods according to the six evaluation measures.

Item Type: Article
Subjects: Institute Archives > Biological Science
Depositing User: Managing Editor
Date Deposited: 18 Sep 2023 08:56
Last Modified: 18 Sep 2023 08:56
URI: http://eprint.subtopublish.com/id/eprint/2821

Actions (login required)

View Item
View Item