Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context
| dc.contributor.author | Mier, Pablo | |
| dc.contributor.author | Andrade-Navarro, Miguel A. | |
| dc.date.accessioned | 2024-12-12T12:16:30Z | |
| dc.date.available | 2024-12-12T12:16:30Z | |
| dc.date.issued | 2024 | |
| dc.description.abstract | Homorepeats, specifically polyglutamine (polyQ) and polyalanine (polyA), are often implicated in protein-protein interactions (PPIs). So far, a method to predict the participation of homorepeats in protein interactions is lacking. We propose a machine learning approach to identify PPI-involved polyQ and polyA regions within the human proteome based on known interacting regions. Using the dataset of human homorepeats, we identified 157 polyQ and 745 polyA regions potentially involved in PPIs. Machine learning models, trained on amino acid context and homorepeat length, demonstrated high precision (0.90–0.98) but variable recall (0.42–0.85). Random forest outperformed other models (AUC polyQ = 0.686, AUC polyA = 0.732) using the positions surrounding the homorepeat −10 to +10. Integrating paralog information marginally improved predictions but was excluded for model simplicity. Further optimization revealed that for polyQ, using amino acid surrounding positions from −6 to +6 increased AUC to 0.715. For polyA, n | en_GB |
| dc.identifier.doi | http://doi.org/10.25358/openscience-11108 | |
| dc.identifier.uri | https://openscience.ub.uni-mainz.de/handle/20.500.12030/11127 | |
| dc.language.iso | eng | de |
| dc.rights | CC-BY-4.0 | * |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | * |
| dc.subject.ddc | 570 Biowissenschaften | de_DE |
| dc.subject.ddc | 570 Life sciences | en_GB |
| dc.title | Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context | en_GB |
| dc.type | Zeitschriftenaufsatz | de |
| jgu.journal.title | Heliyon | de |
| jgu.journal.volume | 10 | de |
| jgu.organisation.department | FB 10 Biologie | de |
| jgu.organisation.name | Johannes Gutenberg-Universität Mainz | |
| jgu.organisation.number | 7970 | |
| jgu.organisation.place | Mainz | |
| jgu.organisation.ror | https://ror.org/023b0x485 | |
| jgu.pages.alternative | e37861 | de |
| jgu.publisher.doi | 10.1016/j.heliyon.2024.e37861 | de |
| jgu.publisher.issn | 2405-8440 | de |
| jgu.publisher.name | Elsevier | de |
| jgu.publisher.place | London [u.a.] | de |
| jgu.publisher.year | 2024 | |
| jgu.rights.accessrights | openAccess | |
| jgu.subject.ddccode | 570 | de |
| jgu.subject.dfg | Lebenswissenschaften | de |
| jgu.type.contenttype | Scientific article | de |
| jgu.type.dinitype | Article | en_GB |
| jgu.type.resource | Text | de |
| jgu.type.version | Published version | de |