Please use this identifier to cite or link to this item: http://doi.org/10.25358/openscience-7761
Authors: Hoffmann, Isabell
Sariyar, Murat
Binder, Harald
Title: Combining techniques for screening and evaluating interaction terms on high-dimensional time-to-event data
Online publication date: 14-Sep-2022
Year of first publication: 2014
Language: english
Abstract: BACKGROUND: Molecular data, e.g. arising from microarray technology, is often used for predicting survival probabilities of patients. For multivariate risk prediction models on such high-dimensional data, there are established techniques that combine parameter estimation and variable selection. One big challenge is to incorporate interactions into such prediction models. In this feasibility study, we present building blocks for evaluating and incorporating interactions terms in high-dimensional time-to-event settings, especially for settings in which it is computationally too expensive to check all possible interactions. RESULTS: We use a boosting technique for estimation of effects and the following building blocks for pre-selecting interactions: (1) resampling, (2) random forests and (3) orthogonalization as a data pre-processing step. In a simulation study, the strategy that uses all building blocks is able to detect true main effects and interactions with high sensitivity in different kinds of scenarios. The main challenge are interactions composed of variables that do not represent main effects, but our findings are also promising in this regard. Results on real world data illustrate that effect sizes of interactions frequently may not be large enough to improve prediction performance, even though the interactions are potentially of biological relevance. CONCLUSION: Screening interactions through random forests is feasible and useful, when one is interested in finding relevant two-way interactions. The other building blocks also contribute considerably to an enhanced pre-selection of interactions. We determined the limits of interaction detection in terms of necessary effect sizes. Our study emphasizes the importance of making full use of existing methods in addition to establishing new ones.
DDC: 610 Medizin
610 Medical sciences
Institution: Johannes Gutenberg-Universität Mainz
Department: FB 04 Medizin
Place: Mainz
ROR: https://ror.org/023b0x485
DOI: http://doi.org/10.25358/openscience-7761
Version: Published version
Publication type: Zeitschriftenaufsatz
License: CC BY
Information on rights of use: https://creativecommons.org/licenses/by/2.0/
Journal: BMC bioinformatics
15
Pages or article number: Art. 58
Publisher: BioMed central
Publisher place: London
Issue date: 2014
ISSN: 1471-2105
Publisher URL: http://dx.doi.org/10.1186/1471-2105-15-58
Publisher DOI: 10.1186/1471-2105-15-58
Appears in collections:DFG-OA-Publizieren (2012 - 2017)

Files in This Item:
  File Description SizeFormat
Thumbnail
combining_techniques_for_scre-20220913200022146.pdf1 MBAdobe PDFView/Open