Please use this identifier to cite or link to this item: http://doi.org/10.25358/openscience-8609
Authors: Ludt, Annekathrin
Ustjanzew, Arsenij
Binder, Harald
Strauch, Konstantin
Marini, Federico
Title: Interactive and reproducible workflows for exploring and modeling RNA-seq data with pcaExplorer, ideal, and GeneTonic
Online publication date: 19-Jan-2023
Year of first publication: 2022
Language: english
Abstract: The generation and interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task. While raw data quality control, alignment, and quantification can be streamlined via efficient algorithms that can deliver the preprocessed expression matrix, a common bottleneck in the analysis of such large datasets is the subsequent in-depth, iterative processes of data exploration, statistical testing, visualization, and interpretation. Specific tools for these workflow steps are available but require a level of technical expertise which might be prohibitive for life and clinical scientists, who are left with essential pieces of information distributed among different tabular and list formats. Our protocols are centered on the joint use of our Bioconductor packages (pcaExplorer, ideal, GeneTonic) for interactive and reproducible workflows. All our packages provide an interactive and accessible experience via Shiny web applications, while still documenting the steps performed with RMarkdown as a framework to guarantee the reproducibility of the analyses, reducing the overall time to generate insights from the data at hand. These protocols guide readers through the essential steps of Exploratory Data Analysis, statistical testing, and functional enrichment analyses, followed by integration and contextualization of results. In our packages, the core elements are linked together in interactive widgets that make drill-down tasks efficient by viewing the data at a level of increased detail. Thanks to their interoperability with essential classes and gold-standard pipelines implemented in the open-source Bioconductor project and community, these protocols will permit complex tasks in RNA-seq data analysis, combining interactivity and reproducibility for following modern best scientific practices and helping to streamline the discovery process for transcriptome data. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC.
DDC: 610 Medizin
610 Medical sciences
Institution: Johannes Gutenberg-Universität Mainz
Department: FB 04 Medizin
Place: Mainz
ROR: https://ror.org/023b0x485
DOI: http://doi.org/10.25358/openscience-8609
Version: Published version
Publication type: Zeitschriftenaufsatz
License: CC BY-NC
Information on rights of use: https://creativecommons.org/licenses/by-nc/4.0/
Journal: Current protocols
2
4
Pages or article number: e411
Publisher: Wiley
Publisher place: Hoboken, NJ
Issue date: 2022
ISSN: 2691-1299
Publisher DOI: 10.1002/cpz1.411
Appears in collections:DFG-491381577-H

Files in This Item:
  File Description SizeFormat
Thumbnail
interactive_and_reproducible_-20230119111930704.pdf2.28 MBAdobe PDFView/Open