Pattern analysis, dimensionality reduction and hypothesis testing in high-dimensional data from animal studies with small sample sizes

dc.contributor.authorTodorov, Hristo
dc.date.accessioned2020-10-19T07:54:30Z
dc.date.available2020-10-19T07:54:30Z
dc.date.issued2020
dc.description.abstractExperimental animal studies are typically associated with small sample sizes due to ethical and practical limitations. However, such research projects often generate high-dimensional data sets where the number of response variables is much greater than the number of observations. This leads to several challenges with respect to the choice of an appropriate statistical method. The current research project focused on exploratory and inferential analysis of multidimensional data sets from animal experiments with small group sizes. A systematic comparison of univariate and multivariate hypothesis testing methods using Monte Carlo simulations revealed that multivariate techniques offer no real benefit in terms of power compared to univariate statistics. The well-known dimensionality reduction technique, principal component analysis (PCA) was demonstrated to capture dominant patterns in transcriptomic data successfully. However, PCA was outperformed by ordination methods which take group assignment into account in terms of sensitivity to detect treatment effects using simulated data. In contrast, multicollinearity combined with small sample sizes was associated with high false positive rate when not handled correctly by the multivariate statistical method. Additionally, microbiome studies based on amplicon sequencing of the 16S rRNA gene were presented as a special case requiring more flexible ordination and hypothesis testing techniques. Taken together, this thesis demonstrates that harnessing the full potential of multidimensional data is a challenging task which requires applying appropriate statistical methods. A profound understanding of the strengths and limitations of the alternative strategies is necessary in order to model the complex nature of multivariate data and in turn draw correct inferences.en_GB
dc.identifier.doihttp://doi.org/10.25358/openscience-5150
dc.identifier.urihttps://openscience.ub.uni-mainz.de/handle/20.500.12030/5154
dc.identifier.urnurn:nbn:de:hebis:77-openscience-0977132b-0007-4cb0-8c73-6557862489795
dc.language.isoengde
dc.rightsCC-BY-4.0*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.subject.ddc570 Biowissenschaftende_DE
dc.subject.ddc570 Life sciencesen_GB
dc.titlePattern analysis, dimensionality reduction and hypothesis testing in high-dimensional data from animal studies with small sample sizesen_GB
dc.typeDissertationde
jgu.date.accepted2020-09-23
jgu.description.extent113 Seitende
jgu.organisation.departmentFB 10 Biologiede
jgu.organisation.nameJohannes Gutenberg-Universität Mainz
jgu.organisation.number7970
jgu.organisation.placeMainz
jgu.organisation.rorhttps://ror.org/023b0x485
jgu.rights.accessrightsopenAccess
jgu.subject.ddccode570de
jgu.type.dinitypePhDThesisen_GB
jgu.type.resourceTextde
jgu.type.versionOriginal workde

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
todorov_hristo-pattern_analys-20201009103226129.pdf
Size:
15.54 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
3.57 KB
Format:
Item-specific license agreed upon to submission
Description: