Pattern analysis, dimensionality reduction and hypothesis testing in high-dimensional data from animal studies with small sample sizes
dc.contributor.author | Todorov, Hristo | |
dc.date.accessioned | 2020-10-19T07:54:30Z | |
dc.date.available | 2020-10-19T07:54:30Z | |
dc.date.issued | 2020 | |
dc.description.abstract | Experimental animal studies are typically associated with small sample sizes due to ethical and practical limitations. However, such research projects often generate high-dimensional data sets where the number of response variables is much greater than the number of observations. This leads to several challenges with respect to the choice of an appropriate statistical method. The current research project focused on exploratory and inferential analysis of multidimensional data sets from animal experiments with small group sizes. A systematic comparison of univariate and multivariate hypothesis testing methods using Monte Carlo simulations revealed that multivariate techniques offer no real benefit in terms of power compared to univariate statistics. The well-known dimensionality reduction technique, principal component analysis (PCA) was demonstrated to capture dominant patterns in transcriptomic data successfully. However, PCA was outperformed by ordination methods which take group assignment into account in terms of sensitivity to detect treatment effects using simulated data. In contrast, multicollinearity combined with small sample sizes was associated with high false positive rate when not handled correctly by the multivariate statistical method. Additionally, microbiome studies based on amplicon sequencing of the 16S rRNA gene were presented as a special case requiring more flexible ordination and hypothesis testing techniques. Taken together, this thesis demonstrates that harnessing the full potential of multidimensional data is a challenging task which requires applying appropriate statistical methods. A profound understanding of the strengths and limitations of the alternative strategies is necessary in order to model the complex nature of multivariate data and in turn draw correct inferences. | en_GB |
dc.identifier.doi | http://doi.org/10.25358/openscience-5150 | |
dc.identifier.uri | https://openscience.ub.uni-mainz.de/handle/20.500.12030/5154 | |
dc.identifier.urn | urn:nbn:de:hebis:77-openscience-0977132b-0007-4cb0-8c73-6557862489795 | |
dc.language.iso | eng | de |
dc.rights | CC-BY-4.0 | * |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | * |
dc.subject.ddc | 570 Biowissenschaften | de_DE |
dc.subject.ddc | 570 Life sciences | en_GB |
dc.title | Pattern analysis, dimensionality reduction and hypothesis testing in high-dimensional data from animal studies with small sample sizes | en_GB |
dc.type | Dissertation | de |
jgu.date.accepted | 2020-09-23 | |
jgu.description.extent | 113 Seiten | de |
jgu.organisation.department | FB 10 Biologie | de |
jgu.organisation.name | Johannes Gutenberg-Universität Mainz | |
jgu.organisation.number | 7970 | |
jgu.organisation.place | Mainz | |
jgu.organisation.ror | https://ror.org/023b0x485 | |
jgu.rights.accessrights | openAccess | |
jgu.subject.ddccode | 570 | de |
jgu.type.dinitype | PhDThesis | en_GB |
jgu.type.resource | Text | de |
jgu.type.version | Original work | de |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- todorov_hristo-pattern_analys-20201009103226129.pdf
- Size:
- 15.54 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 3.57 KB
- Format:
- Item-specific license agreed upon to submission
- Description: