Covariance Clustering: Modelling Covariance in Designed Experiments When the Number of Variables is Greater than Experimental Units
-
Published:2023-10-11
Issue:
Volume:
Page:
-
ISSN:1085-7117
-
Container-title:Journal of Agricultural, Biological and Environmental Statistics
-
language:en
-
Short-container-title:JABES
Author:
Forknall Clayton R.ORCID, Verbyla Arūnas P., Nazarathy Yoni, Yousif Adel, Osama Sarah, Jones Shirley H., Kerr Edward, Schulz Benjamin L., Fox Glen P., Kelly Alison M.
Abstract
AbstractThe size and complexity of datasets resulting from comparative research experiments in the agricultural domain is constantly increasing. Often the number of variables measured in an experiment exceeds the number of experimental units composing the experiment. When there is a necessity to model the covariance relationships that exist between variables in these experiments, estimation difficulties can arise due to the resulting covariance structure being of reduced rank. A statistical method, based in a linear mixed model framework, is presented for the analysis of designed experiments where datasets are characterised by a greater number of variables than experimental units, and for which the modelling of complex covariance structures between variables is desired. Aided by a clustering algorithm, the method enables the estimation of covariance through the introduction of covariance clusters as random effects into the modelling framework, providing an extension of the traditional variance components model for building covariance structures. The method was applied to a multi-phase mass spectrometry-based proteomics experiment, with the aim of exploring changes in the proteome of barley grain over time during the malting process. The modelling approach provides a new linear mixed model-based method for the estimation of covariance structures between variables measured from designed experiments, when there are a small number of experimental units, or observations, informing covariance parameter estimates.
Funder
State of Queensland acting through the Department of Agriculture and Fisheries
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Statistics, Probability and Uncertainty,General Agricultural and Biological Sciences,Agricultural and Biological Sciences (miscellaneous),General Environmental Science,Statistics and Probability
Reference58 articles.
1. Agrawal GK, Sarkar A, Righetti PG, Pedreschi R, Carpentier S, Wang T, Barkla BJ, Kohli A, Ndimba BK, Bykova NV, Rampitsch C, Zolla L, Rafudeen MS, Cramer R, Bindschedler LV, Tsakirpaloglou N, Ndimba RJ, Farrant JM, Renaut J, Job D, Kikuchi S, Rakwal R (2013) A decade of plant proteomics and mass spectrometry: translation of technical advancements to food security and safety issues. Mass Spectrom Rev 32:335–365 2. Brien CJ, Bailey RA (2006) Multiple randomizations. J R Stat Soc Ser B (Stat Methodol) 68:571–609 3. Brien CJ, Harch BD, Correll RL, Bailey RA (2011) Multiphase experiments with at least one later laboratory phase. I. Orthogonal designs. J Agric Biol Environ Stat 16:422–450 4. Butler DG (2022) ODW: generate optimal experimental designs. (R Package Version 2.1.4) 5. Butler DG, Cullis BR, Gilmour AR, Gogel BJ, Thompson R (2017) ASReml-R reference manual version 4. Report, VSN International Ltd
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|