
2.4 - Basic Multivariate Statistics A multivariate observation is typically represented as a vector. So, if m variables Y1, ...,Ym have been measured on each experimental unit, the m-dimensional observation consisting of m measurements may be represented by Y = (Y1, ..., Ym)T. The entire experiment then typically consists of a large number of such m-dimensional vectors. For example, the shapes of 3D scans of faces may be represented by vectors Si = (xi1, yi1, zi1, xi2, yi2,zi2, ..., xim, yim, zim)T or (pi1, pi2, ..., pim)T where pij = (xij, yij, zij) and m is the number of vertices being used in the representation of the face. ( NB notice that vertices must be in one-to-one correspondence for statistical analysis of the shape (and texture) vectors to make sense). Usually these m-dimensional vectors are considered to be random vectors meaning that their elements, or coordinates, are random variables. An experimental unit given a certain treatment results in a random vector drawn from an m-dimensional distribution representing the (hypothetical) population of experimental units given this treatment. Important characteristics of such a multivariate distribution are the population means, variances and covariances. The mean, E(Y), of the random vector Y is represented by the vector consisting of the means of the individual variables, that is E(Y) = ( E(Y1), ..., E(Ym))T , so that the mean face shape M, given n faces (in 1-1 correspondence) is M = The variance, Var(Yi), of a random variable Yi in a multivariate distribution is defined to be the mean of the square of the deviation of Yi from the mean of ith variables i.e. Var(Yi) = E( ( Yi - E(Yi))2 ) It may therefore be looked upon as a measure of dispersion. Variance of a random vector is defined as for the mean i.e. Var(Y) = ( Var(Y1), ..., Var(Ym)) The covariance, Cov(Yi, Yj), of two random variables Yi and Yj is defined as Cov(Yi, Yj) = E( (Yi - E(Yi))( Yj - E(Yj)) ) Notice that Var(Yi) = Cov(Yi, Yi) The covariance matrix (also known as variance or variance-covariance matrix)of the m-dimensional random vector Y is defined as the m x m matrix whose ijth entry is Cov(Yi, Yj), and is therefore symmetric as Cov(Yi, Yj) = Cov(Yj, Yi). This matrix finds great use in principal components analysis. As covariance is often difficult to interpret, it is often standardised by dividing by the product of the standard deviations of the two variables to give the correlation coefficient Cor(Yi, ,Yj) =
= 


[home]
[scientific research]
[art/photos]
[poems]
[music]
[links]
[email me]
[home]
[frames]
[no frames]