TIBCO Statistica® Canonical Analysis

Last updated:
10:47am Sep 29, 2020

Canonical Analysis investigates the relationship between two sets of variables. This is known as canonical correlation and is a special case of the general linear model.  It may be used by: 

  • medical researcher studying the relationship of various risk factors to the development of a group of symptoms
  • sociologist investigating the relationship between predictors of social mobility 
  • marketing employee exploring customer data to help with campaign "next steps"

Available techniques are eigenvalues, correlation coefficients, significance test of the canonical correlations, canonical weights, canonical scores, factor structure, variance, redundancy, and practical significance.  In equation form, redundancy is:

Redundancyleft = [S(loadingsleft2)/p]*Rc2

Redundancyright = [S(loadingsright2)/q]*Rc2

In these equations, p denotes the number of variables in the first (left) set of variables, and q denotes the number of variables in the second (right) set of variables; Rc2 is the respective squared canonical correlation.

Note that you can compute the redundancy of the first (left) set of variables given the second (right) set, and the redundancy of the second (right) set of variables, given the first (left) set. Because successively extracted canonical roots are uncorrelated, you could sum up the redundancies across all or only the first significant roots. This provides a single index of redundancy as proposed by Stewart and Love, 1968.