Data Standardization

The data standardization algorithm consists of two distinctive phases. Thus for each gene of a given expression matrix the following two steps will be performed:

Calculation of standardized expression profile

A recursive aggregation algorithm is applied to obtain the standardized expression profile of a gene by aggregating the expression profiles of the genes in its estimation list. The considered aggregation model has been inspired by a work on non-parametric recursive aggregation⁽³⁾, where a set of aggregation operators is applied initially over a vector of input values, and then again over the result of the aggregation, and so on until a certain stop condition is met.

The used recursive aggregation algorith is illustrated in the below figure. The expression profiles included in the estimation list are initially combined in parallel with k different weighted aggregation operators. In this way k new expression profiles (one per aggregation operator) are produced and these new profiles are aggregated again this time with the nonparametric versions of the given aggregation operators. The latter process is repeated again and again until for each time point the difference between the aggregated values is small enough to stop further aggregation.

(1) Sakoe,H. and Chiba,S. (1978) Dynamic programming algorithm optimization for spoken word recognition, IEEE Trans. on Acoust., Speech, and Signal Process, ASSP-26, 43-49.