Correction: Three-parameter lognormal distribution ubiquitously found in cDNA microarray data and its application to parametric data treatment

DOI: 10.1186/1471-2105-5-82

f(ri) = [k/{(2π)1/2 σ(ri - γ)}] exp [-{log(ri - γ) - μ}2/2σ2] for ri > γ,where k is a compensation constant (k = loge = 0.4343), σ and μ are the shape and scale parameters for log(ri - γ), respectively.The threshold parameter, γ, was found through trial and improvement calculation processes; in the trial, the distribution of log(ri - γ) was checked by normal probability plotting, and the value that gave the best fit to the model was selected for γ. The fitness was evaluated by the sum of absolute differences between the model and log(ri - γ), within the interquartile range of data. The parameter μ was found as the median of log(ri - γ), and the parameter σ was found from the interquartile range of log(ri - γ); these are known as robust alternatives for the arithmetic mean and standard deviation, respectively. Parameters μ and σ were found for each data grid, a group of data for DNA spots that were printed by an identical pin in order to avoid divergences caused by pin-based differences. Z-normalization was carried out for each datum asZri = {log(ri - γ) - μ}/σ.Intensity data (ri) less than γ were treated as "data not detected", since such data might contain negative noise larger than the signal (see Results).


