Typically extrema filtration techniques are
based on non-parametric properties such as magnitude of prominences and the
widths at half prominence, which cannot be used with data that possess a
dynamic nature. In this work, an extrema identification that is totally
independent of derivative-based approaches and independent of quantitative
attributes is introduced. For three consecutive positive terms arranged in a
line, the ratio (R) of the sum of the maximum and minimum to the sum of the three
terms is always 2/n, where n is the number of terms and 2/3 ≤ R ≤ 1 when n = 3.
R > 2/3 implies that one term is away from the other two terms. Applying
suitable modifications for the above stated hypothesis, the method was
developed and the method is capable of identifying peaks and valleys in any
signal. Furthermore, three techniques were developed for filtering
non-dominating, sharp, gradual, low and high extrema. Especially, all the
developed methods are non-parametric and suitable for analyzing processes that
have dynamic nature such as biogas data. The methods were evaluated using
automatically collected biogas data. Results showed that the extrema
identification method was capable of identifying local extrema with 0% error.
Furthermore, the non-parametric filtering techniques were able to distinguish
dominating, flat, sharp, high, and low extrema in the biogas data with high
robustness.
References
[1]
Mavron, V.C. and Phillips, T.N. (2007) Maxima and Minima. In: Mavron, V.C. and Phillips, T.N., Eds., Elements of Mathematics for Economics and Finance, Springer, London, 137-158.
[2]
Sande, H.V., Henrotte, F. and Hameyer, K. (2004) The Newton-Raphson Method for Solving Non-Linear and Anisotropic Time-Harmonic Problems. The International Journal for Computation and Mathematics in Electrical and Electronic Engineering, 23, 950-958.
http://dx.doi.org/10.1108/03321640410553373
[3]
Chioua, M., Srinivasan, B., Guay, M. and Perrier, M. (2007) Dependence of the Error in the Optimal Solution of Perturbation-Based Extremum Seeking Methods on the Excitation Frequency. The Canadian Journal of Chemical Engineering, 85, 447-453.
http://dx.doi.org/10.1002/cjce.5450850407
[4]
Khan, I.R. and Ohba, R. (1999) Closed-Form Expressions for the Finite Difference Approximations of First and Higher Derivatives Based on Taylor Series. Journal of Computational and Applied Mathematics, 107, 179-193.
http://dx.doi.org/10.1016/S0377-0427(99)00088-6
[5]
Gilgen, H. (2006) Univariate Time Series in Geosciences: Theory and Examples. Springer, Berlin.
[6]
Zou, H.-F., Zhang, Y.-K. and Lu, P.-C. (1991) The Prediction of the Peak Width at Half Height in HPLC. Chinese Journal of Chemistry, 9, 237-244.
http://dx.doi.org/10.1002/cjoc.19910090307
[7]
Antoniadis, A., Bigot, J. and Lambert-Lacroix, S. (2010) Peaks Detection and Alignment for Mass Spectrometry Data. Journal de la Société Fran?aise de Statistique, 151, 17-37.
[8]
Jeffries, N. (2005) Algorithms for Alignment of Mass Spectrometry Proteomic Data. Bioinformatics, 21, 3066-3073. http://dx.doi.org/10.1093/bioinformatics/bti482
[9]
Sauve, A.C. and Speed, T.P. (2004) Normalization, Baseline Correction and Alignment of High-Throughput Mass Spectrometry Data. Proceedings Gensips.
[10]
Mtetwa, N. and Smith, L.S. (2006) Smoothing and Thresholding in Neuronal Spike Detection. Neurocomputing, 69, 1366-1370. http://dx.doi.org/10.1016/j.neucom.2005.12.108
[11]
Tzallas, A.T., Oikonomou, V.P. and Fotiadis, D. (2006) Epileptic Spike Detection Using a Kalman Filter Based Approach. IEEE Engineering in Medicine and Biology Society Conference, 1, 501-504. http://dx.doi.org/10.1109/iembs.2006.260780
[12]
Gelb, A. (1974) Applied Optimal Estimation. MIT Press, Boston.
[13]
Shim, B., Min, H. and Yoon, S. (2009) Nonlinear Preprocessing Method for Detecting Peaks from Gas Chromatograms. BMC Bioinformatics, 10, 378.
http://dx.doi.org/10.1186/1471-2105-10-378
[14]
Scholkmann, F., Boss, J. and Wolf, M. (2012) An Efficient Algorithm for Automatic Peak Detection in Noisy Periodic and Quasi-Periodic Signals. Algorithms, 5, 588-603.
http://dx.doi.org/10.3390/a5040588
[15]
Gy?rfi, L., Kohler, M., Krzyzak, A. and Walk, H. (2006) A Distribution-Free Theory of Nonparametric Regression. Springer, New York.
Wasserman, L. (2006) All of Nonparametric Statistics. Springer, New York.
[18]
Kothari, C.R. (2004) Research Methodology: Methods and Techniques. New Age International (P) Limited, Delhi.
[19]
Li, J., Ray, S. and Lindsay, B.G. (2007) A Nonparametric Statistical Approach to Clustering via Mode Identification. Journal of Machine Learning Research, 8, 1687-1723.
[20]
Adikaram, K.K.L.B., Hussein, M.A., Effenberger, M. and Becker, T. (2014) Outlier Detection Method in Linear Regression Based on Sum of Arithmetic Progression. The Scientific World Journal, 2014, Article ID: 821623. http://dx.doi.org/10.1155/2014/821623
[21]
Adikaram, K.K.L.B., Hussein, M.A., Effenberger, M. and Becker, T. (2015) Universal Linear Fit Identification: A Method Independent of Data, Outliers and Noise Distribution Model and Free of Missing or Removed Data Imputation. PLoS ONE, 10, e0141486.
http://dx.doi.org/10.1371/journal.pone.0141486
[22]
Han, J., Kamber, M. and Pei, J. (2006) Data Mining, Southeast Asia Edition: Concepts and Techniques. Elsevier Science, Amsterdam.
[23]
Shalabi, L.A., Shaaban, Z. and Kasasbeh, B. (2006) Data Mining: A Preprocessing Engine. Journal of Computer Science, 2, 735-739. http://dx.doi.org/10.3844/jcssp.2006.735.739