Document Type : Research Articles
Authors
Yazd University
Abstract
This paper deals with the problem of signal modeling using fractional-order linear prediction. In this research, we obtain the closed-form expression of the optimum sampling frequency of the One-Parameter Fractional-order Linear Prediction (OPFLP) and examine the performance when the fractional order (alpha) is in (0<=alpha<=2). Our investigation focuses on determining optimum alpha within the individual ranges of 0<=alpha<=1 and 1<=alpha<=2 while considering various influential parameters, such as sampling frequency and environmental interferences. We initiate our study by examining the impact of the sampling frequency, a critical parameter that demands meticulous selection, on the optimal value of alpha. Simulation Results demonstrate that if the sampling rate falls within five to six times the maximum frequency of the signal under scrutiny, the optimal range for alpha resides within 1<=alpha<=2. Conversely, when the sampling frequency exceeds six times the maximum signal frequency, the optimal alpha shifts to 0<=alpha<=1. This observation underscores the crucial relationship between sampling frequency and the appropriate selection of the fractional order alpha for effective OPFLP performance. In the next step, we assess the robustness of OPFLP in handling challenging signal processing tasks, particularly in hands-free speech acquisition applications. We evaluate the model's performance and robustness against environmental interferences in three scenarios: noisy environments, reverberant environments, and noisy-reverberant settings. Simulation outcomes highlight OPFLP's superior robustness compared to second-order LP in handling environmental interferences. Furthermore, our investigations elucidate that noise exerts a more detrimental impact on OPFLP performance than reverberation, emphasizing the nuanced effects of these interferences on the model's efficacy.
Keywords
Main Subjects
[2] D. Baleanu, Z. B. Güvenç, J. A. T. Machado, and others, New trends in nanotechnology and fractional calculus applications. Springer, 2010.
[3] S. Das and I. Pan, Fractional order signal processing: introductory concepts and applications. Springer Science & Business Media, 2011.
[4] H. Sheng, Y. Chen, and T. Qiu, Fractional processes and fractional-order signal processing: techniques and applications. Springer Science & Business Media, 2011.
[5] E. Abbaszadeh-Soorami and M. Haddad-Zarif, “FractionalOrder Variable Structure Equations In Robust Control,” Authorea Prepr., 2023.
[6] M. Ghamgosar, S. M. Mirhosseini-Alizamini, and M. Dadkhah, “Design of optimal sliding mode control based on linear matrix inequality for fractional time-varying delay systems,” Int. J. Ind. Electron. Control Optim., vol. 5, no. 4, pp. 317–325, 2022.
[7] Y.-N. Li, H.-R. Sun, and Z. Feng, “Fractional abstract Cauchy problem with order in (1, 2),” Dyn. Partial Differ. Equations, vol. 13, no. 2, pp. 155–177, 2016.
[8] C. Li and M. Li, “Hölder regularity for abstract fractional Cauchy problems with order in (0, 1),” J. Appl. Math. Phys, vol. 6, pp. 310–319, 2018.
[9] J. D. Markel and A. H. J. Gray, Linear prediction of speech, vol. 12. Springer Science & Business Media, 2013.
[10] H. Yang, Y. Ye, D. Wang, and B. Jiang, “A novel fractionalorder signal processing based edge detection method,” in 11th International Conference on Control Automation Robotics & Vision, 2010, pp. 1122–1127.
[11] M. Athineos and D. P. W. Ellis, “Frequency-domain linear prediction for temporal features,” in IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No. 03EX721), 2003, pp. 261–266.
[12] S. Sadhu and H. Hermansky, “Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech,” arXiv Prepr. arXiv2203.13216, 2022.
[13] B. Jo and S. Beack, “Representations of the complex-valued frequency-domain LPC for audio coding,” IEEE Signal Process. Lett., 2024.
[14] V. Despotovic, T. Skovranek, and Z. Peric, “One-parameter fractional linear prediction,” Comput. & Electr. Eng., vol. 69, pp. 158–170, 2018.
[15] T. Skovranek and V. Despotovic, “Audio signal processing using fractional linear prediction,” Mathematics, vol. 7, no. 7, p. 580, 2019.
[16] T. Skovranek, V. Despotovic, and Z. Peric, “Optimal fractional linear prediction with restricted memory,” IEEE Signal Process. Lett., vol. 26, no. 5, pp. 760–764, 2019.
[17] K. Xu and X. Song, “A Current Noise Cancellation Method Based on Fractional Linear Prediction for Bearing Fault Detection,” Sensors, vol. 24, no. 1, p. 52, 2023.
[18] A. A. Navish, M. Priya, and R. Uthayakumar, “A comparative study on estimation of fractal dimension of EMG signal using SWT and FLP,” Comput. Methods Biomech. Biomed. Eng. Imaging ¥& Vis., vol. 11, no. 3, pp. 586–597, 2023.
[19] E. A. P. Habets and J. Benesty, “A two-stage beamforming approach for noise reduction and dereverberation,” IEEE Trans. Audio. Speech. Lang. Process, vol. 21, no. 5, pp. 945–958, 2013.
[20] V. W. Neo, C. Evers, and P. A. Naylor, “Enhancement of noisy reverberant speech using polynomial matrix eigenvalue decomposition,” IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 29, pp. 3255–3266, 2021.
[21] N. Yazdi and K. Todros, “Measure-transformed MVDR beamforming,” IEEE Signal Process. Lett., vol. 27, pp. 1959–1963, 2020.
[22] S. Zhang and X. Li, “Microphone array generalization for multichannel narrowband deep speech enhancement,” arXiv Prepr. arXiv2107.12601, 2021.
[23] J. Benesty, J. Chen, and E. A. P. Habets, Speech enhancement in the STFT domain. Springer Science & Business Media, 2011.
[24] A. W. Rix, J. G. Beerends, M. P. Hollier, and A. P. Hekstra, “Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs,” in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP),, 200, vol. 2, pp. 749–752, May 2001.
[25] J. B. Allen and D. A. Berkley, “Image method for efficiently simulating small-room acoustics,” J. Acoust. Soc. Am., vol. 65, no. 4, pp. 943–950, 1979.
[26] E. A. P. Habets, I. Cohen, and S. Gannot, “Generating nonstationary multisensor signals under a spatial coherence constraint,” J. Acoust. Soc. Am., vol. 124, no. 5, pp. 2911– 2917, 2008.
[27] J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallet, and N. L. Dahlgren, “TIMIT Acoustic Phonetic Continuous Speech Corpus,” Linguist. Data Consortium, 1993, Accessed: May 24, 2019 [Online].Available:http://ci.nii.ac.jp/naid/20000921365/en/.