Results
- Showing results for:
- Reset all filters
Search results
-
Conference paperCauchi B, Naylor PA, Gerkmann T, et al., 2015,
LATE REVERBERANT SPECTRAL VARIANCE ESTIMATION USING ACOUSTIC CHANNEL EQUALIZATION
, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 2481-2485, ISSN: 2076-1465 -
Conference paperLim F, Naylor PA, Thomas MRP, et al., 2015,
ACOUSTIC BLUR KERNEL WITH SLIDING WINDOW FOR BLIND ESTIMATION OF REVERBERATION TIME
, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168- Author Web Link
- Cite
- Citations: 1
-
Conference paperHu M, Parada PP, Sharma D, et al., 2015,
SINGLE-CHANNEL SPEAKER DIARIZATION BASED ON SPATIAL FEATURES
, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Publisher: IEEE, ISSN: 1931-1168 -
Conference paperSharma D, Poddar A, Manna S, et al., 2015,
THE SAS PROJECT: SPEECH SIGNAL PROCESSING IN HIGH SCHOOL EDUCATION
, 23rd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 1781-1785, ISSN: 2076-1465- Author Web Link
- Cite
- Citations: 2
-
Journal articleLim F, Zhang W, Habets EAP, et al., 2014,
Robust multichannel dereverberation using relaxed multichannel least squares
, IEEE ACM Transactions on Audio, Speech, and Language Processing, Vol: 22, Pages: 1379-1390, ISSN: 1558-7916A novel approach is proposed for robust multichannel dereverberation in the presence of system identification error (SIEs), based on channel shortening. A mathematical link is derived between the well known multiple-input/output inverse theorem (MINT) algorithm and channel shortening. The relaxed multichannel least squares (RMCLS) algorithm is then proposed as an efficient realization within the channel shortening paradigm and is shown through experimental results to outperform MINT in the presence of SIEs. While the RMCLS is robust to SIEs, the coloration of the output cannot be controlled. Two extensions to RMCLS are proposed to control the level of coloration and the performances of both extensions are evaluated comparatively. It is shown that both substantially maintain the dereverberation performance and robustness to SIEs obtained from RMCLS while effectively controlling the level of coloration introduced.
-
Conference paperEvers C, Moore AH, Naylor PA, 2014,
Multiple source localisation in the spherical harmonic domain
-
Conference paperMoore AH, Naylor PA, Skoglund J, 2014,
An Analysis of the Effect of Larynx-Synchronous Averaging on Dereverberation of Voiced Speech
, European Signal Processing Conference, ISSN: 2219-5491 -
Conference paperEaton J, Naylor PA, 2014,
Detection of clipping in coded speech signals
, 21st European Signal Processing Conference (EUSIPCO), Publisher: IEEEIn order to exploit the full dynamic range of communicationsand recording equipment, and to minimise the effects of noiseand interference, input gain to a recording device is typicallyset as high as possible. This often leads to the signal exceedingthe input limit of the equipment resulting in clipping. Com-munications devices typically rely on codecs such as GSM06.10to compress voice signals into lower bitrates. Althoughdetecting clipping in a hard-clipped speech signal is straight-forward due to the characteristic flattening of the peaks of thewaveform, this is not the case for speech that has subsequentlypassed through a codec. We describe a novel clipping detec-tion algorithm based on amplitude histogram analysis and leastsquares residuals which can estimate the clipped samples andthe original signal level in speech even after the clipped speechhas been perceptually coded.
-
Journal articleJarrett DP, Taseska M, Habets EAP, et al., 2014,
Noise Reduction in the Spherical Harmonic Domain Using a Tradeoff Beamformer and Narrowband DOA Estimates
, IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol: 22, Pages: 965-976 -
Conference paperEaton J, Naylor PA, 2014,
Noise-robust detection of peak-clipping in decoded speech
, Pages: 7019-7023Clipping is a commonplace problem in voice telecommunications and detection of clipping is useful in a range of speech processing applications. We analyse and evaluate the performance of three previously presented algorithms for clipping detection in decoded speech in high levels of ambient noise. We identify a baseline method which is well known for clipping detection, determine experimentally the optimized operation parameter for the baseline approach, and use this in our experiments. Our results indicate that the new algorithms outperform the baseline except at extreme levels of clipping and negative signal-to-noise ratios.
-
Conference paperStanton R, Gaubitch N, Naylor P, et al., 2014,
A Differentiable Approximation to Speech Intelligibility Index with Applications to Listening Enhancement
, AES Intl Conf on Audio ForensicsThe Speech Intelligibility Index is a standardised objective measure for estimating the intelligibility of speech in noise. It is, however difficult to use it in the iterative optimisation of speech enhancement algorithms because it is a discontinuous function of its input parameters. In this paper, we derive an approximation for the Speech Intelligibility Index that is both continuous and differentiable, which allows for more efficient optimisation procedures. The use of the approximation is demonstrated in an application to near-end speech enhancement.
-
Conference paperParada PP, Sharma D, Naylor PA, 2014,
Non-intrusive estimation of the level of reverberation in speech
, Pages: 4718-4722, ISSN: 1520-6149We show corroborating evidence that, among a set of common acoustic parameters, the clarity index C50 provides a measure of reverberation that is well correlated with speech recognition accuracy. We also present a data driven method for non-intrusive C50 parameter estimation from a single channel speech signal. The method extracts a number of features from the speech signal and uses a binary regression tree, trained on appropriate training data, to estimate the C50. Evaluation is carried out using speech utterances convolved with real and simulated room impulse responses, and additive babble noise. The new method outperforms a baseline approach in our evaluation. © 2014 IEEE.
-
Conference paperCosta MH, Naylor PA, 2014,
ILD PRESERVATION IN THE MULTICHANNEL WIENER FILTER FOR BINAURAL HEARING AID APPLICATIONS
, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 636-640, ISSN: 2076-1465- Author Web Link
- Cite
- Citations: 7
-
Conference paperZahedi A, Ostergaard J, Jensen SH, et al., 2014,
Distributed Remote Vector Gaussian Source Coding for Wireless Acoustic Sensor Networks
, Data Compression Conference (DCC), Publisher: IEEE COMPUTER SOC, Pages: 263-272, ISSN: 1068-0314 -
Conference paperAntonello N, van Waterschool T, Moonen M, et al., 2014,
SOURCE LOCALIZATION AND SIGNAL RECONSTRUCTION IN A REVERBERANT FIELD USING THE FDTD METHOD
, 22nd European Signal Processing Conference (EUSIPCO), Publisher: IEEE, Pages: 301-305, ISSN: 2076-1465- Author Web Link
- Cite
- Citations: 7
-
Conference paperSharma D, Meredith L, Lainez J, et al., 2014,
A Non-Intrusive PESQ Measure
, IEEE Global Conference on Signal and Information Processing (GlobalSIP), Publisher: IEEE, Pages: 975-978- Author Web Link
- Cite
- Citations: 10
-
Conference paperParada PP, Sharma D, Naylor PA, et al., 2014,
REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS
, IEEE Global Conference on Signal and Information Processing (GlobalSIP), Publisher: IEEE, Pages: 567-571- Author Web Link
- Cite
- Citations: 4
-
Conference paperZahedi A, Ostergaard J, Jensen SH, et al., 2014,
Distributed Remote Vector Gaussian Source Coding with Covariance Distortion Constraints
, IEEE International Symposium on Information Theory (ISIT), Publisher: IEEE, Pages: 586-590- Author Web Link
- Cite
- Citations: 3
-
Conference paperThomas MRP, Tashev IJ, Lim F, et al., 2014,
OPTIMAL BEAMFORMING AS A TIME DOMAIN EQUALIZATION PROBLEM WITH APPLICATION TO ROOM ACOUSTICS
, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 75-79- Author Web Link
- Cite
- Citations: 3
-
Conference paperLim F, Naylor PA, 2014,
STATISTICAL MODELLING OF MULTICHANNEL BLIND SYSTEM IDENTIFICATION ERRORS
, 14th International Workshop on Acoustic Signal Enhancement (IWAENC), Publisher: IEEE, Pages: 119-123- Author Web Link
- Cite
- Citations: 4
This data is extracted from the Web of Science and reproduced under a licence from Thomson Reuters. You may not copy or re-distribute this data in whole or in part without the written consent of the Science business of Thomson Reuters.
Contact us
Address
Speech and Audio Processing Lab
CSP Group, EEE Department
Imperial College London
Exhibition Road, London, SW7 2AZ, United Kingdom