University of Sussex
Browse

File(s) not publicly available

Defining properties of speech spectrogram images to allow effective pre-processing prior to pattern recognition

journal contribution
posted on 2023-06-08, 21:12 authored by Mohammed Al-Darkazali, Rupert YoungRupert Young, Chris ChatwinChris Chatwin, Phil BirchPhil Birch
The speech signal of a word is a combination of frequencies which can produce specific transition frequency shapes. These can be regarded as a written text in some unknown ‘script’. Before attempting methods to read the speech spectrogram image using image processing techniques we need first to define the properties of the speech spectrogram image as well as the reduction of the clutter of the spectrogram image and the selection of the methods to be employed for image matching. Thus methods to convert the speech signal to a spectrogram image are initially employed, followed by reduction of the noise in the signal by capturing the energy associated with formants of the speech signal. This is followed by the normalisation of the size of the image and its resolution of in both the frequency and time axes. Finally, template matching methods are employed to recognise portions of text and isolated words. The paper describes the pre-processing methods employed and outlines the use of normalised grey-level correlation for the recognition of words. © (2013) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.

History

Publication status

  • Published

Journal

Proceedings of the SPIE Laser Interactions with Materials

ISSN

0277-786X

Publisher

SPIE

Volume

8748

Page range

87480G

Department affiliated with

  • Engineering and Design Publications

Full text available

  • No

Peer reviewed?

  • Yes

Legacy Posted Date

2015-06-22

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC