University of Sussex
Browse
6-Cahill_revised2-Check2.pdf (427.66 kB)

What are the “phonemes” in phoneme-grapheme mappings? A perspective on the use of databases for lexicon development

Download (427.66 kB)
journal contribution
posted on 2023-06-09, 12:21 authored by Lynne CahillLynne Cahill
The CELEX lexical database (Baayen, Piepenbrock & van Rijn 1995) was developed in the 1990s, providing a database of the syntactic, morphological, phonological and orthographic forms of between 50,000 and 125,000 words of Dutch, English and German. This database was used as the basis for the development of the PolyLex lexicons, which included syntactic, morphological and phonological information for around 3,000 words of Dutch, English and German. Orthographic information was subsequently added in the PolyOrth project. The PolyOrth project was based on the assumption that the underlying, lexical phonological forms could be used to derive the surface orthographic forms by means of a combination of phoneme-grapheme mappings and sets of autonomous spelling rules for each language. One of the complications encountered during the project was the fact that the phonological forms in CELEX were not always genuinely underlying forms which made deriving the orthographic form tricky. This paper discusses the nature of lexical databases and the implications for using such databases for research on writing systems and their relationship to phonology.

History

Publication status

  • Published

File Version

  • Accepted version

Journal

Journal of Written Language and Literacy

ISSN

1387-6732

Publisher

John Benjamins

Issue

1

Volume

20

Page range

104-127

Department affiliated with

  • English Publications

Research groups affiliated with

  • Centre for Cognitive Science Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2018-08-23

First Open Access (FOA) Date

2018-08-23

First Compliant Deposit (FCD) Date

2018-08-18

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC