University of Sussex
Browse
CEA2017_yamakata_FontEmbedded.pdf (144.08 kB)

A comparison of cooking recipe named entities between Japanese and English

Download (144.08 kB)
conference contribution
posted on 2023-06-09, 07:00 authored by Yoko Yamakata, John Carroll, Shinsuke Mori
In this paper, we analyze the structural differences between the instructional text in Japanese and English cooking recipes. First, we constructed an English recipe corpus of 100 recipes, designed to be comparable to an existing Japanese recipe corpus. We annotated recipe named entities (r-NEs) in the English corpus according to guidelines previously defined for Japanese. We trained a state-of-art NE recognizer, PWNER, on the English r-NEs, and achieved very similar accuracy and coverage to previous results for the Japanese corpus, thus demonstrating the quality and consistency of the annotations. Second, we compared the r-NEs annotated in the Japanese and English corpora, and uncovered lexical, semantic, and underlying structural differences between Japanese and English recipes. We discuss reasons for these differences, which have significant implications for cross-language retrieval and automatic translation of recipes.

History

Publication status

  • Published

File Version

  • Accepted version

Journal

Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities (CEA2017); Melbourne, Australia; 20 August 2017

Publisher

Association for Computing Machinery

Page range

7-12

ISBN

9781450352673

Department affiliated with

  • Informatics Publications

Research groups affiliated with

  • Data Science Research Group Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2017-07-05

First Open Access (FOA) Date

2017-08-30

First Compliant Deposit (FCD) Date

2017-07-05

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC