CEA2017_yamakata_FontEmbedded.pdf (144.08 kB)
A comparison of cooking recipe named entities between Japanese and English
conference contribution
posted on 2023-06-09, 07:00 authored by Yoko Yamakata, John Carroll, Shinsuke MoriIn this paper, we analyze the structural differences between the instructional text in Japanese and English cooking recipes. First, we constructed an English recipe corpus of 100 recipes, designed to be comparable to an existing Japanese recipe corpus. We annotated recipe named entities (r-NEs) in the English corpus according to guidelines previously defined for Japanese. We trained a state-of-art NE recognizer, PWNER, on the English r-NEs, and achieved very similar accuracy and coverage to previous results for the Japanese corpus, thus demonstrating the quality and consistency of the annotations. Second, we compared the r-NEs annotated in the Japanese and English corpora, and uncovered lexical, semantic, and underlying structural differences between Japanese and English recipes. We discuss reasons for these differences, which have significant implications for cross-language retrieval and automatic translation of recipes.
History
Publication status
- Published
File Version
- Accepted version
Journal
Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities (CEA2017); Melbourne, Australia; 20 August 2017Publisher
Association for Computing MachineryExternal DOI
Page range
7-12ISBN
9781450352673Department affiliated with
- Informatics Publications
Research groups affiliated with
- Data Science Research Group Publications
Full text available
- Yes
Peer reviewed?
- Yes
Legacy Posted Date
2017-07-05First Open Access (FOA) Date
2017-08-30First Compliant Deposit (FCD) Date
2017-07-05Usage metrics
Categories
No categories selectedLicence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC