paper6.pdf (469.33 kB)
Annotating a corpus of clinical text records for learning to recognize symptoms automatically
chapter
posted on 2023-06-08, 00:08 authored by Rob Koeling, John Carroll, Rosemary Tate, Amanda NicholsonWe report on a research effort to create a corpus of clinical free text records enriched with annotation for symptoms of a particular disease (ovarian cancer). We describe the original data, the annotation procedure and the resulting corpus. The data (approximately 192K words) was annotated by three clinicians and a procedure was devised to resolve disagreements. We are using the corpus to investigate the amount of symptom-related information in clinical records that is not coded, and to develop techniques for recognizing these symptoms automatically in unseen text.
History
Publication status
- Published
File Version
- Published version
Publisher
Norwegian University of Science and TechnologyPublisher URL
Volume
744Page range
43-50Pages
82.0Event name
Louhi 2011: The third international workshop on health documentation text mining and information analysisEvent location
Bled, SloveniaEvent type
conferenceEvent date
July 6, 2011Book title
Proceedings of LOUHI 2011 Third International Workshop on Health Document Text Mining and Information AnalysisPlace of publication
Trondheim, NorwayISBN
1613-0073Series
CEUR Workshop ProceedingsDepartment affiliated with
- Primary Care and Public Health Publications
Notes
E-publicationFull text available
- Yes
Peer reviewed?
- Yes
Editors
Laura Slaughter, Øystein Nytrø, Hans MoenLegacy Posted Date
2012-02-06First Open Access (FOA) Date
2016-03-22First Compliant Deposit (FCD) Date
2016-03-22Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC