brainsteam.co.uk/brainsteam/content/annotations/2022/11/23/1669236425.md

4.1 KiB
Raw Blame History

date hypothesis-meta in-reply-to tags type url
2022-11-23T20:47:05
created document flagged group hidden id links permissions tags target text updated uri user user_info
2022-11-23T20:47:05.414293+00:00
title
Towards Automatic Curation of Antibiotic Resistance Genes via Statement Extraction from Scientific Papers: A Benchmark Dataset and Models
false __world__ false _Vj2omtvEe2z-rfNY4eZiw
html incontext json
https://hypothes.is/a/_Vj2omtvEe2z-rfNY4eZiw https://hyp.is/_Vj2omtvEe2z-rfNY4eZiw/aclanthology.org/2022.bionlp-1.40.pdf https://hypothes.is/api/annotations/_Vj2omtvEe2z-rfNY4eZiw
admin delete read update
acct:ravenscroftj@hypothes.is
acct:ravenscroftj@hypothes.is
group:__world__
acct:ravenscroftj@hypothes.is
prompt-models
NLProc
selector source
end start type
1532 444 TextPositionSelector
exact prefix suffix type
Antibiotic resistance has become a growingworldwide concern as new resistance mech-anisms are emerging and spreading globally,and thus detecting and collecting the cause Antibiotic Resistance Genes (ARGs), havebeen more critical than ever. In this work,we aim to automate the curation of ARGs byextracting ARG-related assertive statementsfrom scientific papers. To support the researchtowards this direction, we build SCIARG, anew benchmark dataset containing 2,000 man-ually annotated statements as the evaluationset and 12,516 silver-standard training state-ments that are automatically created from sci-entific papers by a set of rules. To set upthe baseline performance on SCIARG, weexploit three state-of-the-art neural architec-tures based on pre-trained language modelsand prompt tuning, and further ensemble themto attain the highest 77.0% F-score. To the bestof our knowledge, we are the first to leveragenatural language processing techniques to cu-rate all validated ARGs from scientific papers.Both the code and data are publicly availableat https://github.com/VT-NLP/SciARG. g,clb21565,lifuh}@vt.eduAbstract 1 IntroductionAntibiotic resista TextQuoteSelector
https://aclanthology.org/2022.bionlp-1.40.pdf
The authors use prompt training on LLMs to build a classifier that can identify statements that describe whether or not micro-organisms have antibiotic resistant genes in scientific papers. 2022-11-23T20:47:05.414293+00:00 https://aclanthology.org/2022.bionlp-1.40.pdf acct:ravenscroftj@hypothes.is
display_name
James Ravenscroft
https://aclanthology.org/2022.bionlp-1.40.pdf
prompt-models
NLProc
hypothesis
annotation /annotation/2022/11/23/1669236425
Antibiotic resistance has become a growingworldwide concern as new resistance mech-anisms are emerging and spreading globally,and thus detecting and collecting the cause Antibiotic Resistance Genes (ARGs), havebeen more critical than ever. In this work,we aim to automate the curation of ARGs byextracting ARG-related assertive statementsfrom scientific papers. To support the researchtowards this direction, we build SCIARG, anew benchmark dataset containing 2,000 man-ually annotated statements as the evaluationset and 12,516 silver-standard training state-ments that are automatically created from sci-entific papers by a set of rules. To set upthe baseline performance on SCIARG, weexploit three state-of-the-art neural architec-tures based on pre-trained language modelsand prompt tuning, and further ensemble themto attain the highest 77.0% F-score. To the bestof our knowledge, we are the first to leveragenatural language processing techniques to cu-rate all validated ARGs from scientific papers.Both the code and data are publicly availableat https://github.com/VT-NLP/SciARG.
The authors use prompt training on LLMs to build a classifier that can identify statements that describe whether or not micro-organisms have antibiotic resistant genes in scientific papers.