Adverse Events in Twitter-Development of a Benchmark Reference Dataset: Results from IMI WEB-RADR

Dietrich, Juergen; Gattepaille, Lucie M.; Grum, Britta Anne; Jiri, Letitia; Lerch, Magnus; Sartori, Daniele; Wisniewski, Antoni

Adverse Events in Twitter-Development of a Benchmark Reference Dataset: Results from IMI WEB-RADR

Juergen Dietrich (), Lucie M. Gattepaille, Britta Anne Grum, Letitia Jiri, Magnus Lerch, Daniele Sartori and Antoni Wisniewski
Additional contact information
Juergen Dietrich: Pharmacovigilance, Bayer AG
Lucie M. Gattepaille: Uppsala Monitoring Centre
Britta Anne Grum: Pharmacovigilance, Bayer AG
Letitia Jiri: Global Patient Safety Pharmacovigilance Operations, Amgen Limited
Magnus Lerch: Lenolution GmbH
Daniele Sartori: Uppsala Monitoring Centre
Antoni Wisniewski: Global Regulatory Affairs, Patient Safety and Quality Assurance, Global Medicines Development, AstraZeneca

Drug Safety, 2020, vol. 43, issue 5, No 7, 467-478

Abstract: Abstract Introduction and Objective Social media has been suggested as a source for safety information, supplementing existing safety surveillance data sources. This article summarises the activities undertaken, and the associated challenges, to create a benchmark reference dataset that can be used to evaluate the performance of automated methods and systems for adverse event recognition. Methods A retrospective analysis of public English-language Twitter posts (Tweets) was performed. We sampled 57,473 Tweets out of 5,645,336 Tweets created between 1 March, 2012 and 1 March, 2015 that mentioned at least one of six medicinal products of interest (insulin glargine, levetiracetam, methylphenidate, sorafenib, terbinafine, zolpidem). Products, adverse events, indications, product-event combinations, and product-indication combinations were extracted and coded by two independent teams of safety reviewers. Results The benchmark reference dataset consisted of 1056 positive controls (“adverse event Tweets”) and 56,417 negative controls (“non-adverse event Tweets”). The 1056 adverse event Tweets contained 1396 product-event combinations referring to personal adverse event experiences, comprising 292 different MedDRA® Preferred Terms. The 1171 product-event combinations (83.9%) were confined to four MedDRA® System Organ Classes. The 195 Tweets (18.5%) contained indication information, comprising 25 different Preferred Terms. Conclusions A manually curated benchmark reference dataset based on Twitter data has been created and is made available to the research community to evaluate the performance of automated methods and systems for adverse event recognition in unstructured free-text information.

Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s40264-020-00912-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:drugsa:v:43:y:2020:i:5:d:10.1007_s40264-020-00912-9

Ordering information: This journal article can be ordered from
http://www.springer.com/adis/journal/40264

DOI: 10.1007/s40264-020-00912-9

Access Statistics for this article

Drug Safety is currently edited by Nitin Joshi

More articles in Drug Safety from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().