Sökning: WFRF:(Westman Gabriel 1977 ) >
A natural language ...
A natural language processing approach towards harmonisation of European medicinal product information
-
- Bergman, Erik (författare)
- Läkemedelsverket
-
- Sherwood, Kim (författare)
- Läkemedelsverket
-
- Forslund, Markus (författare)
- Läkemedelsverket
-
visa fler...
-
- Arlett, Peter (författare)
- European Medicines Agency, Amsterdam, Netherlands
-
- Westman, Gabriel, 1977- (författare)
- Uppsala universitet,Infektionsmedicin,Läkemedelsverket
-
visa färre...
-
(creator_code:org_t)
- 2022-10-20
- 2022
- Engelska.
-
Ingår i: PLOS ONE. - : Public Library of Science (PLoS). - 1932-6203. ; 17:10
- Relaterad länk:
-
https://doi.org/10.1...
-
visa fler...
-
https://uu.diva-port... (primary) (Raw object)
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Product information (PI) is a vital part of any medicinal product approved for use within the European Union and consists of a summary of products characteristics (SmPC) for healthcare professionals and package leaflet (PL) for patients, together with the product packaging. In this study, based on the English corpus of the EMA product information documents for all centrally approved medicinal products within the EU, a BERT sentence embedding model was used together with clustering and dimensional reduction techniques to identify sentence similarity clusters that could be candidates for standardization. A total of 1258 medicinal products were included in the study. From these, a total of 783 K sentences were extracted from SmPC and PL documents which were aggregated into a total of 284 and 129 semantic similarity clusters, respectively. The spread distribution among clusters shows separation into different cluster types. Examples of clusters with low spread include those with identical word embeddings due to current standardization, such as section headings and standard phrases. Others show minor linguistic variations, while the group with the largest variability contains variable wording but with significant semantic overlap. The sentence clusters identified could serve as candidates for further standardization of the PI. Moving from free text human wording to auto-generated text elements based on multiple-choice input for appropriate parts of the package leaflet and summary of product characteristics, could reduce both time and complexity for applicants as well as regulators, and ultimately provide patients and prescribers with documents that are easier to understand and better adapted for search availabilities.
Ämnesord
- MEDICIN OCH HÄLSOVETENSKAP -- Medicinska och farmaceutiska grundvetenskaper -- Farmaceutiska vetenskaper (hsv//swe)
- MEDICAL AND HEALTH SCIENCES -- Basic Medicine -- Pharmaceutical Sciences (hsv//eng)
- HUMANIORA -- Språk och litteratur -- Jämförande språkvetenskap och allmän lingvistik (hsv//swe)
- HUMANITIES -- Languages and Literature -- General Language Studies and Linguistics (hsv//eng)
- SAMHÄLLSVETENSKAP -- Medie- och kommunikationsvetenskap -- Systemvetenskap, informationssystem och informatik med samhällsvetenskaplig inriktning (hsv//swe)
- SOCIAL SCIENCES -- Media and Communications -- Information Systems, Social aspects (hsv//eng)
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
-
PLOS ONE
(Sök värdpublikationen i LIBRIS)
Till lärosätets databas