NLP-KG
Semantic Search

Publication:

Clinical Text Classification to SNOMED CT Codes Using Transformers Trained on Linked Open Medical Ontologies

Anton HristovPetar IvanovAnna AksenovaT. AsamovPavlin GyurovTodor PrimovS. Boytcheva • @International Conference Recent Advances in Natural Language Processing • 01 January 2023

TLDR: An approach for medical text coding with SNOMED CT using transformers enhanced with clustering and filtering of the candidates and a classical machine learning approach - support vector classification (SVC) using transformer embeddings is adopted.

Citations: 1
Abstract: We present an approach for medical text coding with SNOMED CT. Our approach uses publicly available linked open data from terminologies and ontologies as training data for the algorithms. We claim that even small training corpora made of short text snippets can be used to train models for the given task. We propose a method based on transformers enhanced with clustering and filtering of the candidates. Further, we adopt a classical machine learning approach - support vector classification (SVC) using transformer embeddings. The resulting approach proves to be more accurate than the predictions given by Large Language Models. We evaluate on a dataset generated from linked open data for SNOMED codes related to morphology and topography for four use cases. Our transformers-based approach achieves an F1-score of 0.82 for morphology and 0.99 for topography codes. Further, we validate the applicability of our approach in a clinical context using labelled real clinical data that are not used for model training.

Related Fields of Study

loading

Citations

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next

References

Sort by
Previous
Next

Showing results 1 to 0 of 0

Previous
Next