From 7706faf9365c68100b0e92b9a5f89b7913b46fcd Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Mario=20Sa=CC=88nger?= <mario.saenger@student.hu-berlin.de> Date: Thu, 24 May 2018 15:09:30 +0200 Subject: [PATCH] Extend introduction text --- paper/10_introduction.tex | 13 ++++++++++++- 1 file changed, 12 insertions(+), 1 deletion(-) diff --git a/paper/10_introduction.tex b/paper/10_introduction.tex index e65ceac..d291b44 100644 --- a/paper/10_introduction.tex +++ b/paper/10_introduction.tex @@ -21,4 +21,15 @@ Classification of Disease version 10 (ICD-10). The task has been carried out the last two years of the lab, however was only concerned with French and English certificates. In contrast, the organizers provided annotated death reports as well as ICD-10 dictionaries for French, Italian and Hungarian this year. The -development of language-independent, multilingual approaches was encouraged. \ No newline at end of file +development of language-independent, multilingual approaches was encouraged. + +Inspired by the recent success of recurrent neural network models +\cite{cho_learning_2014,lample_neural_2016,dyer_transition-based_2015} in +general and the convincing performance of the work from Miftahutdinov and +Tutbalina \cite{miftakhutdinov_kfu_2017} in the last year's competition we opt +for the development of a deep learning model for this year's task. Our work +introduces a language independent approach for ICD-10 classification using +multi-language word embeddings and LSTM-based recurrent models. We divide the +the classification into two tasks. First, we extract symptoms from a certificate +line backed by an encoder-decoder model. Given the symptoms the actual ICD-10 +classification will be performed by a separate LSTM model. \ No newline at end of file -- GitLab