cv
Basics
Name | Kirill Milintsevich |
Label | Researcher (NLP / Medical AI) |
me@milintsevich.com | |
Url | https://scholar.google.com/citations?user=BQNVCjYAAAAJ&hl=en |
Summary | Throughout my academic career, I have explored a range of topics within the field of NLP, including, but not limited to, low-resource languages, morphology, language generation, and mental health. |
Work
-
2024.12 - Present Postdoctoral Researcher
Institut national de l'audiovisuel
Working with the transcriptions of the TV broadcastings.
-
2022.05 - 2022.08 NLP Engineer
TransPerfect
Developing the inverse text normalization system for the automatic speech recognition pipeline.
-
2021.06 - 2021.09 Computational Linguist
TransPerfect
Developing the inverse text normalization system for the automatic speech recognition pipeline.
Education
-
2020.10 - 2024.10 Caen, France
-
2018.09 - 2020.08 Tartu, Estonia
-
2016.09 - 2018.08 Moscow, Russia
-
2012.09 - 2016.08 Vladivostok, Russian
Publications
-
2024.06 Evaluating Lexicon Incorporation for Depression Symptom Estimation
6th Clinical Natural Language Processing Workshop (Clinical NLP) at NAACL 2024
This paper explores the impact of incorporating sentiment, emotion, and domain-specific lexicons into a transformer-based model for depression symptom estimation. Lexicon information is added by marking the words in the input transcripts of patient-therapist conversations as well as in social media posts. Overall results show that the introduction of external knowledge within pre-trained language models can be beneficial for prediction performance, while different lexicons show distinct behaviours depending on the targeted task. Additionally, new state-of-the-art results are obtained for the estimation of depression level over patient-therapist interviews.
-
2024.05 Analyzing Symptom-based Depression Level Estimation through the Prism of Psychiatric Expertise
Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
The ever-growing number of people suffering from mental distress has motivated significant research initiatives towards automated depression estimation. Despite the multidisciplinary nature of the task, very few of these approaches include medical professionals in their research process, thus ignoring a vital source of domain knowledge. In this paper, we propose to bring the domain experts back into the loop and incorporate their knowledge within the gold-standard DAIC-WOZ dataset. In particular, we define a novel transformer-based architecture and analyse its performance in light of our expert annotations. Overall findings demonstrate a strong correlation between the psychological tendencies of medical professionals and the behavior of the proposed model, which additionally provides new state-of-the-art results.
-
2024.03 Your Model Is Not Predicting Depression Well And That Is Why: A Case Study of PRIMATE Dataset
9th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2024)
This paper addresses the quality of annotations in mental health datasets used for NLP-based depression level estimation from social media texts. While previous research relies on social media-based datasets annotated with binary categories, i.e. depressed or non-depressed, recent datasets such as D2S and PRIMATE aim for nuanced annotations using PHQ-9 symptoms. However, most of these datasets rely on crowd workers without the domain knowledge for annotation. Focusing on the PRIMATE dataset, our study reveals concerns regarding annotation validity, particularly for the lack of interest or pleasure symptom. Through reannotation by a mental health professional, we introduce finer labels and textual spans as evidence, identifying a notable number of false positives. Our refined annotations, to be released under a Data Use Agreement, offer a higher-quality test set for anhedonia detection. This study underscores the necessity of addressing annotation quality issues in mental health datasets, advocating for improved methodologies to enhance NLP model reliability in mental health assessments.
Languages
Russian | |
Native speaker |
English | |
Fluent |
French | |
Fluent |