I am a PhD student at the Department of Computer and Systems Sciences at Stockholm University. The focus of my research is privacy-preserving techniques in natural language processing. My PhD project is funded primarily by the DataLEASH project and I am supervised by Professor Hercules Dalianis and Professor Aron Henriksson.
I have a MSc in computer science and engineering (civ.ing. i datateknik) from KTH Royal Institute of Technology. I worked in the tech industry as an IT consultant before starting my PhD, primarily as a back-end developer and data engineer.
In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
BMC Medical Informatics and Decision Making special issue on Health information privacy and security (2024)
BMC Medical Informatics and Decision Making special issue on Health information privacy and security (2024)
In Proceedings of the 6th Clinical Natural Language Processing Workshop @ NAACL 2024
In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo) @ EACL2024
In AMIA Annual Symposium Proceedings 2023
In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa 2023)
In Proceedings of the 18th Scandinavian Conference on Health Informatics (SHI 2022)
In Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022)
In Proceedings of the Legal and Ethical Issues Workshop @ LREC2022
In Proceedings of the TERM21 Workshop @ LREC 2022
In Proceedings of the 21st Workshop on Biomedical Language Processing @ ACL 2022
In Proceedings of the AAAI 2021 Fall Symposium on Human Partnership with Medical AI: Design, Operationalization, and Ethics (AAAI-HUMAN 2021)
An extended abstract of my master's thesis presented at the 2020 workshop on RESOURCEs and representations For Under-resourced Languages and domains.
Licentiate thesis at Stockholm University (2023).
Master's thesis at KTH - Royal Institute of Technology (2020).
Bachelor's thesis at KTH - Royal Institute of Technology (2016).
I teach several courses and I also supervise bachelor's and master's theses. I am teaching or have taught in the following courses:
My PhD project examines the extent to which LLMs leak information about their training data — and how to mitigate these risks. This includes exploring different attacks, such as training data extraction attacks and membership inference attacks. I have also conducted experiments on how automatic de-identification and data synthetization impact data utility for machine learning purposes.
In addition to these privacy-oriented research interests, I am also very excited by research in NLP for under-resourced languages, bias in machine learning, NLP for the social sciences, and clinical NLP.
Don't hesitate to contact me if you are interested in collaborating!