I am a PhD student at the Department of Computer and Systems Sciences at Stockholm University. The focus of my research is privacy-preserving techniques in natural language processing. My PhD project is funded primarily by the DataLEASH project and I am supervised by Professor Hercules Dalianis and Professor Aron Henriksson.
I have a MSc in computer science and engineering (civ.ing. i datateknik) from KTH Royal Institute of Technology. I worked in the tech industry as an IT consultant before starting my PhD, primarily as a back-end developer and data engineer.
BMC Medical Informatics and Decision Making special issue on Health information privacy and security (2024)
BMC Medical Informatics and Decision Making special issue on Health information privacy and security (2024)
In Proceedings of the 6th Clinical Natural Language Processing Workshop @ NAACL 2024
In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo) @ EACL2024
In AMIA Annual Symposium Proceedings 2023
In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa 2023)
In Proceedings of the 18th Scandinavian Conference on Health Informatics (SHI 2022)
In Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022)
In Proceedings of the Legal and Ethical Issues Workshop @ LREC2022
In Proceedings of the TERM21 Workshop @ LREC 2022
In Proceedings of the 21st Workshop on Biomedical Language Processing @ ACL 2022
In Proceedings of the AAAI 2021 Fall Symposium on Human Partnership with Medical AI: Design, Operationalization, and Ethics (AAAI-HUMAN 2021)
An extended abstract of my master's thesis presented at the 2020 workshop on RESOURCEs and representations For Under-resourced Languages and domains.
Licentiate thesis at Stockholm University (2023).
Master's thesis at KTH - Royal Institute of Technology (2020).
Bachelor's thesis at KTH - Royal Institute of Technology (2016).
I teach several courses and I also supervise bachelor's and master's theses. I am teaching or have taught in the following courses:
I am currently working on to what extent masked language models (such as BERT) leak sensitive information about their training data. Since BERT-style models are very common, especially for lesser-resourced languages, this could have significant privacy implications.
Don't hesitate to contact me if you are interested in collaborating!