These are the professional pages of prof.dr. Marco Spruit. I am Full Professor of Advanced Data Science in Population Health at the department of Public Health & Primary Care (PHEG) of the Leiden University Medical Centre (LUMC) and the Leiden Institute of Advanced Computer Science (LIACS) of the faculty of Science (FWN) of Leiden University.

My overarching research objective is to establish an authoritative national infrastructure for Dutch Natural Language Processing and Machine Learning to facilitate and popularise Self-Service Data Science. Current and future work, as well as previous projects performed within UU's Applied Data Science Lab, are now embedded within the Translational Data Science Lab at Leiden University.

Within the broad spectrum of Data Science, the discipline of Translational Data Science encompasses the fields of Applied Data Science and Self-Service Data Science to translate new algorithms to novel applications and introduce new insights from these novel applications into daily practices, respectively. This is analogous to the well-known domain of Translational Medicine, where T1 refers to bench-to-bedside research, and T2 refers to clinial trials in daily practices. Our work, therefore, spans the entire knowledge discovery process (e.g. CRISP-DM). The overarching research objective is to establish an authoritative national infrastructure for Dutch Natural Language Processing and Machine Learning to facilitate and popularise Self-Service Data Science. We focus in particular on the Population Health and Wellbeing domain (more).

I have notably conducted several European Horizon2020 studies (OPERAM, SAF21, SMESEC, GEIGER) and nationally funded research projects (e.g. STRIMP). I have authored 200+ publications including 85+ journal papers. My research team currently consists of 10 Ph.D students and 1 postdoc. I have successfully completed 7 Ph.D projects as daily supervisor.

Recently Added Files

  • 2023 0130 EUniWell spruit NLP.pdf   1414k - Jan 31, 2023, 1:45 AM by Marco Spruit (v2)
    ‎Translational Data Science: A case of Natural Language Processing in Mental Health (30/01/2023). EUniWell Workshop on the Impact of Artificial Intelligence on Healthcare and Education, Online. [30 min]‎
  • 2022 - stats.jpg   157k - Dec 17, 2022, 5:10 AM by Marco Spruit (v2)
    ‎Certificate of Probability and Statistics: To p or not to p? course at Coursera ‎
Showing 2 files from page Files.

Recently Added News

  • Welzijn.AI at the Dutch AI Parade Last week I discussed my plans for the Welzijn.AI chatbot with a diverse and interested audience at the AI Parade Dialogue at the City Library Kennemerwaard in Alkmaar De ...
    Posted Nov 19, 2022, 11:15 AM by Marco Spruit
  • VACANCY: Assistant Professor in Health Data Science The Leiden Institute of Advanced Computer Science (LIACS) of the Faculty of Science at Leiden University, and the Public Health & Primary Care (PHEG) department at Leiden University Medical Center (LUMC ...
    Posted Oct 5, 2022, 4:11 AM by Marco Spruit
  • Dr. Yigit Ozkan: Cybersecurity Maturity Assessment and Standardisation Today Bilge Yigit Ozkan defended her dissertation Cybersecurity Maturity Assessment and Standardisation at Utrecht's Academiegebouw. Her opponents had many interesting and though questions for her, but she stood her ...
    Posted Jul 12, 2022, 1:07 AM by Marco Spruit
Showing posts 1 - 3 of 191. View more »

At the Leiden UMC Campus The Hague (LCDH) Marco is establishing the research line Translational Data Science in Population Health. This line has three themes. In Data Engineering for Population Health, we investigate the further consolidation, standardisation and enrichment of the ELAN information infrastructure, in line with Health-RI. In Data Analytics for Population Health, Natural Language Processing and Machine Learning techniques are investigated for their suitability to answer current and new types of LCDH research questions from a Self-Service Data Science perspective, in collaboration with the LIACS department. In eHealth Implementations for Population Health, we design and implement data science interventions through eHealth, mHealth and NLP software solutions within the LCDH region, in close collaboration with the LCDH partners and the National eHealth Living Lab (NeLL).

Before, I was an Associate Professor and, before that, Assistant Professor in the Natural Language Processing and Organisation & Information research groups, respectively, of the Department of Information and Computing Sciences within the Faculty of Science at Utrecht University for 13 years. Until 2007 I worked in industry for fourteen years as an Information Retrieval and Big Data engineer at ZyLAB Europe and the Intelligence Service, among others.