Phd Candidate Ai-driven Fair Data Extraction and Harmonization

3 weken geleden


Groningen, Nederland UMCG Voltijd

**Functiebeschrijving**:

- Data harmonization: develop methods to map free-text clinical data to standardized coding systems and ontologies, ensuring compliance with FAIR principles.
- AI model innovation: select, adapt, and refine large language models (local, cluster, or cloud-based) and frameworks (Ollama, OntoGPT, LangChain, etc) for automated data recoding.
- Prompt and agentic workflow engineering: devise and implement best practices for improving language model performance in data extraction and ontology mapping.
- Interdisciplinary collaboration: Work across data science, software engineering genomics, and clinical teams to create scalable solutions that enhance patient care and research outcomes.

**Project AI-driven FAIR data extraction and harmonization**
By converting clinical notes and cohort variables into standard coding systems you will help create sufficiently large datasets for automated analysis and advanced diagnostics. Imagine helping rare disease patients by mapping textual symptom descriptions to precise phenotypic codes, which then combine with genomic data to identify potential causative variants. Or envision scaling your methods to unify data from multiple large cohort studies to research healthy child development, by seamlessly integrating local data models with emerging APIs such as DataSHIELD, Beacon or FAIR Data Point to create discoverability and analysis, and build new global collaborations.

Your research will focus on leveraging state-of-the-art Large Language Models to drive this conversion process, driven by many open questions. Which model types and sizes are most effective? How should they be prompted, orchestrated, and validated for optimal accuracy? Could we deploy them locally on our own cluster, or should we tap into cloud resources? Can we enable our partner universities and hospitals to run them locally in a federation? You will experiment with existing agentic frameworks like Ollama, LangChain, and OntoGPT to discover and refine best practices.

You will develop novel methods that will have a direct real-world impact: from improving patient diagnoses and enabling large scale anonymized data reuse for research, to laying groundwork for deeper integration with electronic health records for healthcare mainstreaming. The UMCG is a world-leader in terms of integrating AI in healthcare processes and we will leverage this position in this project to achieve global impact. Join our team of forward-thinking researchers and clinicians to shape the future of AI-driven data extraction and harmonization for healthcare.

**Wat vragen wij**:

- Master’s degree in AI, Computer Science, Bioinformatics, or a related field.
- Passion for machine learning, natural language processing, and biomedical data.
- Strong analytical skills and a willingness to learn new techniques.
- Excellent communication skills and a collaborative mindset.
- Familiarity with relevant technologies are a big plus (e.g. ontologies, coding systems, FAIR data principles, agentic AI frameworks, programming in R, Java, or Python).

**Wat bieden wij**:

- A dynamic research environment at the forefront of AI-driven healthcare innovation.
- Access to diverse, real-world medical data sets and cutting-edge computational resources.
- Support and collaboration with MOLGENIS large open source scientific software team to help you deploy and test your methods in working solutions.
- Mentorship by leading experts in AI, genomics, and clinical informatics.
- Opportunities to publish in high-impact journals and present at international conferences.

This is a full-time PhD contract for 4 years in an excellent environment for further development. First, a temporary one-year position will be offered with the option of renewal for another 3 years. Your salary will be a minimum of € 2.901;
- gross per month in the first year and a maximum of € 3.677;
- (scale PhD) in the final (4th) year, based on a full-time appointment. In addition, the UMCG will offer you 8% holiday pay, and 8.3% end-of-year bonus. The conditions of employment comply with the Collective Labour Agreement for Medical Centres (CAO-UMC).

**Meer informatie**: Neem voor meer informatie contact op met:
Joeri van der Velde, telefoonnummer 06 1981 4646
**Solliciteren**:

- confirmation with further information.

The UMCG has a preventive Hepatitis B policy. The UMCG can provide you with the vaccination, should it be required for your position. In case of specific professions a ‘Certificate of Good Conduct’ is required.



  • Groningen, Nederland UMCG Voltijd

    **Functiebeschrijving**: **Your tasks are**: - Exploring molecular dynamics and proteomics methods to enhance variant interpretation, which may also include a role for non-coding variation (e.g. promotors, enhancers, TFBSs, TADs, lncRNAs, etc) - Developing AI models for classifying genetic variants based on functional impact that are explainable and...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    The Centre for Media and Journalism Studies (CMJS) at the Faculty of Arts and the Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence at the Faculty of Science and Engineering are looking for a PhD student for the project “Assessing the reliability of news and online information: fostering critical digital literacy skills for...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    Among the most challenging to develop catalytic reactions are stereoselective processes. Typically, a family of catalysts is explored based on a preliminary hypothesis. After initial experimental results, further research is guided by trial and error with the goal of deriving intuitive trends. Data-driven approaches are attractive alternatives. Descriptors...

  • Phd

    7 dagen geleden


    Groningen, Nederland UMCG Voltijd

    **Functiebeschrijving**: **PhD students working on these projects will be expected to**: - Integrate and analyze the genomic-, metagenomic-, metabolomics - and proteomics datasets - Perform big data analysis using advanced machine learning techniques - Summarize and report key analytical findings in both oral and written form at work meetings, at...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    Since its foundation in 1614, the University of Groningen has enjoyed an international reputation as a dynamic and innovative center of higher education offering high-quality teaching and research. Belonging to the best research universities of Europe and joining forces with prestigious partner universities and networks, the University of Groningen is truly...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    We have a vacancy for a 4-year PhD student in Applied Mathematics within the project “HiWAVE - Natural hazard prediction with adaptive hierarchical wave models”, funded by the Dutch Research Council (NWO) via the talent programme Vidi in the group of Assistant Professor Dr. Julian Koellermeier. HiWAVE predicts free-surface waves by developing new...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    The Centre for Media and Journalism Studies (CMJS) at the Faculty of Arts and the Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence at the Faculty of Science and Engineering are looking for a PhD student in “Assessing the reliability of news and online information: fostering critical digital literacy skills for Generative...

  • Data Engineer AI

    4 weken geleden


    Groningen, Nederland CIMSOLUTIONS Voltijd € 8.000

    CIMSOLUTIONS AI is een nieuwe business unit van CIMSOLUTIONS. Onze missie is het leveren van innovatieve softwareproducten en oplossingen met de hoogste business value voor het bedrijfsleven en de overheid, gebaseerd op Artificial Intelligence en Data Science. We bevinden ons in een spannende fase van groei en expansie, en we zijn op zoek naar een...

  • Ai Programmeertrainer

    2 weken geleden


    Groningen, Nederland Outlier Ai Voltijd

    Outlier helps the world’s most innovative companies improve their AI models by providing human feedback. Are you an experienced software engineer who would like to lend your coding expertise to train AI models? We partner with organizations to train AI large language models, helping cutting-edge generative AI models write better code. Projects typically...

  • Data Engineer AI

    7 dagen geleden


    Groningen, Groningen, Nederland CIMSOLUTIONS Voltijd

    CIMSOLUTIONS AI is een nieuwe business unit van CIMSOLUTIONS. Onze missie is het leveren van innovatieve softwareproducten en oplossingen met de hoogste business value voor het bedrijfsleven en de overheid, gebaseerd op Artificial Intelligence en Data Science. We bevinden ons in een spannende fase van groei en expansie, en we zijn op zoek naar een...

  • Data Beheerder AI

    7 dagen geleden


    Groningen, Groningen, Nederland CIMSOLUTIONS Voltijd

    CIMSOLUTIONS AI is een nieuwe business unit van CIMSOLUTIONS. Onze missie is het leveren van innovatieve softwareproducten en oplossingen met de hoogste business value voor het bedrijfsleven en de overheid, gebaseerd op Artificial Intelligence en Data Science. We bevinden ons in een spannende fase van groei en expansie, en we zijn op zoek naar een...

  • Data Beheerder AI

    2 dagen geleden


    Groningen, Groningen, Nederland CIMSOLUTIONS Voltijd

    CIMSOLUTIONS AI is een nieuwe business unit van CIMSOLUTIONS. Onze missie is het leveren van innovatieve softwareproducten en oplossingen met de hoogste business value voor het bedrijfsleven en de overheid, gebaseerd op Artificial Intelligence en Data Science. We bevinden ons in een spannende fase van groei en expansie, en we zijn op zoek naar een...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    Project Description The PhD researcher will work in the project ‘The roles of siblings and school peers in young adults’ life-course events’, led by Principal Investigator Clara H. Mulder and funded by an Open Competition Large grant from the Dutch Science Foundation (NWO). The research team will further include a postdoc researcher, an assistant...


  • Groningen, Groningen, Nederland UMCG Voltijd

    Posted on December 11, 2023 PhD Opportunity: Fully-Funded 4-Year PhD in Classifying hyperkinetic movement disorders using fMRI at UMCG/University of Groningen. Project description Hyperkinetic movement disorders are defined as excessive involuntary movements. Tremor is the most prevalent and best-known example. Presently, classification of such...


  • Groningen, Groningen, Nederland UMCG Voltijd

    Posted on December 14, 2023 Job description Multiple sclerosis (MS) is a chronic progressive demyelinating disease of the central nervous system characterized by neuroinflammation and neurodegeneration. Current disease-modifying therapies are non-curative and do not reverse progression. MS lesion progression is prevented by remyelination, a natural...

  • Practice Lead Data Science

    6 dagen geleden


    Groningen, Nederland Ordina Voltijd

    **Your Impact****: As a Data Science & AI Practice Lead at Ordina, you’ll put to use your proven knowledge and skills to good use in helping top-class players and developing the careers of a talented team. - We are active in themes such as Natural Language Processing (NLP), Computer Vision, Machine Learning Operations (MLOps), AI Ops and Explainable A.I....


  • Groningen, Nederland De Hanze Voltijd

    **About the Research** **Why DESTRESS?** Stress poses a significant threat to the health of employees as well as the health of the organizations they work for. The resilience of employees and organizations is interconnected; an organization can only be resilient if its employees are. The key question is whether we can identify and address rising stress...


  • Groningen, Nederland Rijksuniversiteit Groningen Voltijd

    Among the most challenging to develop catalytic reactions are stereoselective processes. Typically, a family of catalysts is explored based on a preliminary hypothesis. After initial experimental results, further research is guided by trial and error with the goal of deriving intuitive trends. Data-driven approaches are attractive alternatives. Descriptors...


  • Groningen, Groningen, Nederland University of Groningen Voltijd

    Applications are invited for a fully-funded four-year PhD position at the intersection of the fields of Journalism Studies and Architecture, focusing on the role of material space and artifacts in the construction of journalistic identity and practice in former Yugoslavia. Place and Identity in Journalism in Former Yugoslavia inquires into the role of...


  • Groningen, Nederland UMCG Voltijd

    **Functiebeschrijving**: Asthma and COPD are widespread chronic respiratory diseases that impose a heavy social and economic burden. Traditional treatments often follow a "one-size-fits-all" approach, merely suppressing symptoms without achieving true health improvements. The MSCA Doctoral Network RESPIRE-EXCEL is set to revolutionize this by introducing...