
AI tool for best practice in social innovation with large language models
PhD position | MSCA Doctoral Network
About Data2Action
Data2Action is a Marie Skłodowska-Curie European Doctoral Training Network uniting social innovation with data science & AI to co-create a healthier, fairer and more sustainable world for all. In practical terms, it is a network of 5 academic partners where 13 PhD students will work towards their degrees on topics related to data science for social innovation. The network also involves 25 associated partners – NGOs, research organizations and companies – with whom the students will be able to engage.
Read more about Data2Action here.
Doctoral candidate research project
The main objective of the research project of the doctoral candidate is to explore how advanced large language models (LLMs) and generative AI technologies can effectively scale the impact of best practices in social innovation. The aim is to identify and evaluate authoritative sources documenting successful social innovation practices and define key categories and concepts relevant to social innovation organisations. These data will be extracted, structured, and organised into a structured knowledge base using LLMs. The conversational AI tools resulting from this should enable social innovators to gain easy access to insights from peer experiences.
To achieve this, you are expected to:
- Conduct desk research and actively engage with a network of social innovation organizations to identify reliable sources documenting effective social innovation practices (e.g., Social Innovation Atlas, EU Social Innovation Match, openGRAPH).
- Employ qualitative methods (interviews, focus groups) and quantitative methods (surveys) to determine key information categories of practical relevance to social innovators, organizing these into a structured ontology.
- Develop methods for extracting and structuring information from identified sources using text embedding techniques (e.g., embeddings, vector databases such as Pinecone) and annotation methods using large language models (e.g., the Mistral family models adapted with low-rank techniques). Particular attention will be paid to issues of bias, equity, diversity and security in the content of sources and development of the knowledge base.
- Implement and validate a conversational AI interface, leveraging advanced search methods (e.g., FAISS) combined with large language models, to provide intuitive interaction with the knowledge base. This tool will be tested in other projects in the doctoral network.
You will complete two secondments at partner organizations. You are planned to work with
Shipyard Foundation to get insight into the social innovation process and another at KnowledgeBiz to deepen expertise in large language model and generative AI. However, the secondments can be adjusted to their needs.
You will also be expected to:
- Report on findings by publishing scientific articles, resulting in a PhD dissertation;
- Present findings at (inter)national meetings/conferences;
- Contribute to the wider work of the Data2Action project
- Contribute to educational activities of the department and within the consortium.
We offer
The MSCA programme offers competitive and attractive working conditions. You will have an employment contract with Jožef Stefan Institute, Slovenia’s leading scientific research institution, and enrol into a PhD programme at Jožef Stefan International Postgraduate School. You will benefit from a competitive salary, annual leave, social security coverage. The salary will be composed of:
A living allowance (3400 EUR per month – gross salary) which is adjusted by applying a country correction coefficient to the living allowance of the country in which the researcher is recruited. The country correction coefficients are indicated in Table 1 of the MSCA Work Programme
Monthly mobility allowance: An additional 600 EUR/month to cover travel expenses. Monthly family allowance, if applicable and depending on the family situation: 660 euro per month
For additional information see EU MSCA website. Please be aware that these amounts are subject to taxes and the exact salary will be confirmed upon appointment.
At Jožef Stefan Institute you will work in a department conducting applied research on AI. You can expect:
- Access to Premier Research Infrastructure: The Institute boasts state-of-the-art facilities, including advanced AI laboratories and high-performance computing resources, providing an environment conducive to ground-breaking research.
- Collaborative and Supportive Environment: You will interact daily with supervisors and colleagues who have extensive and diverse expertise on AI. You will collaborate
particularly closely with two other PhD students involved in the Data2Action network.
- Global Exposure and Networking: The Data2Action network will provide specialized training sessions beyond the PhD programme, and the opportunity to interact with highly motivated PhD students and researchers outside your own institution. With this, you will be enhancing your skill set and expanding your professional network. You will also be encouraged to attend and actively participate in international conferences.
Requirements
Essential
- Eligible applicants must possess or be finalising a Master’s degree or an equivalent degree in a relevant discipline for Data2Action, especially computer science, artificial intelligence, ICT engineering, and digital humanities.
- Proficiency in at least one programming language
- Excellent English language proficiency
- Open-minded, self-aware, independent, collaborative, critical thinker, team player, strong communicator.
- Available full-time to start the program in September 2025.
Desirable
- Prior experience in natural language processing or artificial intelligence
- Prior experience in academic writing or scientific publications
- Prior practical or research experience with social innovation and/or the public sector Eligibility Conditions
- Applicants do not already hold a doctoral degree.
- Applicants must not have resided or carried out their main activity (work, studies, etc.) in the country of recruitment for more than 12 months in the 36 months immediately before the recruitment date. Compulsory national service, short stays such as holidays, and time spent as part of a procedure for obtaining refugee status under the Geneva Convention are not taken into account.
Application
Instructions
To apply, send an e-mail to mitja.lustrek@ijs.si with:
- Cover letter outlining your research interest, motivation to participate in the MSCA project, and previous experience (studies, employments etc.)
- Explicit confirmation that you meet the eligibility conditions and requirements
- Curriculum vitae (CV)
- Degree Transcripts
- Two recommendation letters (may be provided by professors, teaching assistants or previous employers)
You will receive a confirmation of your application.
Deadline
Sunday 18th May 2025
Selection process
Our selection procedure is open, transparent, merit-based, impartial, and equitable, in line with the Code of Conduct for the Recruitment of Researchers (link).
The selection procedure will consist of the following steps:
- Eligibility check: The Recruitment Committee will check each application is complete and that applicants fulfil the eligibility criteria described in the previous section.
- Screening: Applications will be reviewed for eligibility and suitability based on the criteria listed for each position.
- Online interviews: The short-listed candidates will be interviewed by a Selection Committee that will include the recruiting Principal Investigators. Selection committees will bring together diverse expertise and competencies and have an adequate gender balance.
- The recruiting institution will notify the selection outcome after the interview.
Additional information
For more information about this position, you can contact Prof Mitja Luštrek (mitja.lustrek@ijs.si).
We are hosting two information sessions on Tuesday 29th April 2025 for those interested in applying for a PhD fellow position with Data2Action. The session will start with a 15-minute presentation and then there will be a Q&A. We are running two sessions to ensure maximum global reach. Both sessions will be the same so you should only attend one. Sign up here:
Tuesday 29th April 2025, 10am UK time – sign up here:
Tuesday 29th April 2025, 6pm UK time – sign up here: