Publications
An integrated requirements framework for analytical and AI projects
Trujillo, J., Lavalle, A., Reina-Reina, A., García-Carrasco, J., Maté, A., Maaß, W.
Data & Knowledge Engineering
To this day, the requirements of data warehouses, user visualizations and ML projects have been tackled in an independent manner, ignoring the possible cross-requirements, collective constraints and dependencies between the outputs of the different systems that should be taken into account to ensure a successful analytical project. In this work, we take a holistic approach and propose a methodology that supports modeling and subsequent analysis while taking into account these three aspects. This methodology has several advantages, mainly that (i) it enables us to identify possible conflicts between actors on different tasks that are overlooked if the systems are treated in an isolated manner and (ii) this holistic view enables modeling multi-company systems, where the information or even the analytical results can be provided by third-parties, identifying key participants in federated environments. After presenting the required formalism to carry out this kind of analysis, we showcase it on a real-world running example of the tourism sector.
To this day, the requirements of data warehouses, user visualizations and ML projects have been tackled in an independent manner, ignoring the possible cross-requirements, collective constraints and dependencies between the outputs of the different systems that should be taken into account to ensure a successful analytical project. In this work, we take a holistic approach and propose a methodology that supports modeling and subsequent analysis while taking into account these three aspects. This methodology has several advantages, mainly that (i) it enables us to identify possible conflicts between actors on different tasks that are overlooked if the systems are treated in an isolated manner and (ii) this holistic view enables modeling multi-company systems, where the information or even the analytical results can be provided by third-parties, identifying key participants in federated environments. After presenting the required formalism to carry out this kind of analysis, we showcase it on a real-world running example of the tourism sector.
Conceptualizing Personal Data Pricing Through Construal Level Theory
Stein, H., Reich, R.H., Visinescu, L., Maaß, W.
AMCIS 2025
Individuals' increasing generation of digital data and evolving data ownership rights reinforce monetization opportunities in personal data markets. Understanding how personal data valuation and pricing is construed becomes essential. So far, psychological factors such as data sensitivity or proximity are under-researched. This study examines how construal level theory contributes to construing and explaining individual data value through an exploratory quantitative survey (n = 633). The results show that different construal levels influence data pricing, psychological ownership and willingness-to-disclose.
Individuals' increasing generation of digital data and evolving data ownership rights reinforce monetization opportunities in personal data markets. Understanding how personal data valuation and pricing is construed becomes essential. So far, psychological factors such as data sensitivity or proximity are under-researched. This study examines how construal level theory contributes to construing and explaining individual data value through an exploratory quantitative survey (n = 633). The results show that different construal levels influence data pricing, psychological ownership and willingness-to-disclose.
Designing Decision Support Systems for Rural Mobility Enhancement
Janzen, S., Stein, H., Abdel Khaliq, L., Hergert, F., Maaß, W.
Springer
Mobility in rural regions is critical for ensuring access to essential services such as education, healthcare, and employment. However, rural mobility remains largely individualized, with heavy reliance on private cars—leading to environmental, social, and infrastructural challenges. Despite increasing data availability from GPS, mobile devices, and public transport applications, data-driven approaches for generating integrated mobility insights tailored to specific rural areas remain underdeveloped. This paper addresses this gap by investigating how decision support systems (DSS) can be designed to provide actionable, regionspecific mobility insights for policymakers, businesses, and public transport providers. Following a Design Science Research (DSR) methodology, we identified factors of mobility mode choice. From this, we derived meta-requirements and proposed a set of design principles for data-driven DSS in rural mobility contexts. These principles were implemented in a DSS for the Saarland region of Germany and evaluated with policymakers and public transport planners. Our findings contribute both conceptual design knowledge and practical guidance, demonstrating how DSS can foster more efficient, inclusive, and sustainable rural mobility planning.
Mobility in rural regions is critical for ensuring access to essential services such as education, healthcare, and employment. However, rural mobility remains largely individualized, with heavy reliance on private cars—leading to environmental, social, and infrastructural challenges. Despite increasing data availability from GPS, mobile devices, and public transport applications, data-driven approaches for generating integrated mobility insights tailored to specific rural areas remain underdeveloped. This paper addresses this gap by investigating how decision support systems (DSS) can be designed to provide actionable, regionspecific mobility insights for policymakers, businesses, and public transport providers. Following a Design Science Research (DSR) methodology, we identified factors of mobility mode choice. From this, we derived meta-requirements and proposed a set of design principles for data-driven DSS in rural mobility contexts. These principles were implemented in a DSS for the Saarland region of Germany and evaluated with policymakers and public transport planners. Our findings contribute both conceptual design knowledge and practical guidance, demonstrating how DSS can foster more efficient, inclusive, and sustainable rural mobility planning.
ESCADE: Energy-efficient Artificial Intelligence for Cost-effective and Sustainable Data Centers
Janzen, S., Stein, H., Trinley, K., Agnes, C., Jain, V., Rajeshkar, K., Shenoy, N., Rusch, A., Gosh, S., Maaß, W.
CAiSE
Data centers play a central role in digital transformation, especially in the field of artificial intelligence (AI). However, their energy consumption is enormous, e.g., 16 billion kWh in Germany in 2020. At the same time, energy costs are rising and climate neutrality requirements are increasing. These factors pose major challenges for the sustainable and cost-effective operation of data centers. This paper introduces the ESCADE project (05/2023 - 04/2026), an ongoing research initiative funded by the German Federal Ministry of Economics and Climate Action, aiming to optimize the energy-efficiency of AI in data centers. AI compression techniques such as knowledge distillation, quantization and neural architecture search result in smaller, more energy-efficient AI models that deliver comparable performance. When combined with neuromorphic hardware, these models can achieve energy savings of up to 80The ESCADE consortium, a multidisciplinary collaboration of seven industry and academic partners, explores energy- efficient AI in two use cases: visual computing for scrap sorting in steel industry and natural language processing for software development. This paper provides a comprehensive overview of the ESCADE project, outlining its objectives, work packages, and anticipated outcomes. A central contribution is the introduction of first results in terms of the information system EAVE: Energy Analytics for Cost-effective and Sustainable Operations. By using AI-based analyses, EAVE optimizes the relationship between AI performance and operating costs of AI applications in data centers. The system measures and predicts the energy consumption, CO emissions and operating costs of different AI model configurations, including hardware options. At the same time, it analyzes which factors significantly influence these values. This enables decision-makers to manage the operation of data centers in a data-based and efficient manner while meeting environmental targets.
Data centers play a central role in digital transformation, especially in the field of artificial intelligence (AI). However, their energy consumption is enormous, e.g., 16 billion kWh in Germany in 2020. At the same time, energy costs are rising and climate neutrality requirements are increasing. These factors pose major challenges for the sustainable and cost-effective operation of data centers. This paper introduces the ESCADE project (05/2023 - 04/2026), an ongoing research initiative funded by the German Federal Ministry of Economics and Climate Action, aiming to optimize the energy-efficiency of AI in data centers. AI compression techniques such as knowledge distillation, quantization and neural architecture search result in smaller, more energy-efficient AI models that deliver comparable performance. When combined with neuromorphic hardware, these models can achieve energy savings of up to 80The ESCADE consortium, a multidisciplinary collaboration of seven industry and academic partners, explores energy- efficient AI in two use cases: visual computing for scrap sorting in steel industry and natural language processing for software development. This paper provides a comprehensive overview of the ESCADE project, outlining its objectives, work packages, and anticipated outcomes. A central contribution is the introduction of first results in terms of the information system EAVE: Energy Analytics for Cost-effective and Sustainable Operations. By using AI-based analyses, EAVE optimizes the relationship between AI performance and operating costs of AI applications in data centers. The system measures and predicts the energy consumption, CO emissions and operating costs of different AI model configurations, including hardware options. At the same time, it analyzes which factors significantly influence these values. This enables decision-makers to manage the operation of data centers in a data-based and efficient manner while meeting environmental targets.
FEDWELL: Life-Long Federated User and Mental Modeling for Health and Well-being
Janzen, S., Saxena, P., Agnes, C., Kahn, M.E.U., Gomaa, A., Feld, M., Zenner, A., Lessel, P., Wolter, J., Daiber, F., Math, R., Kleer, N., Schwartz, T., Krüger, A., Maaß, W.
CAiSE
Adaptive and personalized AI systems in healthcare rely on user-specific and contextual information to provide support. However, incomplete, unreliable, and outdated data prevents both patients experiencing illness, pain, or cognitive impairment, as well as therapists, in making proper and informed decisions. Patients specifically may not have the knowledge to comprehend complex medical information, or effectively communicate symptoms. AI- driven mental models and user models can bridge these cognitive gaps, ensuring personalized and effective patient care. The FedWell research project (09/2023–08/2026), funded by the Federal Ministry of Education and Research (BMBF), explores the integration of artificial mental models (AMMs) and user models from various sources into adaptive AI systems to assist patients in decision-making. The project focuses on two key applications: rehabilitation support after knee/hip surgery and treatment decision assistance for patients with cognitive impairments (e.g., multiple sclerosis, dementia). FedWell employs a combination of structured surveys, contextual data collection, and AI techniques to model patient behavior, attitudes, and intentions. A decision support system MENTALYTICS is developed from fine-tuned large language models (LLaMA-2, LLaMA-3, Mistral, Phi-3), that employs AMMs. By the end of the project, FedWell aims to deliver robust AMMs capable of representing patient beliefs and decision-making processes, ultimately guiding them toward treatment options that best fit their individual needs.
Adaptive and personalized AI systems in healthcare rely on user-specific and contextual information to provide support. However, incomplete, unreliable, and outdated data prevents both patients experiencing illness, pain, or cognitive impairment, as well as therapists, in making proper and informed decisions. Patients specifically may not have the knowledge to comprehend complex medical information, or effectively communicate symptoms. AI- driven mental models and user models can bridge these cognitive gaps, ensuring personalized and effective patient care. The FedWell research project (09/2023–08/2026), funded by the Federal Ministry of Education and Research (BMBF), explores the integration of artificial mental models (AMMs) and user models from various sources into adaptive AI systems to assist patients in decision-making. The project focuses on two key applications: rehabilitation support after knee/hip surgery and treatment decision assistance for patients with cognitive impairments (e.g., multiple sclerosis, dementia). FedWell employs a combination of structured surveys, contextual data collection, and AI techniques to model patient behavior, attitudes, and intentions. A decision support system MENTALYTICS is developed from fine-tuned large language models (LLaMA-2, LLaMA-3, Mistral, Phi-3), that employs AMMs. By the end of the project, FedWell aims to deliver robust AMMs capable of representing patient beliefs and decision-making processes, ultimately guiding them toward treatment options that best fit their individual needs.
KI in der Rehabilitation: Anwendung künstlicher mentaler Modelle für eine personalisierte Medizin
Sabine Janzen, Prajvi Saxena, Cicy Agnes, Wolfgang Maaß
Bundesgesundheitsblatt - Gesundheitsforschung - Gesundheitsschutz
Artificial intelligence (AI) can support patient-centered care in prevention and rehabilitation. In Germany, almost 1.9 million patients were treated in rehabilitation hospitals in 2023, mostly due to musculoskeletal disorders. The success of rehabilitation depends on cooperation between patient, doctor, and therapist as well as active participation. However, cognitive limitations, language barriers, and psychological factors tackle decision-making and communication abilities of patients. This leads to incomplete or distorted data and impairs individualized therapy. A potential solution approach is to apply artificial mental models (AMMs) that anticipate patients’ unknown mental models. These concepts are based on cognitive science theories and world models from AI. AMMs can optimize treatment decisions, correct misjudgments, and thus increase the success of rehabilitation. Particularly in knee rehabilitation, an AI agent can determine how patients perceive their recovery and enable individual adjustments. The BMFTR project FedWELL investigates the use of AMM in rehabilitation. A non-discriminatory base model was developed using data from online forums, user studies, and machine learning models. Initial results show that AI-supported models can predict individual assumptions and expectations of patients within the rehabilitation process and enable personalized therapies. This article presents the research design of the project and reports the first results of the initial survey phase.
Artificial intelligence (AI) can support patient-centered care in prevention and rehabilitation. In Germany, almost 1.9 million patients were treated in rehabilitation hospitals in 2023, mostly due to musculoskeletal disorders. The success of rehabilitation depends on cooperation between patient, doctor, and therapist as well as active participation. However, cognitive limitations, language barriers, and psychological factors tackle decision-making and communication abilities of patients. This leads to incomplete or distorted data and impairs individualized therapy. A potential solution approach is to apply artificial mental models (AMMs) that anticipate patients’ unknown mental models. These concepts are based on cognitive science theories and world models from AI. AMMs can optimize treatment decisions, correct misjudgments, and thus increase the success of rehabilitation. Particularly in knee rehabilitation, an AI agent can determine how patients perceive their recovery and enable individual adjustments. The BMFTR project FedWELL investigates the use of AMM in rehabilitation. A non-discriminatory base model was developed using data from online forums, user studies, and machine learning models. Initial results show that AI-supported models can predict individual assumptions and expectations of patients within the rehabilitation process and enable personalized therapies. This article presents the research design of the project and reports the first results of the initial survey phase.
Large language models for conceptual modeling: Assessment and application potential
Veda C. Storey and Oscar Pastor and Giancarlo Guizzardi and Stephen W. Liddle and Wolfgang Maaß and Jeffrey Parsons and Jolita Ralyté and Maribel Yasmina Santos
Data & Knowledge Engineering
Large Language Models (LLMs) are being rapidly adopted for many activities in organizations, business, and education. Included in their applications are capabilities to generate text, code, and models. This leads to questions about their potential role in the conceptual modeling part of information systems development. This paper reports on a panel presented at the 43rd International Conference on Conceptual Modeling where researchers discussed the current and potential role of LLMs in conceptual modeling. The panelists discussed applications and interest levels and expressed both optimism and caution in the adoption of LLMs. Suggested is a need for much continued research by the conceptual modeling community on LLM development and their role in research and teaching.
Large Language Models (LLMs) are being rapidly adopted for many activities in organizations, business, and education. Included in their applications are capabilities to generate text, code, and models. This leads to questions about their potential role in the conceptual modeling part of information systems development. This paper reports on a panel presented at the 43rd International Conference on Conceptual Modeling where researchers discussed the current and potential role of LLMs in conceptual modeling. The panelists discussed applications and interest levels and expressed both optimism and caution in the adoption of LLMs. Suggested is a need for much continued research by the conceptual modeling community on LLM development and their role in research and teaching.
Streamlining LLMs: Adaptive Knowledge Distillation for Tailored Language Models
Saxena, P., Janzen, S., Maaß, W.
NAACL 2025
Large language models (LLMs) like GPT-4 and LLaMA-3 offer transformative potential across industries, e.g., enhancing customer service, revolutionizing medical diagnostics, or identifying crises in news articles. However, deploying LLMs faces challenges such as limited training data, high computational costs, and issues with transparency and explainability. Our research focuses on distilling compact, parameter-efficient tailored language models (TLMs) from LLMs for domain-specific tasks with comparable performance. Current approaches like knowledge distillation, fine-tuning, and model parallelism address computational efficiency but lack hybrid strategies to balance efficiency, adaptability, and accuracy. We present ANON - an adaptive knowledge distillation framework integrating knowledge distillation with adapters to generate computationally efficient TLMs without relying on labeled datasets. ANON uses cross-entropy loss to transfer knowledge from the teacher's outputs and internal representations while employing adaptive prompt engineering and a progressive distillation strategy for phased knowledge transfer. We evaluated ANON's performance in the crisis domain, where accuracy is critical and labeled data is scarce. Experiments showed that ANON outperforms recent approaches of knowledge distillation, both in terms of the resulting TLM performance and in reducing the computational costs for training and maintaining accuracy compared to LLMs for domain-specific applications.
Large language models (LLMs) like GPT-4 and LLaMA-3 offer transformative potential across industries, e.g., enhancing customer service, revolutionizing medical diagnostics, or identifying crises in news articles. However, deploying LLMs faces challenges such as limited training data, high computational costs, and issues with transparency and explainability. Our research focuses on distilling compact, parameter-efficient tailored language models (TLMs) from LLMs for domain-specific tasks with comparable performance. Current approaches like knowledge distillation, fine-tuning, and model parallelism address computational efficiency but lack hybrid strategies to balance efficiency, adaptability, and accuracy. We present ANON - an adaptive knowledge distillation framework integrating knowledge distillation with adapters to generate computationally efficient TLMs without relying on labeled datasets. ANON uses cross-entropy loss to transfer knowledge from the teacher's outputs and internal representations while employing adaptive prompt engineering and a progressive distillation strategy for phased knowledge transfer. We evaluated ANON's performance in the crisis domain, where accuracy is critical and labeled data is scarce. Experiments showed that ANON outperforms recent approaches of knowledge distillation, both in terms of the resulting TLM performance and in reducing the computational costs for training and maintaining accuracy compared to LLMs for domain-specific applications.
The impact of electricity price forecasting on the optimal day-ahead dispatch of battery energy storage systems
Tadayon, L., Detering, D., Maaß, W., Frey, G.
NEIS 2025
Energy storage operation strategies depend on forecasts of the expected prices in the dispatch period. The quality of the forecast influences the economic success of the dispatch strategy. Nevertheless, current literature only evaluates the fore-cast with performance metrics on the time series, not with realized revenue. In addition, dispatch strategies are mostly evaluated with perfect foresight, which is not available in real life applications, or synthetically generated price forecasts, that do not precisely reflect shortcomings of real forecasting models. To examine the influence of forecasting performance on the realized revenue and the robustness against imprecise forecasts, we examine the use case of energy arbitrage in the day-ahead energy market with a battery storage. Different forecasting models are implemented to predict the resulting clearing price of the day-ahead auction before its closure based on the historic realized price of the past days. The price forecast is then used in a mixed integer linear programming-based dispatch model for the battery storage to derive the optimal schedule for market participation. To benchmark the realized revenue, we use the actual realized price to compute the optimal dispatch decision, resulting in the theoretical maximal revenue under perfect foresight. The impact of different forecasting models is evaluated by comparing the resulting revenues. Furthermore, we study the impact of including the ageing costs of the battery in the dispatch decision-making, which affects the forecast-based dispatch. The results show significantly higher missed revenue of 84.84 % compared to the theoretical maximum revenue when ageing costs are included and only 28.27 % missed revenue when ageing costs are neglected.
Energy storage operation strategies depend on forecasts of the expected prices in the dispatch period. The quality of the forecast influences the economic success of the dispatch strategy. Nevertheless, current literature only evaluates the fore-cast with performance metrics on the time series, not with realized revenue. In addition, dispatch strategies are mostly evaluated with perfect foresight, which is not available in real life applications, or synthetically generated price forecasts, that do not precisely reflect shortcomings of real forecasting models. To examine the influence of forecasting performance on the realized revenue and the robustness against imprecise forecasts, we examine the use case of energy arbitrage in the day-ahead energy market with a battery storage. Different forecasting models are implemented to predict the resulting clearing price of the day-ahead auction before its closure based on the historic realized price of the past days. The price forecast is then used in a mixed integer linear programming-based dispatch model for the battery storage to derive the optimal schedule for market participation. To benchmark the realized revenue, we use the actual realized price to compute the optimal dispatch decision, resulting in the theoretical maximal revenue under perfect foresight. The impact of different forecasting models is evaluated by comparing the resulting revenues. Furthermore, we study the impact of including the ageing costs of the battery in the dispatch decision-making, which affects the forecast-based dispatch. The results show significantly higher missed revenue of 84.84 % compared to the theoretical maximum revenue when ageing costs are included and only 28.27 % missed revenue when ageing costs are neglected.
AI-Driven Adaptive Systems for Knee Rehabilitation: Leveraging Artificial Mental Models for Personalized Patient Support
Janzen, S., Saxena, P., Agnes, C., Maaß, W.
KonKIS
In the evolving domain of prevention and rehabilitation, adaptive and personalized Artificial Intelligence (AI) systems are pivotal for delivering patient-centric care. This study investigates the application of Artificial Mental Models (AMM) within healthcare AI systems, specifically in knee rehabilitation, to address cognitive challenges that result in incomplete or biased data, impacting patient decision-making and communication. Leveraging Large Language Models (LLMs), we develop and fine-tune AMMs to accurately capture individual patients' mental models and enhance support during rehabilitation. Our research adopts a Design Science Research (DSR) methodology encompassing two phases: elicitation and individualization. In the elicitation phase, a domain-specific AMM is generated through a quantitative study involving 150 participants and indirect patient observations, ensuring a discrimination- and bias-free model. The individualization phase utilizes curated and non-curated patient data to fine-tune the AMM for individual patients. The effectiveness of these patient-specific AMMs is evaluated in real-world rehabilitation settings through A/B testing, comparing patient and AMM predictions of pain with actual pain assessments. The study's outcomes highlight the potential of AI systems to provide accurate, personalized, and bias-free patient care, significantly improving rehabilitation outcomes. The methodology and findings suggest broader applicability across various healthcare domains, enhancing the integration of AI in routine patient care and advancing the effectiveness of therapeutic interventions.
In the evolving domain of prevention and rehabilitation, adaptive and personalized Artificial Intelligence (AI) systems are pivotal for delivering patient-centric care. This study investigates the application of Artificial Mental Models (AMM) within healthcare AI systems, specifically in knee rehabilitation, to address cognitive challenges that result in incomplete or biased data, impacting patient decision-making and communication. Leveraging Large Language Models (LLMs), we develop and fine-tune AMMs to accurately capture individual patients' mental models and enhance support during rehabilitation. Our research adopts a Design Science Research (DSR) methodology encompassing two phases: elicitation and individualization. In the elicitation phase, a domain-specific AMM is generated through a quantitative study involving 150 participants and indirect patient observations, ensuring a discrimination- and bias-free model. The individualization phase utilizes curated and non-curated patient data to fine-tune the AMM for individual patients. The effectiveness of these patient-specific AMMs is evaluated in real-world rehabilitation settings through A/B testing, comparing patient and AMM predictions of pain with actual pain assessments. The study's outcomes highlight the potential of AI systems to provide accurate, personalized, and bias-free patient care, significantly improving rehabilitation outcomes. The methodology and findings suggest broader applicability across various healthcare domains, enhancing the integration of AI in routine patient care and advancing the effectiveness of therapeutic interventions.
AI-Driven Software Engineering – The Role of Conceptual Modeling
Fill, H.-G., Cabot, J., Maaß, W., Van Sinderen, M.
International Journal of Conceptual Modeling
The following discussion paper summarizes the results of a panel discussion conducted on July 10, 2023 at the International Conference on Software Technologies (ICSOFT) in Rome, Italy. The panelists included Jordi Cabot from Luxembourg Institute of Science and Technology, Luxembourg, Wolfgang Maass from University of Saarland, Germany, and Marten van Sinderen from University of Twente, Netherlands. The panel was moderated by Hans-Georg Fill from University of Fribourg, Switzerland.
The following discussion paper summarizes the results of a panel discussion conducted on July 10, 2023 at the International Conference on Software Technologies (ICSOFT) in Rome, Italy. The panelists included Jordi Cabot from Luxembourg Institute of Science and Technology, Luxembourg, Wolfgang Maass from University of Saarland, Germany, and Marten van Sinderen from University of Twente, Netherlands. The panel was moderated by Hans-Georg Fill from University of Fribourg, Switzerland.
Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters
Gurgurov, D., Hartmann, M., Ostermann, S.
Association for Computational Linguistics
This paper explores the integration of graph knowledge from linguistic ontologies into multilingual Large Language Models (LLMs) using adapters to improve performance for low-resource languages (LRLs) in sentiment analysis (SA) and named entity recognition (NER). Building upon successful parameter-efficient fine-tuning techniques, such as K-ADAPTER and MAD-X, we propose a similar approach for incorporating knowledge from multilingual graphs, connecting concepts in various languages with each other through linguistic relationships, into multilingual LLMs for LRLs. Specifically, we focus on eight LRLs --- Maltese, Bulgarian, Indonesian, Nepali, Javanese, Uyghur, Tibetan, and Sinhala --- and employ language-specific adapters fine-tuned on data extracted from the language-specific section of ConceptNet, aiming to enable knowledge transfer across the languages covered by the knowledge graph. We compare various fine-tuning objectives, including standard Masked Language Modeling (MLM), MLM with full-word masking, and MLM with targeted masking, to analyze their effectiveness in learning and integrating the extracted graph data. Through empirical evaluation on language-specific tasks, we assess how structured graph knowledge affects the performance of multilingual LLMs for LRLs in SA and NER, providing insights into the potential benefits of adapting language models for low-resource scenarios.
This paper explores the integration of graph knowledge from linguistic ontologies into multilingual Large Language Models (LLMs) using adapters to improve performance for low-resource languages (LRLs) in sentiment analysis (SA) and named entity recognition (NER). Building upon successful parameter-efficient fine-tuning techniques, such as K-ADAPTER and MAD-X, we propose a similar approach for incorporating knowledge from multilingual graphs, connecting concepts in various languages with each other through linguistic relationships, into multilingual LLMs for LRLs. Specifically, we focus on eight LRLs --- Maltese, Bulgarian, Indonesian, Nepali, Javanese, Uyghur, Tibetan, and Sinhala --- and employ language-specific adapters fine-tuned on data extracted from the language-specific section of ConceptNet, aiming to enable knowledge transfer across the languages covered by the knowledge graph. We compare various fine-tuning objectives, including standard Masked Language Modeling (MLM), MLM with full-word masking, and MLM with targeted masking, to analyze their effectiveness in learning and integrating the extracted graph data. Through empirical evaluation on language-specific tasks, we assess how structured graph knowledge affects the performance of multilingual LLMs for LRLs in SA and NER, providing insights into the potential benefits of adapting language models for low-resource scenarios.
Adaptive AI Systems in Knee Rehabilitation: Integrating Artificial Mental Models for Personalized Patient Support
Janzen, S., Saxena, P., Agnes, C., Maaß, W.
13. Jahreskongress der Deutschen Kniegesellschaft
In the field of prevention and rehabilitation, adaptive and personalized Artificial Intelligence (AI) systems play a vital role in enhancing patient-centric care. These AI systems analyze extensive data on user behavior and situational context to provide tailored support that meets unique needs of individuals, such as patients recovering from knee surgeries. Such patients often face cognitive challenges that interfere with their ability to process complex medical information, make informed decisions, and communicate effectively about their symptoms. These challenges may lead to the generation of incomplete, inaccurate, or biased data, significantly impacting the effectiveness of their treatment. This research introduces the concept of artificial mental models (AMM) integrated into healthcare AI systems. AMMs are cognitive frameworks that encapsulate patient's perceptions and expectations about their therapy and recovery journey. They are beneficial in scenarios requiring nuanced understanding and adaptation to patient's changing conditions, e.g., to assist an amateur soccer player recovering from knee surgery. Here, the AMM acts as a liaison between patient and therapist, helping to devise a personalized exercise regimen that adapts to patient's pain and progress. This work explores the generation of AMMs using Large Language Models instrumental in both eliciting and fine-tuning AMMs to individual patient needs. The research encompasses a prospective study with two phases: elicitation and individualization. The elicitation phase involves creating a bias-free, domain-specific basis AMM through extensive data collection from both quantitative studies and indirect observations, including data on personality traits and expected pain during specific exercises. This basis AMM is evaluated in a technical experiment to ensure it is free from bias and discrimination. In the individualization phase, the basis AMM is refined using direct observations of specific patients, incorporating curated data such as medication details, rehabilitation plans, and patient-reported outcomes, as well as non-curated data like movement patterns and fitness status. The final AMM tailored for individual patients is assessed through action research, involving real-world application to evaluate its impact on rehabilitation outcomes. The research questions focus on whether the predictions made by the AMM about pain are consistent with the patients' experiences and the assessments made by therapists. Results validate the effectiveness of AMMs in real-time clinical settings, demonstrating their potential to significantly enhance the personalization and effectiveness of patient care in knee rehabilitation. Broader implications suggest that AI-driven approaches could enhance patient care across various areas of healthcare, supporting the ability of systems to predict patient needs and improve overall outcomes by more targeted and effective interventions.
In the field of prevention and rehabilitation, adaptive and personalized Artificial Intelligence (AI) systems play a vital role in enhancing patient-centric care. These AI systems analyze extensive data on user behavior and situational context to provide tailored support that meets unique needs of individuals, such as patients recovering from knee surgeries. Such patients often face cognitive challenges that interfere with their ability to process complex medical information, make informed decisions, and communicate effectively about their symptoms. These challenges may lead to the generation of incomplete, inaccurate, or biased data, significantly impacting the effectiveness of their treatment. This research introduces the concept of artificial mental models (AMM) integrated into healthcare AI systems. AMMs are cognitive frameworks that encapsulate patient's perceptions and expectations about their therapy and recovery journey. They are beneficial in scenarios requiring nuanced understanding and adaptation to patient's changing conditions, e.g., to assist an amateur soccer player recovering from knee surgery. Here, the AMM acts as a liaison between patient and therapist, helping to devise a personalized exercise regimen that adapts to patient's pain and progress. This work explores the generation of AMMs using Large Language Models instrumental in both eliciting and fine-tuning AMMs to individual patient needs. The research encompasses a prospective study with two phases: elicitation and individualization. The elicitation phase involves creating a bias-free, domain-specific basis AMM through extensive data collection from both quantitative studies and indirect observations, including data on personality traits and expected pain during specific exercises. This basis AMM is evaluated in a technical experiment to ensure it is free from bias and discrimination. In the individualization phase, the basis AMM is refined using direct observations of specific patients, incorporating curated data such as medication details, rehabilitation plans, and patient-reported outcomes, as well as non-curated data like movement patterns and fitness status. The final AMM tailored for individual patients is assessed through action research, involving real-world application to evaluate its impact on rehabilitation outcomes. The research questions focus on whether the predictions made by the AMM about pain are consistent with the patients' experiences and the assessments made by therapists. Results validate the effectiveness of AMMs in real-time clinical settings, demonstrating their potential to significantly enhance the personalization and effectiveness of patient care in knee rehabilitation. Broader implications suggest that AI-driven approaches could enhance patient care across various areas of healthcare, supporting the ability of systems to predict patient needs and improve overall outcomes by more targeted and effective interventions.
Adaptive Knowledge Distillation for Efficient Domain-Specific Language Models
Saxena, P., Janzen, S., & Maass, W.
19th Women in Machine Learning workshop (WiML 2024) at NeurIPS
Presents an adaptive knowledge distillation framework to create efficient domain-specific language models.
Beyond One-Fits-All: A Case Study Approach to AI System Design Methods
Janzen, S., Stein, H.
Springer
Despite the widespread application of artificial intelligence (AI) as universal solution for complex business problems, there remains a significant gap in design methods for AI systems, distinguishing them sharply from traditional software systems. This research aims to address the lack of standardized design methods tailored for AI projects, which are often impeded by unique challenges such as data sensitivity, model performance, and regulatory compliance. Through an exploratory case study of four AI projects, this paper investigates correlations between characteristics of AI projects and the design methods applied, introducing a set of If-This-Then-That (IFTTT) patterns. These patterns are intended to aid in selecting and combining design method components that align with the specific needs of AI projects. Results highlight the importance of understanding project-specific characteristics to enhance the effectiveness of design methods in AI engineering, offering practitioners actionable insights for improving quality and reliability of AI systems through tailored design approaches.
Despite the widespread application of artificial intelligence (AI) as universal solution for complex business problems, there remains a significant gap in design methods for AI systems, distinguishing them sharply from traditional software systems. This research aims to address the lack of standardized design methods tailored for AI projects, which are often impeded by unique challenges such as data sensitivity, model performance, and regulatory compliance. Through an exploratory case study of four AI projects, this paper investigates correlations between characteristics of AI projects and the design methods applied, introducing a set of If-This-Then-That (IFTTT) patterns. These patterns are intended to aid in selecting and combining design method components that align with the specific needs of AI projects. Results highlight the importance of understanding project-specific characteristics to enhance the effectiveness of design methods in AI engineering, offering practitioners actionable insights for improving quality and reliability of AI systems through tailored design approaches.