This work explores the applicability of synthetic data for training deep learning models aimed at real-time classification of astronomical radio signals. Building on previous research where lightweight convolutional neural networks (CNNs) using DM-time representations showed promising performance in detecting transient signals, we now turn to the question of whether synthetic datasets can...
The Boolean Satisfiability Problem (SAT) is a foundational problem in computer science with applications across a wide range of domains. Because SAT solvers exhibit varying behavior across different problem classes, the ability to generate synthetic SAT instances is valuable for benchmarking and solver-specific analysis. Recent methods have introduced Deep Learning approaches into this...
Tractography enables the reconstruction of white matter pathways from diffusion MRI and is a key tool for studying brain connectivity in both research and clinical contexts. Within the overall tractography pipeline, the parcellation step assigns individual streamlines to specific anatomical bundles, or discards them as false positive detections. We introduce PETParc (Parallel Efficient...
Stochastically sampling word segmentations from a subword tokeniser, also called subword regularisation, is a known way to increase robustness of language models to out-of-distribution inputs, such as text containing spelling errors. Recent work has observed that usual augmentations that make popular deterministic subword tokenisers stochastic still cause only a handful of all possible...
Multi-Agent Path Finding (MAPF) focuses on determining conflict-free paths for multiple agents navigating through a shared space to reach specified goal locations. This problem becomes computationally challenging, particularly when handling large numbers of agents, as frequently encountered in practical applications like coordinating autonomous vehicles. Quantum Computing (QC) is a promising...
Forecasting high-energy flares in blazars—active galactic nuclei with relativistic plasma jets oriented toward Earth—over extended temporal horizons presents a significant challenge due to the complex variability inherent in their light curves. In this study, we investigate the long-term predictability of flare activity using over 15 years of photon flux observations from the Fermi-LAT...
Emergent Misalignment (EMA) is a puzzling phenomenon where models finetuned on a narrowly misaligned task (e.g., including insecure backdoors in code) learn to be broadly misaligned. EMA is concerning, as models trained on superficially harmless data might become broadly misaligned. At the same time, the fact that alignment behavior across different domains is so strongly correlated during...
Hyperbolic representations are effective in modeling knowledge graph data which is prevalently used to facilitate multi-hop reasoning. However, a rigorous and detailed comparison of the two spaces for this task is lacking. In this paper, through a simple integration of hyperbolic representations with an encoder-decoder model, we perform a controlled and comprehensive set of experiments to...
The AI research ecosystem is a demanding, high-pressure environment that profoundly shapes the future of technology. Its effectiveness and sustainability depend not only on technical innovation but also on the people who sustain its progress. Investigating the psychosocial factors that link individual traits to work experiences and mental health is therefore essential for enabling sustainable,...
Traditional interpretability techniques such as rule-based models and feature attribution methods, each offer complementary strengths, however are often applied in isolation. Rule-based approaches are intuitive and logically structured, making them easy to understand, but they often struggle to scale effectively. On the other hand, feature attribution techniques like SHAP are well-suited to...
This abstract outlines my current research for my PhD thesis, focusing specifically on creating a synthetic dataset for multi-camera multi-object tracking (MCMOT) within logistics applications.
Motivation: Tracking moving assets such as trucks, trailers, or containers in logistics yards is crucial for developing digital twins, measuring key performance indicators, and enhancing operational...
Understanding causal relationships in oncology is essential for improving treatment strategies and generating testable medical hypotheses. We present CaDSIm (Causal Discovery with Simultaneous Imputation), a new method for learning causal structures and associated Structural Equation Models from real world pan-cancer data, which is typically high dimensional, noisy, and incomplete.
Our...
Dynamical systems governed by ordinary differential equations (ODEs) serve as models for a vast number of natural and social phenomena. In this work, we offer a fresh perspective on the classical problem of imputing missing time series data, whose underlying dynamics are assumed to be determined by ODEs. Specifically, we revisit ideas from amortized inference and neural operators, and propose...
The Lamarr Scientific Forum is rounding off the first day with a closing and all information needed on dinner plans.
We begin program day number 2 with a short look back at the previous day and ahead at today's program.
Recent works for time-series forecasting more and more leverage the high predictive power of Deep Learning models.
With this increase in model complexity, however, comes a lack in understanding of the underlying model decision process, which is problematic for high-stakes application scenarios. At the same time, simple, interpretable forecasting methods such as ARIMA still perform very...
Chirality information (i.e., information that allows distinguishing left from right) is ubiquitous for various data modes in computer vision, including images, videos, point clouds, and meshes. Contrary to symmetry, for which there has been a lot of research in the image domain, chirality information in shape analysis (point clouds and meshes) has remained underdeveloped. Although many shape...
Despite advances in conversational systems, the evaluation of such systems remains a challenging problem. Current evaluation paradigms often rely on costly homogeneous human annotators or oversimplified automated metrics, leading to a critical gap in socially aligned conversational agents, where pluralistic values (i.e., acknowledging diverse human experiences) are essential to reflect the...
We explore what it means to build a scientific "theory" of a black-box model, drawing on van Fraassen's Constructive Empiricism (CE), and demonstrate how such a theory can be used for explainable AI (XAI).
A scientific theory is more than just an explanation: it not only has value in its own right, but also serves as a robust framework for answering different questions.
According to CE, a...
Service robots operating in cluttered human environments such as homes, offices, and schools cannot rely on predefined object arrangements and must continuously update their semantic and spatial estimates while dealing with possible frequent rearrangement. Identifying all objects in cluttered, occlusion-heavy environments, such as shelves, requires selecting informative viewpoints and...
The post-surgical gauze retention can lead to serious complications and necessitate additional surgery for its removal. Due to data scarcity, the research on gauze segmentation on real-world surgical data remains underexplored. This work presents first investigation of gauze segmentation on real-surgical data. We use prevalently used segmentation architectures, including CNN-based,...
In this work, we address unsupervised temporal action segmentation, which segments a set of long, untrimmed videos into semantically meaningful segments that are consistent across videos. While recent approaches combine representation learning and clustering in a single step for this task, they do not cope with large variations within temporal segments of the same class. To address this...
Large Language Models (LLMs) remain vulnerable to adversarial jailbreaks, yet existing attacks rely on handcrafted priors or require white-box access for gradient propagation. We show that token-level iterative optimization can succeed without gradients and introduce RAILS (RAndom Iterative Local Search), a simple yet effective method using only model logits with a query budget comparable to...
In the healthcare domain, sensitive patient data is inherently decentralized across institutions and cannot be centralized due to strict privacy regulations. Federated learning offers a collaborative model training without explicitly sharing patient data by communicating model parameters or soft labels. These approaches, however, are still vulnerable to privacy leakage and often limit model...
Social sciences define values as preferred behaviors or outcomes that motivate an individual's actions or judgments.
While LLMs often reflect biases from their training data, it remains unclear what values underlie their generation processes, and whether such internal value systems can be measured or modified.
In this paper, we investigate whether fine-tuning can steer a model’s internal...
Detecting temporal abnormal patterns over streaming data is challenging due to volatile data properties and the lack of real-time labels. The abnormal patterns are usually hidden in the temporal context, which cannot be detected by evaluating single points. Furthermore, the normal state evolves over time due to concept drifts. A single model does not fit all data over time. Autoencoders are...
While many have analyzed the resource efficiency of trained models, an important question remains: How can one be sustainable and resource-aware during AI development, or in other words, when looking for a suitable model to train on a specific learning task? AutoML can help with finding well-performing models on given data, however these frameworks overly focus on predictive quality and...
Pallets are one of the most important load carriers for international supply chains. Yet, continuously tracking activities such as driving, lifting or standing along their life cycle is hardly possible. As part of a preliminary project, it was shown that it is possible to develop a prediction model for pallet activities using data from inertial measurements units mounted on a pallet. A...