Unified Pipeline for Reproducible Benchmarking of Sequence Models on Biomedical Data

2 June 2026, by Viktoria Wrobel

Photo: base.camp

This project aims to design and implement a unified, modular pipeline for training and evaluating sequence models within the context of computational biology. While modern architectures such as Transformers, state-space models, and emerging alternatives are widely used, their comparison is often hindered by inconsistent preprocessing, training procedures, and evaluation protocols. We propose a standardized framework that enables fair, reproducible, and efficient benchmarking across heterogeneous model classes on structured biomedical data. The pipeline will support interchangeable model components, configurable experiments, and consistent metrics, thereby facilitating systematic analysis of architectural trade-offs. Its key contribution lies in bridging methodological gaps between models and enabling transparent, scalable experimentation in real-world research settings.

Latest articles

Photo: base.camp

02.06.2026|basecamp-projects

Fine-Tuning a General-Purpose Large Language Model for Financial Process Mining

Financial process mining reconstructs business processes based on journal entries. This becomes problematic when system-specific terms lead to unclear or ambiguous activity descriptions. The aim of this project is to solve this problem by fine-tuning a general-purpose large language model (LLM) for...

Sensory Support for Loss of Breathing Sensation (Device)

Photo: base.camp

19.05.2026|basecamp-projects

Sensory Support for Loss of Breathing Sensation

Empty Nose Syndrome is a debilitating condition where people can lose the physical sensation of airflow which often leads to severe psychological distress. This project aims to refine a wearable haptic feedback device that translates breathing chest movements into vibrations to restore breathing...