
LabMed 2026,3, 2 18 of 24
rather than precise population parameters. These sample sizes were constrained by pre-
operational feasibility, adequate for detecting major system failures (which were absent),
but insufficient for narrow precision on secondary metrics.
2. Single-observer assessment. Pre-operational drills used single observers to classify
anomaly severity and assess drill pass/fail criteria, precluding inter-rater reliability estima-
tion. This introduces potential for classification bias: different observers might categorize
the same event differently (e.g., Minor vs. Major severity), particularly for borderline cases.
The impact on our results is likely limited. We observed zero Major/Critical events, so mis-
classification would only affect distribution within the Minor category, but the consistency
of severity classification remains unvalidated. Dual-observer protocols with inter-rater
reliability analysis (Cohen’s kappa) were not feasible during pre-operational simulations
but represent an important operational validation step.
3. Simulation fidelity and Hawthorne effects. Despite prioritizing functional, phys-
ical, and psychological fidelity, simulations conducted in pre-operational contexts may
not predict operational performance. Staff awareness of observation and assessment may
have produced optimal “best behavior” responses (Hawthorne effect), yielding faster
alarm response times, heightened attention to labeling, and more careful documentation
than would occur under routine operational pressures. Additionally, simulations cannot
replicate cumulative fatigue, competing priorities during high donor volume, equipment
degradation over time, or staff turnover effects. Our observed performance (zero Criti-
cal/Major anomalies, 7-min mean alarm response) represents controlled conditions, not
sustained operational reality.
4. Inherent limitations of pre-operational validation. Simulations fundamentally
cannot answer questions that require longitudinal operational data: How quickly are
deviations closed in practice when CAPA investigations compete with daily operations?
Do alarm response times remain stable over months, or do they drift as urgency perception
fades? Do different staff members classify deviation severity consistently when facing
actual product-impacting events? Pre-operational validation demonstrates that system
components function correctly (procedures execute, approvals route, records link), but
cannot prove that these systems will perform reliably at scale over extended periods. This
distinction between system validation (demonstrated here) and process validation (requires
operational data) is fundamental to quality system maturation.
5. Single-site generalizability. This validation was conducted at one center with
specific characteristics: small-scale operation (two apheresis devices, one freezer), urban
location with rapid vendor access, and staff experienced in clinical laboratory practices.
Performance at centers with different scales (high-throughput operations), geographic
contexts (rural locations with delayed vendor response), or staffing profiles (newly hired
teams) remains unvalidated. The adaptation guidance (Table 6) provides a framework for
addressing these variations, but actual multi-site performance has not been demonstrated.
6. Conclusions
This study presents the first published pre-operational validation of a deviation-ready
quality management system for a source plasma center, using simulation-based testing to
demonstrate system functionality before first donor collections. The validation confirmed
that critical quality system components (document control, deviation escalation pathways,
CAPA linkages, alarm response protocols, cold-chain documentation, and training gates)
function correctly under controlled conditions and produce inspection-ready audit trails
aligned with 21 CFR Parts 606/640, BPDR reporting requirements, and EU Directive
2005/62/EC.
Key achievements include the following:
https://doi.org/10.3390/labmed3010002