Safe Offline Reinforcement Learning for Sepsis Treatment: A Two-Stage Framework Combining Constraint-Aware Learning with Runtime Safety Filtering
Bailing Zhang, Yuwei Mi
2026, 2(1): 103-118. doi: 10.53941/tai.2026.100007