Safe Offline Reinforcement Learning for Sepsis Treatment: A Two-Stage Framework Combining Constraint-Aware Learning with Runtime Safety Filtering
Bailing Zhang, Yuwei Mi
2026, 2(1): 103-118. doi: 10.53941/tai.2026.100007
Stable CDE Autoencoders with Acuity Regularization for Offline Reinforcement Learning in Sepsis Treatment
Yue Gao
2025, 1(1): 307-325. doi: 10.53941/tai.2025.100021