Zhengrong Jia* ; Kwong-Cheong Wong*
Presenter: Zhengrong Jia
From: Asia AI Education and Future Technology Association, Hong Kong SAR, China
Research Area: Deep Learning · Clinical EHR · Stroke Mortality Prediction
Email: zhengrong.jia.academic@gmail.com
Paper CI5896 · CCAI 2026 · May 22–24, Nanjing
Paper CI5896 · DT-Transformer · eICU · Stroke Mortality Prediction
Why Standard DL Still Fails Here
The challenge is not data volume — it is architectural mismatch between standard DL and the structure of clinical tabular data.
Multi-center · freely available · Pollard et al., 2018
14% mortality → AUPRC is the right metric. Our 0.6171 is far above random-classifier level.
Heterogeneous feature types demand separate encoders. Raw EHR data demands an online safety layer.
C1: Dual-Tower Encoder · C2: Adaptive Safeguard · C3: Optuna HPO
Dual-Tower encoding + Adaptive Safeguard + Optuna HPO — three contributions, one differentiable pipeline.
High-dimensional sparse indices → dense 22-d vector h_cat. Each feature gets its own learned embedding space.
Self-Attention explicitly computes pairwise interactions across all 94 physiological features — something MLPs only approximate.
The outputs of both towers are concatenated into h_joint ∈ ℝ⁸⁶ — bridging static demographic features and dynamic vital signs.
XGBoost retains a marginal AUPRC lead (0.6467) — our deep learning model closes the gap with full differentiability.
Replacing the Transformer encoder with fully connected layers drops AUPRC by 14.41% — the gain is driven by self-attention modeling global feature interactions.
Beyond empirical gains: three design principles for applying deep learning to heterogeneous clinical tabular data.
Top 5 features by attention weight
The model autonomously prioritizes clinically significant risk factors, improving interpretability compared to black-box baselines.
A fully differentiable alternative to gradient boosting — extensible to multimodal integration with unstructured clinical notes.
Zhengrong Jia · zhengrong.jia.academic@gmail.com