Skip to content
Education

B.Sc. Software Engineering

Macau University of Science and Technology

2021 – 2025 (Graduated Aug 2025)

  • Data Structures & Algorithms, OS, Computer Networks
  • Database Systems (SQL), Software Architecture & Design Patterns
  • Full-Stack Web Development, Cloud & Distributed Systems
  • Machine Learning & AI Fundamentals, Final-Year Research Project
Languages
  • Mandarin — Fluent (Native)
  • English — Intermediate (Academic reading & writing proficient)
Technical Skills
Languages
Python Java C / C++ SQL JavaScript
ML Frameworks
PyTorch Scikit-learn XGBoost YOLOv8 NumPy Pandas
Architectures
Transformer LSTM GRU MLP CNN (YOLO)
Data & Viz
Matplotlib Seaborn OpenCV
Web & Systems
Vue.js Spring Boot REST API HTML / CSS
DevOps & Tools
Git Docker Linux LaTeX
Research Interests
Clinical Predictive Modeling

Transformer-based temporal modeling on longitudinal EHR data; early warning systems for ICU critical events with missing-data-aware architectures, building on prior work in stroke risk prediction (CCAI 2026).

Medical Image Analysis

Vision Transformer and hybrid CNN-ViT architectures for brain MRI segmentation; semi-supervised learning strategies to address annotation scarcity in radiology AI, with focus on multi-class lesion delineation.

Explainable AI for Healthcare

Faithfulness evaluation of attention-based saliency methods for deep radiology models; human-in-the-loop clinical validation frameworks to bridge the gap between model accuracy and clinician trust in high-stakes diagnosis.

Engineering Projects

DT-Transformer — Clinical AI Training Pipeline

2025 – Present

Python · PyTorch · Scikit-learn · Pandas · Docker · Linux

  • Engineered an end-to-end ML pipeline from raw eICU CSV ingestion to model serialization, handling 200k+ patient records with reproducible preprocessing via custom DataLoader classes
  • Designed a modular architecture separating data, model, training, and evaluation layers for maintainability and extensibility
  • Implemented stratified 5-fold cross-validation harness and automated metric logging (AUROC, AUPRC, F1) across all baselines
  • Containerized the full training environment with Docker for environment reproducibility; open-source release in preparation

Time-Series Prediction System — Undergraduate AI Project

2023 – 2024

Python · PyTorch · LSTM · GRU · MLP · Scikit-learn · Pandas · NumPy · Matplotlib · Seaborn

  • Designed and trained LSTM, GRU, and vanilla RNN models for sequential data prediction; best LSTM configuration achieved 1.42% MAPE on the held-out test set
  • Built a full data preprocessing pipeline: missing-value imputation, min-max normalization, sliding-window feature construction, and train/val/test splitting with no data leakage
  • Benchmarked deep sequence models against classical baselines (MLP, Linear Regression, XGBoost) using RMSE, MAE, and MAPE; produced publication-quality visualizations with Matplotlib and Seaborn
  • Implemented early stopping, learning-rate scheduling, and gradient clipping to stabilize RNN training on long sequences

AI-Integrated Web Platform — B.Sc. Capstone

2024 – 2025

Vue.js · Java (Spring Boot) · REST API · SQL · Python · Scikit-learn · Docker

  • Built a production-grade full-stack system with a Vue.js SPA frontend calling a Java Spring Boot RESTful backend, with an embedded Python ML microservice exposed via REST for real-time inference
  • Integrated trained Scikit-learn classification model into the backend serving layer, enabling on-demand predictions without re-training overhead
  • Designed normalized relational schema, implemented JWT-based auth with role-based access control, and deployed the full stack via Docker Compose on a Linux server

Personal Academic Website — github.com/ZR-JIA

2025 – Present

Jekyll · Liquid · CSS Custom Properties · JavaScript · GitHub Pages CI/CD

  • Designed and implemented a fully custom responsive Bento-grid design system from scratch — zero third-party UI frameworks
  • Architected a component-based layout using Jekyll includes, Liquid templating, and CSS custom properties (design tokens) for consistent theming
  • Integrated Font Awesome iconography, WebP image optimization, and Web App Manifest for PWA-ready delivery
  • Automated deployment via GitHub Pages CI/CD; includes SEO metadata, sitemap, and robots.txt configuration
Research Experience

Independent Researcher — Clinical AI

Aug 2025 – Present

Deep Learning on Tabular EHR Data · Stroke Mortality Prediction

Designed and implemented a Dual-Tower Transformer (DT-Transformer) for in-hospital stroke mortality prediction using the multicenter eICU Collaborative Research Database. The architecture decouples categorical demographics from numerical vitals into separate encoding towers with Self-Attention, achieving state-of-the-art performance with an AUPRC of 0.6171 (std 0.006) — a 14.4% relative improvement over the strongest neural baseline (NN, AUPRC 0.5394). An Adaptive Runtime Safeguard was integrated to detect out-of-distribution physiological inputs at inference time, ensuring clinical deployment safety.

  • Outperformed strongest neural baseline NN (AUPRC 0.5394) and Standard Transformer (0.5279); competitive with XGBoost (0.6467)
  • Implemented attention map visualization for clinical interpretability of feature importance
  • Paper accepted at CCAI 2026 (The 6th International Conference on Computer Communication and Artificial Intelligence, May 2026)
Professional Experience

Machine Learning Engineer Intern

Jul 2024 – Aug 2024

China Southern Power Grid AI Technology Co., Ltd. · Guangzhou

Python · PyTorch · YOLOv8 · OpenCV · Pandas · NumPy · Linux

  • Developed a computer vision pipeline for power inspection robots to automate the recognition of circuit breaker states (open / closed / fault), replacing manual visual inspection on high-voltage transmission lines
  • Fine-tuned a YOLOv8 object detection model on a domain-specific dataset of power equipment imagery, achieving reliable detection of switching-state indicators and structural anomalies in varying lighting and weather conditions
  • Built and maintained a 60 GB image dataset — covering substations across Guangdong Province — by standardizing annotation formats, deduplicating frames, and writing automated ingestion scripts to feed the training pipeline
  • Conducted fault-type analysis via exploratory data analysis (Pandas / Matplotlib), quantifying the distribution of insulator cracking, corona discharge, and loose-connector faults to guide class-balanced sampling strategies
  • Contributed to technical bid documentation translating ML algorithm performance benchmarks into procurement specifications for robot deployment contracts
  • Outcome: Received written commendation from supervising engineer; earned full score (100/100) on the formal internship evaluation