Golden Gate Bridge
A young HPC + AI infra enthusiast

Haoquan Chen
陈淏泉

I build the infrastructure layer between ambitious workloads and the heterogeneous machines that run them.

Research Interests

Where I want to push the limits

The questions I keep coming back to when hardware, compilers, and distributed systems meet.

HPC SystemsSystem software, scheduling, and runtime support for large-scale scientific computing.
GPU ArchitectureMulti-GPU execution, accelerator memory systems, and architecture-aware performance.
Distributed AI InfraTraining, serving, scheduling, and resource management for AI clusters.
Performance PortabilityCompiler and container layers that adapt applications across heterogeneous platforms.

Experience

How I learned systems from real SUPERCOMPUTERS

Labs taught me abstractions; supercomputers taught me constraints; competitions taught me to measure.

arcSYSu Lab, Sun Yat-sen University

Worked around HPC containers, compiler-assisted deployment, and distributed system support for heterogeneous clusters. The experience shaped my understanding of container runtimes, compiler toolchains, reproducible evaluation, and production systems on Tianhe-Xingyi supercomputer.

HPC ContainersDistributed SystemsDockerKubernetes

SYSU Supercomputing Team

The team gave me full-stack HPC training from the ground up, from hardware setup and cluster configuration to operator development, GPU tuning, application optimization, and performance profiling, with hands-on experience on self-built clusters and Bridges-2 supercomputer.

Parallel ProgrammingGPUKernel OptimizationPerformance Profiling

AIR, Tsinghua University

Worked around large-scale AI for Science applications and the system infrastructure behind real parallel workloads. The experience helped me connect AI frameworks, MPI-style execution, scientific simulation, and deployment constraints across Tianhe-3, Sunway, and Tecorigin.

AI4SLarge-scale Simulation

Research

What came out of the experiments

Artifacts from turning systems ideas into papers, prototypes, and evaluations that have to survive real machines.

SC'25 · CCF-A

coMtainer: Compilation-assisted HPC Container Images with Enhanced Adaptability

Built the container transformation and evaluation workflow for heterogeneous HPC clusters, preserving near-native execution while improving deployment adaptability.

coMtainer build graph and image model
Submitting to SC'26

HorizonAKMC for Large-scale Atomistic Simulation

Working on distributed reinforcement-learning infrastructure for Kinetic Monte Carlo simulation across national supercomputing platforms.

ICCCS'25

CWL-Bubble: Extending CWL for Dynamic Scientific Workflows

Co-authored a workflow-language extension for loops and recursive task generation in complex scientific workflows.

Projects

Things I built along the way

Small enough to ship, technical enough to teach me something about performance, platforms, or play.

OpenMX · Performance optimization

Performance Optimization of OpenMX

Optimized the nanomaterials simulation package on Bridges-2 using profiling, kernel refactoring, and hybrid MPI + OpenMP parallelization. Achieved 2.2x end-to-end performance improvement.

C/C++FortranMPI+OpenMPVTuneuProfScaLAPACK/ELPA
YatCC-OL · HPC code-servers

Lightweight HPC Development Platform

Built an online development platform above the Tianhe-Xingyi supercomputer for the YatCC-AI course, with Flask backend services and Kubernetes integration for standard cloud clusters.

FlaskKubernetesRedisDocker
WebGL · Space fighter game

PlanetWar: Maverick

Press to play

WebGLJavaScriptGame

Awards

A few milestones from the journey

World ChampionISC25 Student Cluster CompetitionJune 2025
Top PrizeTencent Scholarship, 1 / 424Nov 2025
Third PrizeCompiler Design Competition 2024Aug 2024
Outstanding StudentsScholarship, Sun Yat-sen University2023-2025

Technical Skills

Tools I reach for when things get REAL

ProgrammingPython, C/C++, Shell scripting
Parallel ComputingMPI, OpenMP, SIMD
Performance ToolsNsight, VTune, uProf, Perf
HPC & CloudDocker, Kubernetes, Slurm
AI InfrastructurePyTorch, NCCL, distributed training systems

Writing

What I noticed while building and traveling