avatar

Sizhuang He

Second-Year Ph.D. Student in Computer Science
Yale University
sizhuang.he (at) yale.edu

Loading...
Citations
Loading...
H-Index
Last updated: Loading...

Curriculum Vitae

Last updated: January 30, 2026 (Updated with STRIDE).

PDF TeX

Research Interest

Generative Modeling: Flow Matching, Diffusion, Discrete Diffusion, Operator Learning: Modeling Continuous Spatiotemporal Dynamics, Integral Equations, Computational Biology: Single-cell Transcriptomics Data Analysis, LLMs and Agentic AI: Autonomous Systems for Biological Discovery

Currently, I work on discrete diffusion models on the finite symmetric group and develop LLM multi-agent systems for single-cell perturbation response prediction and DNA methylation data curation.

Education

Yale University New Haven, CT
Ph.D. in Computer Science Aug. 2024 -- Present
  • Advisor: Dr. David van Dijk
  • Research Focus: Machine Learning for Computational Biology
University of Michigan, Ann Arbor Ann Arbor, MI
Bachelor of Science in Honors Mathematics (Minor in Computer Science) Sep. 2019 -- May 2023
  • Graduated with Highest Distinction
  • GPA: 4.0 / 4.0

Publications

  1. Learning Permutation Distributions via Reflected Diffusion on Ranks
    S. He*, Y. Zhang*, et al.
    In Review
  2. Non-Markovian Discrete Diffusion with Causal Language Models
    Y. Zhang*, S. He*, et al.
    NeurIPS 2025
  3. STRIDE: Post-Training LLMs to Reason and Refine Bio-Sequences via Edit Trajectories
    D. Zhang, S. Zhang, S. He, Y. Zhang and D. van Dijk
    In Review
  4. TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion
    Z. Wu, S. Wang, S. Zhang, S. He, et al.
    In Review
  5. Intelligence at the Edge of Chaos
    S. Zhang*, A. Patel*, S. Rizvi, N. Liu, S. He, et al.
    ICLR 2025
  6. COAST: Intelligent Time-Adaptive Neural Operators
    Z. Wu, S. Zhang, S. He, et al.
    AI4MATH Workshop at ICML 2025
  7. Scaling Large Language Models for Next-Generation Single-Cell Analysis
    S. Rizvi*, D. Levine*, A. Patel*, S. Zhang*, E. Wang*, S. He, et al.
    In Review
  8. CaLMFlow: Flow Matching using Causal Language Models
    S. He*, D. Levine*, et al.
    arXiv
  9. Operator Learning Meets Numerical Analysis: Improving Neural Networks through Iterative Methods
    E. Zappala, D. Levine, S. He, et al.
    arXiv

* denotes equal contribution.

Honors & Awards

Services


Powered by Jekyll and Minimal Light theme.