avatar

Sizhuang He

Second-Year Ph.D. Student in Computer Science
Yale University
sizhuang.he (at) yale.edu

Loading...
Citations
Loading...
H-Index
Last updated: Loading...

About Me

I’m a second-year Computer Science Ph.D. student at Yale University, advised by Dr. David van Dijk. Previously, I completed my Bachelor’s degree at University of Michigan, Ann Arbor, majoring in Honors Mathematics and minoring in Computer Science.

I work on the intersection of machine learning and biology:

Check out my CV here: Curriculum Vitae

News

Selected Publications

  1. Cell2Sentence
    C2S-Scale scales this framework to 27 billion parameters trained on a billion-token multimodal corpus—achieving state-of-the-art predictive and generative performance for complex, multicellular analyses. Read more about this work in our Google Research blog post. — bioRxiv

  2. CaDDI
    We introduce a novel approach to discrete diffusion models that conditions on the entire generative trajectory, thereby lifting the Markov constraint and allowing the model to revisit and improve past states. CaDDi treats standard causal language models as a special case and permits the direct reuse of pretrained LLM weights with no architectural changes. — NeurIPS 2025 (Poster)

  3. CaLMFlow
    We present Volterra Flow Matching, a novel generative modeling framework that reformulates ODE-based flow matching frameworks with Volterra Integral Equations, hence avoiding a core challenge in ODE-based methods, known as stiffness. We show the connection between Volterra Integral Equations and causal transformers, the backbone of modern Large Language Models and hence demonstrates that causal language models can be naturally extended to generative modeling over continuous data domains through the lens of Volterra Flow Matching. — arXiv

  4. Intelligence at the Edge of Chaos
    By training LLMs on elementary cellular automata rules of varying complexity, we pinpoint a 'sweet spot' of data complexity that maximizes downstream predictive and reasoning abilities. Our findings suggest that exposing models to appropriately complex patterns is key to unlocking emergent intelligence. — ICLR 2025 (Poster)

Complete Publication List

(*: Equal Contribution; Last Updated: Sep 2025)

Filter by Research Area
# Causal Language Models # Cellular Automata # Complexity Theory # Computational Biology # Diffusion Models # Flow Matching # Generative Modeling # Integral Equations # Large Language Models # Neural Operators # Numerical Analysis # Operator Learning # Single-Cell Analysis

2025

  1. Scaling Large Language Models for Next-Generation Single-Cell Analysis
    Syed Rizvi*, Daniel Levine*, Aakash Patel*, Shiyang Zhang*, Eric Wang*, Sizhuang He and 17 more authors
    bioRxiv
    # Large Language Models # Computational Biology # Single-Cell Analysis
  2. Non-Markovian Discrete Diffusion with Causal Language Models
    Yangtian Zhang *, Sizhuang He*, Daniel Levine, Lawrence Zhao, David Zhang and 4 more authors
    NeurIPS 2025 (Poster)
    # Diffusion Models # Causal Language Models # Generative Modeling
  3. TANTE: Time-Adaptive Operator Learning via Neural Taylor Expansion
    Zhikai Wu, Sifan Wang, Shiyang Zhang, Sizhuang He, Min Zhu and 3 more authors
    In Review
    # Neural Operators # Operator Learning
  4. COAST: Intelligent Time-Adaptive Neural Operators
    Zhikai Wu, Shiyang Zhang, Sizhuang He, Sifan Wang, Min Zhu and 3 more authors
    AI4MATH Workshop at ICML 2025 (Poster)
    # Neural Operators # Operator Learning

2024

  1. CaLMFlow: Flow Matching using Causal Language Models
    Sizhuang He*, Daniel Levine*, Ivan Vrkic, Marco Bressana, David Zhang and 4 more authors
    arXiv
    # Flow Matching # Causal Language Models # Generative Modeling # Integral Equations
  2. Intelligence at the Edge of Chaos
    Shiyang Zhang*, Aakash Patel*, Syed Rizvi, Nianchen Liu, Sizhuang He and 3 more authors
    ICLR 2025 (Poster)
    # Large Language Models # Complexity Theory # Cellular Automata

2023

  1. Operator Learning Meets Numerical Analysis: Improving Neural Networks through Iterative Methods
    Emanuele Zappala, Daniel Levine, Sizhuang He, Syed Rizvi, Sacha Lévy and 1 more authors
    arXiv
    # Numerical Analysis # Operator Learning # Integral Equations

Services

Apart from research

Beyond research, my biggest passions include:


Powered by Jekyll and Minimal Light theme.