Yiming Qin

I am a computer science PhD student at the École Polytechnique Fédérale de Lausanne (EPFL). I am currently working on graph generation and have an interest in drug discovery and conditional generation. I am fortunate to be advised by Prof. Pascal Frossard.

Prior to my PhD, I completed my Master's degree at the at Shanghai Jiao Tong University (SJTU), where I worked with Prof. Ya Zhang, Prof. Yanfeng Wang, Prof. Jiaochao Yao, and Dr. Huangjie Zheng. At the same time, I earned the double degree (title of Ingénieur) from École polytechnique (X) on applied mathematics, finished with an internship at École Normale Supérieure (ENS), Paris under the supervision of Dr. Lisa Bedin and Prof. Auguste Genovesio. Before that, I completed my Bachelor's degree a B.E. in Information Engineering and a B.A. in French at SJTU, within the SPEIT.

Email  /  GitHub  /  Google Scholar  /  LinkedIn

profile photo

News

Jun 2026
“DBMol” is accepted as a poster at the ICML 2026 GenBio workshop. See you in Seoul!
Jun 2026
Our work on scalable graph generation was accepted at the ICML 2026 Graph Foundation Models Workshop.
May 2026
“CoFRe” was published on arXiv and accepted at the ICML 2026 SPIGM workshop.
Mar 2026
I will revisit Cambridge for a short term. Happy to chat!
Feb 2026
Our work “DIRECTO” was accepted as a poster at ICLR 2026.
Oct 2025
I am excited to begin an academic visit at the University of Cambridge, hosted by Pietro Liò's group and LMB-MRC.
Jul 2025
Our work “DeFoG” was presented as an Oral at ICML 2025.

Research

My research focuses on generative models and graphs. Specifically, I am interested in controlling generative models, such as model editing and conditional generation, the intersection of autoregressive and diffusion language models, and applying graph generative models to diverse domains, such as drug discovery.

project image

DBMol: Design of High-Affinity, Target-Specific Small Molecules through Structure Prediction Models


Yiming Qin*, Kai Yi*, Miruna Cretu, Sjors H. W. Scheres, Pietro Liò, Pascal Frossard
ICML 2026 GenBio Workshop, 2026

DBMol is a docking-dataset-free framework that uses structure prediction models to guide de novo small-molecule design, achieving strong affinity and pocket specificity without reinforcement learning or docking supervision. This project applies 2D graph generation for small-molecule design.

project image

Fixed-Point Masked Generative Modeling


Andrea Miele, Yiming Qin, Alba Carballo-Castro, Justin Deschenaux, Pascal Frossard
ICML 2026 SPIGM Workshop, 2026
arxiv /

We introduce CoFRe, a fixed-point framework for masked generative models with a looped architecture that reuses shared parameters, reducing parameters, training cost, and memory usage while improving low-budget generation across text and image modalities.

project image

Efficient Edge-aware Attention Network for Graph Generation


Vincent Jung*, Alba Carballo-Castro*, Yiming Qin*, Lonneke van der Plas, Pascal Frossard
ICML 2026 Graph Foundation Models Workshop, 2026

We propose a kernel-friendly edge-aware attention layer for graph generation that is directly compatible with PyTorch FlexAttention, reducing GPU memory by around 65%. This work offers a complementary angle to SparseDiff for scalable graph generation.

project image

Generating Directed Graphs with Dual Attention and Asymmetric Encoding


Alba Carballo-Castro, Manuel Madeira, Yiming Qin, Dorina Thanou, Pascal Frossard
ICLR, 2026
arxiv / code /

We propose DIRECTO, a discrete flow matching model with direction-aware encodings and dual-attention, plus new benchmarks, for better directed graph generation. It extends ideas from DeFoG to directed graphs and shows their robustness in a new setting.

project image

DeFoG: Discrete Flow Matching for Graph Generation


Yiming Qin*, Manuel Madeira*, Dorina Thanou, Pascal Frossard
ICML (Oral Presentation), 2025
arxiv / code /

Graph generation models often face limitations in sampling efficiency and flexibility due to tightly coupled training and sampling stages. We introduce DeFoG, a discrete flow matching framework that disentangles these stages, improving both efficiency and performance over existing diffusion models.

project image

SparseDiff: Sparse Discrete Diffusion for Scalable Graph Generation


Yiming Qin, Clément Vignac, Pascal Frossard
TMLR, 2025
arxiv / code /

Generative models for graphs struggle with scalability due to predicting interactions for all node pairs. We introduce SparseDiff, a denoising diffusion model that leverages sparsity to overcome this. It reformulates diffusion, denoising, and the model architecture around sparse graph representations.

project image

Class-balancing Diffusion Models


Yiming Qin, Huangjie Zheng, Jiangchao Yao, Mingyuan Zhou, Ya Zhang
CVPR, 2023
arxiv / code /

We observe significant performance degradation on tail classes in both diversity and fidelity, and propose Class-Balancing Diffusion Models (CBDM) that are trained with a distribution adjustment regularizer as a solution.




Supervised projects

Here are some research projects that I have supervised. Open projects (for EPFL students) can be found here. Feel free to get in touch.

project image

Balancing Symmetry and Efficiency in Graph Flow Matching


Benjamin Honoré, Alba Carballo-Castro, Yiming Qin, Pascal Frossard
arXiv, 2026
arxiv /

We study the trade-off between symmetry and efficiency in graph flow matching, showing that properly modulating equivariance can accelerate convergence while delaying overfitting.

project image

On the Role of Structure in Hierarchical Graph Neural Networks


Luca Sbicego, Sevda Öğüt, Manuel Madeira, Yiming Qin, Dorina Thanou, Pascal Frossard
ICLR (Workshop), 2025
arxiv /

Hierarchical GNNs still don’t outperform flat ones overall but recover better from structural perturbations.




Projects

Here are some projects that I have worked on.

project image

Learning Cell Dynamic Features for Unlabeled Video-Microscopies from Synthetic Data



ENS Ulm, 2021

We design a multi-task model with two heads: one trained on synthetic mitosis videos (SynDiv) using supervised/self-supervised tasks, and one trained on real microscopy videos (LSP) using self-supervised tasks. Joint training improves the learned representations, yielding better correlations with cell counts, division events, and genetic perturbations than models trained on SynDiv or LSP alone.





Design and source code from Leonid Keselman and Jon Barron's website