Lanqing Li (李蓝青)

I am a principal investigator (PI) at Zhejiang Lab, leading the molecular design & synthesis group at the Research Center for Computational Drug Discovery starting from Jan 2023. Prior to that, I was a senior research scientist at Tencent AI Lab, where I work on machine learning and its applications in drug discovery and autonomous control. I also have experience in developing computer-aided detection (CAD) algorithms and softwares as a Tech Lead at a pre-IPO startup, InferVision.

Starting from August 2022, I'm fortunate to study as a part-time PhD supervised by Prof. Pheng Ann Heng, at The Chinese University of Hong Kong. Previously, I hold a Master's Degree in Physics (PhD program) from The University of Chicago and received my Bechelor's Degree in Physics from MIT in 2015, where I was advised by Prof. Alan Guth, Prof. David Kaiser and Prof. Nevin Weinberg to conduct research in theoretical physics.

At Tencent I've worked on iDrug and iGrow solutions. I'm enthusiastic about artificial intelligence and its prospect of making a better world.

Email  /  CV  /  Bio  /  Google Scholar  /  LinkedIn  /  Github

profile photo
Research

I'm interested in robust machine learning, AI-aided drug discovery (AIDD), reinforcement learning (RL), and AI for science. In particular, I work on the understanding and development of machine learning algorithms that are robust to distribution shift, imbalanced/offline data, etc., to better facilitate the computational design and synthesis of drugs. For those keen visitors, please find my related publications below.

Hiring! We are looking for full-time researchers, engineers, post docs and highly-motivated Ph.D. students and interns to join our team. The openings cover the following exciting areas:

  • Computational Drug Design & Synthesis, e.g., De Novo Drug/Protein Design, Generative Models, Retrosynthesis
  • Robust Machine Learning, e.g., OOD/Imbalanced/Continual Learning
  • Reinforcement Learning, e.g., Offline/Meta RL, RL for Molecular Design, Physics-informed RL

Feel free to shot an email if you are interested.

News
  • 2023/09/27: Our work on protein inverse folding and protein large language model at Zhejiang Lab are reported by CCTV News, check out 01:41:00-01:46:00 in this recording!
  • 2023/09/26: Our CVPR paper CC-SAM is selected as a highlight by CCF多媒体专委会.
  • 2023/06/18: 2 CVPR papers, CC-SAM on deep long-tailed recognition, and MIT on uncertainty calibration are published.
  • 2023/02/28: 2/2 papers accepted to CVPR 2023.
  • 2022/11/20: 3/3 papers, ImGCL, DrugOOD and MEGAE, are accepted to AAAI 2023.
  • 2022/09/19: Our AIDD benchmark for imbalanced learning algorithms, ImDrug, now appears on arXiv.
  • 2022/09/15: 1 paper accepted to NeurIPS 2022.
  • 2022/07/11: 1 paper VPQ appears at ACM SIGIR 2022.
  • 2022/05/15: 1 paper LA-GNN accepted to ICML 2022.
  • 2022/04/30: I will co-mentors this year‘s Tencent AI Lab Rhino-Bird Focused Research Program, with a focus on 3D de novo drug design and drug-target interaction.
  • 2022/03/02: I will co-mentor this year's Tencent AI Lab Rhino-Bird Elite Training Program, with a focus on deep graph learning and its applications in OOD and long-tailed settings.
  • 2022/01/24: Our AIDD benchmark for out-of-distribution algorithms, DrugOOD, now appears on arXiv.
Selected Publications (*: co-first author, : corresponding author, : my group members or interns)
AI for Drug Discovery
ImGCL: Revisiting Graph Contrastive Learning on Imbalanced Node Classification
Liang Zeng, Lanqing Li, Ziqi Gao, Peilin Zhao , Jian Li
AAAI, 2023
paper / arXiv / poster / code

We introduce an important yet under-explored problem, namely graph contrastive learning on imbalanced node classification. We propose a novel ImGCL self-training framework as an effective solution, which utilizes the node centrality based progressively balanced sampling methods to obtain balanced labels, with theoretical insight for its convergence.

Handling Missing Data via Max-Entropy Regularized Graph Autoencoder
Ziqi Gao, Yifan Niu, Jiashun Cheng, Jianheng Tang, Tingyang Xu , Peilin Zhao , Lanqing Li, Fugee Tsung, Jia Li,
AAAI, 2023
paper / poster / code

We present a regularized graph autoencoder for graph attribute imputation, named MEGAE, which aims at mitigating spectral concentration problem by maximizing the graph spectral entropy. Notably, we first present the method for estimating graph spectral entropy without the eigen-decomposition of Laplacian matrix and provide the theoretical upper error bound.

Hierarchical Graph Learning for Protein–Protein Interaction
Ziqi Gao, Yifan Niu, Chenran Jiang, Jiawen Zhang, Xiaosen Jiang, Lanqing Li, Peilin Zhao , Huanming Yang, Yong Huang, Jia Li,
Nature Communications 14.1 (2023): 1093. (selected to Editors’ Highlight collection)
paper / poster / code

We present a double-viewed hierarchical graph learning model, HIGH-PPI, to predict PPIs and extrapolate the molecular details involved. In this model, we create a hierarchical graph, in which a node in the PPI network (top outside-of-protein view) is a protein graph (bottom inside-of-protein view). In the bottom view, a group of chemically relevant descriptors, instead of the protein sequences, are used to better capture the structure-function relationship of the protein.

Local Augmentation for Graph Neural Networks
Songtao Liu, Rex Ying, Hanze Dong, Lanqing Li, Tingyang Xu , Yu Rong , Peilin Zhao , Junzhou Huang , Dinghao Wu
ICML, 2022
paper / poster / code

We propose a general graph augmentation strategy by generate conditioned node features in their local neighborhood to enhance the expressive power of existing GNN.

Robust Machine Learning
Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition
Zhipeng Zhou*, Lanqing Li*, Peilin Zhao, Pheng-Ann Heng, Wei Gong,
CVPR, 2023
paper / code

Based on PAC-Bayesian framework, we develop a novel sharpness-aware robust optimization scheme to improve the generalization of deep long-tailed learning methods.

On the Pitfall of Mixup for Uncertainty Calibration
Deng-Bao Wang, Lanqing Li, Peilin Zhao, Pheng-Ann Heng, Min-Ling Zhang,
CVPR, 2023
paper / code

We interrogate the general perception that mixup training improves uncertainty calibration. Through systematic empirical studies, we conclude that the answer is quite the opposite when post-hoc calibration is considered. To circumvent this "pitfall", we propose a general strategy named mixup inference in training (MIT), which adopts a simple decoupling principle for recovering the outputs of raw samples at the end of forward network pass.

Reinforcement Learning
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao, Ke Xu, Kuangqi Zhou, Lanqing Li, Xueqian Wang , Bo Yuan , Peilin Zhao
SIGIR, 2022
paper / poster / code

We propose Value Penalized Q-learning (VPQ), a novel uncertainty-based offline RL algorithm that penalizes the unstable Q-values in the regression target using uncertainty-aware weights, achieving the conservative Q-function without the need of estimating the behavior policy.

iGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control
Xiaoyan Cao, Yao Yao, Lanqing Li, Wanpeng Zhang , Zhicheng An , Zhong Zhang, Shihui Guo , Li Xiao, Xiaoyu Cao , Dijun Luo
AAAI, 2022
paper / arXiv / poster / code

A pioneering smart agriculture solution to autonomous greenhouse control based on reinforcement learning, genetic algorithms and black-box simulator.

FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
Lanqing Li, Rui Yang, Dijun Luo
ICLR, 2021
paper / poster / presentation / arXiv / code

The first end-to-end model-free offline meta-RL algorithm, in pursuit of more practical RL.

A Simulator-based Planning Framework for Optimizing Autonomous Greenhouse Control Strategy
Zhicheng An , Xiaoyan Cao, Yao Yao, Wanpeng Zhang , Lanqing Li, Yue Wang, Shihui Guo , Dijun Luo
Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling (ICAPS), 2021
paper / arXiv / poster / code

We explore and examine the feasibility of black-box optimization based on a data-driven simulator for optimal control of autonomous greenhouses.

Physics
Theory of self-resonance after inflation. I. Adiabatic and isocurvature Goldstone modes
Mark P Hertzberg , Johanna Karouby, William Spitzer, Juana C. Becerra , Lanqing Li
Physical Review D 90 (12), 123528, 2014
paper / arXiv

We develop a theory of self-resonance after inflation.

Theory of self-resonance after inflation. II. Quantum mechanics and particle-antiparticle asymmetry
Mark P Hertzberg , Johanna Karouby, William Spitzer, Juana C. Becerra , Lanqing Li
Physical Review D 90 (12), 123529, 2014
paper / arXiv

We develop a theory of self-resonance after inflation.

Scientific Community Activities

Invited Talks and Seminars

  • Intelligent Drug Discovery Platform and Its Applications, presented at the "Computation + Biology" Youth Academic Research Symposium, Zhejiang lab. (09/2023)
  • Guest lecture on reinforcement learning applications. The Chinese University of Hong Kong, Shenzhen. (02/2023) Slides

Academic Services

  • Reviewer, ICLR 2024
  • Reviewer, CVPR 2023
  • Reviewer, ICML 2022, 2023
  • Reviewer, NeurIPS 2022, 2023
  • Reviewer, IJCAI 2021, 2022
  • Reviewer, TPAMI

Source code credit to Dr. Jon Barron