ONR-YIP , Young Investigator Award from the Office of Naval Research
AFOSR-YIP, Young Investigator Award from the Air Force Office of Scientific Research
Best Papers, Best Student Papers, Best Paper Hnorable mentiones, from NeurIPS, KDD, SIGIR, ASONAM, etc.
Publications
Reinforcement learning and applications in search, NLP, knowledge graphs, 强化学习算法及在搜索、自然语言、和知识图谱的应用
On Penalization in Stochastic Multi-armed Bandits, IEEE Transactions on Information Theory, 2025. pdf
Extreme Bandits using Robust Statistics, IEEE Transactions on Information Theory, 2023. pdf
Piecewise Stationary Bandits under Risk Criteria, AISTATS 2023. pdf
Regret Analysis for RL using Renewal Bandit Feedback, ITW 2022. pdf
Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning, NAACL 2022. pdf
Extracting Knowledge from Web Text with Monte Carlo Tree Search, WWW 2020. pdf
An Advantage Actor-Critic Algorithm with Confidence Exploration for Open Information Extraction, SDM 2020. pdf
A Reinforced Semi-supervised Neural Network for Helpful Review Identification, CIKM 2020. pdf
Video Recommendation with Multi-gate Mixture of Experts Soft Actor Critic, SIGIR 2020.pdf
Reinforced Product Metadata Selection for Helpfulness Assessment of Customer Reviews, EMNLP 2019. pdf
End-to-end Deep Reinforcement Learning Based Coreference Resolution, ACL 2019. pdf
Logician and Orator: Learning from the Duality between Language and Knowledge in Open Domain, EMNLP 2018. pdf
Optimization for machine learning and deep learning 深度学习机器学习优化算法
Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees, ICLR, 2025. pdf
Empirical Risk Minimization for Losses without Variance, Statistica Sinica, 2025. pdf
Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property, ICLR 2023. pdf
Dataset Pruning: Reducing Training Data by Examining Generalization Influence, ICLR 2023. pdf
Exponential Generalization Bounds with Near-Optimal Rates for Lq-Stable Algorithms, ICLR 2023. pdf
One-Step Estimator for Permuted Sparse Recovery, ICML 2023. pdf
LSDS++: Dual Sampling for Accelerated k-means++, ICML 2023. pdf
Regression with Label Permutation in Generalized Linear Model, ICML 2023. pdf
k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy, NeurIPS 2023.pdf
On the Overlooked Structure of Stochastic Gradients, NeurIPS 2023.pdf
L2-Uniform Stability of Randomized Learning Algorithms: Sharper Generalization Bounds and Confidence Boostin, NeurIPS 2023.pdf
Constructing Orthogonal Convolutions in an Explicit Manner, ICLR 2022. pdf
Discriminative Similarity for Data Clustering, ICLR 2022. pdf
Optimal Transport for Long-Tailed Recognition with Learnable Cost Matrix, ICLR 2022. pdf
Minimax M-estimation under Adversarial Corruption, ICML 2022. pdf
Nearly Optimal Catoni’s M-estimator for Infinite Variance, ICML 2022. pdf
The benefits of diversity: Permutation recovery in unlabeled sensing from multiple measurement vectors, IEEE Transactions on Information Theory, 2022. pdf
Stability and Risk Bounds of Iterative Hard Thresholding, IEEE Transactions on Information Theory, 2022. pdf
Transformer and Generative AI models 生成式AI模型
Visual Transformer with Differentiable Channel Selection: An Information Bottleneck Inspired Approach, ICML 2024. pdf
Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality, Preprint 2024. pdf
CoopHash: Cooperative Learning of Multipurpose Descriptor & Contrastive Pair Generator via Variational MCMC Teaching for Image Hashing, Preprint 2024. pdf
Word Embedding with Neural Probabilistic Prior, SDM 2024. pdf
An Energy-Based Prior for Generative Saliency, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 2023. pdf
Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation, AISTATS 2023. pdf
CoopInit: Initializing Generative Adversarial Networks via Cooperative Learning, AAAI 2023. pdf
A Tale of Two Latent Flows: Learning Latent Space Normalizing Flow with Short-run Langevin Flow for Approximate Inference, AAAI 2023. pdf
Learning Latent Structural Relations with Message Passing Prior, WACV 2023. pdf
Efficient Learning to Learn a Robust CTR Model for Web-scale Online Sponsored Search Advertising, CIKM 2021.pdf
Multi-modal Dictionary BERT for Cross-modal Video Search in Baidu Advertising, CIKM 2021.pdf
MixBERT for Multi-modal Matching in Image Advertising, CIKM 2021.pdf
Assorted Attention Network for Cross-Lingual Language-to-Vision Retrieval, CIKM 2021.pdf
Combo-Attention Network for Baidu Video Advertising, KDD 2020.pdf
Video Recommendation with Multi-gate Mixture of Experts Soft Actor Critic, SIGIR 2020.pdf
Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems, MLSys 2020.pdf
Sample Optimization For Display Advertising, CIKM 2020.pdf
AIBox: CTR Prediction Model Training on a Single Node, CIKM 2019.pdf
MOBIUS: Towards the Next Generation of Query-Ad Matching in Baidu’s Sponsored Search, KDD 2019.pdf
Input method editor (IME) 输入法
Improved Touch-screen Inputting Using Sequence-level Prediction Generation, WWW 2020. pdf
FastInput: Improving Input Efficiency on Mobile Devices, CIKM 2018. pdf
Knowledge graphs (KG) 知识图谱
OIE@OIA: an Adaptable and Efficient Open Information Extraction Framework, ACL 2022. pdf
SpaceE: Knowledge Graph Embedding by Relational Linear Transformation in the Entity Space, HT 2022. pdf
Explainable Concept Graph Completion by Bridging Open-DomainRelations and Concepts, SDM 2022. pdf
MQuadE: a Unified Model for Knowledge Fact Embedding, WWW 2021. pdf
ReadsRE: Retrieval-Augmented Distantly Supervised Relation Extraction, SIGIR 2021. pdf
Extracting Knowledge from Web Text with Monte Carlo Tree Search, WWW 2020. pdf
A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression, EMNLP 2020. pdf
Learning Interpretable Relationships between Entities, Relations and Concepts via Bayesian Structure Learning on Open Domain Facts, ACL 2020. pdf
Integration of Knowledge Graph Embedding into Topic Modeling with Hierarchical Dirichlet Process, NAACL 2019. pdf
Knowledge Graph Embedding Based Question Answering, WSDM 2019. pdf
Logician and Orator: Learning from the Duality between Language and Knowledge in Open Domain, EMNLP 2018. pdf
Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction, WSDM 2018. pdf
Federated Learning, Deep Learning Tranining Algorithms 联邦学习、分布式训练、和深度学习训练算法
Stochastic Controlled Averaging for Federated Learning with Communication Compression, ICLR 2024. pdf
Sharper Analysis for Minibatch Stochastic Proximal Point Method: Stability, Smoothness, and Deviation, Journal of Machine Learning Research (JMLR), 2023.pdf
Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression, ICML 2023.pdf
On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond, NeurIPS 2022. pdf
Fed-LAMB: Layer-wise and Dimension-wise Locally Adaptive Federated Learning, UAI 2023. pdf
On Distributed Adaptive Optimization with Gradient Compression, ICLR 2022. pdf
On the Convergence of Decentralized Adaptive Gradient Methods, ACML 2022. pdf
An Optimistic Acceleration of AMSGrad for Nonconvex Optimization, ACML 2021. pdf
Toward Communication Efficient Adaptive Gradient Method, FODS 2020. pdf
Understanding and Detecting Convergence for Stochastic Gradient Descent with Momentum, BIGDATA 2020. pdf
On Convergence of Distributed Approximate Newton Methods: Globalization, Sharper Bounds and Beyond, Journal of Machine Learning Research 2020. pdf
Towards Better Generalization of Adaptive Gradient Methods, NeurIPS 2020. pdf
Privacy 数据隐私保护
Improved Convergence of Differential Private SGD with Gradient Clipping, ICLR 2023. pdf
Differential Privacy with Random Projections and Sign Random Projections. pdf
Differentially Private One Permutation Hashing and Bin-wise Consistent Weighted Sampling. pdf
Building K-Anonymous User Cohorts with Consecutive Consistent Weighted Sampling (CCWS), SIGIR 2023. pdf
k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy. pdf
Breaking the Linear Error Barrier in Differentially Private Graph Distance Release, NeurIPS 2022. pdf
NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language, IEEE CNS 2022. pdf
Distances Release with Differential Privacy in Tree and Grid Graph, ISIT 2022. pdf
AI model security 模型安全
Defending Backdoor Attacks on Vision Transformer via Patch Processing, AAAI 2023. pdf
Marksman Backdoor: Backdoor Attacks with Arbitrary Target Class, NeurIPS, 2022.pdf
Integrity Authentication in Tree Models, KDD 2022.pdf
Identification for Deep Neural Network: Simply Adjusting Few Weights!, ICDE 2022.pdf
DeepAuth: A DNN Authentication Framework by Model-Unique and Fragile
ature Embedding, AAAI 2022.pdf
Backdoor Attack with Imperceptible Input and Latent Modification, NeurIPS 2021.pdf
LIRA: Learnable, Imperceptible and Robust Backdoor Attacks, ICCV 2021.pdf
Robust Watermarking for Deep Neural Networks via Bi-level Optimization, ICCV 2021.pdf
A/B testing
Cluster-Adaptive Network A/B Testing: From Randomization to Estimation, Journal of Machine Learning Research (JMLR) 2024. pdf
A New and Unified Family of Covariate Adaptive Randomization Procedures and Their Properties, JASA 2024. pdf
Adaptive Randomization in Network Data, Eletronic Journal of Statistics 2024. pdf
A/B Testing in Network Data with Covariate-Adaptive Randomization, ICML 2023. pdf
Adaptive A/B Test on Networks with Cluster Structures, AISTATS 2022. pdf
Embedding based retrieval (EBR) with graph-based approxinate near neighbor (ANN) search, 基于图的方法向量数据快速检索
GUITAR: Gradient Pruning toward Fast Neural Ranking, SIGIR, 2024.pdf
Asymmetric Hashing for Fast Ranking via Neural Network Measures, SIGIR 2023. pdf
Proximity Graph Maintenance for Fast Online Nearest Neighbor Search. pdf
Constrained Approximate Similarity Search on Proximity Graph. pdf
Fast Neural Ranking on Bipartite Graph Indices, VLDB 2022. pdf
Norm Adjusted Proximity Graph for Fast Inner Product Retrieval, KDD 2021. pdf
SONG: Approximate Nearest Neighbor Search on GPU, ICDE 2020. pdf
Fast Item Ranking under Neural Network based Measures, WSDM 2020. pdf
On Efficient Retrieval of Top Similarity Vectors, EMNLP 2019. pdf
Möbius Transformation for Fast Inner Product Search on Graph, NeurIPS 2019. pdf
Embedding based retrieval (EBR), compression, hashing methods,向量数据快速检索、数据压缩、哈希算法
On the Trade-Off Between Bit Depth and Number of Samples for a Basic Approach to Structured Signal Recovery From b -Bit Quantized Linear Measurements, IEEE Transactions on Information Theory, 2018. pdf
Linearized GMM Kernels and Normalized Random Fourier Features, KDD 2017. pdf
Quantized Random Projections and Non-Linear Estimation of Cosine Similarity, NIPS 2016. pdf
Improved Asymmetric Locality Sensitive Hashing (ALSH) for Maximum Inner Product Search (MIPS), UAI 2015. pdf
Asymmetric Minwise Hashing for Indexing Binary Inner Products and Set Containment, WWW 2015. pdf
0-Bit Consistent Weighted Samplings, KDD 2015. pdf
Densifying One Permutation Hashing via Rotation for Fast Near Neighbor Search, ICML 2014. pdf