I am a final year Ph.D. candidate of Cyberspace Science and Technology in Beijing Jiaotong University, advised by Prof. Wenjia Niu and A/P Endong Tong with Beijing Key Laboratory of Security and Privacy in Intelligent Transportation. I have been visiting Deakin University (working with Prof. Gang Li ) from Feb, 2024 to Aug, 2024.

My research interests lie in (1) AI security including robust and privacy-preserving reinforcement learning, and (2) Intelligent Transportation security including adversarial attack and defense in Intelligent Signal System.

For more detailed personal information, please refer to my CV (English version/ Chinese version).

I'm on the job market this year, please feel free to contact me if you are interested in my background!

Contact: yikeli@bjtu.edu.cn; yileak823@gmail.com

My research focuses on Trustworthy RL, with emphasis on policy robustness and policy privacy:

(1) We address robust RL by formulating it as a max-expectation optimization problem and introducing the DRRL framework, which dynamically generates and sequences tasks of varying difficulty to enhance policy robustness and training stability. (Li et al, IJCAI'23)

(2) We treat robust RL as a multi-task problem, using a GAN-based task generation model and curricular learning to iteratively train policies with progressive tasks, achieving improved stability and robustness across diverse environments. (Li et al, TST'22 and Tian et al, AutoSec'21)

(3) To protect the privacy of RL models from imitation learning attacks, we propose a hierarchical Policy Confusion Defense (PCD) framework that models privacy-preserving training as finding optimal dynamic switching among diverse expert policies to induce confusion. (Li et al, submitted to TDSC'23)

We target the data poisoning congestion attacks in USDOT-sponsored Intelligent Traffic Signal System:

(1) To collaborative defend against data poinsing attacks in multi-intersection traffic network, we model multi-intersection signal planning as a multi-agent RL problem and propose an actor-attention-critic algorithm to enhance robustness while improving traffic and energy efficiency. (Li et al, TGCN'22)

(2) To bridge the gap in timely visualized attack predictions, we introduce a CycleGAN-based framework leveraging traffic image feature learning to predict congestion caused by attacks. (Li et al, WCMC'20)


  • Yike Li, Yunzhe Tian, Endong Tong, Wenjia Niu, and Jiqiang Liu. Robust Reinforcement Learning via Progressive Task Sequence. In the 32nd International Joint Conference on Artificial Intelligence (IJCAI), 2023.
  • Yike Li, Jiayin Song, Yunzhe Tian, Endong Tong, Yuling Liu, Guozhu Meng, Yalun Wu, Jianhua Li, Wenjia Niu, and Jiqiang Liu. Towards Preventing Imitation Learning Attack via Policy Confusion Defense. Summited to IEEE Transactions on Dependable and Secure Computing (TDSC), 2023 (Under Second Review).
  • Yike Li, Wenjia Niu, Yunzhe Tian, Tong Chen, Zhiqiang Xie, Yalun Wu, Yingxiao Xiang, Endong Tong, Thar Baker, and Jiqiang Liu. Multiagent Reinforcement Learning-Based Signal Planning for Resisting Congestion Attack in Green Transportation. In IEEE Transactions on Green Communications and Networking (TGCN), 2022.
  • Yike Li, Yunzhe Tian, Endong Tong, Wenjia Niu, Yingxiao Xiang, Tong Chen, Yalun Wu, and Jiqiang Liu. Curricular Robust Reinforcement Learning via GAN-Based Perturbation Through Continuously Scheduled Task Sequence. In TSINGHUA Science and Technology (TST), 2022.
  • Yike Li, Yingxiao Xiang, Endong Tong, Wenjia Niu, Bowei Jia, Long Li, Jiqiang Liu, and Zhen Han. An Empirical Study on GAN-Based Traffic Congestion Attack Analysis: A Visualized Method. In Wireless Communications and Mobile Computing (WCMC), 2020.
  • Yunzhe Tian, Yike Li, Kang Chen, Zhenguo Zhang, Endong Tong, Jiqiang Liu, Fangyun Qin, Zheng Zheng, and Wenjia Niu. Towards Label-Efficient Dep Learning-based Aging-rcelated Bug Prediction with Spiking Convolutional Neural Nctworks. Submmited to IEEE Transactions on Emerging Topics in Computing (TETC), 2024 (Under Second Review).
  • Jiayin Song, Yike Li, Yunzhe Tian, Xingyu Wu, Qiong Li, Endong Tong, Wenjia Niu, Zhenguo Zhang, and Jiqiang Liu. Knowledge-Driven Backdoor Removal in Deep Neural Networks via Reinforcement Learning. In the 17th International Conference on Knowledge Science, Engineering and Management (KSEM), 2024.
  • Yunzhe Tian, Yike Li, Kang Chen, Endong Tong, Wenjia Niu, Jiqiang Liu, Fangyun Qin, and Zheng Zheng. Mitigating Overfitting for Deep Learning-based Aging-related Bug Prediction via Brain-inspired Regularization in Spiking Neural Networks. In the 16th International Workshop on Software Aging and Rejuvenation (WoSAR), 2023.
  • Yunzhe Tian, Yike Li, Yingxiao Xiang, Wenjia Niu, Endong Tong, and Jiqiang Liu. Curricular Reinforcement Learning for Robust Policy in Unmanned CarRacing Game. In NDSS 2021, Workshop on Automotive and Autonomous Vehicle Security (AutoSec).
  • 相迎宵,李轶珂,刘吉强,王潇瑾,陈彤,童恩栋,牛温佳,韩臻. 面向降频污染攻击的智能交通拥堵态势量化分析. 软件学报, 2021.
  • Tong Chen, Yingxiao Xiang, Yike Li, Yunzhe Tian, Endong Tong, Wenjia Niu, Jiqiang Liu, Li Gang and Qi Alfred Chen. Protecting Reward Function of Reinforcement Learning via Minimal and Non-catastrophic Adversarial Trajectory. In the 40th International Symposium on Reliable Distributed Systems (SRDS 2021), 2021.
  • Zhiqiang Xie, Yingxiao Xiang, Yike Li, Shuang Zhao, Endong Tong, Wenjia Niu, Jiqiang Liu, Jian Wang. Security Analysis of Poisoning Attacks Against Multi-agent Reinforcement Learning. In the 21st International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), 2021.
  • Yalun Wu, Minglu Song, Yike Li, Yunzhe Tian, Endong Tong, Wenjia Niu, Bowei Jia, Haixiang Huang, Qiong Li and Jiqiang Liu. Improving Convolutional Neural Network-based Webshell Detection through Reinforcement Learning. In the 23rd International Conference on Information and Communications Security (ICICS 2021), 2021.
  • Xu Gao, Jiqiang Liu, Yike Li, Xiaojin Wang, Yingxiao Xiang, Endong Tong, Wenjia Niu, and Zhen Han. Queue Length Estimation Based Defence Against Data Poisoning Attack for Traffic Signal Control. In the 10th International Conference on Intelligent Information Processing (IIP 2020), 2020.
  • Reviewer of Journal of Information and Knowledge Management (JIKM).
  • Reviewer of Journal of Intelligent & Fuzzy Systems (JIFS).
  • Oral Presentation in CTCIS 2021, Baoding, China.
  • Oral Presentation in IJCAI 2023, Macao, China.
  • Teaching Assistant (Jun. 2021 - Jul. 2021)
    80S504Q: Information Security Professional Practice and Training
  • Teaching Assistant (Feb. 2021 - Apr. 2022)
    M602031B: Situation Awareness of Cyberspace Security
  • Guest Lecturer (Feb. 2023 - Aug. 2023)
    M402055B: Artificial Intelligence Security
    Guest Lecture on Reinforcement Learning Security
  • Research Advising and Mentoring
    Team leader for the RL group, a subgroup within the THETA Lab led by Prof. Wenjia Niu.
  • 2022, 2023, 2024, First-class Doctoral Academic Scholarship of Beijing Jiaotong University
  • 2024, Second Prize of the 34th Huiguang Cup Academic Cultural Festival, Academic Poster Track (Coverage: 34th慧光杯 | 优秀学术海报和创新实践竞赛成果展示 )
  • 2023, Team First Prize of the DataCon Big Data Security Analysis Competition, AI security Track
  • 2023, Fourth Place of IEEE Trojan Removal Competition at ICLR 2023.
  • 2022, Team First Prize of The Vulnerability Mining Competition for Olympic Winter Games Beijing 2022 (Coverage: 计算机学院信安团队参与冬奥卫士演练活动荣获一等奖)
  • 2020, Second Prize of The 17th China Post-Graduate Mathematical Contest in Modeling (Huawei Cup)
  • 2017, Second Prize of The 3th “Internet+” Innovation and Entrepreneurship Competition