Publications

publications by categories in reversed chronological order.

2025

  1. iclr2025_preview.png
    BadJudge: Backdoor Vulnerabilities of LLM-As-A-Judge
    Terry Tong, Fei Wang, Zhe Zhao, and 1 more author
    In The Thirteenth International Conference on Learning Representations. More Information can be found here , 2025
  2. Unraveling Indirect In-Context Learning Using Influence Functions
    Hadi Askari, Shivanshu Gupta, Terry Tong, and 3 more authors
    2025

2024

  1. emnlp.png
    Securing Multi-turn Conversational Language Models From Distributed Backdoor Attacks
    Terry Tong, Qin Liu, Jiashu Xu, and 1 more author
    In Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
  2. allerton.png
    Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges
    Qin Liu, Wenjie Mo, Terry Tong, and 4 more authors
    In 2024 60th Annual Allerton Conference on Communication, Control, and Computing, Nov 2024