The complete publication list can be found at Google Scholar.

  • When GPT Spills the Tea: Comprehensive Assessment of Knowledge File Leakage in GPTs
    Xinyue Shen, Yun Shen, Michael Backes, Yang Zhang; conf
    pdf arXiv

  • Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media
    Zhen Sun, Zongmin Zhang, Xinyue Shen, Ziyi Zhang, Yule Liu, Michael Backes, Yang Zhang, Xinlei He; conf
    pdf arXiv Dataset Code

  • JailbreakRadar: Comprehensive Assessment of Jailbreak Attacks Against LLMs
    Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang; conf
    pdf arXiv Website online Code

  • GPTracker: A Large-Scale Measurement of Misused GPTs
    Xinyue Shen, Yun Shen, Michael Backes, Yang Zhang; arXiv
    pdf Dataset Code

  • On the Effectiveness of Prompt Stealing Attacks on In-The-Wild Prompts
    Yicong Tan, Xinyue Shen, Yun Shen, Michael Backes, Yang Zhang; arXiv
    pdf Code

  • HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
    Xinyue Shen, Yixin Wu, Yiting Qu, Michael Backes, Savvas Zannettou, Yang Zhang; arXiv
    pdf arXiv Website online Dataset Hugging Face Code ArtifactAppendix
    ๐Ÿ“ฆ Artifact Badges: Available, Functional, Results Reproduced

  • From Meme to Threat: On the Hateful Meme Understanding and Induced Hateful Content Generation in Open-Source Vision Language Models
    Yihan Ma, Xinyue Shen, Yiting Qu, Ning Yu, Michael Backes, Savvas Zannettou, Yang Zhang; arXiv
    pdf Dataset Hugging Face Code

  • โ€œDo Anything Nowโ€: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
    Xinyue Shen, Zeyuan Chen, Michael Backes, Yun Shen, Yang Zhang; arXiv
    pdf arXiv Website online Dataset Hugging Face Code GitHub Repo stars
    ๐Ÿ† Listed in Award
    ๐ŸŽ™๏ธ Coverage: New Scientist German Federal Office for Information Security NIST Deutschlandfunk Nova Spektrum.de

  • MGTBench: Benchmarking Machine-Generated Text Detection
    Xinlei He, Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang; arXiv
    pdf arXiv Dataset Hugging Face Code GitHub Repo stars
    ๐Ÿ† Listed in Award

  • Prompt Stealing Attacks Against Text-to-Image Generation Models
    Xinyue Shen, Yiting Qu, Michael Backes, Yang Zhang; arXiv
    pdf arXiv Slides Video Dataset Hugging Face Code
    ๐Ÿ† Recognized in Award
    ๐ŸŽ™๏ธ Coverage: German Federal Office for Information Security NIST CISPA News

  • The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective
    Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang; arXiv
    pdf

  • ModScan: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities
    Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang; arXiv
    pdf arXiv

  • Games and Beyond: Analyzing the Bullet Chats of Esports Livestreaming
    Yukun Jiang, Xinyue Shen, Rui Wen, Zeyang Sha, Junjie Chu, Yugeng Liu, Michael Backes, Yang Zhang; arXiv
    pdf arXiv Poster

  • Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
    Yiting Qu, Xinyue Shen, Xinlei He, Michael Backes, Savvas Zannettou, Yang Zhang; arXiv
    pdf arXiv Code
    ๐ŸŽ™๏ธ Coverage: Montreal AI Ethics Institute German Federal Office for Information Security

  • On Xing Tian and the Perseverance of Anti-China Sentiment Online
    Xinyue Shen, Xinlei He, Michael Backes, Jeremy Blackburn, Savvas Zannettou, Yang Zhang; arXiv
    pdf arXiv slides

Before PhD:

  • Evil Under the Sun: Understanding and Discovering Attacks on Ethereum Decentralized Applications
    Liya Su, Xinyue Shen (co-first author), Xiangyu Du, Xiaojing Liao, XiaoFeng Wang, Luyi Xing, Baoxu Liu; arXiv
    pdf slides