Papers

  • “Do Anything Now”: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
    Xinyue Shen, Zeyuan Chen, Michael Backes, Yun Shen, Yang Zhang; CCS 2024
    [arxiv] [website] [code] Media Coverage: New Scientist, Deutschlandfunk Nova

  • Prompt Stealing Attacks Against Text-to-Image Generation Models
    Xinyue Shen, Yiting Qu, Michael Backes, Yang Zhang; USENIX Security 2024
    [arxiv]

  • Games and Beyond: Analyzing the Bullet Chats of Esports Livestreaming
    Yukun Jiang, Xinyue Shen, Rui Wen, Zeyang Sha, Junjie Chu, Yugeng Liu, Michael Backes, Yang Zhang; ICWSM 2024
    [arxiv]

  • Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
    Yiting Qu, Xinyue Shen, Xinlei He, Michael Backes, Savvas Zannettou, Yang Zhang; CCS 2023
    [arxiv] [code] Media Coverage: Montreal AI Ethics Institute

  • On Xing Tian and the Perseverance of Anti-China Sentiment Online
    Xinyue Shen, Xinlei He, Michael Backes, Jeremy Blackburn, Savvas Zannettou, Yang Zhang; ICWSM 2022
    [pdf] [arxiv] [slides]

  • Evil Under the Sun: Understanding and Discovering Attacks on Ethereum Decentralized Applications
    Liya Su, Xinyue Shen (co-first author), Xiangyu Du, Xiaojing Liao, XiaoFeng Wang, Luyi Xing, Baoxu Liu; USENIX Security 2021
    [pdf] [slides]

  • In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
    Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang
    [arxiv]

  • MGTBench: Benchmarking Machine-Generated Text Detection
    Xinlei He, Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang
    [arxiv] [code]

  • Backdoor Attacks in the Supply Chain of Masked Image Modeling
    Xinyue Shen, Xinlei He, Zheng Li, Yun Shen, Michael Backes, Yang Zhang
    [arxiv]

  • Comprehensive Assessment of Toxicity in ChatGPT
    Boyang Zhang, Xinyue Shen, Wai Man Si, Zeyang Sha, Zeyuan Chen, Ahmed Salem, Yun Shen, Michael Backes, Yang Zhang
    [arxiv]

  • Comprehensive Assessment of Jailbreak Attacks Against LLMs
    Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang
    [arxiv]

Talk

  • Solving The Last Mile Problem Between Machine Learning and Security Operations
    Xiangyu Liu, Xinyue Shen
    Hack In The Box Conference (HITBConf 2018) [pdf] [link]

Pop-science & Novels

Also, as a sci-fiction writer, I am grateful for the opportunity to meet you via stories (Full List).

  • Back Home. Exploration Discovery, 2024.01-02.
  • Hacking Storm. Exploration Discovery, 2023.01-02.
  • Is Artificial Intelligence a “Tower”? Science Fiction World (Youth), 2022.09.
  • Empty Yellow Crane Tower Here. The EELISA Science Fiction Contest, Chinese-Language Category Winner, 2022.02.
  • Lady White Bone. The Ninth “Light-Year” Award, First Prize, 2021.01.
  • Hack! A Seven-Day Invasion Diary of Trojan Horse. “Pop-science and Sci-fiction Youth Star” Award from CSWA, 2020.11.
  • Stars on the Wrist. Science Fiction Cube, 2020.07.
  • A War without Smoke: the Evolution History of Hacker Empire. Science Fiction World, 2019.06.