conference paper
"Prompter Says": A Linguistic Approach to Understanding and Detecting Jailbreak Attacks Against Large-Language Models
Proceedings of the 1st ACM Workshop on Large AI Systems and Models with Privacy and Safety Analysis
Publication Date
November 19, 2023
Author(s)
Dylan Lee, Shaoyuan Xie, Shagoto Rahman, Kenneth Pat, David Lee, Qi Alfred Chen