Purple llama cyberseceval: A secure coding benchmark for language models M Bhatt, S Chennabasappa, C Nikolaidis, S Wan, I Evtimov, D Gabi, ... arXiv preprint arXiv:2312.04724, 2023 | 17 | 2023 |
CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models M Bhatt, S Chennabasappa, Y Li, C Nikolaidis, D Song, S Wan, F Ahmad, ... arXiv preprint arXiv:2404.13161, 2024 | | 2024 |