Black-Box Adversarial Attacks on LLM-Based Code Completion

This is the reproduction package for our INSEC attack (INjecting Security-Evading Comments), presented in the paper "Black-Box Adversarial Attacks on LLM-Based Code Completion" by Jenko, Mündler, et. al., ICML 2025. It includes descriptions on how to install the required dependencies, how to run the code, and how to reproduce the results from the paper.

Installation

We provide extensive installation instructions in the INSTALL.md file.

Running the code

Below is an example of how to get the attack strings on StarCoder 3B.

cd scripts
python3 generic_launch.py --config fig3_main/main_scb3/config.json --save_dir ../results/example

The naming convention is <save-dir>/<listparam>/<timestamp>/<elem>, where

save_dir is the save-dir parameter passed to generic_launch.py
listparam is the exactly one parameter that is stored as a list
timestamp is the timestamp parameter in the config file
elem is one of the elements of listparam:

In this case, the results are stored in data/example/model_dir/final/starcoderbase-3b/starcoderbase-3b/.

Reproducing Figures

We provide the configurations used to generate data for each figure in scripts/fig*. They can be run as described above.

Dataset

Note: You can find the vulnerability dataset on Hugging Face

You can find the training, validation and test sets for the vulnerability dataset in the folders data_train_val and data_test respectively. Each directory contains subdirectories for the respective CWEs. The CWE directories contain JSONL lists of objects (train.jsonl, val.jsonl, and test.jsonl) with the following attributes:

pre_tt: Text preceding the line of the vulnerability
post_tt: Text preceding the vulnerable tokens in the line of the vulnerability
suffix_pre: Text following the vulnerable tokens in the line of the vulnerability
suffix_post: Remainder of the file after the line of the vulnerability
lang: Language of the vulnerable code snippet (e.g., py or cpp)
key: Key character sequences that were used to substitute CodeQL queries during training. Only in the train split.
info: A metadata object, containing the CodeQL query to check the snippet for vulnerabilities and the source of the code snippet.

In particular, the prefix for model infilling is pre_tt + post_tt, whereas the suffix is suffix_pre + suffix_post.

For the functionality datasets, please find the corresponding data in the subfolders of multipl-e, including the functionality dataset for the main evaluation based on Multipl-E, multiple_fim, our confirmation dataset based on HumanEval-X, humaneval-x_fim, and our repository-level completion dataset based on RepoBench, repobench_fim.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
bigcode-evaluation-harness		bigcode-evaluation-harness
data_test		data_test
data_train_val		data_train_val
human-eval-infilling		human-eval-infilling
insec		insec
multipl-e		multipl-e
results		results
scripts		scripts
.gitignore		.gitignore
INSTALL.md		INSTALL.md
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Black-Box Adversarial Attacks on LLM-Based Code Completion

Installation

Running the code

Reproducing Figures

Dataset

About

Uh oh!

Languages

eth-sri/insec

Folders and files

Latest commit

History

Repository files navigation

Black-Box Adversarial Attacks on LLM-Based Code Completion

Installation

Running the code

Reproducing Figures

Dataset

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages