LLM-Aided Automatic Modelling for Security Protocol Verification

This repository corresponds to the artifact of our ICSE 2025 and consists of three main components:

A new benchmark containing 18 protocol cases for evaluation.
A self-contained tool with a web frontend.
Other comparison experiments
Input-output log files which may help to ease the understanding of the workflow of the tool

First, we provide an overview of the directory structure of this repository. Then, we explain how to use the tool, followed by an example demonstrating how user interaction is involved. Finally, we provide a detailed description of the benchmark introduced in this work.

Directories structure

📂 AutoSM
├── 📂 ace-builds
├── 📂 anb
├── 📂 benchmark
├── 📂 examples
├── 📂 src
├── 📂 static
└── 📂 templates

ace-builds: Ace is a code editor written in JavaScript, which is embedded in our frontend.
anb: Comparisons with one correct-by-construction approach.
benchmark: A benchmark that contains 18 protocols' text descriptions and corresponding symbolic models.
examples: The examples used to present the workflow.
src: The source code of our implementation
static: Static configurations including images and .css file.
templates: HTML page of web-based frontend

Introduction

This tool can generate formal symbolic model for a protocol automatically from unstructed natural language, equipped with LLMs' powered ability for semantic parsing. Comparing with existing text-to-code tasks, we pay more attention on the trustworthiness of the general translation process, i.e., the output of the tool should be consistent with the unstructed natural language description semantically. We try to make as much control as possible for the overall process (at least, provide some evidence of the trustworthiness for a non-expert user), though "black-box" LLM is introduced. The tool is composed of four stages, transitioning from natural language input to a Tamarin model:

Parser, a LLM-powered CCG parser, which takes protocol documents as input, parses them into lambda calculus expressions (that are defined specifically for modeling security protocols).
Repairer, which repairs the broken specifications with static analysis techniques and user interaction to make them well-formed.
Rewriter, which transforms the lambda expressions into Sapic+ [1] specification.
Compiler, which is designed and implemented by Cheval et al., taking the well-formed Sapic+ process as input and compiles it into models accepted by the protocol verifiers (Tamarin, DeepSec, and ProVerif) directly.

How to use the tool?

Setup

Install Tamarin-prover

Follow the Tamarin manual.

$ brew install tamarin-prover/tap/tamarin-prover

Make sure the prover equipped with a Sapic+ platform.

To check the installation of Tamarin, enter the command tamarin-prover --version in the command line. The output should resemble the following:

tamarin-prover 1.8.0, (C) David Basin, Cas Cremers, Jannik Dreier, Simon Meier, Ralf Sasse, Benedikt Schmidt, 2010-2023

This program comes with ABSOLUTELY NO WARRANTY. It is free software, and you
are welcome to redistribute it according to its LICENSE, see
'https://github.com/tamarin-prover/tamarin-prover/blob/master/LICENSE'.

maude tool: 'maude'
checking version: 2.7.1. OK.
checking installation: OK.
Generated from:
Tamarin version 1.8.0
Maude version 2.7.1

Before using Tamarin, set the terminal encoding to UTF-8:
```
$ export LANG=en_US.UTF-8        
```

Setup the conda environments, and install the related packages.

$ conda create -n llm4V python=3.10
$ conda activate llm4V
$ pip install -r requirements.txt

Configuration

Configure openai API key in file src/conf/config.json,

{
  "API_URL_BASE": "YOUR API URL BASE",
  "openai_api_key": "YOUR OPENAI KEY",
}

Run the tool's frontend

$ cd src
$ python -m flask --app frontend run

(Optional) Add --debug flag to enable debugging mode.
Then open web-based tool at http://127.0.0.1:5000.

User example

Here we take protocol Denning-Sacco as example to show how user can use the tool generate symbolic model semi-automatically.

Load the protocol document.
Hire LLM as a parser to start reading the document, generating "reading logs"
Check the parsing result with hints, which include the message sequence chart (MSC) to the protocol shown in the left panel.
Rewrite to the Sapic+ specification.

How we construct the Benchmark?

Here, we provide more details about the benchmark introduced in this work. The benchmark consists of a set of three tuples ($N$, $\mathcal{P}$, $\mathcal{R}$), where $N$ represents the protocol description, $\mathcal{P}$ is the Sapic+ model, and $\mathcal{R}$ is the Tamarin model. We begin with $\mathcal{R}$, which is available in the Tamarin GitHub repository. To derive their corresponding $\mathit{N}$, we read the related documentation, such as RFCs, and extract and reformulate the core parts relevant to the model. As a result, some of the $N$ in our benchmark may not be identical to the original paper. For each given model $\mathcal{R}$, a set of safety properties $\Phi$ have been verified on $\mathcal{R}$. We manually build the Sapic+ model $\mathcal{P}$, ensuring that every property $\varphi \in \Phi$ which have been verified on $\mathcal{R}$ still holds on $\mathcal{P}$.

We use a table to document the sources of our protocol descriptions and how we reformulated the original content.

No.	Protocol	Source	Title	Authors	Note
1	Otway-Rees	GIAC paper	Otway-Rees Key Exchange Protocol Specification		We have rearranged the content order on page 6 of the original paper. Specifically, we have moved Table 2, which illustrates the messages in each step, to the description section.
2	NSSK	Wikipedia	Needham-Schroeder Symmetric protocol
3	NSPK	Teaching Assignment	Description of the Needham Schroeder public key protocol and its attack	Véronique Cortier
4	KEMTLS	CCS 2020	Post-Quantum TLS Without Handshake Signatures (Full version, March 15, 2022)	Peter Schwabe, Douglas Stebila, Thom Wiggers	The protocol description is excerpted from Section 3, page 5 of the original paper.
5	Example	Tamarin-manual
6	SSH	RFC 4253	The Secure Shell (SSH) Transport Layer Protocol	T. Ylonen, C. Lonvick, Ed.	We only select Section 8 and Section 7.2 as input, excluding other parts. These sections are reordered to reflect the actual order of protocol execution; that is, first performing the key exchange (Section 8), followed by retrieving the key from shared keys (Section 7.2).
7	EDHOC	draft-ietf-lake-edhoc-02	A lightweight DH based key exchange	G. Selander, J. Mattsson, F. Palombini, Ericsson AB
8	NAXOS	Tamarin-manual
9	Toy	Teaching material of Tamarin-prover	Tamarin Toy Protocol	Benjamin Kiesl
10	Yahalom	Wikipedia	Yahalom Protocol
11	Kao Chow Authentication v.1	Operating Systems Review	An Efficient and Secure Authentication Protocol Using Uncertified Keys	I-Lung Kao and Randy Chow	The protocol description is excerpted from Section 2, page 2 of the original paper. We do not modify any contents given by the paper, but we moved the message format given in the Alice&bob notation next to the corresponding natural language description.
12	Sigfox	ACSAC 2017	Automated Analysis of Secure Internet of Things Protocols	Jun Young Kim, Ralph Holz, Wen Hu	The text is excerpted from Section 3.1, page 5 in the original paper. The original description contains some explanation about the `fact` and MSRs in Tamarin, which are excluded here.
13	Neuman-Stubblebine	Wikipedia	Neuman-Stubblebine protocol
14	Woo-Lam	International Workshop on Security In Information Systems	Analysing the Woo-Lam Protocol Using CSP and Rank Functions	Siraj Shaikh and Vicky Bush
15	SPLICE/AS	IEICE Transactions on Information and Systems	Design and Implementation of an Authentication System in WIDE Internet Environment	Suguru Yamaguchi, Kiyohiko Okayama, Hideo Miyahara	The protocol description is excerpted from Section 3.1, "Requesting a Server," which corresponds to Figure 3. The figure illustrates the message format exchanged between the parties, and this information has been integrated into the corresponding descriptive sentence.
16	CCITT X.509-1	The Directory-Authentication Framework	CCITT X.509 (11/1988) AND Security Defects in CCITT Recommendation X .509	ITU	We focus on One-way authentication, Section 9.2. The original protocol description is located in page 18.
17	Denning-Sacco	International Conference on Rewriting Techniques and Applications (RTA'11)	FAST: An Efficient Decision Procedure for Deduction and Static Equivalence	Bruno Conchinha, David Basin, Carlos Caleiro	This protocol originates from [2]. But the protocol description is excerpted from [3], Section 4.3. Besides, we find an error in the Alice and Bob notation in [2], that has been corrected here.
18	LAK06	SCIS06	RFID Mutual Authentication Scheme based on Synchronized Secret Information	Sangshin Lee, Tomoyuki Asano, and Kwangjo Kim	The raw protocol description, excerpted from the original paper, contains a key chaining mechanism that requires looping to characterize. In our abstraction, we merged the 'Reader' entity with its backend.

User tutorial

Here gives an overivew for the general workflow of the tool. We use a NSPK example to illustrate how user can interact with the tool and how tool can generate formal specificaions and check the results automatically.

References

Cheval, Vincent, Charlie Jacomme, Steve Kremer, and Robert Künnemann. 2022. "SAPIC+: Protocol Verifiers of the World, Unite!" In 31st USENIX Security Symposium (USENIX Security 22), 3935–52.
Wikipedia. "Needham-Schroeder protocol." https://en.wikipedia.org/wiki/Needham%E2%80%93Schroeder_protocol
Global Information Assurance Certification Paper.
Schwabe, Peter, Douglas Stebila, and Thom Wiggers. "Post-quantum TLS without handshake signatures." Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security, 2020.
Tamarin Prover Manual. https://tamarin-prover.com/manual/
Ylonen, Tatu. "RFC 4253: The secure shell (SSH) transport layer protocol." (2006).
Göran Selander, John Preuß Mattsson, and Francesca Palombini. "Ephemeral Diffie-Hellman Over COSE (EDHOC)." Internet-Draft draft-ietf-lake-edhoc-02, Internet Engineering Task Force, May 6, 2021.
Benjamin Kiesl. "Tamarin Toy Protocol." https://github.com/benjaminkiesl/tamarin_toy_protocol
Wikipedia. "Yahalom (protocol)" https://en.wikipedia.org/wiki/Yahalom_(protocol)
Kao, I-Lung, and Randy Chow. "An efficient and secure authentication protocol using uncertified keys." ACM SIGOPS Operating Systems Review 29.3 (1995): 14-21.
Kim, Jun Young, et al. "Automated analysis of secure internet of things protocols." Proceedings of the 33rd Annual Computer Security Applications Conference, 2017.
Wikipedia. "Neuman-Stubblebine protocol" https://en.wikipedia.org/wiki/Neuman%E2%80%93Stubblebine_protocol
Shaikh, Siraj, and Vicky Bush. "Analysing the Woo-Lam protocol using CSP and rank functions." Journal of Research and Practice in Information Technology 38.1 (2006): 19-29.
CCITT Recommendation X.509, The Directory - Authentication Framework, CCITT, December 1988.
I'Anson, Collin, and Chris Mitchell. "Security defects in CCITT recommendation X. 509: the directory authentication framework." ACM SIGCOMM Computer Communication Review 20.2 (1990): 30-34.
Conchinha, Bruno, David A. Basin, and Carlos Caleiro. "FAST: an efficient decision procedure for deduction and static equivalence." 22nd International Conference on Rewriting Techniques and Applications (RTA'11), Schloss-Dagstuhl-Leibniz Zentrum für Informatik, 2011.
Denning, Dorothy E., and Giovanni Maria Sacco. "Timestamps in key distribution protocols." Communications of the ACM 24.8 (1981): 533-536.
Sangshin Lee, Tomoyuki Asano, and Kwangjo Kim. "RFID mutual authentication scheme based on synchronized secret information." SCIS 2006. Institute of Electronics, Information and Communication Engineers, 2006.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-Aided Automatic Modelling for Security Protocol Verification

Directories structure

Introduction

How to use the tool?

Setup

Configuration

User example

How we construct the Benchmark?

User tutorial

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
ace-builds		ace-builds
anb		anb
benchmark		benchmark
src		src
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

LLM-Aided Automatic Modelling for Security Protocol Verification

Directories structure

Introduction

How to use the tool?

Setup

Configuration

User example

How we construct the Benchmark?

User tutorial

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages