Add concurrency checking based on GenMC - serial queue by KurtWu10 · Pull Request #561 · au-ts/sddf

KurtWu10 · 2025-11-17T06:50:01Z

This PR is a variant of PR Add concurrency checking based on GenMC #523 that targets serial queues. It should also fix issue serial example failures on Rasberry Pi 4B #528.

The following invalid assertion

Lines 159 to 172 in 63ac0fd

    
           /** 
        
            * Update the value of the tail in the shared data structure to make 
        
            * locally enqueued data visible. 
        
            * 
        
            * @param queue_handle queue to update. 
        
            * @param local_tail tail which points to the last character enqueued. 
        
            */ 
        
           static inline void serial_update_shared_tail(serial_queue_handle_t *queue_handle, uint32_t local_tail) 
        
           { 
        
               uint32_t current_length = serial_queue_length(queue_handle); 
        
               uint32_t new_length = local_tail - queue_handle->queue->head; 
        
               /* Ensure updates to tail don't overwrite existing data */ 
        
               assert(new_length >= current_length);

in L172 has also been removed. It is invalid because the consumer can consume multiple queue entries and update the head between L168 and L169.

A similar assertion in serial_update_shared_head() has also been removed.

Queue operations with the _length suffix always use relaxed atomic operation to prevent data races in all situations. These operation do not provide any synchronisation. On the contrary, other operations with _empty / _full / _free suffix use acquire atomic operation appropriately for synchronisation.

Limitations

On the memory model side, compared with PR Add concurrency checking based on GenMC #523, this PR uses additional relaxed atomic operations that may be a concern for verifiers like GenMC.
The test does not cover all serial queue APIs.
There are some redundant memory operations that will not be removed in compiler's dead-code elimination optimisation, e.g. in the updated serial_update_shared_tail().

Courtney3141

I'm going to make some additional modifications to the serial queue, but otherwise this is a great PR!

Courtney3141 · 2025-11-20T06:28:05Z

Also, I will leave it to you to add the additional comments to all the queue functions:

"This function is only to be called by the CONSUMER/PRODUCER of the queue."

(I'll leave the phrasing up to you)

Courtney3141 · 2025-11-20T07:37:09Z

Also, I will leave it to you to add the additional comments to all the queue functions:

"This function is only to be called by the CONSUMER/PRODUCER of the queue."

(I'll leave the phrasing up to you)

Never mind, I did this in my last commit. Please just check that my comments are in fact correct!

midnightveil · 2025-11-21T02:01:52Z

+ */
+static inline uint32_t load_acquire_32(const uint32_t *ptr)
+{
+#ifdef CONFIG_ENABLE_SMP_SUPPORT


Is it really worth having this?

I'd thought we'd discussed this with Gernot and decided the queues should just always be correct for cross-core since it's misleading if not.

(and when I tested it made no performance impact).

I remember the discussion on GitHub as well. I'll run additional benchmarks on this. The current implementation is based on memory of my recent benchmarks that it increases utilisation by 5~10%.

In addition, I will also benchmark whether the dmb ishst barrier is beneficial to the store-release function, which cannot be represented by the C standard library.

I am very surprised by you finding it had no performance impact, as when I tested it a long ago I remember it being very significant. Although I can't seem to find those results for the life of me...

I finally found the graph!

I suspect it probably depends on what system we test on, as they will have different costs? But yeah, IDK. All my tests were run with it forced to 1 as per the docs I had in the description.

Yes, it is system dependent. There is an about 5-10% system-wide relative overhead on maaxboard (cortex-a53)

with higher cycles per packet as well. There is no overhead on odroid c4 (cortex-a55, under investigation). These are unicore UDP benchmarks comparing different implementations of network queues.

Courtney3141 · 2025-11-21T04:30:10Z

I'm thinking it may be worth adding an introductory comment at the top of the serial queue file saying something along the lines of:

The serial queue, like all sDDF queues, is an implementation of a single-producer, single-consumer FIFO queue. The key assumption being that only the producer is permitted to modify the tail, and only the consumer is permitted to modify the head. Both components are permitted to read both indices. The library's atomic operations are written to ensure correctness under these assumptions, thus each function's description contains an explicit notes on its assumed caller.

Signed-off-by: Ivan Velickovic <i.velickovic@unsw.edu.au> Signed-off-by: Kurt Wu <rihui.wu@unsw.edu.au>

Signed-off-by: Kurt Wu <rihui.wu@unsw.edu.au>

KurtWu10 force-pushed the genmc-serial branch from 296e51d to 8a231aa Compare November 17, 2025 06:51

KurtWu10 marked this pull request as ready for review November 17, 2025 06:58

Ivan-Velickovic added the hardware-test Run the hardware tests on this PR. label Nov 17, 2025

KurtWu10 force-pushed the genmc-serial branch from 8a231aa to e09fd1a Compare November 17, 2025 07:11

KurtWu10 removed the hardware-test Run the hardware tests on this PR. label Nov 17, 2025

KurtWu10 force-pushed the genmc-serial branch from 3ca0bc9 to aadc40e Compare November 17, 2025 09:37

KurtWu10 added the hardware-test Run the hardware tests on this PR. label Nov 17, 2025

KurtWu10 force-pushed the genmc-serial branch from aadc40e to d5e1b60 Compare November 17, 2025 09:44

KurtWu10 removed the hardware-test Run the hardware tests on this PR. label Nov 17, 2025

KurtWu10 force-pushed the genmc-serial branch from 6c0e8dd to b1ff1df Compare November 18, 2025 00:27

KurtWu10 added the hardware-test Run the hardware tests on this PR. label Nov 18, 2025

KurtWu10 force-pushed the genmc-serial branch from b1ff1df to 9249fce Compare November 18, 2025 00:29

KurtWu10 linked an issue Nov 18, 2025 that may be closed by this pull request

serial example failures on Rasberry Pi 4B #528

Closed

KurtWu10 requested a review from Courtney3141 November 20, 2025 04:35

Courtney3141 requested changes Nov 20, 2025

View reviewed changes

Comment thread ci/genmc/genmc.sh

Comment thread ci/genmc/README.md Outdated

Comment thread include/sddf/serial/queue.h Outdated

Comment thread include/sddf/serial/queue.h

Comment thread include/sddf/serial/queue.h

KurtWu10 force-pushed the genmc-serial branch from 9300ac7 to 35216e4 Compare November 20, 2025 09:52

midnightveil reviewed Nov 21, 2025

View reviewed changes

KurtWu10 force-pushed the genmc-serial branch from bafadd8 to 1810982 Compare November 21, 2025 03:05

KurtWu10 removed the hardware-test Run the hardware tests on this PR. label Nov 21, 2025

Courtney3141 force-pushed the genmc-serial branch 4 times, most recently from 4711f06 to 8a99a20 Compare November 24, 2025 05:32

KurtWu10 added 2 commits November 24, 2025 16:35

ci/genmc: add genmc and serial queue library tests to ci

be9b975

Signed-off-by: Ivan Velickovic <i.velickovic@unsw.edu.au> Signed-off-by: Kurt Wu <rihui.wu@unsw.edu.au>

serial: fix queue implementation

9e62c2e

Signed-off-by: Kurt Wu <rihui.wu@unsw.edu.au>

Courtney3141 force-pushed the genmc-serial branch from 8a99a20 to 9e62c2e Compare November 24, 2025 05:36

Courtney3141 enabled auto-merge (rebase) November 24, 2025 05:38

Courtney3141 approved these changes Nov 24, 2025

View reviewed changes

Courtney3141 merged commit 82e1306 into main Nov 24, 2025
14 checks passed

Courtney3141 deleted the genmc-serial branch November 24, 2025 05:45

KurtWu10 mentioned this pull request Jan 13, 2026

sddf queue functional correctness #511

Open

	/**
	* Update the value of the tail in the shared data structure to make
	* locally enqueued data visible.
	*
	* @param queue_handle queue to update.
	* @param local_tail tail which points to the last character enqueued.
	*/
	static inline void serial_update_shared_tail(serial_queue_handle_t *queue_handle, uint32_t local_tail)
	{
	uint32_t current_length = serial_queue_length(queue_handle);
	uint32_t new_length = local_tail - queue_handle->queue->head;

	/* Ensure updates to tail don't overwrite existing data */
	assert(new_length >= current_length);

Conversation

KurtWu10 commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Limitations

Uh oh!

Courtney3141 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Courtney3141 commented Nov 20, 2025

Uh oh!

Courtney3141 commented Nov 20, 2025

Uh oh!

Uh oh!

midnightveil Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

KurtWu10 Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Courtney3141 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Courtney3141 Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Courtney3141 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

midnightveil Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

KurtWu10 Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Courtney3141 commented Nov 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

KurtWu10 commented Nov 17, 2025 •

edited

Loading

KurtWu10 Nov 21, 2025 •

edited

Loading

Courtney3141 Nov 21, 2025 •

edited

Loading

KurtWu10 Nov 21, 2025 •

edited

Loading