afl-bridge: fix race between main thread and a vCPU thread #102

lorc · 2025-02-11T01:06:58Z

In some cases qemu_main_loop() can exit before libafl_sync_exit_cpu() completes. This will case race between Rust code that restarts QEMU and vCPU thread that updates last_exit_reason. What I observed is

libafl_exit_signal_vm_start() from a new iteration cleared last_exit_reason.cpu before libafl_sync_exit_cpu() tried to access *last_exit_reason.cpu. This caused NULL pointer dereference.

Fix this by not setting cpu->exit in prepare_qemu_exit() and updating it only in rr_cpu_thread_fn(). This will ensure that qemu_main_loop() waits for vCPU thread to actually stop before returning control to Rust code.

lorc · 2025-02-13T11:13:13Z

Hi, @rmalmain,

Could you please take a look?

reformat code with more typedefs.

lorc · 2025-03-07T20:14:51Z

@rmalmain, @domenukk, @vanhauser-thc , @hexcoder-, @andreafioraldi, @tokatoka,

May I ask you to take a look at this PR? We really want to use LibAFL for Xen fuzzing on our CI server, but there is an issue that crashes the whole fuzzer. This PR fixes it.

If you need more information, please let me know.

tokatoka · 2025-03-07T21:16:17Z

I'll tell romain to check it.
but he'll be back only after next Wednesday

lorc · 2025-03-07T21:43:04Z

Thanks! I waited almost a month, so I'll wait couple days more :)

rmalmain

yeah sorry for the delay, didn't have much time lately.
the fix looks good to me, thanks for the pr!

rmalmain · 2025-03-13T11:22:18Z

accel/tcg/tcg-accel-ops-rr.c

@@ -273,6 +279,12 @@ static void *rr_cpu_thread_fn(void *arg)
                    bql_lock();
                    break;
                }
+//// --- Begin LibAFL code ---


can you also propagate this change to accel/tcg/tcg-accel-ops-mttcg.c please?

Sure, will do this today.

rmalmain · 2025-03-13T12:29:45Z

Thank you, I started to run the tests on the main repo.

…

-------- Message d'origine -------- Le 13/03/2025 12:46, Volodymyr Babchuk a écrit :

@lorc commented on this pull request. --------------------------------------------------------------- In [accel/tcg/tcg-accel-ops-rr.c](#102 (comment)): > @@ -273,6 +279,12 @@ static void *rr_cpu_thread_fn(void *arg) bql_lock(); break; } +//// --- Begin LibAFL code --- Sure, will do this today. — Reply to this email directly, [view it on GitHub](#102 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/AAIHINGI2GSVMC5BNRRIJMT2UFVXZAVCNFSM6AAAAABW34E7ZKVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDMOBRGU4DANZYHE). You are receiving this because you were mentioned.Message ID: ***@***.***>

In some cases qemu_main_loop() can exit before libafl_sync_exit_cpu() completes. This will case race between Rust code that restarts QEMU and vCPU thread that updates last_exit_reason. What I observed is libafl_exit_signal_vm_start() from a new iteration cleared last_exit_reason.cpu before libafl_sync_exit_cpu() tried to access *last_exit_reason.cpu. This caused NULL pointer dereference. Fix this by not setting cpu->exit in prepare_qemu_exit() and updating it only in rr_cpu_thread_fn() and MTTCG counterpart. This will ensure that qemu_main_loop() waits for vCPU thread to actually stop before returning control to the Rust code. Signed-off-by: Volodymyr Babchuk <volodymyr_babchuk@epam.com>

lorc · 2025-03-13T13:29:44Z

Okay, I've updated the PR by adding the complementary change into MTTR. But I didn't tested MTTR case, though.

rmalmain · 2025-03-13T16:09:37Z

it should behave similarly.
i'll re-run the tests on the libafl pr.

rename symbols to fit with rust 2024.

3c60ef9

reformat code with more typedefs.

domenukk requested a review from rmalmain March 9, 2025 19:05

rmalmain requested changes Mar 13, 2025

View reviewed changes

rmalmain mentioned this pull request Mar 13, 2025

Fix LibAFL QEMU race condition between main loop and a vcpu AFLplusplus/LibAFL#3072

Merged

rmalmain self-requested a review March 13, 2025 13:02

rmalmain approved these changes Mar 13, 2025

View reviewed changes

lorc force-pushed the pr_fix_race branch from 6bc6c5e to 9a3918b Compare March 13, 2025 13:26

rmalmain merged commit 0b9d826 into AFLplusplus:main Mar 14, 2025
1 check passed

lorc deleted the pr_fix_race branch March 14, 2025 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

afl-bridge: fix race between main thread and a vCPU thread #102

afl-bridge: fix race between main thread and a vCPU thread #102

lorc commented Feb 11, 2025

lorc commented Feb 13, 2025

lorc commented Mar 7, 2025

tokatoka commented Mar 7, 2025

lorc commented Mar 7, 2025

rmalmain left a comment

rmalmain Mar 13, 2025

lorc Mar 13, 2025

rmalmain commented Mar 13, 2025 via email

lorc commented Mar 13, 2025

rmalmain commented Mar 13, 2025

afl-bridge: fix race between main thread and a vCPU thread #102

afl-bridge: fix race between main thread and a vCPU thread #102

Conversation

lorc commented Feb 11, 2025

lorc commented Feb 13, 2025

lorc commented Mar 7, 2025

tokatoka commented Mar 7, 2025

lorc commented Mar 7, 2025

rmalmain left a comment

Choose a reason for hiding this comment

rmalmain Mar 13, 2025

Choose a reason for hiding this comment

lorc Mar 13, 2025

Choose a reason for hiding this comment

rmalmain commented Mar 13, 2025 via email

lorc commented Mar 13, 2025

rmalmain commented Mar 13, 2025