gh-126895: fix readline module in free-threaded build #131208

tom-pytel · 2025-03-13T19:54:23Z

This is nothing more than adding @critical_section to functions which use the readline library. It is not thread-safe (because readline is not thread-safe), but it doesn't crash like before.

The test test_free_threading_doctest_difflib is probably overkill, but included at first to show that it works. Can comment out or remove as desired.

Also, without test_free_threading_doctest_difflib (that uses stuff which is not free-thread safe), test_readline fails without crashing with --parallel-threads, where previously it crashed. Also without it doesn't complain when tested with TSAN.

Subinterpreters are not an issue because readline just doesn't load in them: module readline does not support loading in subinterpreters

Issue: Segfault/aborts calling readline.set_completer_delims in threads in a free-threaded build #126895

tom-pytel · 2025-03-14T11:57:13Z

ping @ZeroIntensity , @colesbury

ZeroIntensity

This covers free-threading, but after re-reading some of my analysis from back in November, I'm pretty sure it's the GNU readline library itself that's not thread-safe. Critical sections are per-interpreter, so this will continue to race in subinterpreters. I'd rather handle both cases in one fix.

That said, I'm not sure whether it's unsafe to call any readline function concurrently, or just the same function across multiple threads. I suspect its the former. In that case, let's go with a global lock around readline calls, but otherwise we can go with a local static PyMutex to serialize single calls.

tom-pytel · 2025-03-14T13:57:58Z

Critical sections are per-interpreter, so this will continue to race in subinterpreters.

What I meant above is that readline just straight up doesn't import in a subinterpreter, it gives that error: module readline does not support loading in subinterpreters, so this is not an issue.

That said, I'm not sure whether it's unsafe to call any readline function concurrently,

You can call across threads and it won't crash because it is synchronized with GIL, you will just get wrong results. But if you called free-threaded it would crash, now it just behaves like not free-threaded and doesn't crash.

but otherwise we can go with a local static PyMutex to serialize single calls

Could do that, but is that safe if one call dies in an unexpected way and mutex is left locked?

ZeroIntensity · 2025-03-14T14:09:21Z

Oh, I thought we got everything in the stdlib over to multi-phase init :(. It might be worth fixing that and going with global locks for 3.14, but this fix makes sense for 3.13.
Yes, but I'm trying to figure out the actual implications here. Can we call unrelated readline functions concurrently, or does all of readline need to be protected by a single lock?
Critical sections have the same issue. We shouldn't worry about threads randomly dying and leaving things locked.

tom-pytel · 2025-03-14T14:19:17Z

Yes, but I'm trying to figure out the actual implications here. Can we call unrelated readline functions concurrently, or does all of readline need to be protected by a single lock?

The single lock on module is what I implemented here, so they are all mutually exclusive (since readline lib is specifically not thread-safe).

Critical sections have the same issue. We shouldn't worry about threads randomly dying and leaving things locked.

I was under the impression that critical sections, as opposed to pure PyMutex, are fairly deadlock proof? Don't they release after a period of time? And also: https://docs.python.org/3/c-api/init.html#python-critical-section-api -

"Critical sections avoid deadlocks by implicitly suspending active critical sections and releasing the locks during calls to PyEval_SaveThread(). When PyEval_RestoreThread() is called, the most recent critical section is resumed, and its locks reacquired. This means the critical section API provides weaker guarantees than traditional locks ..."

ZeroIntensity · 2025-03-14T14:24:08Z

Critical sections are based off an object's PyMutex, it just implicitly releases that mutex when the thread state is detached (for example, when waiting on another lock), which prevents deadlocks. If the thread crashes, the lock is still held. (These details don't matter much here, anyway.)

tom-pytel · 2025-03-14T14:26:36Z

So what's the verdict then wrt this, changes?

ZeroIntensity · 2025-03-14T14:31:42Z

Let's just confirm that we really do need a global lock here; I don't want to drastically inhibit scaling if we don't have to.

tom-pytel · 2025-03-14T14:43:27Z

Let's just confirm that we really do need a global lock here; I don't want to drastically inhibit scaling if we don't have to.

Short: Yes it needs to be global because readline is not thread-safe. But its only applied on things that actually call into readline.

colesbury

Adding critical sections seems fine to me:

I don't think the new tests are worth it
There are a bunch of functions that don't have critical sections: readline.write_history_file, readline.append_history_file, readline.set_history_length, ...

Modules/readline.c

ZeroIntensity · 2025-03-15T14:40:07Z

I'm fine with critical sections too, I'm just worried about basically enabling the GIL for all usage of readline.

Yes it needs to be global because readline is not thread-safe. But its only applied on things that actually call into readline.

I think you're misinterpreting what I mean a little. "Thread-safety" is a blanket term; I mean that we might not need locks synchronizing the entire module. For example, it makes sense to me that write_history_file isn't thread-safe against read_history_file, but I see no reason that set_startup_hook couldn't run in another thread while those two are synchronized.

colesbury · 2025-03-15T15:04:15Z

Adding a lock to the readline module is not like enabling the GIL. The GIL affects everything in the process, including things that don't touch the readline module. The readline lock only affects things that use readline. Using multiple threads to concurrently muck around with readline is not a real use case that we should optimize for. So far the only time I've seen anyone try to use readline from multiple threads at all is when fuzzing CPython.

...but I see no reason that set_startup_hook couldn't run in another thread while those two are synchronized...

In general, we should prefer the simplest approach to thread safety that is efficient. In this case, that means a single lock for the entire readline module.

tom-pytel · 2025-03-15T15:51:47Z

read_history_file had critical section but I did miss write_history_file and append_history_file. Other points:

@critical_section module -> @critical_section.
Added atomics instead of critical sections to _history_length stuff.
Removed tests but left one single minimized one because there really should be at least one to check for crash?
readline_set_completion_display_matches_hook_impl has critical section because is setting rl_ global var which may be used inside locked readline functions. Anything that touches global rl_ stuff is critical sectioned (except setup / constructor stuff).
I didn't give stuff set_hook and set_startup_hook and the like critical sections because they just setting py vars, not readline stuff, maybe should have at least atomics, but does it matter?

One potential issue. call_readline() uses readline stuff but not locked here because is super low level and no clean way to access lock. Also not used here but exposed via PyOS_ReadlineFunctionPointer which is used by myreadline.c in Parser. Could store module in a global to use for lock or look it up everytime in sys.modules in this function, but is it worth it? Or can just count on caller to sync and that nothing else is using readline simultaneously?

colesbury · 2025-03-15T16:12:24Z

set_hook should not have it's own critical section, but the callers should have a critical section
call_readline cannot (and should not) use a critical section. It's called without the GIL held in the default build.

bedevere-bot · 2025-03-17T13:19:27Z

🤖 New build scheduled with the buildbot fleet by @colesbury for commit 0dca261 🤖

Results will be shown at:

https://buildbot.python.org/all/#/grid?branch=refs%2Fpull%2F131208%2Fmerge

If you want to schedule another build, you need to add the 🔨 test-with-refleak-buildbots label again.

colesbury · 2025-03-17T15:58:04Z

Thanks Tom!

I'm not going to backport this to 3.13. I think it's unlikely that anyone is going to try to use the readline module from multiple threads concurrently outside of fuzzing.

…31208) The underlying readline library is not thread-safe so this adds `@critical_section` to functions that use it.

pythongh-126895: fix readline module in free-threaded build

991ca84

bedevere-app bot added the awaiting review label Mar 13, 2025

bedevere-app bot mentioned this pull request Mar 13, 2025

Segfault/aborts calling readline.set_completer_delims in threads in a free-threaded build #126895

Closed

blurb-it bot and others added 3 commits March 13, 2025 19:54

📜🤖 Added by blurb_it.

70293ec

disable test_free_threading_doctest_difflib()

b92b20c

alternate method run test_free_threading_doctest_difflib

d4ea7c2

ZeroIntensity reviewed Mar 14, 2025

View reviewed changes

ZeroIntensity added topic-free-threading needs backport to 3.13 bugs and security fixes labels Mar 14, 2025

colesbury self-requested a review March 14, 2025 20:59

colesbury reviewed Mar 15, 2025

View reviewed changes

Modules/readline.c Outdated Show resolved Hide resolved

requested changes

7124d20

generate clinic stuff

089821e

add @critical_section to callers of set_hook

0dca261

colesbury approved these changes Mar 17, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Mar 17, 2025

colesbury added the 🔨 test-with-refleak-buildbots Test PR w/ refleak buildbots; report in status section label Mar 17, 2025

bedevere-bot removed the 🔨 test-with-refleak-buildbots Test PR w/ refleak buildbots; report in status section label Mar 17, 2025

colesbury self-assigned this Mar 17, 2025

colesbury removed the needs backport to 3.13 bugs and security fixes label Mar 17, 2025

colesbury merged commit 863d54c into python:main Mar 17, 2025
59 checks passed

bedevere-app bot removed the awaiting merge label Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-126895: fix readline module in free-threaded build #131208

gh-126895: fix readline module in free-threaded build #131208

tom-pytel commented Mar 13, 2025 •

edited

Loading

tom-pytel commented Mar 14, 2025

ZeroIntensity left a comment

tom-pytel commented Mar 14, 2025 •

edited

Loading

ZeroIntensity commented Mar 14, 2025

tom-pytel commented Mar 14, 2025 •

edited

Loading

ZeroIntensity commented Mar 14, 2025

tom-pytel commented Mar 14, 2025

ZeroIntensity commented Mar 14, 2025

tom-pytel commented Mar 14, 2025 •

edited

Loading

colesbury left a comment

ZeroIntensity commented Mar 15, 2025

colesbury commented Mar 15, 2025

tom-pytel commented Mar 15, 2025

colesbury commented Mar 15, 2025

bedevere-bot commented Mar 17, 2025

colesbury commented Mar 17, 2025

gh-126895: fix readline module in free-threaded build #131208

gh-126895: fix readline module in free-threaded build #131208

Conversation

tom-pytel commented Mar 13, 2025 • edited Loading

tom-pytel commented Mar 14, 2025

ZeroIntensity left a comment

Choose a reason for hiding this comment

tom-pytel commented Mar 14, 2025 • edited Loading

ZeroIntensity commented Mar 14, 2025

tom-pytel commented Mar 14, 2025 • edited Loading

ZeroIntensity commented Mar 14, 2025

tom-pytel commented Mar 14, 2025

ZeroIntensity commented Mar 14, 2025

tom-pytel commented Mar 14, 2025 • edited Loading

colesbury left a comment

Choose a reason for hiding this comment

ZeroIntensity commented Mar 15, 2025

colesbury commented Mar 15, 2025

tom-pytel commented Mar 15, 2025

colesbury commented Mar 15, 2025

bedevere-bot commented Mar 17, 2025

colesbury commented Mar 17, 2025

tom-pytel commented Mar 13, 2025 •

edited

Loading

tom-pytel commented Mar 14, 2025 •

edited

Loading

tom-pytel commented Mar 14, 2025 •

edited

Loading

tom-pytel commented Mar 14, 2025 •

edited

Loading