Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python's help utility crashes when collections.abc module is asked. #123339

Closed
taaaf11 opened this issue Aug 26, 2024 · 10 comments
Closed

Python's help utility crashes when collections.abc module is asked. #123339

taaaf11 opened this issue Aug 26, 2024 · 10 comments
Labels
stdlib Python modules in the Lib dir type-bug An unexpected behavior, bug, or error

Comments

@taaaf11
Copy link

taaaf11 commented Aug 26, 2024

Bug report

Bug description:

I am running Python 3.13.0-b2 on Debian 12 i386. When collections.abc is entered into python interactive help utility, the interpreter crashes.

>>> help
>>> collections.abc
(Python interpreter crashes)

The traceback is indented. So I am attaching video.

recording.mp4

CPython versions tested on:

3.13

Operating systems tested on:

Linux

Linked PRs

@taaaf11 taaaf11 added the type-bug An unexpected behavior, bug, or error label Aug 26, 2024
@taaaf11

This comment was marked as resolved.

@python python deleted a comment Aug 26, 2024
@Eclips4

This comment was marked as resolved.

@taaaf11

This comment was marked as resolved.

@picnixz
Copy link
Member

picnixz commented Aug 26, 2024

It appears that lines, lnum = findsource(object) returns objects with len(lines) < lnum. It appears that:

    if isclass(object):
        try:
            firstlineno = vars(object)['__firstlineno__']
        except (TypeError, KeyError):
            raise OSError('source code not available')
        return lines, firstlineno - 1

and I assume that firstlineno is too large for lines. There is a check if lnum >= len(lines) but only for non-class objects. I can prepare a patch (and it seems it fixes the issue).

@serhiy-storchaka
Copy link
Member

Why these classes have the wrong value at first place? I suspect there is a different issue behind this, and clipping the line number can just sweep it under cover.

@picnixz
Copy link
Member

picnixz commented Aug 28, 2024

The issue is likely due to the fact that _collections_abc has it's name set to collections.abc (#123347 (comment))

@serhiy-storchaka
Copy link
Member

This is a severe issue. Modules collections.abc and decimal are affected completely, because their content is defined in other files. But several other classes in other modules, like abc.ABCMeta or tomllib.TOMLDecodeError are affected too.

For collections.abc and decimal the solution can be replacing these modules completely in sys.modules This allows to get sources of classes in these modules (previously this did not work). For other classes with changed __module__ the only solution is to delete __firstlineno__. Since this issue can affect not only the stdlib classes, it is better to do this on the low level: setting __module__ will delete __firstlineno__.

@picnixz
Copy link
Member

picnixz commented Sep 2, 2024

I'll work on something now. I'm done with what I wanted to do today.

@picnixz
Copy link
Member

picnixz commented Sep 2, 2024

Ok, I've tried to simply change the type.__module__ setter but it appears it's not sufficient enough. I have the idea but I don't have the time today. So I'll do it tomorrow.

@picnixz picnixz self-assigned this Sep 2, 2024
serhiy-storchaka added a commit to serhiy-storchaka/cpython that referenced this issue Sep 2, 2024
…lineno__ in classes

* Setting the __module__ attribute for a class now removes the
  __firstlineno__ item from the type's dict.
* The _collections_abc and _pydecimal modules now completely replace the
  collections.abc and decimal modules after importing the latters. This
  allows to get the source of classes and functions defined in these
  modules.
* inspect.findsource() now checks whether the first line number for a
  class is out of bound.
@serhiy-storchaka
Copy link
Member

I already had a ready solution, just was not able to create a PR due to a power outage.

serhiy-storchaka added a commit to serhiy-storchaka/cpython that referenced this issue Sep 2, 2024
…lineno__ in classes

* Setting the __module__ attribute for a class now removes the
  __firstlineno__ item from the type's dict.
* The _collections_abc and _pydecimal modules now completely replace the
  collections.abc and decimal modules after importing the latters. This
  allows to get the source of classes and functions defined in these
  modules.
* inspect.findsource() now checks whether the first line number for a
  class is out of bound.
@picnixz picnixz removed their assignment Sep 3, 2024
serhiy-storchaka added a commit that referenced this issue Sep 28, 2024
…__ in classes (GH-123613)

* Setting the __module__ attribute for a class now removes the
  __firstlineno__ item from the type's dict.
* The _collections_abc and _pydecimal modules now completely replace the
  collections.abc and decimal modules after importing them. This
  allows to get the source of classes and functions defined in these
  modules.
* inspect.findsource() now checks whether the first line number for a
  class is out of bound.
serhiy-storchaka added a commit to serhiy-storchaka/cpython that referenced this issue Sep 28, 2024
…__firstlineno__ in classes (pythonGH-123613)

* Setting the __module__ attribute for a class now removes the
  __firstlineno__ item from the type's dict.
* The _collections_abc and _pydecimal modules now completely replace the
  collections.abc and decimal modules after importing them. This
  allows to get the source of classes and functions defined in these
  modules.
* inspect.findsource() now checks whether the first line number for a
  class is out of bound.
(cherry picked from commit 69a4063)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
Yhg1s pushed a commit that referenced this issue Sep 30, 2024
…tlineno__ in classes (GH-123613) (#124735)

* Setting the __module__ attribute for a class now removes the
  __firstlineno__ item from the type's dict.
* The _collections_abc and _pydecimal modules now completely replace the
  collections.abc and decimal modules after importing them. This
  allows to get the source of classes and functions defined in these
  modules.
* inspect.findsource() now checks whether the first line number for a
  class is out of bound.
(cherry picked from commit 69a4063)
@hugovk hugovk closed this as completed Jan 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
stdlib Python modules in the Lib dir type-bug An unexpected behavior, bug, or error
Projects
None yet
Development

No branches or pull requests

5 participants