Unclear documentation for TarFile

### Documentation

The tarfile docs does not make it clear how a programmer can read data from a tarfile into memory without doing a round-trip writing it to the file system. As far as I understand, reading partial data from a tar file essentially amounts to the following steps:

```python
import tarfile

with open("myfile.tar") as f:
    tar = tarfile.TarFile(fileobj=f)

    tar_info = next(member for member in f.getmembers() if member.is_file())
    f.seek(tar_info.offset_data)
    data = f.read(tar_info.size)
```

However, to arrive at this, you either need to be confident enough to read the CPython source code, or you need to know that tar-files stores the byte-contents unchanged, and that `TarInfo.size` is the size of the data without the file header. Neither of these are obvious for less experienced programmers.

I suggest that we make two changes to the tarfile docs:

1. Expand the documentation for [`TarInfo.size`](https://docs.python.org/3/library/tarfile.html#tarfile.TarInfo.size) so it says more than just "Size in bytes". Size of what exactly? The archived file as far as I can tell.
2. Include a minimal example (like I have above, but slightly more pedagogical maybe) to the [Reading Examples](https://docs.python.org/3/library/tarfile.html#reading-examples) section.

I can propose a PR with these changes if you think that is useful.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unclear documentation for TarFile #146396

Documentation

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Unclear documentation for TarFile #146396

Description

Documentation

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions