Bases: _RecordBatchFileReader
Class for reading Arrow record batch data from the Arrow binary file format
pyarrow.NativeFile
, or file-like Python object
Either an in-memory buffer, or a readable file object. If you want to use memory map use MemoryMappedFile as source.
int
, default None
If the file is embedded in some larger file, this is the byte offset to the very end of the file data
pyarrow.ipc.IpcReadOptions
Options for IPC serialization. If None, default values will be used.
MemoryPool
, default None
If None, default memory pool is used.
Methods
Attributes
Read the record batch with the given index.
int
The index of the record batch in the IPC file.
RecordBatch
Read the record batch with the given index along with its custom metadata
int
The index of the record batch in the IPC file.
RecordBatch
KeyValueMetadata
Read the record batch with the given index.
int
The index of the record batch in the IPC file.
RecordBatch
The number of record batches in the IPC file.
Read all record batches as a pyarrow.Table
Read contents of stream to a pandas.DataFrame.
Read all record batches as a pyarrow.Table then convert it to a pandas.DataFrame using Table.to_pandas.
Arguments to forward to Table.to_pandas()
.
pandas.DataFrame
Current IPC read statistics.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4