Bases: object
Reader interface for a single ORC file
str
or pyarrow.NativeFile
Readable source. For passing Python file objects or byte buffers, see pyarrow.io.PythonFileInterface or pyarrow.io.BufferReader.
Methods
Attributes
Compression codec of the file
Number of bytes to buffer for the compression codec in the file
Length of the data stripes in the file in bytes
The number of compressed bytes in the file footer
The number of bytes in the file
The number of bytes in the file postscript
Format version of the ORC file, must be 0.11 or 0.12
The file metadata, as an arrow KeyValueMetadata
The number of rows in the file
Number of stripe statistics
The number of stripes in the file
Read the whole file.
list
If not None, only these columns will be read from the file. A column name may be a prefix of a nested field, e.g. âaâ will select âa.bâ, âa.câ, and âa.d.eâ. Output always follows the ordering of the file and not the columns list.
pyarrow.Table
Content of the file as a Table.
Read a single stripe from the file.
int
The stripe index
list
If not None, only these columns will be read from the stripe. A column name may be a prefix of a nested field, e.g. âaâ will select âa.bâ, âa.câ, and âa.d.eâ
pyarrow.RecordBatch
Content of the stripe as a RecordBatch.
Number of rows per an entry in the row index or 0 if there is no row index
The file schema, as an arrow schema
Software instance and version that wrote this file
The number of compressed bytes in the file stripe statistics
Name of the writer that wrote this file. If the writer is unknown then its Writer ID (a number) is returned
Version of the writer
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4