Load an ORC object from the file path, returning a DataFrame.
String, path object (implementing os.PathLike[str]
), or file-like object implementing a binary read()
function. The string could be a URL. Valid URL schemes include http, ftp, s3, and file. For file URLs, a host is expected. A local file could be: file://localhost/path/to/table.orc
.
If not None, only these columns will be read from the file. Output always follows the ordering of the file and not the columns list. This mirrors the original behaviour of pyarrow.orc.ORCFile.read()
.
Back-end data type applied to the resultant DataFrame
(still experimental). Behaviour is as follows:
"numpy_nullable"
: returns nullable-dtype-backed DataFrame
(default).
"pyarrow"
: returns pyarrow-backed nullable ArrowDtype
DataFrame.
Added in version 2.0.
Filesystem object to use when reading the parquet file.
Added in version 2.1.0.
Any additional kwargs are passed to pyarrow.
Notes
Before using this function you should read the user guide about ORC and install optional dependencies.
If path
is a URI scheme pointing to a local or remote file (e.g. âs3://â), a pyarrow.fs
filesystem will be attempted to read the file. You can also pass a pyarrow or fsspec filesystem object into the filesystem keyword to override this behavior.
Examples
>>> result = pd.read_orc("example_pa.orc")
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4