It is using all available memory and is terminated by OOM.
To ReproduceAllocate a machine with 32 GB RAM, like c6a.4xlarge on AWS, with Ubuntu 22.04.
ssh into that machine.
Run the following commands:
sudo apt-get update
sudo apt-get install python3-pip
pip install duckdb
wget 'https://datasets.clickhouse.com/hits_compatible/hits.parquet'
Create the following run.py file:
#!/usr/bin/env python3
import duckdb
import timeit
con = duckdb.connect(database='my-db.duckdb', read_only=False)
print("Will load the data")
start = timeit.timeit()
con.execute("CREATE TABLE hits AS SELECT * FROM parquet_scan('hits.parquet')")
end = timeit.timeit()
print(end - start)
Make it executable:
Run it:
Wait around 10 minutes...
Will load the data
Killed
Environment (please complete the following information):
With OOM it cannot qualify in the ClickHouse benchmark.
Before Submittingmaster
branch? No.pip install duckdb --upgrade --pre
It installs the same version 0.4.0.RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4