A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://github.com/ClickHouse/ClickHouse/issues/34437 below:

It makes sense to compress marks and primary.idx · Issue #34437 · ClickHouse/ClickHouse · GitHub

Describe the issue

root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name *.mrk2 | xargs wc -c
7602201768 total
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name *.mrk2 | xargs cat | clickhouse-compressor | wc -c
1367956184
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name *.mrk2 | xargs cat | clickhouse-compressor --codec 'ZSTD' | wc -c
455141209
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name *.mrk2 | xargs cat | clickhouse-compressor --codec 'ZSTD(3)' | wc -c
364667373
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name primary.idx | xargs wc -c
1184498877 total
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name primary.idx | xargs cat | clickhouse-compressor | wc -c
756816544
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name primary.idx | xargs cat | clickhouse-compressor --codec 'ZSTD' | wc -c
602538200
root@ip-172-31-22-38:/opt/clickhouse/store/308/30855e33-a73f-481b-942c-04710e028fe9# find . -name primary.idx | xargs cat | clickhouse-compressor --codec 'ZSTD(3)' | wc -c
552119618

In this example, marks compressed 20 times and primary key - 2 times.
That's a huge difference if they are loaded over slow network (example: using MergeTree over web disk plugged on my desktop).


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4