Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: MergeTree Folders #2

Draft
wants to merge 2 commits into
base: main
Choose a base branch
from
Draft

WIP: MergeTree Folders #2

wants to merge 2 commits into from

Conversation

lmangani
Copy link
Contributor

@lmangani lmangani commented Dec 21, 2024

This WIP PR attempts implementing a discovery function to read data from ClickHouse generated MergeTree leveraging the columns.txt and count.txt files to read the data.bin values.

-rw-r--r-- 1 root root  259 Nov 19  2023 checksums.txt
-rw-r--r-- 1 root root   94 Nov 19  2023 columns.txt
-rw-r--r-- 1 root root    1 Nov 19  2023 count.txt
-rw-r--r-- 1 root root   92 Nov 19  2023 data.bin
-rw-r--r-- 1 root root   62 Nov 19  2023 data.cmrk3
-rw-r--r-- 1 root root   10 Nov 19  2023 default_compression_codec.txt
-rw-r--r-- 1 root root    1 Nov 19  2023 metadata_version.txt
-rw-r--r-- 1 root root   50 Nov 19  2023 primary.cidx
-rw-r--r-- 1 root root  229 Nov 19  2023 serialization.json

The function is not yet fully functional and has lots of casting issues but has potential:

D SELECT * FROM clickhouse_folder('/root/.chdb_data/data/a/test/all_1_1_0/');

@lmangani lmangani changed the title ClickHouse Folders WIP: MergeTree Folders Dec 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant