Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Old MV partitions not deleted in shared data mode from Minio #56355

Open
swapkh91 opened this issue Feb 27, 2025 · 5 comments
Open

Old MV partitions not deleted in shared data mode from Minio #56355

swapkh91 opened this issue Feb 27, 2025 · 5 comments
Assignees
Labels
type/bug Something isn't working

Comments

@swapkh91
Copy link

Steps to reproduce the behavior (Required)

CREATE TABLE `events` (
  `anonymousId` varchar(65533) NULL COMMENT "",
  `event` varchar(65533) NULL COMMENT "",
  `timestamp` datetime NULL COMMENT ""
) ENGINE=OLAP 
DUPLICATE KEY(`anonymousId`)
COMMENT "OLAP"
PARTITION BY date_trunc('DAY', timestamp)
DISTRIBUTED BY RANDOM
PROPERTIES (
"bloom_filter_columns" = "timestamp",
"bucket_size" = "4294967296",
"compression" = "LZ4",
"datacache.enable" = "true",
"datacache.partition_duration" = "1 days",
"enable_async_write_back" = "false",
"partition_live_number" = "1",
"replication_num" = "1",
"storage_volume" = "default_volume"
);
CREATE MATERIALIZED VIEW `events_mv` (`anonymousId`, `timestamp_ist`, `timestamp`,
  INDEX segment_id_idx (`anonymousId`) USING BITMAP COMMENT '')
COMMENT "MATERIALIZED_VIEW"
PARTITION BY (date_trunc('day', `timestamp`))
DISTRIBUTED BY RANDOM
REFRESH ASYNC START("2025-02-06 10:00:00") EVERY(INTERVAL 1 MINUTE)
PROPERTIES (
"replicated_storage" = "true",
"partition_ttl_number" = "2",
"replication_num" = "1",
"datacache.enable" = "true",
"enable_async_write_back" = "false",
"storage_volume" = "default_volume",
"warehouse" = "default_warehouse"
)
AS SELECT `events`.`timestamp_ist`, `events`.`timestamp`
FROM (SELECT convert_tz(`events`.`timestamp`, 'GMT', '+05:30') AS `timestamp_ist`, `events`.`timestamp`
FROM `segment`.`events`) `events`
WHERE (`events`.`event` = 'EXPIRED');

Expected behavior (Required)

show partitions from events_mv;
this command shows 1-2 partitions, but in Minio previous partition folders are still present. The older partition folders usually contain only SCHEMA file and sometimes meta folder and SCHEMA file both, refer screenshots.

Real behavior (Required)

This issue was present for tables too and resolved in #41675. Older folders should automatically be deleted from Minio.

StarRocks version (Required)

3.3.8

Image Image Image
@swapkh91 swapkh91 added the type/bug Something isn't working label Feb 27, 2025
@srlch
Copy link
Contributor

srlch commented Feb 27, 2025

@swapkh91 Hi, is the cluster upgraded from v3.2? or other lower version?

@swapkh91
Copy link
Author

@swapkh91 Hi, is the cluster upgraded from v3.2? or other lower version?

@srlch i upgraded it from 3.2.10 -> 3.3.1 -> 3.3.8

@srlch
Copy link
Contributor

srlch commented Feb 27, 2025

@swapkh91 before v3.3, the older partition folders can not be remove directly, could you reproduce this problem in 3.3.8 for a new MV?

@swapkh91
Copy link
Author

@swapkh91 before v3.3, the older partition folders can not be remove directly, could you reproduce this problem in 3.3.8 for a new MV?

i'll check for new MV
for tables it was automatically resolved after upgrading, why not MV?

@srlch
Copy link
Contributor

srlch commented Feb 27, 2025

@swapkh91 before v3.3, the older partition folders can not be remove directly, could you reproduce this problem in 3.3.8 for a new MV?

i'll check for new MV for tables it was automatically resolved after upgrading, why not MV?

The partition removed after upgrading would be erased the folder, both for table and MV, i will also try to reproduce this problem.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants