Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CALCITE-6927] Add rule for join condition remove IS NOT DISTINCT FROM #4277

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

xiedeyantu
Copy link
Contributor

please see: CALCITE-6927

@xiedeyantu xiedeyantu changed the title [CALCITE-6927] Join condition remove IS NOT DISTINCT FROM [CALCITE-6927] Add rule for join condition remove IS NOT DISTINCT FROM Mar 29, 2025
@xiedeyantu xiedeyantu force-pushed the join-cond branch 2 times, most recently from 34fd858 to 6e4ffb6 Compare March 29, 2025 12:31
//~ Inner Classes ----------------------------------------------------------

/** Shuttle that removes 'x IS NOT DISTINCT FROM y' and converts it
* to '(COALESCE(x, '') = COALESCE(y, ''))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

'' is not an accurate description of what's happening.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

RexNode operand1 = tmpCall.operands.get(1);

RexNode operand0Zero =
rexBuilder.makeZeroLiteral(operand0.getType());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this only works if the type is a numeric type.
joins work on arbitrary types, including arrays and maps

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I missed this question. I will think about how to make it compatible with complex types.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

@mihaibudiu
Copy link
Contributor

This does not look like a traditional optimization rule, since it does not really change the plan in any way, it just replaces some expressions with equivalent ones. I don't know if there is a better place for it, though.

@xiedeyantu xiedeyantu force-pushed the join-cond branch 3 times, most recently from b0f202c to 035adc1 Compare April 1, 2025 12:47
}

for (RexNode op : found.getOperands()) {
if (op.getType().getSqlTypeName() == SqlTypeName.MAP) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I actually think maps can be compared, but it's fine to skip this case too.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I referred to the implementation of spark, it does not support map comparison. But I agree with your, map should be comparable. I will skip the map comparison for now, and modify it later if I have more references. WDYT

if (operand.getType().getSqlTypeName() == SqlTypeName.ARRAY) {
ArraySqlType arrayType = (ArraySqlType) operand.getType();
RelDataType elementType = arrayType.getComponentType();
RexNode elementZero = rexBuilder.makeZeroLiteral(elementType);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You need to call the function recursively, because ARRAY can have ARRAY as elements.
Also, this may work for MULTISET too.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

there is also the case of ROW types.
Maybe you want to enhance the makeZeroLiteral function to handle these as well?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great Idea! I think I can create a new ticket to complete this suggestion first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants