feat: add bft function testcases in new substrait testfile format #738

srikrishnak · 2024-11-11T12:48:26Z

No description provided.

srikrishnak · 2024-11-12T02:40:37Z

The breaking changes check failure is not related to this commit. It pointed to changes in update rel. Once we rebase, it will pass.

jacques-n · 2024-11-12T03:17:22Z

We should get CI running first for the coverage code before adding these. That way we won't accidentally add broken things.

Second, we should also add the automated tool that does the conversion so other people can use it if they have their own tests.

EpsilonPrime

This turned into a review of the BFT source testcases since the conversion process is working correctly.

EpsilonPrime · 2024-11-13T00:05:39Z

tests/cases/aggregate_generic/count.test

I don't see the count_star test file.

The count_star() function name doesn't match any function in substrait functions. So, unable to add it as coverage breaks.

tests/cases/arithmetic/bitwise_or.test

tests/cases/arithmetic/max.test

tests/cases/arithmetic/min.test

tests/cases/string/like.test

EpsilonPrime · 2024-11-13T02:41:49Z

tests/cases/string/lower.test

+lower('aBc'::str) = 'abc'::str
+lower('abc'::str) = 'abc'::str
+lower(''::str) = ''::str
+


future: we are going to need to revisit all of the string cases once we add collation and character set. (For instance, the capital letter I in Turkish does not become i but a version without a dot.)

tests/cases/string/regexp_replace.test

EpsilonPrime · 2024-11-13T02:48:01Z

tests/cases/string/repeat.test

+repeat(''::str, 2::i64) = ''::str
+
+# null_input: Examples with null as input
+repeat(null::str, 2::i64) = null::str


later: add tests for null and negative counts

tests/cases/string/starts_with.test

srikrishnak · 2024-11-15T07:20:31Z

We should get CI running first for the coverage code before adding these. That way we won't accidentally add broken things.

Second, we should also add the automated tool that does the conversion so other people can use it if they have their own tests.

Raised a bft PR substrait-io/bft#97 for the script as well.

srikrishnak · 2024-11-15T14:58:43Z

I took care of review comments but had to rebase because of conflicting file.
After rebase, I new counter in coverage pointed to around 400 signature matches although the testcases would pass on bft.

Below are the different cases which caused signature mismatches even for functions which have the right test cases, addressed almost all of them (4 is not yet fixed). Now I am left with around 250 odd test cases. Now, mostly I see failures which are actual signature mismatches or (4.)

tests with return type SubstraitError()
tests for functions like or having variadic arguments
tests with return type decimal
tests with return type any1
tests with return type any

srikrishnak · 2024-11-18T14:30:01Z

I took care of review comments but had to rebase because of conflicting file. After rebase, I new counter in coverage pointed to around 400 signature matches although the testcases would pass on bft.

Below are the different cases which caused signature mismatches even for functions which have the right test cases, addressed almost all of them (4 is not yet fixed). Now I am left with around 250 odd test cases. Now, mostly I see failures which are actual signature mismatches or (4.)

tests with return type SubstraitError()

tests for functions like or having variadic arguments

tests with return type decimal

tests with return type any1

tests with return type any

raised PR #744 to fix issues with function lookup in coverage tool.
This PR will need rebase on #744

EpsilonPrime

There's still the question of extract with microsecond but since that's mirroring the existing test it's fine to investigate it later.

srikrishnak · 2024-11-20T09:32:26Z

I raised a draft PR substrait-io/bft#98 to check how pipeline runs go with these testcases on bft.

srikrishnak · 2024-11-21T02:26:12Z

I raised a draft PR substrait-io/bft#98 to check how pipeline runs go with these testcases on bft.

I checked locally by using the DRAFT PR for duckdb, snowflake, postgres and sqllite. Made sure there are no regressions on them.

The previous patch had one offending test case for sqllite, removed that testcase.

jacques-n

Great work @srikrishnak and @EpsilonPrime ! Thanks for thorough update and review.

srikrishnak requested review from jacques-n, cpcloud, westonpace, EpsilonPrime and vbarua as code owners November 11, 2024 12:48

srikrishnak force-pushed the port-bft-tests branch from 06e36de to 83a1f51 Compare November 12, 2024 02:40

srikrishnak changed the title ~~chore: port tests from bft to substrait~~ feat: add bft function testcases in new substrait testfile format Nov 12, 2024

EpsilonPrime reviewed Nov 13, 2024

View reviewed changes

chore: add tests from bft to substrait

a5adfec

srikrishnak force-pushed the port-bft-tests branch 2 times, most recently from b9a17c8 to 16ce9ed Compare November 19, 2024 17:52

fix substrait signature mismatches in ported tests

cf8851f

srikrishnak force-pushed the port-bft-tests branch from 4bbcfe9 to cf8851f Compare November 19, 2024 17:58

srikrishnak requested a review from EpsilonPrime November 19, 2024 17:58

EpsilonPrime previously approved these changes Nov 19, 2024

View reviewed changes

sqllite's like is case insensitive, remove the offending test case

5e71e89

srikrishnak dismissed EpsilonPrime’s stale review via 5e71e89 November 21, 2024 02:23

srikrishnak requested a review from EpsilonPrime November 21, 2024 02:26

EpsilonPrime approved these changes Nov 21, 2024

View reviewed changes

jacques-n approved these changes Nov 21, 2024

View reviewed changes

jacques-n merged commit d84ccd1 into substrait-io:main Nov 21, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add bft function testcases in new substrait testfile format #738

feat: add bft function testcases in new substrait testfile format #738

srikrishnak commented Nov 11, 2024

srikrishnak commented Nov 12, 2024

jacques-n commented Nov 12, 2024

EpsilonPrime left a comment

EpsilonPrime Nov 13, 2024

srikrishnak Nov 19, 2024

EpsilonPrime Nov 13, 2024

EpsilonPrime Nov 13, 2024

srikrishnak commented Nov 15, 2024

srikrishnak commented Nov 15, 2024

srikrishnak commented Nov 18, 2024

EpsilonPrime left a comment

srikrishnak commented Nov 20, 2024

srikrishnak commented Nov 21, 2024

jacques-n left a comment

feat: add bft function testcases in new substrait testfile format #738

feat: add bft function testcases in new substrait testfile format #738

Conversation

srikrishnak commented Nov 11, 2024

srikrishnak commented Nov 12, 2024

jacques-n commented Nov 12, 2024

EpsilonPrime left a comment

Choose a reason for hiding this comment

EpsilonPrime Nov 13, 2024

Choose a reason for hiding this comment

srikrishnak Nov 19, 2024

Choose a reason for hiding this comment

EpsilonPrime Nov 13, 2024

Choose a reason for hiding this comment

EpsilonPrime Nov 13, 2024

Choose a reason for hiding this comment

srikrishnak commented Nov 15, 2024

srikrishnak commented Nov 15, 2024

srikrishnak commented Nov 18, 2024

EpsilonPrime left a comment

Choose a reason for hiding this comment

srikrishnak commented Nov 20, 2024

srikrishnak commented Nov 21, 2024

jacques-n left a comment

Choose a reason for hiding this comment