Remove semi colon on all transforms #1022

ahuang11 · 2025-01-31T20:02:34Z

I think at some point, I encountered some Lumen AI syntax error related to ; or comments

WITH gold_medals AS (
    SELECT
        "athlete_full_name",
        COUNT(*) AS gold_count
    FROM read_csv('olympic_medals.csv')
    WHERE "medal_type" = 'GOLD'
      AND "slug_game" LIKE '%-2022'  -- Assuming Winter Olympics are identified by the year
    GROUP BY "athlete_full_name"
)
SELECT
    "athlete_full_name",
    gold_count
FROM gold_medals
ORDER BY gold_count DESC
LIMIT 1

Not sure if this will fix, but I think it should be applied to all transforms anyways.

Ideally, these transforms would be applied to the generated SQL for visibility, and not after under the hood in pipeline (okay applied)

codecov · 2025-01-31T20:05:17Z

Codecov Report

Attention: Patch coverage is 33.33333% with 4 lines in your changes missing coverage. Please review.

Project coverage is 57.69%. Comparing base (60ea02f) to head (1b746ee).

Files with missing lines	Patch %	Lines
lumen/ai/agents.py	0.00%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1022      +/-   ##
==========================================
- Coverage   57.70%   57.69%   -0.01%     
==========================================
  Files         109      109              
  Lines       14197    14199       +2     
==========================================
  Hits         8192     8192              
- Misses       6005     6007       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

philippjfr · 2025-02-04T18:47:29Z

lumen/ai/agents.py

@@ -628,16 +628,17 @@ async def _create_valid_sql(
        else:
            source = next(iter(sources))

+        sql_transforms = [SQLLimit(limit=1_000_000)]
+        for transform in sql_transforms:
+            sql_query = transform.apply(sql_query)


Confused, why iterate over a loop when there is only a single item? Also why bake the transform into the query?

Was just keeping the original structure, in case there were more transforms. I'm fine with dropping the loop.

Applying it on the original query so that it shows up in the SQL code editor, in case there's a bug in the SQLTransform output.

Baking it into the query means it falls outside of the programmable interface, e.g. if we want to toggle the limit on and off. So I'm -1 on that part.

On the flip side, if the transform (limit) isn't visible to the user, the user might think it's an issue with the data (unaware there's a limit to 1 million rows; the "Full data" checkbox is pretty tiny and I didn't notice it initially because I was looking at the query):

if we want to toggle the limit on and off

I don't follow; can't it still be toggled on and off (e.g. SELECT * FROM read_parquet('num.parquet') AS num_table LIMIT 1000000, changes into SELECT * FROM read_parquet('num.parquet') AS num_table?

By the way, clicking "Full data" checkbox doesn't work as expected:

ahuang11 requested a review from philippjfr January 31, 2025 20:02

remove semi colon on all transforms

54d0ae2

apply limit visibly

1b746ee

philippjfr reviewed Feb 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove semi colon on all transforms #1022

Remove semi colon on all transforms #1022

ahuang11 commented Jan 31, 2025 •

edited

Loading

codecov bot commented Jan 31, 2025 •

edited

Loading

philippjfr Feb 4, 2025

ahuang11 Feb 4, 2025

philippjfr Feb 4, 2025

ahuang11 Feb 5, 2025 •

edited

Loading

Remove semi colon on all transforms #1022

Are you sure you want to change the base?

Remove semi colon on all transforms #1022

Conversation

ahuang11 commented Jan 31, 2025 • edited Loading

codecov bot commented Jan 31, 2025 • edited Loading

Codecov Report

philippjfr Feb 4, 2025

Choose a reason for hiding this comment

ahuang11 Feb 4, 2025

Choose a reason for hiding this comment

philippjfr Feb 4, 2025

Choose a reason for hiding this comment

ahuang11 Feb 5, 2025 • edited Loading

Choose a reason for hiding this comment

ahuang11 commented Jan 31, 2025 •

edited

Loading

codecov bot commented Jan 31, 2025 •

edited

Loading

ahuang11 Feb 5, 2025 •

edited

Loading