Maintain line and character number after parsing #86

gregorybchris · 2024-01-10T02:02:07Z

Evaluation has many failure cases, many of which have pretty reasonable error messages. However we do not maintain the line and character numbers associated with nodes in the AST. The interpreter can't point to the part of the code associated with a failure that occurs during evaluation.

This feature request is to make code index information available in the evaluation stage.

tekknolagi · 2024-01-10T04:55:45Z

This is a little finicky right now unless we want to store (invalid) line number information on all objects because the AST structures are objects themselves. Perhaps we should separate them out (ehhhh) or find another way to do a mixin or something.

tekknolagi · 2024-01-10T16:31:45Z

came across this out of nowhere this morning

tekknolagi · 2024-01-10T16:32:30Z

the reasoning is a little weird but i think we can calculate the line/col from the byte position easily and it's only one number to store

tekknolagi · 2024-01-10T16:33:06Z

I saw/skimmed part of a cool talk that did this for Circle, I think. Perhaps this talk https://www.youtube.com/watch?v=1m_5SVmGA4k

They have some cool tricks

tekknolagi · 2024-01-10T16:34:53Z

Oh, no, lmao, it was Carbon: https://www.youtube.com/watch?v=ZI198eFghJk
Starting ~30mins?

tekknolagi · 2024-12-26T14:15:21Z

cc @neuroevolutus maybe this is also interesting

I think we want to keep line/col on all AST objects but not on interstitial/value objects. This probably means setting them dynamically after construction and having helper functions has_sourcepos and sourcepos. Then hopefully we can use this in parse/eval/compile errors!

Maybe we can one day even emit #line directives in the compiler.

neuroevolutus · 2024-12-26T21:27:24Z

Neat, I'll have to take a look at the referenced article and video.

neuroevolutus · 2025-01-01T21:41:10Z

@tekknolagi I started working on a pull request for this, and I realised that adding additional fields to the Token class to represent column numbers and/or byte positions would cause the golden tests that check for the string contents of ParseErrors to fail. Would it be acceptable to change those tests to instead test for the presence of the relevant fields and their specific values instead so that they become more immune to changes in the string representation of ParseErrors?

tekknolagi · 2025-01-01T22:31:18Z

Sure! Please split that out into a separate commit (same PR) onto which you stack your lineno commit.

tekknolagi · 2025-01-01T22:32:20Z

In general, tests are as mutable as the rest of the code, and I like your philosophy of having stable tests.

Some tests should check end-to-end string stuff though so that we have tests for stuff users see.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maintain line and character number after parsing #86

Maintain line and character number after parsing #86

gregorybchris commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024 •

edited

Loading

tekknolagi commented Dec 26, 2024

neuroevolutus commented Dec 26, 2024

neuroevolutus commented Jan 1, 2025

tekknolagi commented Jan 1, 2025

tekknolagi commented Jan 1, 2025

Maintain line and character number after parsing #86

Maintain line and character number after parsing #86

Comments

gregorybchris commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024

tekknolagi commented Jan 10, 2024 • edited Loading

tekknolagi commented Dec 26, 2024

neuroevolutus commented Dec 26, 2024

neuroevolutus commented Jan 1, 2025

tekknolagi commented Jan 1, 2025

tekknolagi commented Jan 1, 2025

tekknolagi commented Jan 10, 2024 •

edited

Loading