Edward Langley d14ec443c2 feat(formula): improve tokenizer to correctly handle keywords and delimiters
The tokenizer was greedily consuming spaces and potentially merging
identifiers with subsequent keywords. This change improves the tokenizer
by:
- Peeking ahead past spaces to find the next word/token.
- Breaking the identifier if the next word is a known keyword (WHERE, SUM,
  AVG, MIN, MAX, COUNT, IF).
- Adding support for more delimiter characters (<, >, =, !, ").

This fixes a regression where "Revenue WHERE" was treated as a single
identifier instead of an identifier followed by a WHERE clause.

Includes a new regression test for inline WHERE filters in aggregate
functions.

Co-Authored-By: fiddlerwoaroof/git-smart-commit (unsloth/gemma-4-26B-A4B-it-GGUF:UD-Q5_K_XL)
2026-04-11 00:06:50 -07:00
2026-04-02 16:21:45 -07:00
2026-04-11 00:06:38 -07:00
2026-04-11 00:06:49 -07:00
2026-04-02 11:34:22 -07:00
Description
No description provided
Apache-2.0 2.5 MiB
Languages
Rust 98.1%
Python 1.1%
Shell 0.6%
Nix 0.2%