Skip to content

feat(ingest/sigma): add formula bracket-ref extractor utility#17195

Open
kyungsoo-datahub wants to merge 3 commits intomasterfrom
feat/sigma-cll-formula-parser
Open

feat(ingest/sigma): add formula bracket-ref extractor utility#17195
kyungsoo-datahub wants to merge 3 commits intomasterfrom
feat/sigma-cll-formula-parser

Conversation

@kyungsoo-datahub
Copy link
Copy Markdown
Contributor

Summary

Adds a small, standalone utility module that extracts [Source/Column]
bracket references from Sigma formula strings — the foundational piece
needed by the Sigma column-level-lineage program.

Pure regex-based extraction of  references from
Sigma formula strings. Handles the  escape (verified live across
~150 real workbook formulas) and  parameter refs. Consumed by
the upcoming DM-element FGL resolver (T1.RESOLVE-A) and chart
InputFields fix; landing standalone for review isolation.
@github-actions
Copy link
Copy Markdown
Contributor

Linear: ING-2455

@github-actions github-actions Bot added the ingestion PR or Issue related to the ingestion of metadata label Apr 26, 2026
@codecov
Copy link
Copy Markdown

codecov Bot commented Apr 26, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

@datahub-connector-tests
Copy link
Copy Markdown

datahub-connector-tests Bot commented Apr 26, 2026

Connector Tests Results

All connector tests passed for commit d606fec

View full test logs →

To skip connector tests, add the skip-connector-tests label (org members only).

Autogenerated by the connector-tests CI pipeline.

@maggiehays maggiehays added the needs-review Label for PRs that need review from a maintainer. label Apr 26, 2026
…rser

Bracket refs inside double-quoted string literals (e.g. "[failed]")
are not column references and must not produce lineage edges. Add
_STRING_LITERAL_RE + _strip_string_literals() to blank out string
literal spans before the bracket scan, preserving string length for
positional debugging. Add two regression tests: the reviewer's exact
example (If([status] = "FAILURE", "[failed]", [fallback]) -> 2 refs)
plus an escaped-quote case.
Copy link
Copy Markdown
Contributor

@treff7es treff7es left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@maggiehays maggiehays added pending-submitter-merge and removed needs-review Label for PRs that need review from a maintainer. labels Apr 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ingestion PR or Issue related to the ingestion of metadata pending-submitter-merge

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants