lake: add incremental updates 427#22783
lake: add incremental updates 427#22783ti-chi-bot[bot] merged 17 commits intopingcap:feature/preview-cloud-lakefrom
Conversation
There was a problem hiding this comment.
Code Review
This pull request adds documentation for numerous new geospatial and JSON functions, updates the geospatial overview to specify GEOMETRY/GEOGRAPHY support, and introduces a "Query Result Defaults" section to the worksheet guide. It also refines the STRIP_NULL_VALUE documentation. Feedback suggests using standard blockquote formatting for notes, applying backticks to technical terms like float32, and correcting minor grammatical inconsistencies for better alignment with the style guide.
Add documentation for the new SYSTEM$SET_CACHE_CAPACITY call and wire it into the TOC. Create tidb-cloud-lake/sql/set-cache-capacity.md describing syntax, notes, and examples. Expand system.caches doc (columns, cache list, example queries) to describe cache names, units, and usage; include note that disk_cache_column_data cannot be adjusted. Add Tag Operations sections to ALTER TABLE and ALTER VIEW docs (syntax and examples for SET/UNSET TAG). Also update an example ROLE_ARN value in alter-table.md.
|
/test pull-verify |
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Add documentation for unloading Lance datasets and table versioning (snapshot tags), and wire these features into the TOC and SQL reference. New pages: unload-lance-dataset.md, create-snapshot-tag.md, drop-snapshot-tag.md, fuse-tag.md, table-versioning.md. Update SQL pages to document LANCE as an unload-only FILE_FORMAT and COPY INTO <location> behavior (copy-into-location.md, create-file-format.md, input-output-file-formats.md), add AT(TAG) and snapshot-tag operations (at.md, alter-table.md), and expose the new table-versioning commands in the TOC. Also update Fuse Engine docs (fuse-engine-tables.md) with new options and version bump notes, fix a few links and small editorial/formatting issues (data-lifecycle.md, mcp-server.md, mindsdb.md, unload-csv-file.md, set-cache-capacity.md, tutorials/backup-restore-with-bendsave.md).
Add a new documentation page tidb-cloud-lake/guides/task-flow.md describing the Task Flow feature (concepts, tasks/flows, DAGs, creating/configuring flows, UI visualization, monitoring, runs/version history, scheduling, stream triggers, best practices, and permissions). Also update TOC-tidb-cloud-lake.md to include a link to the new Task Flow guide.
Add documentation for dictionary management and geospatial aggregation, plus TSV/TEXT alias notes and VACUUM VIRTUAL COLUMN. New pages added: CREATE DICTIONARY, DROP DICTIONARY, RENAME DICTIONARY, SHOW CREATE DICTIONARY, SHOW DICTIONARIES, SHOW CATALOGS, ST_COLLECT, ST_ENVELOPE_AGG, ST_INTERSECTION_AGG, ST_UNION_AGG, and VACUUM VIRTUAL COLUMN. Updated files and navigation: TOC, DDL overview, administration commands, aggregate-functions (new Geospatial Aggregation section), COPY INTO table, and guides for TSV/querying TSV to note that TEXT is an alias for TSV in v1.2.890-nightly. Also expanded create-function with a Python UDF server example to demonstrate end-to-end usage. These changes document new features and improve compatibility and examples.
Add documentation for worker management and spatial indexes, plus related reference and TOC updates. - Introduce worker SQL docs: CREATE WORKER, ALTER WORKER, DROP WORKER, SHOW WORKERS, worker-overview and worker-examples; add worker entries to DDL and TOC. - Add spatial index docs: spatial-index-overview, CREATE SPATIAL INDEX, REFRESH SPATIAL INDEX, DROP SPATIAL INDEX; add Spatial Index section to TOC and DDL. - Add DESCRIBE NOTIFICATION INTEGRATION and link from notifications and TOC. - Expand and improve existing pages: richer CREATE INVERTED INDEX examples and QUERY() guidance; add worker management section to CREATE SCALAR FUNCTION; clarify READ_FILE syntax/usage and examples; add tagging and MySQL Endpoint guidance to warehouse guide; normalize example identifiers (use quoted names) and fix minor examples (query-history, range-between, etc.). These changes add new features documentation, clarify usage patterns, and align examples across the Lake docs.
Update internal docs links that pointed to the old anchor (#connecting) to the correct anchor (#connecting-to-a-warehouse) in multiple TiDB Cloud Lake guides for consistency and to avoid broken/incorrect anchors. Affected files: connect-using-dbeaver.md, deepnote.md, grafana.md, jupyter-notebook.md, metabase.md, mindsdb.md, redash.md, superset.md, tableau.md.
Reflow and clean up Markdown formatting in the file: consolidate split blockquote lines, remove stray line breaks and extra blank lines, and fix a broken MySQL mysqldump example. These are cosmetic changes to improve readability and ensure blockquotes and lists render correctly; no functional content changes.
Rename tidb-cloud-lake/worker-examples.md → tidb-cloud-lake/sql/worker-examples.md and update the TOC link to /tidb-cloud-lake/sql/worker-examples.md so SQL-related examples are grouped under the sql/ directory.
|
/test pull-verify |
This reverts commit 0feb8ca.
|
Note For the unexpected pull-verify failure error, I'll improve the check-tags.py script (because it was too strict previously) in #22825 and then merge this PR. |
|
/test pull-verify |
Replace the custom admonition in create-dictionary.md with a Markdown blockquote ("Note") for proper rendering, and add a blank line before the SQL code block in fuse-engine-tables.md so the "Examples" heading and code render correctly. Small formatting tweaks to improve documentation appearance.
Renumber steps in tidb-cloud-lake/guides/task-flow.md to start at 1 instead of 5–7. Promote headings from #### to ### for ENGINE, CLUSTER BY, and <Options> in tidb-cloud-lake/sql/fuse-engine-tables.md for consistent section hierarchy and improved readability.
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: lilin90 The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
70888a4
into
pingcap:feature/preview-cloud-lake
What is changed, added or deleted? (Required)
Incremental Lake docs updates till 4/27
Note
Renamed
/tidb-cloud-lake/sql/rename-database.mdto/tidb-cloud-lake/sql/alter-database.md.Which TiDB version(s) do your changes apply to? (Required)
Tips for choosing the affected version(s):
By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.
For details, see tips for choosing the affected versions.
What is the related PR or file link(s)?
Do your changes match any of the following descriptions?