Skip to content

Latest commit

 

History

History
15 lines (7 loc) · 547 Bytes

File metadata and controls

15 lines (7 loc) · 547 Bytes

DocumentIdStrategy

Strategy for generating deterministic document IDs. Values: URL: hash(page_url + chunk_index) - stable across re-crawls POSITION: hash(seed_url + page_index + chunk_index) - order-based CONTENT: hash(content) - deduplicates identical content

Enum

  • URL (value: 'url')

  • POSITION (value: 'position')

  • CONTENT (value: 'content')

[Back to Model list] [Back to API list] [Back to README]