Skip to content

SentenceTransformersTokenTextSplitter

PropertyPatternTypeDeprecatedDefinitionTitle/Description
+ implementationNoconstNo-Implementation
- chunk_sizeNointegerNo-Chunk Size
- chunk_overlapNointegerNo-Chunk Overlap
- keep_separatorNobooleanNo-Keep Separator
- strip_whitespaceNobooleanNo-Strip Whitespace
- modelNostringNo-Model
- tokens_per_chunkNointegerNo-Tokens Per Chunk

1. Property implementation

Title: Implementation

Typeconst
RequiredYes

Specific value: "SentenceTransformersTokenTextSplitter"

2. Property chunk_size

Title: Chunk Size

Typeinteger
RequiredNo
Default4000

Description: Maximum size of chunks to return

3. Property chunk_overlap

Title: Chunk Overlap

Typeinteger
RequiredNo
Default50

4. Property keep_separator

Title: Keep Separator

Typeboolean
RequiredNo
Defaultfalse

Description: Whether to keep the separator in the chunks

5. Property strip_whitespace

Title: Strip Whitespace

Typeboolean
RequiredNo
Defaulttrue

Description: If True, strips whitespace from the start and end of every document

6. Property model

Title: Model

Typestring
RequiredNo
Default"sentence-transformers/all-mpnet-base-v2"

Description: Model name

7. Property tokens_per_chunk

Title: Tokens Per Chunk

Typeinteger
RequiredNo
Defaultnull

Description: Number of tokens per chunk