purrfectmeow.tc02_mlt package
Submodules
purrfectmeow.tc02_mlt.base module
- class purrfectmeow.tc02_mlt.base.Malet[source]
Bases:
object- DEFAULT_MODEL_NAME = 'sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2'
- DEFAULT_CHUNK_SIZE = 500
- DEFAULT_CHUNK_OVERLAP = 0
- DEFAULT_CHUNK_SEPARATOR = '\n\n'
- classmethod chunking(text, chunk_method='token', **kwargs)[source]
- Parameters:
text (str)
chunk_method (Literal['token', 'separate'] | None)
kwargs (Any)
- Return type:
TokenTextSplitter | CharacterSeparator
purrfectmeow.tc02_mlt.separate module
purrfectmeow.tc02_mlt.token module
Module contents
- class purrfectmeow.tc02_mlt.Malet[source]
Bases:
object- DEFAULT_CHUNK_OVERLAP = 0
- DEFAULT_CHUNK_SEPARATOR = '\n\n'
- DEFAULT_CHUNK_SIZE = 500
- DEFAULT_MODEL_NAME = 'sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2'
- classmethod chunking(text, chunk_method='token', **kwargs)[source]
- Parameters:
text (str)
chunk_method (Literal['token', 'separate'] | None)
kwargs (Any)
- Return type:
TokenTextSplitter | CharacterSeparator