HamaWhiteGG / langchain-java

Java version of LangChain, while empowering LLM for Big Data.
Apache License 2.0
545 stars 106 forks source link

如何实现langchain中的文件加载且文本分割 #77

Closed TheBlindM closed 1 year ago

TheBlindM commented 1 year ago

就像下面这样

    documents = load_docs(app.config['UPLOAD_FOLDER'])
    # 初始化加载器
    text_splitter = CharacterTextSplitter(chunk_size=100, chunk_overlap=0)
    # 切割加载的 document
    split_docs = text_splitter.split_documents(documents)