run-llama / llama-hub

A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
https://llamahub.ai/
MIT License
3.42k stars 727 forks source link

[Bug]: Semantic Chunking Llama Pack UnicodeEncodeError when using llamaindex-cli #929

Open jcalebsmith opened 5 months ago

jcalebsmith commented 5 months ago

Bug Description

When attempting to load LLMCompilerAgentPack with llamaindex-cli on Windows 11, the following error is thrown:

UnicodeEncodeError: 'charmap' codec can't encode characters in position 6859-6865: character maps to <undefined>

I've verified that other llama packs can be loaded successfully (e.g., SelfRAGPack)

Version

0.9.44

Steps to Reproduce

Run the following from the windows command prompt:

cli download-llamapack LLMCompilerAgentPack --download-dir ./llm_compiler_agent_pack

Relevant Logs/Tracbacks

Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "C:\Users\repo\anaconda3\envs\repo-library\Scripts\llamaindex-cli.exe\__main__.py", line 7, in <module>
  File "C:\Users\repo\anaconda3\envs\repo-library\Lib\site-packages\llama_index\command_line\command_line.py", line 168, in main
    args.func(args)
  File "C:\Users\repo\anaconda3\envs\repo-library\Lib\site-packages\llama_index\command_line\command_line.py", line 126, in <lambda>
    func=lambda args: handle_download_llama_pack(**vars(args))
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\repo\anaconda3\envs\repo-library\Lib\site-packages\llama_index\command_line\command_line.py", line 27, in handle_download_llama_pack
    download_llama_pack(
  File "C:\Users\repo\anaconda3\envs\repo-library\Lib\site-packages\llama_index\llama_pack\download.py", line 31, in download_llama_pack
    pack_cls = download_llama_module(
               ^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\repo\anaconda3\envs\repo-library\Lib\site-packages\llama_index\download\module.py", line 229, in download_llama_module
    download_module_and_reqs(
  File "C:\Users\repo\anaconda3\envs\srepo-library\Lib\site-packages\llama_index\download\module.py", line 151, in download_module_and_reqs
    f.write(extra_file_raw_content)
  File "C:\Users\repo\anaconda3\envs\repo-library\Lib\encodings\cp1252.py", line 19, in encode
    return codecs.charmap_encode(input,self.errors,encoding_table)[0]
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
UnicodeEncodeError: 'charmap' codec can't encode characters in position 6859-6865: character maps to <undefined>