Allow loading a model using relative path

This makes Wllama correctly load the model from relative paths.

Tested with locally loaded Felladrin/gguf-sharded-Qwen1.5-0.5B-Chat and tinyllamas/stories15M-q4_0.gguf.
Fixes #63

Tests

The following 4 ways of loading a model were tested:

wllama.loadModelFromUrl("models/Qwen1.5-0.5B-Chat.Q4_k_m.shard-00001-of-00003.gguf")

wllama.loadModelFromUrl("/models/Qwen1.5-0.5B-Chat.Q4_k_m.shard-00001-of-00003.gguf")

wllama.loadModelFromUrl("./models/Qwen1.5-0.5B-Chat.Q4_k_m.shard-00001-of-00003.gguf")

wllama.loadModelFromUrl("https://huggingface.co/Felladrin/gguf-sharded-Qwen1.5-0.5B-Chat/resolve/main/Qwen1.5-0.5B-Chat.Q4_k_m.shard-00001-of-00003.gguf")

It works for both non-sharded and sharded models. For example, this non-sharded model was also tested:

wllama.loadModelFromUrl("models/stories15M-q4_0.gguf")

wllama.loadModelFromUrl("/models/stories15M-q4_0.gguf")

wllama.loadModelFromUrl("./models/stories15M-q4_0.gguf")

wllama.loadModelFromUrl("https://huggingface.co/ggml-org/models/resolve/main/tinyllamas/stories15M-q4_0.gguf")

Screenshots

About typings

The type RequestInfo | URL comes from the argument from the browser's fetch().

ngxson / wllama

Allow loading a model using relative path #64

Tests

Screenshots

About typings