-
If I understand correctly the idea should be that model generate belief states, dbsearch results, action and response conditioned on some dialog context. Then shouldn't we mask the context in between …
-
I get this error when running '!python agent.py --config config.yaml'
/content/muzic/musicagent
2023-12-04 10:03:54.786370: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow bin…
-
- [ ] [HongyeJ on X: "Despite the mixed feelings about Google's latest Gemma model, we're big fans! @GoogleAI Why? Coz we found it pairs incredibly well with our SelfExtend 🤣🤣🤣 - like, perfectly! With…
-
Most of the people do not have access to 8XA100 40GB systems. But a single M1 Max laptop with 64 GB memory could host the training. How difficult is it to port this code to "MPS" ?
-
### Description
I tried 2 transformer models on HF, both of which didn't work.
- We should provide a list of models that can run out of box for people to try out.
- We also need to add a warning th…
-
Testing the inference numbers
-
This is a pinned issue directed to the [Model Request Tracking Board](https://github.com/orgs/mlc-ai/projects/2).
- To submit a model request, create a [Model Request issue](https://github.com/mlc-…
-
This has been a topic of some discussion in #4 and on the Discord, so I figured I'd document our initial findings so far.
We would like to switch away from `ggml` at some point so that we can remov…
-
Hey,
I've been playing around with the nanoGPT repo for a while now. I'm familiar with the basics of neural networks and LLMs but there's still a lot I've yet to learn.
I've been using nanoGPT as a…
-
I test a gpt2 model using trt8.6.1;
the gpt2 onnx model is from https://github.com/NVIDIA/TensorRT/tree/release/8.6/demo/HuggingFace/GPT2
build cmd is **trtexec --onnx=temp/GPT2/gpt2/GPT2-gpt2/dec…