-
### Proposal
Add support for TracrBench transformers
### Motivation
I and @JeremyAlain recently wrote a paper in which we introduced a dataset of 121 tracr-transformers. Tracr transformers a…
-
In my PR of JabRef we implemented RAG manually:
1. It's local-first
2. No need for special setup
3. No need for external application
4. It is fully implemented RAG architecture (though, may be not…
-
Torchtune is a great project that explaining such a complex fine-tuning process in such an elegant way.
I would think having a simple benchmark againt other popular LLM fine-tuning approach is valu…
-
**Problem**
Please consider adding Core ML model package format support to utilize Apple Silicone Nural Engine + GPU.
**Success Criteria**
Utilize both ANE & GPU, not just GPU on Apple Silico…
-
When export dynamic-shape llama2, in the attached [piece of the partitioned model](https://github.com/user-attachments/files/16568296/partitioned-model_piece-2.txt), there are symbol manipulations suc…
-
Google Gemini Pro 1.5 (especially the new experimental version) is one of the top models in the the LLM leaderboard, showcasing its exceptional capabilities and potential.
The model perfor…
-
Hey guys,
I appreciated reading your paper. However I just wanted to see your eval results for myself and running the script results in the follwoing error when I execute ```python evaluate_hotpot_q…
-
### Summary
LLM is a hot topic, there are more and more frameworks to make the execution of LLM faster. WasmEdge already integrated the [llama.cpp](https://github.com/ggerganov/llama.cpp) as one of…
hydai updated
9 months ago
-
![IMG_0303](https://github.com/user-attachments/assets/872bbc7c-db96-4fbd-990f-ca49ab9c6ef1)
**Project Abstract**
This project delivers a user-friendly network traffic analysis tool that empow…
-
Dear authors,
I have some questions about the paper content:
(1) what is the MedVicuna and RadVicuna in Table 1? I cannot find them in the paper or on the Internet;
(2) According to Figure 1, it s…