Open shylee2021 opened 3 weeks ago
Speculative RAG: Enhancing retrieval augmented generation through drafting
https://arxiv.org/abs/2407.08223
Leveraging AI for efficient incident response
LLMs Know More Than What They Say
https://arjunbansal.substack.com/p/llms-know-more-than-what-they-say
What Do Language Models Hear? Probing for Auditory Representations in Language Models https://arxiv.org/abs/2402.16998
PyTorch is dead. Long live JAX. https://neel04.github.io/my-website/blog/pytorch_rant/
LLM Compressor https://github.com/vllm-project/llm-compressor https://neuralmagic.com/blog/llm-compressor-is-here-faster-inference-with-vllm/
Liger-Kernel https://github.com/linkedin/Liger-Kernel
Llama-3.1-s https://homebrew.ltd/blog/can-llama-3-listen https://huggingface.co/homebrewltd/llama3.1-s-instruct-v0.2
Jamba-1.5 https://arxiv.org/pdf/2408.12570 https://huggingface.co/ai21labs/AI21-Jamba-1.5-Large