Open rezzie-rich opened 5 months ago
Hmm it'll be supported once we add all model automatic support!
https://huggingface.co/papers/2404.08801
Is there any ongoing plan to convert models with megalodon architecture?
This may not be much useful for a standalone llm interface, but for ai agents, this could be the biggest breakthrough as they have separate memory management.
Currently not sorry :(
similar to mistral-fing LLM, it would be great if we could get Megalodonian models based on Meta's Megalodon.
https://github.com/XuezheMax/megalodon
it is said to be bad on recall. however, it should be a great fit for agent frameworks since agents tend to work better with higher context windows (in this case, unlimited) and most of them are integrated with a short and long-term memory system to help with recall.