swarmauri / swarmauri-sdk

a modular multimodal framework for ai applications
https://swarmauri.com
Apache License 2.0
73 stars 42 forks source link

[Feature Research]: Llava-next -34B #299

Closed abdulsamodazeez closed 1 month ago

abdulsamodazeez commented 2 months ago

Feature Name

Llava-next -34B

Feature Description

Research about Llava-next -34B

Research Findings

LLaVA-NeXT-34B

LLaVA-NeXT-34B is a model in the LLaVA-NeXT series, which enhances the capabilities of Large Multimodal Models (LMMs). Designed for a variety of scenarios, including multi-image, multi-frame (video), multi-view (3D), and single-image tasks, it boasts several advanced features.

Key Features

Features Supported

Vision

Text

Speech

Multimodal Capabilities

Resources

Potential Impact

LLaVA-NeXT-34B has the potential to revolutionize various domains due to its advanced multimodal capabilities. Here are some areas where it could make a substantial difference:

Healthcare

Education

Customer Service

Content Creation

Robotics and Automation

Accessibility

Research and Development

The versatility and advanced features of LLaVA-NeXT-34B can lead to significant advancements in these areas, improving efficiency, accessibility, and overall user experience.

Additional Resources (optional)

No response

Feature Priority

High

cobycloud commented 2 months ago

we support liuhaotian/llava-13b and liuhaotian/llava-yi-34b, fireworks/firellava-13b

we do not support a llava-next-34b yet