comet-ml / opik

Open-source end-to-end LLM Development Platform
Apache License 2.0
2.2k stars 135 forks source link

[FR]: Enable Opik to display additional media formats, including audio, PDF, and video. #567

Open pleomax0730 opened 2 weeks ago

pleomax0730 commented 2 weeks ago

Proposal summary

Feature Request

Enable Opik to display additional media formats, including audio, PDF, and video.

Background

Opik currently supports only image display, which limits its flexibility for monitoring, testing, and evaluating multimodal LLM applications that may involve other data formats. Expanding support for audio, PDF, and video would allow users to fully leverage Opik’s capabilities across a broader range of use cases.

Proposed Use Cases:

Benefits:

Summary

Adding support for audio, PDF, and video display will make Opik a more versatile platform, suitable for a wide range of LLM applications beyond text and images. This enhancement will empower users to develop, evaluate, and monitor their applications seamlessly across all media types.

Motivation

Many existing solutions for LLM evaluation and monitoring are limited to text and image formats, with little to no support for other media types like audio, PDFs, or video. This lack of multimodal support forces teams to use multiple tools or rely on custom workarounds, creating friction in their workflows and hindering a comprehensive evaluation process.

By introducing audio, PDF, and video support, Opik could become the first open-source platform to offer complete multimodal monitoring and evaluation capabilities. This would make Opik highly attractive to teams working on complex applications that require seamless integration of various data formats, such as multimedia retrieval, interactive voice systems, and document processing pipelines.

Competitive Advantage:

Leading the way with these features would position Opik as a go-to solution for multimodal LLM applications, setting it apart from other evaluation and monitoring tools. This could significantly increase Opik’s user base by attracting organizations and researchers who need comprehensive, media-agnostic monitoring for their LLM projects.

jverre commented 2 weeks ago

Hi @pleomax0730 Thanks for the detailed request ! I really like this idea, we could introduce the concept of an attachment for a trace or span that allows you to log any additional data. Depending on the data type, we could introduce some ways to view the data in the UI.

Is there a specific type of data you would like to us to support first ? We have gotten requests for better PDF support so could be a good candidate to start with

pleomax0730 commented 2 weeks ago

@jverre Thanks for your reply to my feature request! In terms of the current community, I think the best order might be PDF, then audio, and finally video.

jverre commented 2 weeks ago

Makes sense, I'll keep this ticket open. It's quite a big feature so we might not get to it straightaway