modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Apache License 2.0
2.72k stars 168 forks source link

Feedback on image_face_ratio_filter and Suggestion for a New image_face_counter_filter Operator #446

Open TobyJasper opened 5 days ago

TobyJasper commented 5 days ago

Search before continuing 先搜索,再继续

Description 描述

Hi there,

I'd like to provide some feedback on the image_face_ratio_filter operator and also propose a new feature.

  1. image_face_ratio_filter Performance: I've been using this operator for image filtering, but I've noticed that the dlib-based face detection doesn't perform as well as expected, especially in terms of detection accuracy. Additionally, installing the CUDA-supported version of dlib is quite cumbersome on some systems, which could be a barrier for users who want to leverage GPU acceleration. Would you consider using a different model, such as MTCNN or a similar alternative, which might offer better performance and easier installation?

  2. Suggestion for image_face_counter_filter: I'd also like to suggest adding a new operator, image_face_counter_filter, which would allow users to filter images based on the number of faces detected. This would be especially useful for tasks where only images with a specific number of faces (e.g., exactly one face, multiple faces) are needed. What are your thoughts on including this in the project? If needed, I would be happy to submit a PR to implement this feature.

Use case 使用场景

No response

Additional 额外信息

No response

Are you willing to submit a PR for this feature? 您是否乐意为此功能提交一个 PR?

drcege commented 3 days ago

Thank you for your feedback.

  1. We are aware of the challenges involved in installing dlib, so we have switched the face detection implementation to OpenCV, completely eliminating the dependency on dlib. Both implementations aim to provide ease of use for quick setup; however, the accuracy may not reach state-of-the-art levels. We plan to support a series of state-of-the-art face detection models through the ModelScope interface, including the mentioned MTCNN, as well as RetinaFace, MogFace, and others, which we expect to implement in the coming weeks.

  2. Your suggestion regarding the image_face_counter_filter is excellent. Please feel free to submit an implementation PR, and we can work together to enhance it.

TobyJasper commented 1 day ago

My apologies. I haven’t updated the code in almost a month and didn’t notice that dlib was removed.

I’ve submitted the PR for the new operator and tried to keep it in line with image_face_ratio_filter so it’s easier to update related features in the future.