Nandaka / PixivUtil2

Download images from Pixiv and more!
http://nandaka.devnull.zone/
BSD 2-Clause "Simplified" License
2.39k stars 254 forks source link

[Feature Request] AI Image Filtering #1189

Open YukihoAA opened 2 years ago

YukihoAA commented 2 years ago

Prerequisites

Description

[Description of the bug or feature]

I'd like to suggest adding filtering function for NovelAI (AI generated Images) just like R-18 Filtering.

  1. AI Image filter on/off
  2. Generate Folder name string just like %R-18%/
Nandaka commented 2 years ago

You can do this from the website settings. image

photonometric commented 1 year ago

I guess the wording "display fewer" implies that it's a soft option which reduces the AI results, rather than eliminating them entirely (like R-18/R-18G modes).

Anyone who really wants to not see anything AI, should probably just employ software blacklisting (useBlacklistTags/blacklist_tags.txt) and add at least the most popular general tags...rather than wonder how many soft-blacklisted images some pixiv algorithm is letting through.

YukihoAA commented 1 year ago

I can remove AI Photo by Blacklist -> Yes

I'd like to have some features that can categorize AI images by tags or something

Nandaka commented 1 year ago

There is a flag in the json details for each individual images and search by tags. I think value = 1 (non-AI), and value = 2 (AI Generated)?

image image

photonometric commented 1 year ago

Here's some notes about the AI tags, and the "display fewer" toggle, which may be helpful for others confused about what it does. The Pixiv Help Center page for this setting is not helpful at all. From what I have found on some reddit threads, it does seem to aim for blocking all AI pictures, but uses the wording "fewer" as qualifier because it probably won't eliminate 100% of such pictures?

I've also read on reddit that it only blocks AI submissions on "main" pages (searches, recommended works, probably home page)—meaning that places like users' bookmarks will not be filtered. So this may be important to people who download bookmarks a lot

For those making blacklists, or maybe programming such a list into the software somehow, here's the most popular AI-related tags (with current tag population):

#NovelAI (187.1k) #AI (90.7k) WARNING see below #NovelAIDiffusion (53.4k) #AIイラスト (32.1k) #StableDiffusion (29.3k) #AI生成 (9.1k)

Note that the tag #AI can also return other unrelated tags which contain these letters...things like #CLIPSTUDIOPAINT, #SAI龍, #haikyuufanart, kaito, etc. It produces problems when trying to use boolean searches with a tag and another tag that is its subset (eg clipstudiopaint -ai), so in some functions pixiv obviously sees these as a single tag, even though #AI is a perfectly valid tag people can enter. I guess it should just be avoided, as it contains potentially thousands of false positives.

Then there is the general AI-generated "toggle" tag, at the beginning of the list on an image page, where R-18 and "Original" usually goe. But R-18 also functions as a normal tag (i.e. can be used in the search bar etc), albeit one with 6.7 million results! But typing AI-generated into the search only yields 1.2k results, which all contain the english tag #AI-generated. Images are sorted into this category if the artist selects the option when uploading, but since this option took a while to appear, I suspect(?) that the big tags above have been auto-sorted into this category, at least to some extent. I'm not sure if this is ongoing, or if marking as AI depends entirely upon the user upload option.

Nandaka commented 1 year ago

use blacklist tag feature then?

image