gokayfem / ComfyUI_VLM_nodes

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Apache License 2.0
382 stars 31 forks source link

Get Keywords bug #74

Open 807502278 opened 5 months ago

807502278 commented 5 months ago

consistently generate repetitive prompt words:

Output Model: Analysis Description: Represents entries about an analysis. Output Fields: main_character (list of str): Description: Description of the main objects of the analysis artform (list of str): Description: List of Artforms of the analysis photo_type (list of str): Description: List of Types of the photo used in the analysis color_with_objects (list of str): Description: List of objects and their colors of the analysis digital_artform (list of str): Description: List of Digital artforms of the analysis background (list of str): Description: List of Background of the analysis lighting (list of str): Description: List of Lighting settings of the analysis.<|im_end|> <|im_start|>user The image features a large room with a round news desk at its center. Around the desk, there are multiple television screens displaying various news items. A total of seven TVs can be seen, some placed higher and others lower on the walls.

In addition to the TVs, there is a clock hanging prominently above the news desk, likely for time reference during newscasts. There are also three people present in the room, possibly anchors, reporters, or staff members. One person is located near the left side of the desk, another person is closer to the right side, and the third person is situated between them.<|im_end|> <|im_start|>assistantLlama.generate: prefix-match hit

2024-04-15 18:31:33.265128 { 2024-04-15 18:31:33.303200 "main_character": [ 2024-04-15 18:31:33.457650 "anchors", 2024-04-15 18:31:33.575321 "reporters", 2024-04-15 18:31:33.714463 "staff members", 2024-04-15 18:31:33.851172 "artform" 2024-04-15 18:31:33.987056 , 2024-04-15 18:31:34.044436 "digital artform", 2024-04-15 18:31:34.204979 "photo_type", 2024-04-15 18:31:34.365375 "color_with_objects", 2024-04-15 18:31:34.559231 "lighting", 2024-04-15 18:31:34.692425 "background", 2024-04-15 18:31:34.802942 "analysis" 2024-04-15 18:31:34.919105 , 2024-04-15 18:31:34.976868 "image features" 2024-04-15 18:31:35.100664 , 2024-04-15 18:31:35.161095 "room" 2024-04-15 18:31:35.267457 , 2024-04-15 18:31:35.331190 "desk", 2024-04-15 18:31:35.473443 "tvs", 2024-04-15 18:31:35.606691 "clock", 2024-04-15 18:31:35.716958 "people" 2024-04-15 18:31:35.827036 , 2024-04-15 18:31:35.890752 "analysis" 2024-04-15 18:31:36.013149 , 2024-04-15 18:31:36.078554 "image features" 2024-04-15 18:31:36.213116 , 2024-04-15 18:31:36.277273 "room" 2024-04-15 18:31:36.386780 , 2024-04-15 18:31:36.465637 "desk", 2024-04-15 18:31:36.605609 "tvs", 2024-04-15 18:31:36.742462 "clock", 2024-04-15 18:31:36.855544 "people" 2024-04-15 18:31:36.982089 , 2024-04-15 18:31:37.044732 "analysis" 2024-04-15 18:31:37.159626 , 2024-04-15 18:31:37.226000 "image features" 2024-04-15 18:31:37.360219 , 2024-04-15 18:31:37.424708 "room" 2024-04-15 18:31:37.541965 , 2024-04-15 18:31:37.606752 "desk", 2024-04-15 18:31:37.742806 "tvs", 2024-04-15 18:31:37.878617 "clock", 2024-04-15 18:31:37.989823 "people" 2024-04-15 18:31:38.118167 , 2024-04-15 18:31:38.184827 "analysis" 2024-04-15 18:31:38.298161 , 2024-04-15 18:31:38.363939 "image features" 2024-04-15 18:31:38.498918 , 2024-04-15 18:31:38.581974 "room" 2024-04-15 18:31:38.694329 , 2024-04-15 18:31:38.762286 "desk", 2024-04-15 18:31:38.897694 "tvs", 2024-04-15 18:31:39.031190 "clock", 2024-04-15 18:31:39.158660 "people" 2024-04-15 18:31:39.273200 , 2024-04-15 18:31:39.339450 "analysis" 2024-04-15 18:31:39.453478 , 2024-04-15 18:31:39.520263 "image features" 2024-04-15 18:31:39.668803 , 2024-04-15 18:31:39.734674 "room" 2024-04-15 18:31:39.848697 , 2024-04-15 18:31:39.915852 "desk", 2024-04-15 18:31:40.053225 "tvs", 2024-04-15 18:31:40.202175 "clock", 2024-04-15 18:31:40.316517 "people" 2024-04-15 18:31:40.430814 , 2024-04-15 18:31:40.496635 "analysis" 2024-04-15 18:31:40.610507 , 2024-04-15 18:31:40.676111 "image features" 2024-04-15 18:31:40.826118 , 2024-04-15 18:31:40.894840 "room" 2024-04-15 18:31:41.016678 , 2024-04-15 18:31:41.082351 "desk", 2024-04-15 18:31:41.218605 "tvs", 2024-04-15 18:31:41.361152 "clock", 2024-04-15 18:31:41.474544 "people" 2024-04-15 18:31:41.589392 , 2024-04-15 18:31:41.657500 "analysis" 2024-04-15 18:31:41.781417 , 2024-04-15 18:31:41.849050 "image features" 2024-04-15 18:31:41.993225 , 2024-04-15 18:31:42.060375 "room" 2024-04-15 18:31:42.177306 , 2024-04-15 18:31:42.244486 "desk", 2024-04-15 18:31:42.390916 "tvs", 2024-04-15 18:31:42.531261 "clock", 2024-04-15 18:31:42.645462 "people" 2024-04-15 18:31:42.761553 , 2024-04-15 18:31:42.845879 "analysis" 2024-04-15 18:31:42.959771 , 2024-04-15 18:31:43.026580 "image features" 2024-04-15 18:31:43.169524 , 2024-04-15 18:31:43.238133 "room" 2024-04-15 18:31:43.353762 , 2024-04-15 18:31:43.428755 "desk", 2024-04-15 18:31:43.574780 "tvs", 2024-04-15 18:31:43.715942 "clock", 2024-04-15 18:31:43.831681 "people" 2024-04-15 18:31:43.949530 , 2024-04-15 18:31:44.016630 "analysis" 2024-04-15 18:31:44.134030 , 2024-04-15 18:31:44.202369 "image features" 2024-04-15 18:31:44.341285 , 2024-04-15 18:31:44.407523 "room" 2024-04-15 18:31:44.537399 , 2024-04-15 18:31:44.606050 "desk", 2024-04-15 18:31:44.749099 "tvs", 2024-04-15 18:31:44.892176 "clock", 2024-04-15 18:31:45.023651 "people" 2024-04-15 18:31:45.142036 , 2024-04-15 18:31:45.212807 "analysis" 2024-04-15 18:31:45.329278 , 2024-04-15 18:31:45.397552 "image features" 2024-04-15 18:31:45.539993 , 2024-04-15 18:31:45.608259 "room" 2024-04-15 18:31:45.725290 , 2024-04-15 18:31:45.792047 "desk", 2024-04-15 18:31:45.933051 "tvs", 2024-04-15 18:31:46.095922 "clock", 2024-04-15 18:31:46.212102 "people" 2024-04-15 18:31:46.324569 , 2024-04-15 18:31:46.388775 "analysis" 2024-04-15 18:31:46.499294 , 2024-04-15 18:31:46.578324 "image features" 2024-04-15 18:31:46.719749 , 2024-04-15 18:31:46.790859 "room" 2024-04-15 18:31:46.909119 , 2024-04-15 18:31:46.978038 "desk", 2024-04-15 18:31:47.137039 "tvs", 2024-04-15 18:31:47.278027 "clock", 2024-04-15 18:31:47.398354 "people" 2024-04-15 18:31:47.511525 , 2024-04-15 18:31:47.574912 "analysis" 2024-04-15 18:31:47.692474 , 2024-04-15 18:31:47.759242 "image features" 2024-04-15 18:31:47.900012 , 2024-04-15 18:31:47.968781 "room" 2024-04-15 18:31:48.086766 , 2024-04-15 18:31:48.173679 "desk", 2024-04-15 18:31:48.318240 "tvs", 2024-04-15 18:31:48.465233 "clock", 2024-04-15 18:31:48.584936 "people" 2024-04-15 18:31:48.710267 , 2024-04-15 18:31:48.780356 "analysis" 2024-04-15 18:31:48.900935 , 2024-04-15 18:31:48.972195 "image features" 2024-04-15 18:31:49.119150 , 2024-04-15 18:31:49.189979 "room" 2024-04-15 18:31:49.312361 , 2024-04-15 18:31:49.383706 "desk", 2024-04-15 18:31:49.529818 "tvs", 2024-04-15 18:31:49.677399 "clock", 2024-04-15 18:31:49.814975 "people" 2024-04-15 18:31:49.939324 , 2024-04-15 18:31:50.013115 "analysis" 2024-04-15 18:31:50.132472 , 2024-04-15 18:31:50.202386 "image features" 2024-04-15 18:31:50.355348 , 2024-04-15 18:31:50.425269 "room" 2024-04-15 18:31:50.547352 , 2024-04-15 18:31:50.619154 "desk", 2024-04-15 18:31:50.766593 "tvs", 2024-04-15 18:31:50.912947 "clock", 2024-04-15 18:31:51.034890 "people" 2024-04-15 18:31:51.156894 , 2024-04-15 18:31:51.229298 "analysis" 2024-04-15 18:31:51.367105 , 2024-04-15 18:31:51.438196 "image features" 2024-04-15 18:31:51.590013 , 2024-04-15 18:31:51.662001 "room" 2024-04-15 18:31:51.784339 , 2024-04-15 18:31:51.854073 "desk", 2024-04-15 18:31:52.005261 "tvs", 2024-04-15 18:31:52.149676 "clock", 2024-04-15 18:31:52.270133 "people" 2024-04-15 18:31:52.392112 , 2024-04-15 18:31:52.484386 "analysis" 2024-04-15 18:31:52.610757 , 2024-04-15 18:31:52.683965 "image features" 2024-04-15 18:31:52.836487 , 2024-04-15 18:31:52.912075 "room" 2024-04-15 18:31:53.038677 , 2024-04-15 18:31:53.112798 "desk", 2024-04-15 18:31:53.264232 "tvs", 2024-04-15 18:31:53.415371 "clock", 2024-04-15 18:31:53.563378 "people" 2024-04-15 18:31:53.691030 , 2024-04-15 18:31:53.765591 "analysis" 2024-04-15 18:31:53.889717 , 2024-04-15 18:31:53.962596 "image features" 2024-04-15 18:31:54.126987 , 2024-04-15 18:31:54.201364 "room" 2024-04-15 18:31:54.328464 , 2024-04-15 18:31:54.403220 "desk", 2024-04-15 18:31:54.553668 "tvs", 2024-04-15 18:31:54.710219 "clock", 2024-04-15 18:31:54.837153 "people" 2024-04-15 18:31:54.964075 , 2024-04-15 18:31:55.036777 "analysis" 2024-04-15 18:31:55.162835 , 2024-04-15 18:31:55.234777 "image features" 2024-04-15 18:31:55.385104 , 2024-04-15 18:31:55.459660 "room" 2024-04-15 18:31:55.582359 , 2024-04-15 18:31:55.665530 "desk", 2024-04-15 18:31:55.815113 "tvs", 2024-04-15 18:31:55.974462 "clock", 2024-04-15 18:31:56.096790 "people" 2024-04-15 18:31:56.222769 , 2024-04-15 18:31:56.297794 "analysis" 2024-04-15 18:31:56.424673 , 2024-04-15 18:31:56.496533 "image features" 2024-04-15 18:31:56.644702 , 2024-04-15 18:31:56.727489 "room" 2024-04-15 18:31:56.851415 , 2024-04-15 18:31:56.923423 "desk", 2024-04-15 18:31:57.083083 "tvs", 2024-04-15 18:31:57.242780 "clock", 2024-04-15 18:31:57.369668 "people" 2024-04-15 18:31:57.495875 , 2024-04-15 18:31:57.569931 "analysis" 2024-04-15 18:31:57.694924 , 2024-04-15 18:31:57.770628 "image features" 2024-04-15 18:31:57.922185 , 2024-04-15 18:31:57.996375 "room" 2024-04-15 18:31:58.119884 , 2024-04-15 18:31:58.192573 "desk", 2024-04-15 18:31:58.351298 "tvs", 2024-04-15 18:31:58.504407 "clock", 2024-04-15 18:31:58.632002 "people" 2024-04-15 18:31:58.758870 , 2024-04-15 18:31:58.846581 "analysis" 2024-04-15 18:31:58.976038 , 2024-04-15 18:31:59.051661 "image features" 2024-04-15 18:31:59.207093 , 2024-04-15 18:31:59.282747 "room" 2024-04-15 18:31:59.424751 , 2024-04-15 18:31:59.500946 "desk", 2024-04-15 18:31:59.655094 "tvs", 2024-04-15 18:31:59.812598 "clock", 2024-04-15 18:31:59.953561 "people" 2024-04-15 18:32:00.084301 , 2024-04-15 18:32:00.161492 "analysis" 2024-04-15 18:32:00.290444 , 2024-04-15 18:32:00.367563 "image features" 2024-04-15 18:32:00.536950 , 2024-04-15 18:32:00.612357 "room" 2024-04-15 18:32:00.743332 , 2024-04-15 18:32:00.819735 "desk", 2024-04-15 18:32:00.984734 "tvs", 2024-04-15 18:32:01.138504 "clock", 2024-04-15 18:32:01.267203 "people" 2024-04-15 18:32:01.397409 , 2024-04-15 18:32:01.471380 "analysis" 2024-04-15 18:32:01.617081 , 2024-04-15 18:32:01.694365 "image features" 2024-04-15 18:32:01.851777 , 2024-04-15 18:32:01.925724 "room" 2024-04-15 18:32:02.063951 , 2024-04-15 18:32:02.140475 "desk", 2024-04-15 18:32:02.297451 "tvs", 2024-04-15 18:32:02.457738 "clock", 2024-04-15 18:32:02.602534 "people" 2024-04-15 18:32:02.730414 , 2024-04-15 18:32:02.805055 "analysis" 2024-04-15 18:32:02.931286 , 2024-04-15 18:32:03.004109 "image features" 2024-04-15 18:32:03.169809 , 2024-04-15 18:32:03.245883 "room" 2024-04-15 18:32:03.379364 , 2024-04-15 18:32:03.458167 "desk", 2024-04-15 18:32:03.613707 "tvs", 2024-04-15 18:32:03.783399 "clock", 2024-04-15 18:32:03.910739 "people" 2024-04-15 18:32:04.037982 , 2024-04-15 18:32:04.111149 "analysis" 2024-04-15 18:32:04.246778 , 2024-04-15 18:32:04.322762 "image features" 2024-04-15 18:32:04.480715 , 2024-04-15 18:32:04.558014 "room" 2024-04-15 18:32:04.688538 , 2024-04-15 18:32:04.774259 "desk", 2024-04-15 18:32:04.934834 "tvs", 2024-04-15 18:32:05.095191 "clock", 2024-04-15 18:32:05.244696 "people" 2024-04-15 18:32:05.374315 , 2024-04-15 18:32:05.451615 "analysis" 2024-04-15 18:32:05.586033 , 2024-04-15 18:32:05.664223 "image features" 2024-04-15 18:32:05.831247 , 2024-04-15 18:32:05.908003 "room" 2024-04-15 18:32:06.041198 , 2024-04-15 18:32:06.118027 "desk", 2024-04-15 18:32:06.271462 "tvs", 2024-04-15 18:32:06.441928 "clock", 2024-04-15 18:32:06.581326 "people" 2024-04-15 18:32:06.714194 , 2024-04-15 18:32:06.790401 "analysis" 2024-04-15 18:32:06.928002 , 2024-04-15 18:32:07.004238 "image features" 2024-04-15 18:32:07.162574 , 2024-04-15 18:32:07.240648 "room" 2024-04-15 18:32:07.378256 , 2024-04-15 18:32:07.452874 "desk", 2024-04-15 18:32:07.612124 "tvs", 2024-04-15 18:32:07.770781 "clock", 2024-04-15 18:32:07.916399 "people" 2024-04-15 18:32:08.046919 , 2024-04-15 18:32:08.126607 "analysis" 2024-04-15 18:32:08.262187 , 2024-04-15 18:32:08.341204 "image features" 2024-04-15 18:32:08.522671 , 2024-04-15 18:32:08.595382 "room" 2024-04-15 18:32:08.725197 , 2024-04-15 18:32:08.801579 "desk", 2024-04-15 18:32:08.973932 "tvs", 2024-04-15 18:32:09.136800 "clock", 2024-04-15 18:32:09.278714 "people" 2024-04-15 18:32:09.411326 , 2024-04-15 18:32:09.508930 "analysis" 2024-04-15 18:32:09.643261 , 2024-04-15 18:32:09.724054 "image features" 2024-04-15 18:32:09.888202 , 2024-04-15 18:32:09.966982 "room" 2024-04-15 18:32:10.116235 , 2024-04-15 18:32:10.193864 "desk", 2024-04-15 18:32:10.358211 "tvs", 2024-04-15 18:32:10.518111 "clock", 2024-04-15 18:32:10.654396 "people" 2024-04-15 18:32:10.787800 , 2024-04-15 18:32:10.866721 "analysis" 2024-04-15 18:32:11.009994 , 2024-04-15 18:32:11.100342 "image features" 2024-04-15 18:32:11.261507 , 2024-04-15 18:32:11.342771 "room" 2024-04-15 18:32:11.478209 , 2024-04-15 18:32:11.555945 "desk", 2024-04-15 18:32:11.735736 "tvs", 2024-04-15 18:32:11.900343 "clock", 2024-04-15 18:32:12.045865 "people" 2024-04-15 18:32:12.194862 , 2024-04-15 18:32:12.275682 "analysis" 2024-04-15 18:32:12.413921 , 2024-04-15 18:32:12.493232 "image features" 2024-04-15 18:32:12.657300 , 2024-04-15 18:32:12.752684 "room" 2024-04-15 18:32:12.890795 , 2024-04-15 18:32:12.973205 "desk", 2024-04-15 18:32:13.137593 "tvs", 2024-04-15 18:32:13.317995 "clock", 2024-04-15 18:32:13.454379 "people" 2024-04-15 18:32:13.603317 , 2024-04-15 18:32:13.682514 "analysis" 2024-04-15 18:32:13.833219 , 2024-04-15 18:32:13.916146 "image features" 2024-04-15 18:32:14.083885 , 2024-04-15 18:32:14.163674 "room" 2024-04-15 18:32:14.319174 , 2024-04-15 18:32:14.398787 "desk", 2024-04-15 18:32:14.600022 "tvs", 2024-04-15 18:32:14.788315 "clock", 2024-04-15 18:32:14.961443 "people" 2024-04-15 18:32:15.116361 , 2024-04-15 18:32:15.208522 "analysis" 2024-04-15 18:32:15.366586 llama_print_timings: load time = 2106.67 ms llama_print_timings: sample time = 8106.38 ms / 1681 runs ( 4.82 ms per token, 207.37 tokens per second) llama_print_timings: prompt eval time = 0.00 ms / 1 tokens ( 0.00 ms per token, inf tokens per second) llama_print_timings: eval time = 29274.86 ms / 1681 runs ( 17.42 ms per token, 57.42 tokens per second) llama_print_timings: total time = 42262.14 ms

gokayfem commented 5 months ago

what is the llm or vlm you are using ?

807502278 commented 5 months ago

The model in the example:ggml-model-g4.gguf