Open AmericanPresidentJimmyCarter opened 3 days ago
It crashes later on too at:
# Check for empty strings
if (df[caption_column] == "").sum() > 0 and not fallback_caption_column:
raise ValueError(
f"Parquet file {parquet_path} contains empty strings in the '{caption_column}' column."
)
if (df[filename_column] == "").sum() > 0:
raise ValueError(
f"Parquet file {parquet_path} contains empty strings in the '{filename_column}' column."
)
Needs also in prompts.py
if type(image_caption) == bytes:
image_caption = image_caption.decode("utf-8")
if type(image_caption) == str:
image_caption = image_caption.strip()
+ if type(image_caption) in (list, tuple, numpy.ndarray, pd.Series):
+ image_caption = [str(item).strip() for item in image_caption if item is not None]
if prepend_instance_prompt:
if type(image_caption) == list:
image_caption = [instance_prompt + " " + x for x in image_caption]
else:
image_caption = instance_prompt + " " + image_caption
return image_caption
Caption is checked for existence, but if it's a list it causes a crash.
You should