Skip to content

Dataset structure for train_text_to_image_lora.py #12038

@HripsimeS

Description

@HripsimeS

Hello. I am trying to use train_text_to_image_lora.py script following the instructions https://github.com/huggingface/diffusers/tree/main/examples/text_to_image

I get errors on data structure and don't know what is the issue on my side.
I have a folder data where I have folder image and csv file.

C:/Users/XXX//data/

├── images/
│ ├── image1.jpg
│ ├── image2.jpg
│ └── ...
└── captions.csv

Image folder contain images and csv file contains two columns (image names and captions)

image, caption
image1.jpg, A dragon flying through fire
image2.jpg, A knight in shining armor

Please can you let me know how I should organize my dataset to be able to run the training.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions