Skip to content

opinionated changes for working with larger datasets.

Notifications You must be signed in to change notification settings

mel0n/BooruDatasetTagManager

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BooruDatasetTagManager

A simple tag editor for a dataset created for training hypernetworks, embeddings, lora, etc. You can create a dataset from scratch using only images, or you can use a program to edit a dataset created using automatic tagging (wd14-tagger, stable-diffusion-webui, etc.) The editor is primarily intended for booru-style tagged data, but you can adapt it for other datasets as well.

Using

You need a dataset like the following:

You can also specify a dataset without text files if you want to create tags from scratch. In this case, text files will be created on save.

In the program, select "File->Load folder" and specify the directory with the dataset.

In the left column, tags are edited for the selected image, in the right column, tags are edited for all images of the dataset.

After editing, you will select "File->Save all changes".

You can select multiple images at once in a dataset. This will allow you to easily edit tags for images of the same type.

Through the "file" menu, you can open the settings window to customize the application for yourself. Users who have Google Translate blocked can change the translation service to Chinese.

Tag translation

Before using tag translation, you need to select the translation language and translation service in the settings. From the "view" menu, select "Translate tags" to display columns with translated values. When displaying columns, all tags will be automatically translated into the language you selected. The translation is saved in the "Translations" folder with the name of the selected language. You can manually edit the translation in this file as the translation is taken from this file first. Manual translation is recommended to be marked with the "*" symbol.

Translation file example:

//Translation format: <original>=<translation>
black hair=÷åðíûå âîëîñû
*solo=Ñîëî
1girl=1 äåâóøêà

Currently, the manual translation filter can only be used in tag autocompletion (with the option enabled in the settings). But in the future, it can be used somewhere else.

Tag list for autocomplete

The application supports loading tags from csv files of the format used in "Booru tag autocompletion for A1111". You can also create your own txt files with a list of tags (line by line). But since loading data from these files takes a long time, the program converts them to its own format and loads data from it. Therefore, if you change the list of tags, be prepared to wait quite a long time. All files with tags are located in the "Tags" folder.

Other

Also, you can load loss statistics after training. After pressing the "Interrupt" button, in the console you will see the loss statistics for each image.

Copy all text to file. File should look like this:

Loss statistics for file C:\NAI\stable-diffusion-webui\train\NishinoOut2\00006-0-00003-0-98028336_p0.png
loss:0.045±(0.002)
recent 32 loss:0.055±(0.007)
Loss statistics for file C:\NAI\stable-diffusion-webui\train\NishinoOut2\00014-0-00007-0-98909113_p0.png
loss:0.045±(0.002)
recent 32 loss:0.048±(0.007)
...

In the program, select "File->Load loss from file", and you will see:

You can automatically translate tags into the language you need. Specify the code of the language you need in the setting.json file. In the program select "View->Translate tags".

About

opinionated changes for working with larger datasets.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C# 100.0%