iPentec.com / Document / カテゴリ: 画像生成 / タグ: 画像生成 LoRA Anima sd-scripts

Anima用のLoRAを作成する

事前準備

LoRA学習をするためのプログラム、sd-scriptsをインストールします。インストール手順はこちらの記事を参照してください。

学習データの準備

LoRAで学習させる画像データを準備します。今回は1枚だけの画像で学習してみます。

学習データの配置

学習させたい画像データと画像を説明するテキストファイルを配置します。

Anima用のLoRAを作成する:画像1

img.png 今回学習する画像 - Anima用のLoRAを作成する:画像2 — img.png 今回学習する画像

img.txt

girl, face, face close up

dataset.tomlの準備

学習データを配置したディレクトリの一つ上のディレクトリに、dataset.tomlファイルを作成します。
ファイルの内容は以下となります。

dataset.toml (書式)

[general]
enable_bucket = true 

[[datasets]]
resolution = 1024
batch_size = 12

[[datasets.subsets]]
image_dir = '(学習データの配置パス)'
caption_extension = '.txt'
num_repeats = (学習回数)

dataset.toml (今回のデモの例)

[general]
enable_bucket = true 

[[datasets]]
resolution = 1024
batch_size = 12

[[datasets.subsets]]
image_dir = 'D:\data\lora-anima-small-eye-single\image'
caption_extension = '.txt'
num_repeats = 192

config.tomlの準備

学習データを配置したディレクトリの一つ上のディレクトリに、config.tomlファイルを作成します。
ファイルの内容は以下となります。

config.toml (書式)

[model_arguments]
pretrained_model_name_or_path = "(事前学習済みモデルデータのパス)"
qwen3="(Qwen3のモデルデータのパス)"
vae="(VAEのモデルデータのパス)"

[dataset_arguments]
dataset_config = "(dataset.toml のパス)"

[cache_arguments]
cache_latents = false
cache_text_encoder_outputs = false

[network_arguments]
network_module="networks.lora_anima"
network_dim=8 

[optimizer_arguments]
optimizer_type = "AdamW8bit"
learning_rate=1e-4
lr_scheduler="constant"

[flow_matching_arguments]
timestep_sampling="sigmoid"
discrete_flow_shift=1.0

[training_arguments]
max_train_epochs=10
mixed_precision= "bf16"
gradient_checkpointing = true

[saving_arguments]
output_dir = "(LoRAの保存先パス)"
output_name = "(LoRAの名前)"
save_model_as = "safetensors"
save_every_n_epochs = 1

[vae_arguments]
vae_chunk_size=64
vae_disable_cache=true

[sample_arguments]
sample_every_n_epochs = 2
sample_prompts = "(サンプル画像生成のプロンプトのテキストファイルのパス)"

config.toml (今回のデモの例)

[model_arguments]
pretrained_model_name_or_path = "D:\\data\\model\\anima-base-v1.0.safetensors"
qwen3="D:\\data\\model\\qwen_3_06b_base.safetensors"
vae="D:\\data\\model\\qwen_image_vae.safetensors"

[dataset_arguments]
dataset_config = "D:\\data\\lora-anima-small-eye-single\\dataset.toml"

[cache_arguments]
cache_latents = false
cache_text_encoder_outputs = false

[network_arguments]
network_module="networks.lora_anima"
network_dim=8 

[optimizer_arguments]
optimizer_type = "AdamW8bit"
learning_rate=1e-4
lr_scheduler="constant"

[flow_matching_arguments]
timestep_sampling="sigmoid"
discrete_flow_shift=1.0

[training_arguments]
max_train_epochs=10
mixed_precision= "bf16"
gradient_checkpointing = true

[saving_arguments]
output_dir = "D:\\data\\lora-anima-small-eye-single\\output"
output_name = "eye-small"
save_model_as = "safetensors"
save_every_n_epochs = 1

[vae_arguments]
vae_chunk_size=64
vae_disable_cache=true

[sample_arguments]
sample_every_n_epochs = 2
sample_prompts = "D:\\data\\lora-anima-small-eye-single\\prompts.txt"

サンプル画像作成プロンプト

サンプル画像を作成する際に利用するプロンプトを記述したテキストファイルを配置します。

prompts.txt

girl, face, face close up --w 1024 --h 1024 --d 10000
girl, face, face close up --w 1024 --h 1024 --d 20000

起動コマンドの準備

sd-scriptsを実行するバッチファイルを作成します。

exec.bat

accelerate launch --num_cpu_threads_per_process 1 anima_train_network.py --config_file=(config.toml のパス)

exec.bat (今回のデモの例)

accelerate launch --num_cpu_threads_per_process 1 anima_train_network.py --config_file=D:\data\lora-anima-small-eye-single\config.toml

ファイル配置

ファイル配置は下図の状態となります。

Anima用のLoRAを作成する:画像3

学習の実行

コマンドプロンプトを表示して、または直接作成したexec.batを実行します。徐々に学習画像に近づいている様子が確認できます。

学習処理が始まり、完了しました。
Anima用のLoRAを作成する:画像4

出力ディレクトリのsampleディレクトリに出力された画像を確認します。
Anima用のLoRAを作成する:画像5

出力ディレクトリにLoRAのファイルが保存されています。
Anima用のLoRAを作成する:画像6

LoRAモデルのインストール・読み込み

今回はComfyUIで利用します。

以下のディレクトリに作成したLoRAのsafetensorsファイルを配置します。

(ComfyUIの配置ディレクトリ)\ComfyUI\models\loras

以下のワークフローを作成します。基本はこちらの記事で紹介しているAnimaのノードと同様です。

Anima用のLoRAを作成する:画像8

[LoRAを読み込む]ノードのlora_nameに作成したLoRAのファイルを指定します。
Anima用のLoRAを作成する:画像9

プロンプトは以下を利用します。

Prompt

Prompt: 1girl, blue shirt, collared shirt, necktie, masterpiece, best_quality, score_9, score_8
Negative prompt: worst quality, low quality,score_1, score_2, score_3, blurry, jpeg artifacts, lowres

[実行する]ボタンをクリックして画像を生成します。[画像を保存]ノードに生成された画像のプレビューが表示できました。
Anima用のLoRAを作成する:画像10