Update README.md
Browse files
README.md
CHANGED
|
@@ -42,21 +42,21 @@ Various experiments, some decent, some bad. Trained on publicly accessible image
|
|
| 42 |
**Pretty good** but the originals tended to gravitate towards a certain size in the chest area and it can be difficult to change that aspect.
|
| 43 |
|
| 44 |
**digital painting**
|
| 45 |
-
144 * 2048px images, 1024px
|
| 46 |
**Pretty bad** - details look sharp but the style is inconsistent.
|
| 47 |
|
| 48 |
**digital painting v2**
|
| 49 |
-
72 *
|
| 50 |
-
**Decent** - details look slightly worse than v1 but the style is much more consistent. I'd probably use even fewer images
|
| 51 |
|
| 52 |
**game art**
|
| 53 |
270 * 1024px images extracted from a certain cute and funny game, 1024px buckets, very long Qwen3-VL-32B captions.
|
| 54 |
**Very bad** - way too many images, looks like it averaged the style instead of learning it. Definitely fewer images for v2 and maybe shorter captions.
|
| 55 |
|
| 56 |
**ink art**
|
| 57 |
-
82 * 2048px images, 1024px
|
| 58 |
**Bad** - very inconsistent at applying the style, seems to work better at higher resolutions. Same recommendations as above.
|
| 59 |
|
| 60 |
**oil painting**
|
| 61 |
-
60 * 2048px images, 1024px
|
| 62 |
**Good** - no real complaints. I tried training a v2 with longer captions but it didn't change much. The style seems very easy for the model to grasp.
|
|
|
|
| 42 |
**Pretty good** but the originals tended to gravitate towards a certain size in the chest area and it can be difficult to change that aspect.
|
| 43 |
|
| 44 |
**digital painting**
|
| 45 |
+
144 * 2048px images, 1024px and 1536px buckets. Short captions generated with some old LLM.
|
| 46 |
**Pretty bad** - details look sharp but the style is inconsistent.
|
| 47 |
|
| 48 |
**digital painting v2**
|
| 49 |
+
72 * 2048px images, 1024px buckets. Very long captions generated with Qwen3-VL-32B-Instruct.
|
| 50 |
+
**Decent** - details look slightly worse than v1 but the style is much more consistent. I'd probably use even fewer images for v3 but train with 1536px buckets and shorter captions.
|
| 51 |
|
| 52 |
**game art**
|
| 53 |
270 * 1024px images extracted from a certain cute and funny game, 1024px buckets, very long Qwen3-VL-32B captions.
|
| 54 |
**Very bad** - way too many images, looks like it averaged the style instead of learning it. Definitely fewer images for v2 and maybe shorter captions.
|
| 55 |
|
| 56 |
**ink art**
|
| 57 |
+
82 * 2048px images, 1024px and 1536px buckets, very long Qwen3-VL-32B captions.
|
| 58 |
**Bad** - very inconsistent at applying the style, seems to work better at higher resolutions. Same recommendations as above.
|
| 59 |
|
| 60 |
**oil painting**
|
| 61 |
+
60 * 2048px images, 1024px and 1536px buckets. Short captions generated with some old LLM.
|
| 62 |
**Good** - no real complaints. I tried training a v2 with longer captions but it didn't change much. The style seems very easy for the model to grasp.
|