jimmycarter commited on
Commit
8d16cf5
1 Parent(s): dd47519

Model card auto-generated by SimpleTuner

Browse files
Files changed (1) hide show
  1. README.md +339 -0
README.md ADDED
@@ -0,0 +1,339 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ base_model: "black-forest-labs/FLUX.1-dev"
4
+ tags:
5
+ - flux
6
+ - flux-diffusers
7
+ - text-to-image
8
+ - diffusers
9
+ - simpletuner
10
+ - not-for-all-audiences
11
+ - lora
12
+ - template:sd-lora
13
+ - lycoris
14
+ inference: true
15
+ widget:
16
+ - text: 'unconditional (blank prompt)'
17
+ parameters:
18
+ negative_prompt: 'blurry, cropped, ugly'
19
+ output:
20
+ url: ./assets/image_0_0.png
21
+ - text: 'In this scene from the animated series "Helluva Boss," Loona, the wolf-like receptionist of the Immediate Murder Professionals (I.M.P), is depicted leaning against a wall outside the office. She is casually engrossed in her phone, displaying her typical aloof and detached demeanor. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
22
+ parameters:
23
+ negative_prompt: 'blurry, cropped, ugly'
24
+ output:
25
+ url: ./assets/image_1_0.png
26
+ - text: 'Loona shrugs with an exasperated expression, her red eyes wide and frustrated, as she seemingly questions or challenges something said in the I.M.P office. Still from Helluva boss. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes, complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts.'
27
+ parameters:
28
+ negative_prompt: 'blurry, cropped, ugly'
29
+ output:
30
+ url: ./assets/image_2_0.png
31
+ - text: 'A scene from the animated series "Helluva Boss," set in the office. Loona, the wolf-like receptionist with white fur, black-tipped ears, and red eyes, is seated on a couch, facing towards the viewer. Loona''s appearance is complemented by her punk-inspired attire featuring a black choker with spikes, a dark grey top, fingerless wrist-length black gloves, and black shorts. She holds a piece of paper that says,"Welcome to Losercity, jerks". In the background, the office has a striped wall pattern and visible damage on the ceiling, indicating a chaotic or rough environment. On the right side of the image, two imp characters appear to be engaged in conversation.'
32
+ parameters:
33
+ negative_prompt: 'blurry, cropped, ugly'
34
+ output:
35
+ url: ./assets/image_3_0.png
36
+ - text: 'Loona from Helluva Boss is dressed in an oversized taco costume, looking visibly irritated and embarrassed. Her red eyes convey her annoyance as she crosses her arms and glares to the side. Loona''s appearance includes her usual whitish fur, light grey hair, black-tipped ears, and red eyes'
37
+ parameters:
38
+ negative_prompt: 'blurry, cropped, ugly'
39
+ output:
40
+ url: ./assets/image_4_0.png
41
+ - text: 'Loona is standing next to Blitzo (Helluva boss)'
42
+ parameters:
43
+ negative_prompt: 'blurry, cropped, ugly'
44
+ output:
45
+ url: ./assets/image_5_0.png
46
+ - text: 'In this "Helluva Boss" scene, Loona, the wolf-like receptionist, stands in an elevator with a tense and irritated expression, her teeth bared in a snarl. Blitzø, the red demon with distinctive black and white horns, leans close and makes an adorable look, as if asking for a favor. The ornate elevator setting hints at a tense moment, possibly involving a challenging mission or conflict within the I.M.P team.'
47
+ parameters:
48
+ negative_prompt: 'blurry, cropped, ugly'
49
+ output:
50
+ url: ./assets/image_6_0.png
51
+ - text: 'a 2D simple drawing of a madeleine cake, with a green cloud drawn next to it'
52
+ parameters:
53
+ negative_prompt: 'blurry, cropped, ugly'
54
+ output:
55
+ url: ./assets/image_7_0.png
56
+ - text: 'a 3D captivating YouTube thumbnail depicting of a full detailed,it''s on a party real people like, on front there is a giant pulling a nose of a black African real like lady down to size of elephant nose,be creative and unique'
57
+ parameters:
58
+ negative_prompt: 'blurry, cropped, ugly'
59
+ output:
60
+ url: ./assets/image_8_0.png
61
+ - text: 'Whiskers the cat. Whiskers becomes a mentor to other animals.Impressed by Whiskers'' intelligence, other animals in the neighborhood seek his guidance. Whiskers sets up a virtual learning platform using AI technology, where animals can ask questions, receive personalized lessons, and acquire knowledge in various subjects. Whiskers becomes a mentor, helping others unlock their potential.'
62
+ parameters:
63
+ negative_prompt: 'blurry, cropped, ugly'
64
+ output:
65
+ url: ./assets/image_9_0.png
66
+ - text: 'As the stock market fluctuates, the investor remains calm and collected at their desk, surrounded by charts and graphs. Their tailored suit and polished briefcase are a symbol of their expertise and experience in the world of finance. '
67
+ parameters:
68
+ negative_prompt: 'blurry, cropped, ugly'
69
+ output:
70
+ url: ./assets/image_10_0.png
71
+ - text: 'loona from helluva boss is eating a donut'
72
+ parameters:
73
+ negative_prompt: 'blurry, cropped, ugly'
74
+ output:
75
+ url: ./assets/image_11_0.png
76
+ ---
77
+
78
+ # flux-training-losercity-next-lycoris12
79
+
80
+ This is a LyCORIS adapter derived from [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev).
81
+
82
+
83
+ The main validation prompt used during training was:
84
+
85
+
86
+
87
+ ```
88
+ loona from helluva boss is eating a donut
89
+ ```
90
+
91
+ ## Validation settings
92
+ - CFG: `3.5`
93
+ - CFG Rescale: `0.0`
94
+ - Steps: `15`
95
+ - Sampler: `None`
96
+ - Seed: `42`
97
+ - Resolution: `1024`
98
+
99
+ Note: The validation settings are not necessarily the same as the [training settings](#training-settings).
100
+
101
+ You can find some example images in the following gallery:
102
+
103
+
104
+ <Gallery />
105
+
106
+ The text encoder **was not** trained.
107
+ You may reuse the base model text encoder for inference.
108
+
109
+
110
+ ## Training settings
111
+
112
+ - Training epochs: 0
113
+ - Training steps: 100
114
+ - Learning rate: 4e-05
115
+ - Effective batch size: 16
116
+ - Micro-batch size: 1
117
+ - Gradient accumulation steps: 16
118
+ - Number of GPUs: 1
119
+ - Prediction type: flow-matching
120
+ - Rescaled betas zero SNR: False
121
+ - Optimizer: adamw_bf16
122
+ - Precision: Pure BF16
123
+ - Quantised: Yes: fp8-quanto
124
+ - Xformers: Not used
125
+ - LyCORIS Config:
126
+ ```json
127
+ {
128
+ "algo": "lokr",
129
+ "multiplier": 1.0,
130
+ "linear_dim": 1000000,
131
+ "linear_alpha": 1,
132
+ "factor": 10,
133
+ "full_matrix": true,
134
+ "apply_preset": {
135
+ "target_module": [
136
+ "FluxTransformerBlock",
137
+ "FluxSingleTransformerBlock"
138
+ ],
139
+ "name_algo_map": {
140
+ "transformer_blocks.[0-7]*": {
141
+ "algo": "lokr",
142
+ "factor": 4,
143
+ "linear_dim": 1000000,
144
+ "linear_alpha": 1,
145
+ "full_matrix": true
146
+ },
147
+ "transformer_blocks.[8-15]*": {
148
+ "algo": "lokr",
149
+ "factor": 6,
150
+ "linear_dim": 1000000,
151
+ "linear_alpha": 1,
152
+ "full_matrix": true
153
+ },
154
+ "transformer_blocks.[16-18]*": {
155
+ "algo": "lokr",
156
+ "factor": 12,
157
+ "linear_dim": 1000000,
158
+ "linear_alpha": 1,
159
+ "full_matrix": true
160
+ },
161
+ "single_transformer_blocks.[0-15]*": {
162
+ "algo": "lokr",
163
+ "factor": 8,
164
+ "linear_dim": 1000000,
165
+ "linear_alpha": 1,
166
+ "full_matrix": true
167
+ },
168
+ "single_transformer_blocks.[16-23]*": {
169
+ "algo": "lokr",
170
+ "factor": 6,
171
+ "linear_dim": 1000000,
172
+ "linear_alpha": 1,
173
+ "full_matrix": true
174
+ },
175
+ "single_transformer_blocks.[24-37]*": {
176
+ "algo": "lokr",
177
+ "factor": 4,
178
+ "linear_dim": 1000000,
179
+ "linear_alpha": 1,
180
+ "full_matrix": true
181
+ }
182
+ },
183
+ "use_fnmatch": true
184
+ }
185
+ }
186
+ ```
187
+
188
+ ## Datasets
189
+
190
+ ### default_dataset_arb
191
+ - Repeats: 9999
192
+ - Total number of images: 41
193
+ - Total number of aspect buckets: 11
194
+ - Resolution: 1.33 megapixels
195
+ - Cropped: False
196
+ - Crop style: None
197
+ - Crop aspect: None
198
+ ### default_dataset_arb2
199
+ - Repeats: 9999
200
+ - Total number of images: 2565
201
+ - Total number of aspect buckets: 1
202
+ - Resolution: 1.33 megapixels
203
+ - Cropped: False
204
+ - Crop style: None
205
+ - Crop aspect: None
206
+ ### default_dataset_arb3
207
+ - Repeats: 9999
208
+ - Total number of images: 3220
209
+ - Total number of aspect buckets: 23
210
+ - Resolution: 1.33 megapixels
211
+ - Cropped: False
212
+ - Crop style: None
213
+ - Crop aspect: None
214
+ ### default_dataset
215
+ - Repeats: 9999
216
+ - Total number of images: 42
217
+ - Total number of aspect buckets: 1
218
+ - Resolution: 1.048576 megapixels
219
+ - Cropped: True
220
+ - Crop style: center
221
+ - Crop aspect: square
222
+ ### default_dataset_512
223
+ - Repeats: 9999
224
+ - Total number of images: 42
225
+ - Total number of aspect buckets: 1
226
+ - Resolution: 0.262144 megapixels
227
+ - Cropped: True
228
+ - Crop style: center
229
+ - Crop aspect: square
230
+ ### default_dataset_640
231
+ - Repeats: 9999
232
+ - Total number of images: 42
233
+ - Total number of aspect buckets: 1
234
+ - Resolution: 0.4096 megapixels
235
+ - Cropped: True
236
+ - Crop style: center
237
+ - Crop aspect: square
238
+ ### default_dataset_768
239
+ - Repeats: 9999
240
+ - Total number of images: 42
241
+ - Total number of aspect buckets: 1
242
+ - Resolution: 0.589824 megapixels
243
+ - Cropped: True
244
+ - Crop style: center
245
+ - Crop aspect: square
246
+ ### default_dataset_896
247
+ - Repeats: 9999
248
+ - Total number of images: 42
249
+ - Total number of aspect buckets: 1
250
+ - Resolution: 0.802816 megapixels
251
+ - Cropped: True
252
+ - Crop style: center
253
+ - Crop aspect: square
254
+ ### default_dataset_uncaptioned
255
+ - Repeats: 9999
256
+ - Total number of images: 2565
257
+ - Total number of aspect buckets: 1
258
+ - Resolution: 1.048576 megapixels
259
+ - Cropped: True
260
+ - Crop style: center
261
+ - Crop aspect: square
262
+ ### default_dataset_uncaptioned_512
263
+ - Repeats: 9999
264
+ - Total number of images: 2565
265
+ - Total number of aspect buckets: 1
266
+ - Resolution: 0.262144 megapixels
267
+ - Cropped: True
268
+ - Crop style: center
269
+ - Crop aspect: square
270
+ ### default_dataset_art
271
+ - Repeats: 9999
272
+ - Total number of images: 2482
273
+ - Total number of aspect buckets: 1
274
+ - Resolution: 1.048576 megapixels
275
+ - Cropped: True
276
+ - Crop style: center
277
+ - Crop aspect: square
278
+ ### default_dataset_art_512
279
+ - Repeats: 9999
280
+ - Total number of images: 3193
281
+ - Total number of aspect buckets: 1
282
+ - Resolution: 0.262144 megapixels
283
+ - Cropped: True
284
+ - Crop style: center
285
+ - Crop aspect: square
286
+ ### default_dataset_art_640
287
+ - Repeats: 9999
288
+ - Total number of images: 3115
289
+ - Total number of aspect buckets: 1
290
+ - Resolution: 0.4096 megapixels
291
+ - Cropped: True
292
+ - Crop style: random
293
+ - Crop aspect: square
294
+ ### default_dataset_art_768
295
+ - Repeats: 9999
296
+ - Total number of images: 2989
297
+ - Total number of aspect buckets: 1
298
+ - Resolution: 0.589824 megapixels
299
+ - Cropped: True
300
+ - Crop style: random
301
+ - Crop aspect: square
302
+ ### default_dataset_art_896
303
+ - Repeats: 9999
304
+ - Total number of images: 2787
305
+ - Total number of aspect buckets: 1
306
+ - Resolution: 0.802816 megapixels
307
+ - Cropped: True
308
+ - Crop style: random
309
+ - Crop aspect: square
310
+
311
+
312
+ ## Inference
313
+
314
+
315
+ ```python
316
+ import torch
317
+ from diffusers import DiffusionPipeline
318
+ from lycoris import create_lycoris_from_weights
319
+
320
+ model_id = 'black-forest-labs/FLUX.1-dev'
321
+ adapter_id = 'pytorch_lora_weights.safetensors' # you will have to download this manually
322
+ lora_scale = 1.0
323
+ wrapper, _ = create_lycoris_from_weights(lora_scale, adapter_id, pipeline.transformer)
324
+ wrapper.merge_to()
325
+
326
+ prompt = "loona from helluva boss is eating a donut"
327
+
328
+ pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
329
+ image = pipeline(
330
+ prompt=prompt,
331
+ num_inference_steps=15,
332
+ generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
333
+ width=1024,
334
+ height=1024,
335
+ guidance_scale=3.5,
336
+ ).images[0]
337
+ image.save("output.png", format="PNG")
338
+ ```
339
+