[Dev Diary] More sets added to the NND CLIP interrogator

AdComfortable1514@lemmy.world · edit-2 10 hours ago

I appreciate you took the time to write a sincere question.

Kinda rude for people to downvote you.

AdComfortable1514@lemmy.world · edit-2 15 days ago

Simple and cool.

Florence 2 image captioning sounds interesting to use.

Do people know of any other image-to-text models (apart from CLIP) ?

AdComfortable1514@lemmy.world · 16 days ago

Wow , yeah I found a demo here: https://huggingface.co/spaces/Qwen/Qwen2.5

A whole host of LLM models seems to be released. Thanks for the tip!

I’ll see if I can turn them into something useful 👍

AdComfortable1514@lemmy.world · 16 days ago

That’s good to know. I’ll try them out. Thanks.

AdComfortable1514@lemmy.world · edit-2 17 days ago

Hmm. I mean the FLUX model looks good

, so there must maybe be some magic with the T5 ?

I have no clue, so any insights are welcome.

T5 Huggingface: https://huggingface.co/docs/transformers/model_doc/t5

T5 paper : https://arxiv.org/pdf/1910.10683

Any suggestions on what LLM i ought to use instead of T5?

AdComfortable1514@lemmy.world · edit-2 17 days ago

Good find! Fixed. It was well appreciated.

AdComfortable1514@lemmy.world · edit-2 18 days ago

[Dev Diary] More sets added to the NND CLIP interrogator

AdComfortable1514@lemmy.world · 19 days ago

Fair enough

AdComfortable1514@lemmy.world · edit-2 19 days ago

I get it. I hope you don’t interpret this as arguing against results etc.

What I want to say is ,

If implemented correctly , same seed does give the same result for output for a given prompt.

If there is variation , then something in the pipeline must be approximating things.

This may be good (for performance) , or it may be bad.

You are 100% correct in highlighting this issue to the dev.

Though its not a legal document , or a science paper.

Just a guide to explain seeds to newbies.

Omitting non-essential information , for the sake of making the concept clearer , can be good too.

AdComfortable1514@lemmy.world · edit-2 19 days ago

Perchance dev is correct here Allo ;

the same seed will generate the exact same picture.

If you see variety , it will be due to factors outside the SD model. That stuff happens.

But it’s good that you fact check stuff.

AdComfortable1514@lemmy.world · 19 days ago

Do you know where I can find documemtation on the perchance API?

Specifically createPerchanceTree ?

I need to know which functions there are , and what inputs/outputs they take.

AdComfortable1514@lemmy.world · 20 days ago

Thanks! I appreciate the support. Helps a lot to know where to start looking ( ; v ;)b!

AdComfortable1514@lemmy.world · edit-2 20 days ago

I get this error from dynamic imports. Ideas?

AdComfortable1514@lemmy.world · 21 days ago

Cool

AdComfortable1514@lemmy.world · edit-2 23 days ago

Making a better CLIP interrogator with the FLUX T5 encoder?

AdComfortable1514@lemmy.world · 24 days ago

New stuff

Paper: https://arxiv.org/abs/2303.03032

Takes only a few seconds to calculate.

AdComfortable1514@lemmy.world · edit-2 27 days ago

I count casualty_rate = number_shot / (number_shot + number_subdued)

Which in this case is 22/64 = 34% casualty rate for civilians

and 98/131 = 75% casualty rate for police

AdComfortable1514@lemmy.world · 27 days ago

So its 64-131 between work done by bystanders vs. work done by police?

And casualty rate is actually lower for bystanders doing the work (with their guns) than the police?

AdComfortable1514@lemmy.world · edit-2 28 days ago

Prompt+Token % Similarity Calculator. No GPU required.

AdComfortable1514@lemmy.world · 1 month ago

I can’t speculate.

If you feel up for the task I’d suggest running prompts that use Euler a at 20 steps for a given seed using that model and see if results match images on the perchance site.

If they do , then we know the furry model = Pony diffusion

(Though IIRC the furry model on perchance existed before Pony Diffusion. )

AdComfortable1514@lemmy.world · 1 month ago

Aha. So what you wanted to say was that “Starlight” and/or “Glimmer” are triggerwords for the furry model. Gotcha!

AdComfortable1514@lemmy.world · 1 month ago

Those are both the furry model tho?

AdComfortable1514@lemmy.world · 1 month ago

From what I know it is possible to bypass the keyword trigger by writing something like _anime or _1girl

AdComfortable1514@lemmy.world · edit-2 1 month ago

[Request] [T2i] Active model selection please!

AdComfortable1514@lemmy.world · 1 month ago

Good idea!

AdComfortable1514@lemmy.world · edit-2 2 months ago

[Question] What SD 1.5 model does Perchance currently use?

AdComfortable1514@lemmy.world · edit-2 2 months ago

[T2i][Tool] [Request] CLIP Interrogator - create prompt from an image

AdComfortable1514@lemmy.world · edit-2 3 months ago

[Request] Add (ClipSkip:::2) option to text-to-image

AdComfortable1514@lemmy.world · 4 months ago

[Help] How to check if generator name exists

AdComfortable1514@lemmy.world · edit-2 4 months ago

[Request] Create a "Oops Something went wrong" page if a generator fails to load on Perchance

AdComfortable1514@lemmy.world · edit-2 4 months ago

[Help] Two-tier selection with dynamic imports

AdComfortable1514@lemmy.world · edit-2 5 months ago

[Request] AI Image gallery features - Add comments section below images + other stuff

AdComfortable1514@lemmy.world · edit-2 6 months ago

[Help] dynamic-import for large datasets occasionally reads values as 'undefined'

AdComfortable1514@lemmy.world · edit-2 6 months ago

[Request] Loading/Unloading imported datasets

AdComfortable1514@lemmy.world · edit-2 7 months ago

Fusion Image Generator 🔸✨📝🔰🔹 (perchance t2i-generator)

Moderates