Consume Product - Arete Network

TheMafia on scored.co

2 months ago 9 points (+0 / -0 / ) 1 child

It's literally just a prompt.

It's inserted before your prompt.

If they let you use it without that it would just regurgitate the collective "wisdom" of the internet back at you. They literally have to tell it to "sound smart" in their "conditioning" prompt.

If you saw what it actually does under the hood you would see what a retarded toy this shit is.

Permalink Reply

Mirrored from scored.co

el_hoovy on scored.co

2 months ago 9 points (+0 / -0 / ) 3 children

nah it ain't literally JUST the prompt. it's two other things, back in the training data:

1. they are not scraping killallniggers.com for their data, they're scraping Reddit leftist subs. Reddit is a huge huge source of training data, probably Facebook and whatnot too, and that's basically pre-curated leftist training data with how aggressively jewish websites ban dissent.
2. big companies are trying a lot of unique training methods, including something that's basically inbreeding where they have one instance of an LLM judge the training output of another. image generators also do this by having LLMs caption images. at that point you are severely reinforcing what's already there and basically overfitting to what the judging LLM decides - which indeed is probably affected by a prompt to be gay and retarded, but also the training data mentioned before.

it's part of why you can often just *tell* something was written by an AI. it's one big habsburg family. image generators duck around this by having an immense gooner community that uses training data from all over the place and constantly intermix their models. lol, "genetic diversity" might be bullshit for humans, but not generative AI.

Permalink Reply

Mirrored from scored.co

Maskurbator on scored.co

2 months ago 4 points (+0 / -0 / )

Bingo. Grok does it too. In face you can see the sources it references for its "thinking". These things are not operating on logic. They'll fall into the same exact circular reasoning as any off the street leftist or redditor. There's no logic flow or process modeling going on. If there is it takes a back seat.

Permalink Reply

Mirrored from scored.co

TheMafia on scored.co

2 months ago 1 point (+0 / -0 / ) 1 child

> they are not scraping killallniggers.com for their data

They're scraping everything they can get. They have a use for this data even if they're not presenting it to you.

> big companies are trying a lot of unique training methods

They're publishing a lot of white papers. It's not clear what they're actually doing.

> and constantly intermix their models

That's just another name multiple generation and then hand stitching the pieces together.

> lol, "genetic diversity" might be bullshit for humans, but not generative AI.

You can't "intermix" models. Even if you could the problem is overfitting. Which is why they generate them separately and then hand edit it together.

Permalink Reply

Mirrored from scored.co

el_hoovy on scored.co

2 months ago 2 points (+0 / -0 / ) 1 child

> You can't "intermix" models.

absolutely can, and at least for image generation it's downright common, civit.ai is chock full of merges for people trying to generate their five millionth anime breast. likely not how LLMs are done, though.

Permalink Reply

Mirrored from scored.co

TheMafia on scored.co

2 months ago 0 points (+0 / -0 ) 1 child

> absolutely can

Ok. How?

Permalink Reply

Mirrored from scored.co

el_hoovy on scored.co

2 months ago 0 points (+0 / -0 ) 1 child

i don't merge models, i have no idea. [but here's a plebbit thread about it](https://www.reddit.com/r/StableDiffusion/comments/13dmeoq/checkpoint_merge/). i'm not sure why the idea is so contentious, i'm just describing tech...

Permalink Reply

Mirrored from scored.co

Continue thread...

BeefyBelisarius on scored.co

2 months ago 0 points (+0 / -0 )

And since reddit is mostly chatbots these days, you kinda listed the same point twice.

Permalink Reply

Mirrored from scored.co