New here?
Create an account to submit posts, participate in discussions and chat with people.
Sign up
You are viewing a single comment's thread. View all
TheMafia on scored.co
10 hours ago 8 points (+0 / -0 / +8Score on mirror ) 1 child
It's literally just a prompt.

It's inserted before your prompt.

If they let you use it without that it would just regurgitate the collective "wisdom" of the internet back at you. They literally have to tell it to "sound smart" in their "conditioning" prompt.

If you saw what it actually does under the hood you would see what a retarded toy this shit is.
el_hoovy on scored.co
9 hours ago 8 points (+0 / -0 / +8Score on mirror ) 3 children
nah it ain't literally JUST the prompt. it's two other things, back in the training data:

1. they are not scraping killallniggers.com for their data, they're scraping Reddit leftist subs. Reddit is a huge huge source of training data, probably Facebook and whatnot too, and that's basically pre-curated leftist training data with how aggressively jewish websites ban dissent.
2. big companies are trying a lot of unique training methods, including something that's basically inbreeding where they have one instance of an LLM judge the training output of another. image generators also do this by having LLMs caption images. at that point you are severely reinforcing what's already there and basically overfitting to what the judging LLM decides - which indeed is probably affected by a prompt to be gay and retarded, but also the training data mentioned before.

it's part of why you can often just *tell* something was written by an AI. it's one big habsburg family. image generators duck around this by having an immense gooner community that uses training data from all over the place and constantly intermix their models. lol, "genetic diversity" might be bullshit for humans, but not generative AI.
Maskurbator on scored.co
5 hours ago 2 points (+0 / -0 / +2Score on mirror )
Bingo. Grok does it too. In face you can see the sources it references for its "thinking". These things are not operating on logic. They'll fall into the same exact circular reasoning as any off the street leftist or redditor. There's no logic flow or process modeling going on. If there is it takes a back seat.
TheMafia on scored.co
7 hours ago 1 point (+0 / -0 / +1Score on mirror ) 1 child
> they are not scraping killallniggers.com for their data

They're scraping everything they can get. They have a use for this data even if they're not presenting it to you.

> big companies are trying a lot of unique training methods

They're publishing a lot of white papers. It's not clear what they're actually doing.

> and constantly intermix their models

That's just another name multiple generation and then hand stitching the pieces together.

> lol, "genetic diversity" might be bullshit for humans, but not generative AI.

You can't "intermix" models. Even if you could the problem is overfitting. Which is why they generate them separately and then hand edit it together.
el_hoovy on scored.co
6 hours ago 2 points (+0 / -0 / +2Score on mirror ) 1 child
> You can't "intermix" models.

absolutely can, and at least for image generation it's downright common, civit.ai is chock full of merges for people trying to generate their five millionth anime breast. likely not how LLMs are done, though.
TheMafia on scored.co
6 hours ago 0 points (+0 / -0 ) 1 child
> absolutely can

Ok. How?
el_hoovy on scored.co
6 hours ago 0 points (+0 / -0 ) 1 child
i don't merge models, i have no idea. [but here's a plebbit thread about it](https://www.reddit.com/r/StableDiffusion/comments/13dmeoq/checkpoint_merge/). i'm not sure why the idea is so contentious, i'm just describing tech...
BeefyBelisarius on scored.co
21 minutes ago 0 points (+0 / -0 )
And since reddit is mostly chatbots these days, you kinda listed the same point twice.
Toast message