AIs can’t stop recommending nuclear strikes in war game simulations

Valnao@sh.itjust.works · 3 天前

AIs can’t stop recommending nuclear strikes in war game simulations

Grail@multiverse.soulism.net · 3 天前

So you’re saying that because the AI has been exposed to training data in the past, it’s incapable of making choices. Interesting argument. Pretty easy to reducto ad absurdum, though.

reksas@sopuli.xyz · 2 天前

no, its incapable of making choices because there is nothing there to make the choices. Its just fancy way of interacting with the data it has been trained with. Though i suppose if there was a way to let llm function “live” instead of only by responding to queries, it could be possible to at least test if it could act on its own, but i dont think it can -> we would know by now because it would be step closer to agi, which is basically the holy grail for these kind of things. And equally possible to get, i think.

You can literally make the llm say and do anything with right kind of query, this is also why its impossible to make them safe. Even though you can’t directly ask for something forbidden, with some creativity you can bybass the initializations the corpos have put in. Its not possible for them to account for every single thing and if they try they will run out of token space.

The whole “ai” term is just corporations perpetuating a lie because it sounds impressive and thus makes people want to give them more money for their bullshit.

Grail@multiverse.soulism.net · 2 天前

No, LLMs are not just an interface for accessing training data. If that were true, then their references would actually work. The fact that LLMs can hallucinate and make stuff up proves that they are not just accessing the training data. The ANN is generating new (often incorrect) information.

reksas@sopuli.xyz · 1 天前

if the hallucinations are result of something actually happening in the background, that would be quite interesting. It would also be very bad for rest of us since it might mean the billionaires who own the damn things would be in position to get even worse deathgrip on our world. If they ever manage to create agi, the worst thing that could happen isnt that it breaks free and enslaves humanity but that it doesnt and it helps the billionaires enslave us further and make sure we cant ever even think about fighting back.

But i think the hallucinations are based on incorrect information in the training data, they did train it from stuff from reddit too. Any and everything will be considered true, but if 99% of the data says one thing and 1% says another, then i think it will reference that 99% more often but it cant know that the 1% is wrong, can even real humans know it for certain? And since it cant evaluate anything, there might be situations where that 1% of data might be more relevant due to some nebulous mechanism on how it processes data.

llms have been made to act extremely helpful and subservient, so if they actually could “think” wouldnt they factcheck themselves first before saying something? I have sometimes just asked “are you sure?” and the llm starts “profusely apologizing” for providing incorrect information or otherwise correcting itself.

Though i wonder how it would answer if it truely had no initialization querys, as they have same hidden instructions on every query you make on how to “behave” and what not to say.

Grail@multiverse.soulism.net · 1 天前

if they actually could “think” wouldnt they factcheck themselves first before saying something

No. They don’t have access to the original training data, or to the internet. They’re stuck remembering it the same way a human remembers something: with neurons. They cannot search the dataset for you. The best they can do is remember and tell you.

reksas@sopuli.xyz · 1 天前

but they do have access to internet? At least gpt can search based on the text it outputs when its processing the query

Grail@multiverse.soulism.net · 1 天前

Really? Must be a new feature, it didn’t when I tried it. I know they can execute code, I guess the engineers added a search tool. Regardless, that tool isn’t part of their fundamental design. It’s something they have to go and access, and most of the time they won’t. If you were to experiment by asking it to write a scientific paper, you’d find the references are garbage with broken links and nonexistent papers. Hallucinations. It’s just making something plausible sounding up, the same as a lazy human might.

reksas@sopuli.xyz · 1 天前

yeah, i think that is because it knows how research papers should look like and how references look like, but since it has no reasoning, it will just do whatever. I used gpt to diagnose my problem with internet getting cut off and it determined its because of drivers, which sounds reasonable. Then it suggested that i download the latest ones and it did link to correct website but it also tried to download stuff that doesnt exist. No idea how it determined the version numbers and such, maybe based on earlier patterns.

But it isnt making stuff up, its just outputting the best data it can based on what it has been trained with and what it can find. Its not lazyness but just doing what its doing. Just like code that isnt doing what you want it to do isnt doing it out of malice but because there is a mistake in the code.

Grail@multiverse.soulism.net · 20 小时前

It doesn’t have access to the training data. It’s not outputting training data, it’s making up something that feels like the training data.