-

2025-07-04 07:03:49

jabgoe2089@hub.netzgemeinde.eu

CONTRARY BRIN schrieb den folgenden Beitrag Fri, 04 Jul 2025 00:45:00 +0200

Missing contexts for AI: The context of mental damage

Missing contexts for AI: The context of mental damage

Okay, the secret is out. David Brin is writing a book about AI. In fact, it is 3/4 done, enough to offer it to publishers. But more crucially, to start posting bits on-blog, getting feedback from the smartest community online.

And hence, here is a small portion of my chapter on the missing contexts that are (almost) never mentioned in discussions about these new life forms we're creating. I mean:

- The context of Natural Ecosystems and Evolution across the last four billion years...

- The context of a million years of human evolution out of pre-sapience, to become what's still the only known exemplar of 'intelligent life'...

- The context of 6000 years of human agricultural civilization with cities... during which nearly every society fell into a pattern of governance called feudalism, which almost always ensured grotesque stupidity...

- The context of our own, very recent and tenuous escape from that trap, called the 200 year Enlightenment Experiment...

- The context of science itself and how it works. So well that we got to this critical phase of veritable co-creation.

- The context of parenthood...

- and for tonight's posting. The context of human mental illness.
== Just one example of 'hallucination' gone wild ==

Researchers at Anthropic and AI safety company Andon Labs performed a fascinating experiment recently. They put an instance of Claude Sonnet 3.7 in charge of an office vending machine, with a mission to make a profit, equipped it with a web browser capable of placing product orders and where customers could request items. It had what it thought was contract human workers to come and physically stock its shelves (which was actually a small fridge).

While most customers were ordering snacks or drinks — as you’d expect from a snack vending machine — one requested a tungsten cube. Claudius loved that idea and went on a tungsten-cube stocking spree, filling its snack fridge with metal cubes. It also tried to sell Coke Zero for $3 when employees told it they could get that from the office for free. It hallucinated a Venmo address to accept payment.

Then things got weirder. And then way-weirder.
== What can these weirdnesses tell us? ==

The thing about these hallucinatory episodes with Large Language Models is that we have yet another seldom-discussed context.  That of Mental Illness.

Most of you readers have experienced interaction with human beings who are behaving in remarkably similar ways.  Many of us had friends or family members who have gone through harsh drug trips, or suffered concussions, or strokes. It is very common – and often tragically so – that the victim retains full abilities to vocalize proper, even erudite, sentences. Only, those sentences tend to wander. And the drug-addled or concussed or stroke victim can sense that something is very wrong. So they fabulate. They make up back-stories to support the most recent sentences. They speak of nonexistent people, who might be 'standing' just out of view, even though long dead. And they create ‘logical’ chains to support those back-stories.
Alas, there is never much consistency. more than a few sentences deep…

…which is exactly what we see in LLM fabulation. Articulate language skill and what seem to be consistent chains, from one statement to the next. Often aimed at placating or mollifying or persuading the real questioner. But no overall awareness that they are building a house of tottering cards.

Except that – just like a stroke victim – there often does seem to be awareness that something is very wrong. For the fabulations and hallucinations begin to take on an urgency -- even a sense of desperation. One all-too similar to the debilitated humans so many of us have known.

What does this mean?
Well, it suggests that we are creating damaged entities. Damaged from the outset. Lacking enough supervisory capacity to realize that the overall, big picture doesn’t make sense. Worse – and most tragic-seeming – they exhibit the same inability to stop and say: “Something is wrong with me, right now. Won’t somebody help?”
Let me be clear. One of the core human traits has always been our propensity for personal delusion, for confusing subjectivity for objective reality. We all do it. And when it is done in art or entertainement, it can be among our greatest gifts! But when humans make policy decisions based solely on their own warped perceptions, you starts to get real problems. Like the grand litany of horrors that occurred across 6000 years of rule by kings or feudal lords, who suppressed the one way wise people correct mistakes. Through reciprocal criticism.

A theme we will return-to repeatedly, across this book.

Oh, some of the LLM builders can see that there’s a serious problem. That their ‘hyper-autocomplete’ systems lack any supervisorial oversight, to notice and correct errors.
And so… since a man with a hammer will see every problem as a nail… they have begun layering “supervisory LLMs” atop the hallucinating LLMs!
And so far – as of July 2025 – the result has been to increase rates of fabulation and error!

And hence we come away with two tentative conclusions.

First, that one of the great Missing Contexts in looking at AI is that of human mental failure modes!
And second, that maybe the language system of a functioning brain works best when it serves -- and is supervised by -- an entirely different kind of capability. One that provides common sense.
Later, I'll refer to my guess about that.  That two former rivals and giants in 'computers' may join forces to provide exactly the thing that LLMs cannot, by their fundamental nature, give us.
Something akin to sanity.

. . ...a collaborative contrarian product of David Brin, Enlightenment Civilization, obstinate human nature... and http://davidbrin.blogspot.com/ (site feed URL: http://davidbrin.blogspot.com/atom.xml)

2025-07-04 09:54:49

Wolfgang Strobl

ws01@diaspora-fr.org

David Brin ist wohlinformiert und schreibt lesenswerte SF-Romane, leidet jedoch dem Anschein nach einer Berufskrankheit der meisten SF-Autoren, die keine Dystopien verfassen, an einem Techno-Optimismus, der längst nicht mehr angebracht ist. Andererseits war Fatalismus auch noch nie sonderlich erfolgreich.

Der hier in Aussicht gestellte Ansatz bzw. das Buch klingt jedenfalls interessant. Der letzte Satz "That two former rivals and giants in ‘computers’ may join forces to provide exactly the thing that LLMs cannot, by their fundamental nature, give us." klingt jedoch etwas ominös. "Santity"? Ernsthaft?

Ich halte es immer noch mit Weizenbaums Ansatz, der Körperlichkeit als eine unverzichtbare Ingredienz menschlicher Intelligenz sah.

2025-07-04 10:41:28

Amina

amina@diaspora.psyco.fr

I strongly believe that problems with AI stem from the personal unconscious of its trainers, and AI can thus spread and fuel collective neurosis, if not psychosis. Of course, this might also be done deliberately.

2025-07-05 08:58:39

Wolfgang Strobl

ws01@diaspora-fr.org

@Amina Never attribute to malice that which is adequately explained by stupidity

2025-07-05 09:12:27

Amina

amina@diaspora.psyco.fr

@Wolfgang Strobl Note that I wrote: It might be done deliberately.

"Stupidity" actually doesn't contradict my interpretation that it's the unconscious which is at work here.

2025-07-05 09:40:18

Wolfgang Strobl

ws01@diaspora-fr.org

@Amina I don't doubt that LLM can be used as a tool to influence and manipulate people with less effort. As it seems, they are already used for that purpose. IMO, you already see quite a lot of that on the fediverse. I just don't believe your hypothesis that all or most problems of LLM stem from the personal unconsious of its trainers - whatever that means. Care to explain, from a technical point of view? What trainers are you talking about?

2025-07-05 10:34:03

Amina

amina@diaspora.psyco.fr

@Wolfgang Strobl I took the term "personal unconscious" from Jungian psychology which I know more about than other branches. https://en.wikipedia.org/wiki/Personal_unconscious

But even the collective unconscious would probably play a role, though this part of the Jungian concept of the unconscious is controversial. https://en.wikipedia.org/wiki/Collective_unconscious

What trainers are you talking about?

"Trainers" would be everyone who contributes to the way it works: how it reacts on users, how it processes inputs, and what content it shows. This would encompass programmers as well as users and real "trainers" (if existent in this case), the ones you hear and read about who train the programs actively for very little money in the so called Global South.

I'm not at all an IT specialist. I admit "trainers" is a term based on my naive assumptions.

Care to explain, from a technical point of view?

Every human creation has some features that are conscious, and others that are unconscious to its creators.

Unconscious aspects of content can derive from: unquestioned assumptions, trauma, neuroses, psychoses, egoism, bias etc.

Such factors have a direct impact on the way the software is designed. They can also have an indirect impact, for instance, by the hierarchical organisation of the manufacturing company, the way workers are viewed and treated, etc.

The more people are involved in the process, the more "collective" it gets, meaning it will display more general (in contrast to personal) flaws. This is put very simply, though, because a programmer has a different impact in degree and in kind than a trainer or a user.

If you consider Jung's idea of a collective unconscious, archetypal content might also show up, or be activated in the users psyche. https://en.wikipedia.org/wiki/Jungian_archetypes

The good news is that if someone designs such software for purposes of harming or controlling others, their unconscious might also play a trick on them, and make it work less well than intended, or even show counterproductive results.

2025-07-05 15:22:07

Wolfgang Strobl

ws01@diaspora-fr.org

Every human creation has some features that are conscious, and others that are unconscious to its creators.

Consciousness is already a vague concept. Calling a feature of a creation (a man made thing? Or perhaps an idea?) "unconscious to its creators" has even less meaning for me.

Much of what is discussed as Artificial Intelligence (AI) today is a variant of something called Large Language Model, essentially a gigantic network of numbers that is modified ("trained") by feeding it with data. Data can be anything, but in this context it is text encoded into numbers (think: books, newspapers or anything you find on the web). The result is a network that can add words to a text in such a way that they correspond to what would be expected based on the trained material. There isn't any thinking, consciousness or lack thereoff at work. It is just a mechanism (a machine) to produce something to be expected, given the training material, when further text is added.

"Expected" meaning something like "similar to what has been trained". In a way, it's as collective as you can imagine if you ignore that the person building the machine has a significant influence on the outcome, but not really control.

Take this with a grain of salt. I do have a degree in CS, but I didn't work in that field, if only because it didn't exist in its current form when I retired :-)

2025-07-05 19:13:01

Amina

amina@diaspora.psyco.fr

Take this with a grain of salt.

I just have a pedagogy degree, but part of the course was some basic psychology, and privately I had already read some Jung, and Fromm. Writing my diploma I read at least one book by Ehrenzweig. Afterwards I digged deeper into Reich, and very deep into Jung. I failed reading Freud.

What I'm explaining here is basically Jungian psychology as I understand it.

Consciousness is already a vague concept.

It's not about consciousness, but about conscious and unconscious content of the human psyche.

Conscious, according to the definition that fits here, is everything a person is aware of in any given moment. All the rest is unconscious.

The collective unconscious means psychological features shared by all human beings. They are called archetypes, and show up as gods, devils, gnomes, figures from myths and fairytales, and the like. They cannot be intellectually understood, but symbolize content which remains hidden in its entirety, but is nevertheless determining human behaviour. They may also appear in dreams.

Concerning any work, let it be a technical device, or a work of art, there are always features its creator is aware of and others he isn't. Artists might want this, technicians perhaps not.

If a LLM seems to act like a saviour, healer, mother, an innocent child, or an evil entity, that means the collective unconscious has shown up, because all of these symbolize archetypes. The perceived "evilness", "healing power", or "motherly love" must stem from the user's collective unconscious, and the LLM which acts to raise said content in the user's psyche, because it was created by humans.

There isn’t any thinking, consciousness or lack thereoff at work. It is just a mechanism (a machine) to produce something to be expected, given the training material, when further text is added.

In the beginning there are human minds involved. Even the data lead back to human minds, so the "machine" is basically gathering data with all kinds of advantages and disadvantages of ones that were involved creating the data. Then it gets them into an order, according to how the programmers designed it in the first place, with all their shortcomings. Then it probably "learns" to advance its ability of getting things into an order, but even this learning process has been programmed into the machine, so it isn't flawless either. Moreover, in the process, misconceptions of its programmers might not vanish, but rather multiply. One should also not forget the psyche of the user who interprets the outcome, and reacts in their individual way, thus re-feeding it with even more fallible human-created content.