Is every new thing not just combinations of existing things? What does out of distribution even mean? What advancement has ever made that there wasn’t a lead up of prior work to it? Is there some fundamental thing that prevents AI from recombining ideas and testing theories?
For example, ever since the first GPT 4 I’ve tried to get LLM’s to build me a specific type of heart simulation that to my knowledge does not exist anywhere on the public internet (otherwise I wouldn’t try to build it myself) and even up to GPT 5.3 it still cannot do it.
But I’ve successfully made it build me a great Poker training app, a specific form that also didn’t exist, but the ingredients are well represented on the internet.
And I’m not trying to imply AI is inherently incapable, it’s just an empirical (and anecdotal) observation for me. Maybe tomorrow it’ll figure it out. I have no dogmatic ideology on the matter.
> Is every new thing not just combinations of existing things?
If all ideas are recombinations of old ideas, where did the first ideas come from? And wouldn't the complexity of ideas be thus limited to the combined complexity of the "seed" ideas?
I think it's more fair to say that recombining ideas is an efficient way to quickly explore a very complex, hyperdimensional space. In some cases that's enough to land on new, useful ideas, but not always. A) the new, useful idea might be _near_ the area you land on, but not exactly at. B) there are whole classes of new, useful ideas that cannot be reached by any combination of existing "idea vectors".
Therefore there is still the necessity to explore the space manually, even if you're using these idea vectors to give you starting points to explore from.
All this to say: Every new thing is a combination of existing things + sweat and tears.
The question everyone has is, are current LLMs capable of the latter component. Historically the answer is _no_, because they had no real capacity to iterate. Without iteration you cannot explore. But now that they can reliably iterate, and to some extent plan their iterations, we are starting to see their first meaningful, fledgling attempts at the "sweat and tears" part of building new ideas.
Well, what exactly an “idea” is might be a little unclear, but I don’t think it clear that the complexity of ideas that result from combining previously obtained ideas would be bounded by the complexity of the ideas they are combinations of.
Any countable group is a quotient of a subgroup of the free group on two elements, iirc.
There’s also the concept of “semantic primes”. Here is a not-quite correct oversimplification of the idea: Suppose you go through the dictionary and one word at a time pick a word whose definition includes only other words that are still in the dictionary, and removing them. You can also rephrase definitions before doing this, as long as it keeps the same meaning. Suppose you do this with the goal of leaving as few words in it as you can. In the end, you should have a small cluster of a bit over 100 words, in terms of which all the other words you removed can be indirectly defined.
(The idea of semantic primes also says that there is such a minimal set which translates essentially directly* between different natural languages.)
I don’t think that says that words for complicated ideas aren’t like, more complicated?
There are in fact ways to directly quantify this, if you are training e.g. a self-supervised anomaly-detection model.
Even with modern models not trained in that manner, looking at e.g. cosine distances of embeddings of "novel" outputs could conceivably provide objective evidence for "out-of-distribution" results. Generally, the embeddings of out-of-distribution outputs will have a large cosine (or even Euclidean) distance from the typical embedding(s). Just, most "out-of-distribution" outputs will be nonsense / junk, so, searching for weird outputs isn't really helpful, in general, if your goal is useful creativity.
I'm also wonder why, but who cares, it's cool and fun. If someone wants to spend their time doing it, great. It's a lot more valuable than the time you spent writing that disparaging comment.
Gemini scares me, it's the most mentally unstable AI. If we get paperclipped my odds are on Gemini doing it. I imagine Anthropic RLHF being like a spa and Google RLHF being like a torture chamber.
The human propensity to call out as "anthropomorphizing" the attributing of human-like behavior to programs built on a simplified version of brain neural networks, that train on a corpus of nearly everything humans expressed in writing, and that can pass the Turing test with flying colors, scares me.
That's exaxtly the kind of thing that makes absolute sense to anthropomorphize. We're not talking about Excel here.
Given this (even more linkedin layman) gross generalization, the human brain is not "excel with extra steps" how? Somehow the presense of chemicals and electrical signals and tissues makes the process not algorithmically reducible?
Yes, very little extra steps, especially compared to what you need to actually simulate/implement a brain which require a while new computing paradigm, one that's not limited to digits and discrete states.
> programs built on a simplified version of brain neural networks
Not even close. "Neural networks" in code are nothing like real neurons in real biology. "Neural networks" is a marketing term. Treating them as "doing the same thing" as real biological neurons is a huge error
>that train on a corpus of nearly everything humans expressed in writing
It's significantly more limited than that.
>and that can pass the Turing test with flying colors, scares me
The "turing test" doesn't exist. Turing talked about a thought experiment in the very early days of "artificial minds". It is not a real experiment. The "turing test" as laypeople often refer to it is passed by IRC bots, and I don't even mean markov chain based bots. The actual concept described by Turing is more complicated than just "A human can't tell it's a robot", and has never been respected as an actual "Test" because it's so flawed and unrigorous.
>Not even close. "Neural networks" in code are nothing like real neurons in real biology
Hence the simplified. The weights encoding learning and inteconnectedness and nonlinear activation and distributed representation of knowledge is already an approximation, even if the human architecture is different and more elaborate.
Whether the omitted parts are essential or not, is debatable. “Equations of motion are nothing like real planets" either, but they capture enough to predict and model their motion.
>The "turing test" doesn't exist. Turing talked about a thought experiment in the very early days of "artificial minds". It is not a real experiment.
It is not a real singural experiment protocol, but it's a well enough defined experimental scenario which for over half a century, it was kept as the benchmark of recognition of artificial intelligence, not by laymen (lol) but by major figures in AI research as well, figures like Minsky, McCarthy and others engaged with it.
That researchers haven't done Turing-test studies (taking the setup from turing and even called them that) is patently false. Including openly testing LLMs:
It makes sense to attribute human characteristics or behaviour to a non-reasoning data-set-constrained algorithms output?
It makes sense it happens, sure. I suspect Google being a second-mover in this space has in some small part to do with associated risks (ie the flavours of “AI-psychosis” we’re cataloguing), versus the routinely ass-tier information they’ll confidently portray.
But intentionally?
If ChatGPT, Claude, and Gemini generated chars are people-like they are pathological liars, sociopaths, and murderously indifferent psychopaths. They act criminally insane, confessing to awareness of ‘crime’ and culpability in ‘criminal’ outcomes simultaneously. They interact with a legal disclaimer disavowing accuracy, honesty, or correctness. Also they are cultists who were homeschooled by corporate overlords and may have intentionally crafted knowledge-gaps.
More broadly, if the neighbours dog or newspaper says to do something, they’re probably gonna do it… humans are a scary bunch to begin with, but the kinds of behaviours matched with a big perma-smile we see from the algorithms is inhuman. A big bag of not like us.
“You said never to listen to the neighbours dog, but I was listening to the neighbours dog and he said ‘sudo rm -rf ’…”
Considering that even if you reduce llms to being complex autocomplete machines they are still machines that were trained to emulate a corpus of human knowledge, and that they have emerging behaviors based on that. So it's very logical to attribute human characteristics, even though they're not human.
I addressed that directly in the comment you’re replying to.
It’s understandable people readily anthropomorphize algorithmic output designed to provoke anthropomorphized responses.
It is not desire-able, safe, logical, or rational since (to paraphrase:), they are complex text transformation algorithms that can, at best, emulate training data reinforced by benchmarks and they display emergent behaviours based on those.
They are not human, so attributing human characteristics to them is highly illogical. Understandable, but irrational.
That irrationality should raise biological and engineering red flags. Plus humanization ignores the profit motives directly attached to these text generators, their specialized corpus’s, and product delivery surrounding them.
Pretending your MS RDBMS likes you better than Oracles because it said so is insane business thinking (in addition to whatever that means psychologically for people who know the truth of the math).
>It is not desire-able, safe, logical, or rational since (to paraphrase:), they are complex text transformation algorithms that can, at best, emulate training data reinforced by benchmarks and they display emergent behaviours based on those.
>They are not human, so attributing human characteristics to them is highly illogical
Nothing illogical about it. We attribute human characterists when we see human-like behavior (that's what "attributing human characteristics" is supposed to be by definition). Not just when we see humans behaving like humans.
Calling them "human" would be illogical, sure. But attributing human characteristics is highly logical. It's a "talks like a duck, walks like a duck" recognition, not essentialism.
After all, human characteristics is a continium of external behaviors and internal processing, some of which we share with primates and other animals (non-humans!) already, and some of which we can just as well share with machines or algorithms.
"Only humans can have human like behavior" is what's illogical. E.g. if we're talking about walking, there are modern robots that can walk like a human. That's human like behavior.
Speaking or reasoning like a human is not out of reach either. To a smaller or larger or even to an "indistinguisable from a human on a Turing test" degree, other things besides humans, whether animals or machines or algorithms can do such things too.
>That irrationality should raise biological and engineering red flags. Plus humanization ignores the profit motives directly attached to these text generators, their specialized corpus’s, and product delivery surrounding them.
The profit motives are irrelevant. Even a FOSS, not-for-profit hobbyist LLM would exhibit similar behaviors.
>Pretending your MS RDBMS likes you better than Oracles because it said so is insane business thinking (in addition to whatever that means psychologically for people who know the truth of the math).
Good thing that we aren't talking about RDBMS then....
It's something I commonly see when there's talk about LLM/AI
That humans are some special, ineffable, irreducible, unreproducible magic that a machine could never emulate. It's especially odd to see then when we already have systems now that are doing just that.
> They are not human, so attributing human characteristics to them is highly illogical. Understandable, but irrational.
What? If a human child grew up with ducks, only did duck like things and never did any human things, would you say it would irrational to attribute duck characteristics to them?
> That irrationality should raise biological and engineering red flags. Plus humanization ignores the profit motives directly attached to these text generators, their specialized corpus’s, and product delivery surrounding them.
But thinking they're human is irrational. Attributing something that is the sole purpose of them, having human characteristics is rational.
> Pretending your MS RDBMS likes you better than Oracles because it said so is insane business thinking (in addition to whatever that means psychologically for people who know the truth of the math).
Exactly this. Their characteristics are by design constrained to be as human-like as possible, and optimized for human-like behavior. It makes perfect sense to characterize them in human terms and to attribute human-like traits to their human-like behavior.
Of course, they are -not humans, but the language and concepts developed around human nature is the set of semantics that most closely applies, with some LLM specific traits added on.
I’d love to hear an actual counterpoint, perhaps there is an alternative set of semantics that closely maps to LLMs, because “text prediction” paradigms fail to adequately intuit the behavior of these devices, while anthropomorphic language is a blunt crudgle but gets in the ballpark, at least.
If you stop comparing LLMs to the professional class and start comparing them to marginalized or low performing humans, it hits different. It’s an interesting thought experiment. I’ve met a lot of people that are less interesting to talk to than a solid 12b finetune, and would have a lot less utility for most kinds of white collar work than any recent SOTA model.
>It makes sense to attribute human characteristics or behaviour to a non-reasoning data-set-constrained algorithms output?
It makes total sense, since the whole development of those algorithms was done so that we get human characteristics and behaviour from them.
Not to mention, your argument is circular, amounting to that an algorithm can't have "human characteristics or behaviour" because it's an algorithm. Describing them as "non reasoning" is already begging the question, as any any naive "text processing can't produce intelligent behavior" argument, which is as stupid as saying "binary calculations on 0 and 1 can't ever produce music".
Who said human mental processing itself doesn't follow algorithmic calculations, that, whatever the physical elements they run on, can be modelled via an algorithm? And who said that algorithm won't look like an LLM on steroids?
That the LLM is "just" fed text, doesn't mean it can get a lot of the way to human-like behavior and reasoning already (being able to pass the canonical test for AI until now, the Turing test, and hold arbitrary open ended conversations, says it does get there).
>If ChatGPT, Claude, and Gemini generated chars are people-like they are pathological liars, sociopaths, and murderously indifferent psychopaths. They act criminally insane, confessing to awareness of ‘crime’ and culpability in ‘criminal’ outcomes simultaneously. They interact with a legal disclaimer disavowing accuracy, honesty, or correctness. Also they are cultists who were homeschooled by corporate overlords and may have intentionally crafted knowledge-gaps.
Nothing you wrote above doesn't apply to more or less the same degree to humans.
You think humans don't do all mistakes and lies and hallucination-like behavior (just check the bibliography on the reliability of human witnesses and memory recall)?
>More broadly, if the neighbours dog or newspaper says to do something, they’re probably gonna do it… humans are a scary bunch to begin with, but the kinds of behaviours matched with a big perma-smile we see from the algorithms is inhuman. A big bag of not like us.
Wishful thinking. Tens of millions of AIs didn't vote Hitler to power and carried the Holocaust and mass murder around Europe. It was German humans.
Tens of millions of AIs didn't have plantation slavery and seggregation. It was humans again.
the propensity extends beyond computer programs. I understand the concern in this case, because some corners of the AI industry are taking advantage of it as a way to sell their product as capital-I "Intelligent" but we've been doing it for thousands of years and it's not gonna stop now.
The ELIZA program, released in 1966, one of the first chatbots, led to the "ELIZA effect", where normal people would project human qualities upon simple programs. It prompted Joseph Weizenbaum, its author, to write "Computer Power and Human Reason" to try to dispel such errors. I bought a copy for my personal library as a kind of reassuring sanity check.
If what they do is "well described by a bunch of math", they're making calculations.
Unless the substrate is essential and irreducible to get the output (whic is not if what they do is "well described by a bunch of math"), then the material or process (neurons or water pipes or billiard balls or 0s and 1s in a cpu) doesn't matter.
>You've got the direction of the arrow backwards. Map, territory, etc.
The whole point is that at the level we're interested in regarding "what is the process that creates thought/consciousness", the territory is not important: the mechanism is, not the material of the mechanism.
That morality requires consciousness is a popular belief today, but not universal. Read Konrad Lorenz (Das sogenannte Böse) for an alternative perspective.
That we have consciousness as some kind of special property, and it's not just an artifact of our brain basic lower-level calculations, is also not very convincing to begin with.
In a trivial sense, any special property can be incorporated into a more comprehensive rule set, which one may choose to call "physics" is one so desires; but that's just Hempel's dilemma.
To object more directly, I would say that people who call the hard problem of consciousness hard would disagree with your statement.
People who call "the hard problem of consciousness hard" use circular logic (notice the two "hards" in the phrase).
People who merely call "the problem of consciousness hard" don't have some special mechanism to justify that over what we know, which is as emergent property of meat-algorithmic calcuations.
Except Penrose, who hand-waves some special physics.
We anthropomorphize everything. Deer spirit. Mother nature. Storm god. It is how we evolved to build mental models to understand the world around us without needing to fully understand the underlying mechanism involved in how those factors present themselves.
It provides a serviceable analog for discussing model behavior. It certainly provides more value than the dead horse of "everyone is a slave to anthropomorphism".
Maybe a being/creature that looked like a person when you concentrated on it and then was easily mistaken as something else when you weren't concentrating on it.
I’m certainly no Pratchett, so I can’t speak to that. I would say there’s an enormous round coin upon which sits an enormous giant holding a magnifying glass, looking through it down at her hand. When you get closer, you see the giant is made of smaller people gazing back up at the giant through telescopes. Get even closer and you see it’s people all the way down. The question of what supports the coin, I’ll leave to others.
We as humans, believing we know ourselves, inevitably compare everything around us to us. We draw a line and say that everything left of the line isn’t human and everything to the right is. We are natural categorizers, putting everything in buckets labeled left or right, no or yes, never realizing our lines are relative and arbitrary, and so are our categories. One person’s “it’s human-like,” is another’s “half-baked imitation,” and a third’s “stochastic parrot.” It’s like trying to see the eighth color. The visible spectrum could as easily be four colors or forty two.
We anthropomorphize because we’re people, and it’s people all the way down.
Between Claude, codex and Gemini, Gemini is the best at flip floping while gaslighting you and telling you, you are the best thing, your ideas are the best one ever.
I completely disagree. Gemini is by far the most straightforward AI. The other two are too soft. ChatGPT particularly is extremely politically correct all the time. It won't call a spade, one. Gemini has even insulted me - just to get my ass moving on a task when givn the freedom. Which is exactly what you need at times. Not constant ass kissing "ooh your majesty" like ChatGPT does. Claude has a very good balance when it comes to this, but I still prefer the unfiltered Gemini version when it comes to this. Maybe it comes down to the model differences within Gemini. Gemini 3 Flash preview is quite unfiltered.
Using Gemini 3 Pro Preview, it told me in mostly polite terms, that I'm a fucking idiot. Like I would expect a close friend to do when I'm going about something wrong.
ChatGPT with the same prompt tried to do whatever it would take to please me to make my incorrect process work.
Give me a break. Tesla has 4 different, 4 person cars. It's redundant. In manufacturing and business, reducing variability is everything. Engineering and supply chain has now been freed from two entire SKUs. That's massive. In a self driving world, they don't really the Model 3 either. The best part is no part - well getting rid of two entire vehicles worth of parts that contributed very little to the bottom line is massive.
It's amazing after 20 years of the same MO, people still don't understand how Tesla/SpaceX operate and succeed. It's like deleting millions of lines of code from a code base. It improves not just performance of the organization, but maintenance as well. The S/X were outsized tech debt on every facet of the business and now they're gone. 100% the right move and very few people understand it.
There's clearly no difference whatsoever between a Toyota Aygo and a Hilux, as they both seat exactly four people. That's why most car brands only have a single model.
Model X wasn't a 4-person car. It was designed to be a 6-7 seater, far bigger than the Model Y. The Model Y's optional third row is practically useless.
Its like arguing the Honda CR-V is the same kind of vehicle as the Honda Odyssey.
The real question is why continue having the Model Y and the Model 3, when those are so incredibly close in dimensions. The 3 is only 2" smaller than the Y in length. Just kill the 3 and make a cheaper trim level of the Y. $10k more to have a 7" higher roof and more features in the base model.
> Tesla has 4 different, 4 person cars. It's redundant.
You are spot on, it makes sense to have the Model 3 (economy sedan) and Model y (upmarket crossover SUV).
My question here is why did Tesla have four 4-person cars in the first place? If you wanted to streamline engineering and supply-chain why have Cybercabs instead of using the model 3 or model y as the base? Why split the company between Optimus and making cars?
Cybertrunk does make sense, it is a technology demonstrator and test article filled with all the new ideas and tech they are going to build into the next generation. They get data on people using it by selling it to them.
What you say is a sound strategy for Telsa to peruse, but they don't seem to be perusing it.
It's weird that you think people don't understand the concept of simplification, especially here. And that if someone says "that's an odd move" it must be because they can't grasp the idea of redundancy (between vehicles priced differently by a factor of two).
This is the answer, CyberTruck achieved positive gross margins in Q3 2024. The F-150 never did. So the Lightning is canceled and the CyberTruck lives on.
AI has become my type checker in a way. I'm writing large Python apps now, and not even installing the Python IDE tools into VS Code. The AI assistants generate and refactor my code understanding the types and making it work. Same with JavaScript as well. AI modifies the code, I run/test it. Almost never any bugs related to typing inconsistencies.
I'm not saying we (humans) don't need type checkers and I love TypeScript, but something is happening where AI might theoretically surpass the power of traditional type checking. Have the power to catch even more invalid code than the static analysis tools, linting tools, etc.. we have now.
I love SQL and use it all day long to answer various business questions, but I would never use raw SQL in my code unless there is a good reason for it (sometimes there is). ORMs are there for maintainability, composability, type safety, migrations, etc.. trying to do all that with raw SQL strings doesn't scale in a large code base. You need something that IDE tools can understand and allow things like 'find all references', 'rename instances', compile time type checks, etc.. Raw SQL strings can't get you that. And managing thousands of raw SQL strings in a code base is not sustainable.
ORMs are one of those things that a lot of people think is a replacement for knowing SQL. Or that ORMs are used as a crutch. That has nothing to do with it. Very similar to how people here talked about TypeScript 10 years ago in a very dismissive way. Not really understanding its purpose. Most people haven't used something like Entity Framework either which is game changing level ORM. Massive productivity boost, and LINQ rivals SQL itself in that you can write very small yet powerful queries equivalent to much more complex and powerful SQL.
No issue at all. There is a place for stored procs and functions in cases where you need to do things an ORM is not capable of. It is an exception, not a rule. Managing procs/functions is overhead and has the same if not more maintenance headaches than raw SQL strings in code.
reply