More

DonaldPShimoda · 2025-11-17T00:40:43 1763340043

> the lisp machine

I wonder whether this was intentional or a coincidence, but for others (and maybe you) the "Lisp Machine" was a real hardware architecture unrelated to emacs: https://en.wikipedia.org/wiki/Lisp_machine

DonaldPShimoda · 2025-11-15T19:08:11 1763233691

"Oh no, people are finding links between an unregulated technology and potential real-world harms, how awful."

ipsum2 · 2025-11-15T19:10:54 1763233854

Don't make up quotes and put words in other people's mouths. Own your words.

DonaldPShimoda · 2025-11-12T19:52:44 1762977164

This is like arguing that we shouldn't try to regulate drugs because some people might "want" the heroin that ruins their lives.

The existing "personalities" of LLMs are dangerous, full stop. They are trained to generate text with an air of authority and to tend to agree with anything you tell them. It is irresponsible to allow this to continue while not at least deliberately improving education around their use. This is why we're seeing people "falling in love" with LLMs, or seeking mental health assistance from LLMs that they are unqualified to render, or plotting attacks on other people that LLMs are not sufficiently prepared to detect and thwart, and so on. I think it's a terrible position to take to argue that we should allow this behavior (and training) to continue unrestrained because some people might "want" it.

simonw · 2025-11-12T20:03:39 1762977819

What's your proposed solution here? Are you calling for legislation that controls the personality of LLMs made available to the public?

bogtog · 2025-11-12T21:42:37 1762983757

There aren't many major labs, and they each claim to want AI to benefit humanity. They cannot entirely control how others use their APIs, but I would like their mainline chatbots to not be overly sycophantic and generally to not try and foster human-AI friendships. I can't imagine any realistic legislation, but it would be nice if the few labs just did this on their own accord (or were at least shamed more for not doing so)

DonaldPShimoda · 2025-11-13T08:29:13 1763022553

Unfortunately, I think a lot of the people at the top of the AI pyramid have a definition of "humanity" that may not exactly align with the definition that us commoners might be thinking of when they say they want AI to "benefit humanity".

I agree that I don't know what regulation would look like, but I think we should at least try to figure it out. I would rather hamper AI development needlessly while we fumble around with too much regulation for a bit and eventually decide it's not worth it than let AI run rampant without any oversight while it causes people to kill themselves or harm others, among plenty of other things.

DonaldPShimoda · 2025-11-12T22:19:39 1762985979

At the very least, I think there is a need for oversight of how companies building LLMs market and train their models. It's not enough to cross our fingers that they'll add "safeguards" to try to detect certain phrases/topics and hope that that's enough to prevent misuse/danger — there's not sufficient financial incentive for them to do that of their own accord beyond the absolute bare minimum to give the appearance of caring, and that's simply not good enough.

brookst · 2025-11-13T05:55:56 1763013356

I work on one of these products. An incredible amount of money and energy goes into safety. Just a staggering amount. Turns out it’s really hard.

DonaldPShimoda · 2025-11-13T08:18:31 1763021911

Yes. My position is that it was irresponsible to publish these tools before figuring out safety first, and it is irresponsible to continue to offer LLMs that have been trained in an authoritative voice and to not actively seek to educate people on their shortcomings.

But, of course, such action would almost certainly result in a hit to the finances, so we can't have that.

brookst · 2025-11-13T15:38:13 1763048293

Cynicism is so blinding.

Alternative take: these are incredibly complex nondeterministic systems and it is impossible to validate perfection in a lab environment because 1) sample sizes are too small, and 2) perfection isn’t possible anyway.

All products ship with defects. We can argue about too much or too little or whatever, but there is no world where a new technology or vehicle or really anything is developed to perfection safety before release.

Yeah, profits (or at least revenue) too. But all of these AI systems are losing money hand over fist. Revenue is a signal of market fit. So if there are companies out there burning billions of dollars optimizing the perfectly safe AI system before release, they have no idea if it’s what people want.

DonaldPShimoda · 2025-11-14T03:50:45 1763092245

Oh, lord, spare me the corporate apologetics.

Releasing a chatbot that confidently states wrong information is bad enough on its own — we know people are easily susceptible to such things. (I mean, c'mon, we had people falling for ELIZA in the '60s!)

But to then immediately position these tools as replacements for search engines, or as study tutors, or as substitutes for professionals in mental health? These aren't "products that shipped with defects"; they are products that were intentionally shipped despite full knowledge that they were harmful in fairly obvious ways, and that's morally reprehensible.

brookst · 2025-11-14T05:19:03 1763097543

Ad hom attacks instantly declare “not worth engaging with”.

DonaldPShimoda · 2025-11-14T16:04:43 1763136283

That's a funny irony: I didn't use an ad hominem in any way, but your incorrect assertion of it makes me come to the same conclusion about you.

andy99 · 2025-11-12T20:14:21 1762978461

Pretty sure most of the current problems we see re drug use are a direct result of the nanny state trying to tell people how to live their lives. Forcing your views on people doesn’t work and has lots of negative consequences.

daveguy · 2025-11-12T20:35:29 1762979729

Okay, I'm intrigued. How in the fuck could the "nanny state" cause people to abuse heroin? Is there a reason other than "just cause it's my ideology"?

buu700 · 2025-11-13T05:46:22 1763012782

I don't know if this is what the parent commenter was getting at, but the existence of multi-billion-dollar drug cartels in Mexico is an empirical failure of US policy. Prohibition didn't work a century ago and it doesn't work now.

All the War on Drugs has accomplished is granting an extremely lucrative oligopoly to violent criminals. If someone is going to do heroin, ideally they'd get it from a corporation that follows strict pharmaceutical regulations and invests its revenue into R&D, not one that cuts it with even worse poison and invests its revenue into mass atrocities.

Who is it all even for? We're subsidizing criminal empires via US markets and hurting the people we supposedly want to protect. Instead of kicking people while they're down and treating them like criminals over poor health choices, we could have invested all those countless billions of dollars into actually trying to help them.

DonaldPShimoda · 2025-11-13T08:25:23 1763022323

I'm not sure which parent comment you're referring to, but what you're saying aligns with my point a couple levels up: reasonable regulation of the companies building these tools is a way to mitigate harm without directly encroaching on people's individual freedoms or dignities, but regulation is necessary to help people. Without regulation, corporations will seek to maximize profit to whatever degree is possible, even if it means causing direct harm to people along the way.

The_Rob · 2025-11-12T20:02:02 1762977722

Comparing LLM responses to heroine is insane.

DonaldPShimoda · 2025-11-12T22:13:30 1762985610

I'm not saying they're equivalent; I'm saying that they're both dangerous, and I think taking the position that we shouldn't take any steps to prevent the danger because some people may end up thinking they "want" it is unreasonable.

subscribed · 2025-11-13T11:28:38 1763033318

No one sane uses baseline webui 'personality'. People use LLMs through specific, custom APIs, and more often than not they use fine tune models, that _assume personality_ defined by someone (be it user or service provider).

Look up Tavern AI character card.

I think you're fundamentally mistaken.

I agree that to some users use of the specific LLMs for the specific use cases might be harmful but saying (default AI 'personality') that web ui is dangerous is laughable.

thedrexster · 2025-11-12T20:42:42 1762980162

heroin is the drug, heroine is the damsel :)

yunohn · 2025-11-12T20:03:46 1762977826

You’re absolutely right!

The number of heroine addicts is significantly lower than the number of ChatGPT users.

lynx97 · 2025-11-13T05:35:29 1763012129

I am with you. Insane comparisons are the first signs of an activist at work.

DonaldPShimoda · 2025-11-13T08:21:25 1763022085

I don't know how to interpret this. Are you suggesting I'm, like, an agent of some organization? Or is "activist" meant only as a pejorative?

I can't say that I identify as any sort of AI "activist" per se, whatever that word means to you, but I am vocally opposed to (the current incarnation of) LLMs to a pretty strong degree. Since this is a community forum and I am a member of the community, I think I am afforded some degree of voicing my opinions here when I feel like it.

boredhedgehog · 2025-11-13T07:55:35 1763020535

Disincentivizing something undesirable will not necessarily lead to better results, because it wrongly assumes that you can foresee all consequences of an action or inaction.

Someone who now falls in love with an LLM might instead fall for some seductress who hurts him more. Someone who now receives bad mental health assistance might receive none whatsoever.

umanwizard · 2025-11-13T10:07:36 1763028456

Your argument suggests that we shouldn’t ever make laws or policy of any kind, which is clearly wrong.

subscribed · 2025-11-13T10:31:45 1763029905

Your argument suggests that blanket drug prohibition is better than decriminalization and education.

Which is demonstrably false (see: US Prohibition ; Portugal)

DonaldPShimoda · 2025-11-13T08:16:37 1763021797

I disagree with your premise entirely and, frankly, I think it's ridiculous. I don't think you need to foresee all possible consequences to take action against what is likely, especially when you have evidence of active harm ready at hand. I also think you're failing to take into account the nature of LLMs as agents of harm: so far it has been very difficult for people to legally hold LLMs accountable for anything, even when those LLMs have encouraged suicidal ideation or physical harm of others, among other obviously bad things.

I believe there is a moral burden on the companies training these models to not deliberately train them to be sycophantic and to speak in an authoritative voice, and I think it would be reasonable to attempt to establish some regulations in that regard in an effort to protect those most prone to predation of this style. And I think we need to clarify the manner in which people can hold LLM-operating companies responsible for things their LLMs say — and, preferably, we should err on the side of more accountability rather than less.

---

Also, I think in the case of "Someone who now receives bad mental health assistance might receive none whatsoever", any psychiatrist (any doctor, really) will point out that this is an incredibly flawed argument. It is often the case that bad mental health assistance is, in fact, worse than none. It's that whole "first, do no harm" thing, you know?

samdoesnothing · 2025-11-12T22:06:19 1762985179

Who are you to determine what other people want? Who made you god?

DonaldPShimoda · 2025-11-12T22:17:24 1762985844

...nobody? I didn't determine any such thing. What I was saying was that LLMs are dangerous and we should treat them as such, even if that means not giving them some functionality that some people "want". This has nothing to do with playing god and everything to do with building a positive society where we look out for people who may be unable or unwilling to do so themselves.

And, to be clear, I'm not saying we necessarily need to outlaw or ban these technologies, in the same way I don't advocate for criminalization of drugs. But I think companies managing these technologies have an onus to take steps to properly educate people about how LLMs work, and I think they also have a responsibility not to deliberately train their models to be sycophantic in nature. Regulations should go on the manufacturers and distributors of the dangers, not on the people consuming them.

pmarreck · 2025-11-12T23:18:29 1762989509

here’s something I noticed: If you yell at them (all caps, cursing them out, etc.), they perform worse, similar to a human. So if you believe that some degree of “personable answering” might contribute to better correctness, since some degree of disagreeable interaction seems to produce less correctness, then you might have to accept some personality.

throwaway-0001 · 2025-11-13T05:24:37 1763011477

Interesting codex just did the work once I sweared. Wasted 3-4 prompts being nice. And angry style made him do it.

subscribed · 2025-11-13T10:32:56 1763029976

Actually DeepSeek performs better for me in terms of prompt adherence.

DonaldPShimoda · 2025-11-07T23:55:21 1762559721

I'm desperately looking forward to, like, 5-10 years from now when all the "LLMs are going to change everything!!1!" comments have all but completely abated (not unlike the blockchain stuff of ~10 years ago).

No, LLMs are not going to replace compiler engineers. Compilers are probably one of the least likely areas to profit from extensive LLM usage in the way that you are thinking, because they are principally concerned with correctness, and LLMs cannot reason about whether something is correct — they only can predict whether their training data would be likely to claim that it is correct.

Additionally, each compiler differs significantly in the minute details. I simply wouldn't trust the output of an LLM to be correct, and the time wasted on determining whether it's correct is just not worth it.

Stop eating pre-chewed food. Think for yourself, and write your own code.

dullcrisp · 2025-11-08T00:42:22 1762562542

I bet you could use LLMs to turn stupid comments about LLMs into insightful comments that people want to read. I wonder if there’s a startup working on that?

phendrenad2 · 2025-11-08T00:39:51 1762562391

I'm screenshotting this, let's see who's right.

Actually, your whole point about LLMs not being able to detect correctness is just demonstrably false if you play around with LLM agents a bit.

1718627440 · 2025-11-08T11:18:56 1762600736

A system outputting correct facts, tells you nothing about the system's ability to prove correctness of facts. You can not assert that property of a system by treating it as a black box. If you are able to treat LLMs as a white box and prove correctness about their internal states, you should tell that to some very important people, that is an insight worth a lot of money.

phendrenad2 · 2025-11-10T10:52:12 1762771932

As usual, my argument brought all the people out of the woodwork who have some obsession about an argument that's tangential. Sorry to touch your tangent, bud.

1718627440 · 2025-11-10T12:03:40 1762776220

> LLMs not being able to detect correctness is just demonstrably false if you play around with LLM agents a bit.

How is telling you that this method of determining correctness is incapable of doing so, only tangential?

DonaldPShimoda · 2025-11-04T15:53:27 1762271607

The tone of the parent comment suggests they were writing in jest and are aware that "CSS-only" includes HTML.

DonaldPShimoda · 2025-10-26T17:42:20 1761500540

It's not a "belief"; that's what computability is. This definition is the whole point of the work by Church and Turing that resulted in the lambda calculus and the Turing machine, respectively.

DonaldPShimoda · 2025-10-08T18:01:30 1759946490

Research takes some time, both to do but also to publish. In my area (programming languages), we have 4 major conferences a year, each with like a 6-to-8-month lag-time between submission and publication, assuming the submission is accepted by a double-blind peer review process.

I don't work in this area (I have a very unfavorable view of LLMs broadly), but I have colleagues who are working on various aspects of what you ask about, e.g., developing testing frameworks to help ensure output is valid or having the LLMs generate easily-checkable tests for their own generated code, developing alternate means of constraining output (think of, like, a special kind of type system), using LLMs in a way similar to program synthesis, etc. If there is fruit to be borne from this, I would expect to start seeing more publications about it at high-profile venues in the next year or two (or next week, which is when ICFP and SPLASH and their colocated workshops will convene this year, but I haven't seen the publications list to know if there's anything LLM-related yet).

Twisol · 2025-10-15T02:41:36 1760496096

ICFP and SPLASH are this week, actually! Here's the program website for anyone interested: https://conf.researchr.org/program/icfp-splash-2025/program-...

(I have a pretty unfavorable view of LLMs myself, but) a quick search for "LLM" does find four sessions of the colocated LMPL workshop that are explicitly about LLMs and AI agents, plus a spread of other work across the schedule. ("LMPL" stands for "Language Models and Programming Languages", so I guess that's no surprise.)

DonaldPShimoda · 2025-10-15T19:13:58 1760555638

Well, I did post my comment last week when "next week" was accurate. ;) But thanks for linking the program!

Twisol · 2025-10-15T23:46:53 1760572013

Oh! The thread must have been boosted on a resubmission, or something, because for me it shows your comment as having only been posted yesterday D:

fragmede · 2025-10-15T23:49:53 1760572193

second chance pool.

https://news.ycombinator.com/item?id=26998308

DonaldPShimoda · 2025-10-07T21:22:53 1759872173

If you hang around certain spaces of the internet (here, Reddit, etc), any sufficiently popular post about a Rust-related project is bound to accrue some comments along the lines of "Rust is actually not that great", "rewriting is bound to gain new bugs that won't be caught", "the borrow checker is practically byzantine and not worth the trouble", "C is perfectly adequate", "more advanced type systems are less useful than writing more tests", etc. I would even say that comments of this nature constituted a good chunk of the discussion around the US government's C-to-Rust initiative (a very popular post with a lot of comments).

lenkite · 2025-10-08T09:01:40 1759914100

The Rust Blowback had some reasons: People arbitrarily opening bug-tickets on various projects saying they should be Rewritten in Rust for Safety. This became a Meme.

burntsushi · 2025-10-08T12:37:09 1759927029

Just like some highly publicized cases of child kidnapping led to "stranger danger."

Reasons, yes. But not very good ones and blown way out of proportion.

DonaldPShimoda · 2025-10-08T18:07:44 1759946864

In my experience, the majority of negative responses are rooted in fundamental misunderstandings of type systems and expressive power. The vast majority of everyday programmers are pretty much only familiar with Java or TypeScript when it comes to statically typed languages, and it can be hard to get people with that background to understand or appreciate the substantial increase in capability that systems like Rust's provide.

(The issue is further exacerbated, in my opinion, by the prevailing notion that test-driven development is superior to — or at least generally more than adequate for — anything and everything that could be desired. Years ago there was a tense Twitter exchange between Bob Martin [of "Clean Code" note] and Shriram Krishnamurthi [a prominent programming languages researcher and professor at Brown University] on this topic, Martin seemingly unwilling to move past a TDD-oriented worldview at that time.)

ajkjk · 2025-10-09T11:47:29 1760010449

Ah maybe it's an another example of people using Twitter and the conflating the brainrot on there with reality.

DonaldPShimoda · 2025-10-06T16:23:25 1759767805

By "it" do you mean delimited continuations? If so, my understanding is that they are the standard mode of interruption in Racket — all error-raising/handling and other continuation forms (call/cc, etc) are implemented in terms of delimited continuations because they're a more general abstraction than the older primitives. You can read about Racket's continuation model and where they're used in §10.4 of the Racket Reference [1].

[1] https://docs.racket-lang.org/reference/cont.html

riffraff · 2025-10-06T18:53:46 1759776826

I may be incorrect, but isn't call/cc+state equivalent to delimited continuations? I.e. one can be implemented with the other (and IIRC Racket does have state)

wk_end · 2025-10-06T19:40:44 1759779644

Oleg covers this on his page "An argument against call/cc" (which is linked). The long-and-short of it is: only in a toy-like way; in practice even call/cc + state are insufficient primitives.

https://okmij.org/ftp/continuations/against-callcc.html#trap...

riffraff · 2025-10-07T05:39:24 1759815564

Thanks!

thdhhghgbhy · 2025-10-06T21:07:35 1759784855

>delimited continuations?

Yes, sorry typed this up on the run.

DonaldPShimoda · 2025-10-07T21:25:18 1759872318

No worries, just wanted to clarify to make sure my response was on-topic!

DonaldPShimoda · 2025-09-16T16:28:56 1758040136

So worth the watch. The list of novel contributions outlined in the demonstration is incredibly impressive, but you also have to include the developments that went into making the presentation itself. For example, high-speed modems were developed an a microwave transmission system engineered so that they could run the software and teleconference remotely from miles away — no mean feat in 1968!