More

redhale · 2026-05-27T10:40:03 1779878403

This is "Feature Factory" thinking, and it is usually not ideal. Feature count is a poor metric to optimize against. Instead, ROI and delivered customer value should be the focus of product development investments.

redhale · 2026-05-27T10:36:43 1779878203

I think this is clearly be design. They don't want to provide support, they want you to give up and let your issue go.

jimkleiber · 2026-05-27T18:04:02 1779905042

It might be. It also lets my loyalty go slowly by slowly. Monopolies work until they collapse.

If you can't tell, it frustrates me so much. I wonder how the internal culture of Uber changed when it went from almost zero interest rates to now trying to make lots of profit.

My friend said he realized Uber can just rely on a steady stream of people either growing up or getting laid off and trying to make a quick buck, so they can treat their drivers poorly as well.

I'm not sure what's happening, just know support may be a lot simpler and cheaper to address than nothing, or at least in the medium to long term, but maybe not?

redhale · 2026-05-27T10:22:42 1779877362

I understand and sympathize with this point of view.

I would just say this: there is a difference between advice for using a product, and for _optimizing_ your use of a product. Between a user and a power user.

I think devs probably disproportionately like to see themselves as power users of any given tool, and thus with coding agents, there are 1000 "systems" being thrown out on GitHub on any given day. Generally speaking, it is safe to avoid these, especially if you're new to the tool.

But saying the fact that people are into optimizing their setups indicates some fundamental deficiency of the tool misses the point, I think.

Claude Code and Codex CLI (and OpenCode, and I'm sure many others) are _remarkably_ effective right out of the box. The teams behind these tools must make them _generically_ useful so that they are accessible to as many people, and as many use cases, as possible. That is part of why, when you become familiar with the tool, there is typically going to be a level of customization you can apply to it to optimize it for _your_ use cases, beyond the generic out of the box configuration.

Similarly, I don't think it would be fair to critique VS Code simply because most power users augment it with a suite of extensions. In fact, it's customizability/extensibility is part of what makes it great.

sandrello · 2026-05-27T12:31:15 1779885075

I absolutely understand the power user perspective. The point is not that, and maybe I wasn't clear enough in pointing it out.

Here, something different is going on instead of the usual "base tool is ok for 90% of use cases, remaining 10% is covered by plugins and extensions". A lot of developers are finding it difficult to commit to agentic coding workflows, feeling a stretch on a lot of different aspects.

Companies, with the help of a very prominent and vocal part of the web and social media community, are addressing every issue by simply blaming the users, saying it's their fault if they're not keeping up with all the alleged advancements in prompt strategies. See the whole "maybe you haven't tried it in the last two months, everything's changed now". While it's true that things have been moving very fast, the fundamental idea behind the technology is the same, and some concerns about it simply cannot be wiped away by scaling some factors.

redhale · 2026-05-27T10:09:48 1779876588

Just use a fallback, like Codex CLI. Takes a little effort upfront to ensure your configuration is wired correctly for both harnesses, but it is pretty easy to get them 90% identical (there will almost always be some experimental / edge case features that differ across harnesses, but in my experience those are negligible in practice).

new_account_101 · 2026-05-27T10:53:23 1779879203

[flagged]

redhale · 2026-05-27T12:11:57 1779883917

I more meant feature-level differences. For instance, Claude Code has agent teams, and Codex CLI does not. Or for a while, Codex had "/goal" and Claude Code did not (though now Claude Code has it too). To your point, it is usually possible to polyfill these gaps either with custom code/skills/hooks or with third party plugins.

2026-05-27T15:31:39 1779895899

[dead]

esafak · 2026-05-27T21:44:20 1779918260

https://gail.wharton.upenn.edu/research-and-insights/techrep...

redhale · 2026-05-23T00:17:43 1779495463

Came here to say this -- humans are not always rational actors. I get asked questions all the time, which I have no special knowledge of, and which the asker could have easily Googled or ChatGPTed. And yet...

zephen · 2026-05-23T00:24:47 1779495887

Humans. Can't live with 'em, can't serve 'em with some fava beans and a nice Chianti.

redhale · 2026-05-12T23:55:29 1778630129

I feel like caching should be mentioned in tradeoffs, right? If you change the tool list frequently, that's a cache bust. In long sessions that seems like it could significantly affect costs.

azurewraith · 2026-05-13T00:18:49 1778631529

Great question... and there are two answers depending on what you were originally referring to:

re: Claude Code... we actually don't filter or modify the tool list so all tools stay visible -- disallowed calls get blocked at execution time with an error message. No cache busts on transitions, the model sees the full tool sets. The cost there is prompt caching dollars not latency I suppose

re: The research (Rust agent + Ollama) the model only receives tool schemas for the current states' allowed tools. Ollama does have a KV cache reuse facility so changing the tool list busts that cache. Depending on your workflow this can happen as many times as you expect your states to transition until completion. For simple workflows this is 3-5x. Within each state the tool list is stable and cache operates normally. Presenting fewer tools instead of dozens on every agent processing step reduces input tokens and decision complexity, which is where the measurable gains come from.

Both enforce the same constraints depending on the execution interface. The schema level filtering in the research is the S-tier approach. Adding tools/list filtering to the MCP gateway would be beneficial if possible (it looks like we could only filter MCP tools not core ones, which could provide tangible benefit. I've added this evaluation to the roadmap.

redhale · 2026-05-13T02:27:13 1778639233

Nice, thanks for the detailed answer!

redhale · 2026-05-11T10:47:44 1778496464

I came to the comments to say basically this.

I have read and heard takes similar to OP's probably 50+ times from different people in the last few months (and years, now), and I agree mostly.

But I can't get over the myopic nature of this perspective. Technological advances often change the nature of work, and therefore change the nature (or location) of the "struggle".

I can imagine some hunter-gatherers probably admonishing early farmers at the dawn of agriculture for losing the "struggle" of hunting and foraging for their own food. It's much easier to drive a car than tame, train, and ride a horse. And so on throughout time.

So now with AI, some things that were hard before are now easy. So we move on to the next hard thing that maybe before was impossible or unimaginable. There is still hard work to be done.

redhale · 2026-05-08T11:09:19 1778238559

This (from the September 2025 post) now evokes the Curb theme:

> Like you, we have seen numerous reports that more and more firms are capping their total headcount in favor of leaning on more AI tools, leading to downsizing their intern and new-graduate hiring. This is resulting in increased sidelining of new college graduates. But we think this misreads the moment completely, so we’re heading in the opposite direction.

> While we are excited about what AI tools can help do, we have a different philosophy about their role. AI tools make great team members even better, and allow firms to set more ambitious goals. They are not replacements for new hires — but ways to multiply how new hires can contribute to a team.

gk1 · 2026-05-08T12:00:54 1778241654

> AI tools make great team members even better

This is the predominant (public) talking point. And it’s true.

But along with that: when you have effective people becoming even more effective with AI, it becomes glaringly obvious who the INeffective people are. At which point it becomes hard to justify keeping those people around.

(That often includes people who are otherwise effective but aren’t utilizing agents and are therefore losing their edge.)

jurgenburgen · 2026-05-08T12:51:38 1778244698

Before AI, it was impossible to measure productivity. Some tried with misguided metrics like lines of code added but that just incentivized writing obtuse code.

What has changed?

MITSardine · 2026-05-08T22:40:15 1778280015

Stuff just gets done, I guess? Projects move faster, people onboard faster with less intervention, etc. The speedup seems noticeable enough that it doesn’t need precise measuring.

jurgenburgen · 2026-05-10T01:04:11 1778375051

If the speed up is noticeable enough then coming up with a metric should be easy?

I haven’t noticed a speed up in my own org though the feeling of engineers rushing to implementation has become more pronounced. Team members no longer understand what others are doing and siloing has become intense even within my team.

matharmin · 2026-05-08T20:13:05 1778271185

Now you can ask "Is it easier to ask an AI agent to do X than asking my employee?"

Good metrics is difficult, but sometimes a simple comparison like that is enough.

htrp · 2026-05-08T13:11:49 1778245909

vibes maybe?

If effective AI enhanced SWEs can ship features in a week, the guys who ship 1 feature a quarter stand out more?

bartread · 2026-05-08T19:53:35 1778270015

Quality matters as well as speed though: reworking comes at a cost, so you really need to be tracking more than one metric. A lot of problems are caused by optimising for one metric above all else.

hackable_sand · 2026-05-08T20:20:37 1778271637

If it takes 1 quarter to develop a feature and a developer ships a feature in 1 quarter then that makes sense.

If it takes 2 weeks to ship a feature and a developer ships in 1 then yeah, I'm highly suspicious of that.

dboreham · 2026-05-08T13:11:45 1778245905

Impossible to measure in absolute terms but I think it's clear productivity increases relatively when LLMs are used. At least that's my strong experience.

the_af · 2026-05-08T13:35:11 1778247311

I know you're arguing a more general point, but it's worth pointing out in their layoff announcement, CloudFlare is claiming:

- This is NOT performance related.

- This is NOT a cost cutting exercise.

They say both things explicitly. What they don't say very clearly is what the layoffs ARE about.

toast0 · 2026-05-08T15:51:57 1778255517

> This is NOT performance related.

It's important to say a large layoff isn't performance related, because it helps those who got laid off find new work. Even if it was all performance related, you want someone else to hire your former employees.

And, in a large layoff, it's likely to be at least partially true. Large layoffs work better when they're done quickly, when there's signs of layoffs but no information, many people will head for the exits themselves... which helps your headcount numbers, but ideally you want to keep people who are good at figuring stuff out and taking appropriate action and instead they've left. So... lay off people who are 'known performance issues', but also lay off some whole teams that have a mix of performance, and then do some random assignment and catch a mix of performance, because getting direct managers involved to pick who goes means having too many people know about the lay offs.

> This is NOT a cost cutting exercise.

Yeah, this one isn't credible. If it was about something other than costs, like pivoting to a new market, you would offer first choice of jobs for the new market. Even if it's look at our productivity, 20% of our employees have nothing to do, it takes a lot of spin to say not paying them to twiddle their thumbs is something other than cost cutting.

ThrowawayR2 · 2026-05-09T04:50:20 1778302220

Didn't a few large tech companies fail even that low bar of decency? I seem to recall news of layoffs in the not too distant past where the employer openly let it be known those chosen were chosen for performance reasons, e.g. https://www.cnbc.com/2025/01/14/meta-targeting-lowest-perfor...

acdha · 2026-05-08T14:51:13 1778251873

That to me is a pretty clear reason to question the accuracy of those two claims. Insiders are saying that even people who were performing well in very profitable groups are being cut, which is hard to square with the stated motivations.

the_af · 2026-05-08T16:06:11 1778256371

Agreed. One of the two things must be false. But that's what they are saying (not saying I buy it).

apple4ever · 2026-05-09T06:01:36 1778306496

They are lying about it not being a cost cutting exercise

aaa_aaa · 2026-05-08T12:34:01 1778243641

Losing what edge?

zarzavat · 2026-05-08T13:24:44 1778246684

It's weird to fire people instead of giving them training.

alphabeta3r56 · 2026-05-08T19:13:42 1778267622

Training can be socialized by asking people to take govt loans on further education and then letting the people default on them. Why should company spend their profits on it? /s

redhale · 2026-05-07T09:43:23 1778147003

I want to agree, I do. But this point is plainly wrong in my observations:

> The enterprise version of that is I don’t want a CRM unless at least two other giant enterprises have successfully used that CRM for six months. [...] You want solutions that are proven to work before you take a risk on them.

Perhaps not for every category of software and every company. But in practice, any SaaS app that is just CRUD with some business logic + workflows is, imo, absolutely vulnerable to losing customers because people within their customers' orgs vibe coded a replacement.

They are perhaps even more at risk because would-be new customers don't ever even bother searching to find them as an option because they just vibe code a competitor in-house.

The vulnerability lies primarily in the fact that most of these SaaS apps were talking about are _wrong_ to some meaningful degree. They don't fully fit how your company works, and they never did. There is something about them that you are forced to work around in some way. This is true because it is impossible to build a universally perfect product, to perfectly fit it to every business requirement of every user in every company.

But now it is relatively cheap to build the perfect version for your company in-house. Or maybe even just for YOU.

I think medium/long-term this will mean a redistribution of technical talent from SaaS companies to industry companies. Instead of paying millions for SaaS subscriptions, industry companies will spend fewer millions building precisely what they need in-house with the help of AI. Not every SaaS and not every company, but I already see this happening at my company right now.

redhale · 2026-04-15T12:29:57 1776256197

But it is working primarily because of the Max subscription model. If I could use my Max subscription to get $5000 worth of tokens for only $200 via OpenCode or Pi, I would drop Claude Code today. I think a lot of people (and enterprises) are of a similar opinion. Not saying Claude Code would have no users, but its dominance would be greatly diminished.

adampunk · 2026-04-15T22:10:59 1776291059

But you can’t and the reason that you care also has to do with the same production process.

It’s not like a separate company made the terminal app versus the model. If we think that the desktop app is bad, but the model is good then that’s still an endorsement of the software process.

If we think the model doesn’t matter at all, then that’s an even bigger endorsement. Is the model has no content worth talking about over the nearest competitors or an open source alternative, then the remainder is marketing and polish.

I just don’t understand how people can look at a company that is capacity constrained in this market and think that they’re doing things poorly.

allan_s · 2026-04-15T23:57:45 1776297465

Its the same for a lot of bundled things, mac hardware + os or windows pre-installed, chrome being pushed by google.com etc.