More

joshribakoff · 2026-01-18T05:51:21 1768715481

This company predicts software development is a dead occupation yet ships a mobile chat UI that appears to be perpetually full of bugs, and has had a number of high profile incidents.

simonw · 2026-01-18T06:17:44 1768717064

"This company predicts software development is a dead occupation"

Citation needed?

Closest I've seen to that was Dario saying AI would write 90% of the code, but that's very different from declaring the death of software development as an occupation.

falloutx · 2026-01-18T09:25:11 1768728311

The clear disdain he has for the profession is evident in any interview he gives. Him saying 90% of the code was not a signal to us, but it was directed to his fellow execs, that they can soon get rid of 90% of the engineers and some other related professions.

throw234234234 · 2026-01-19T05:28:51 1768800531

I think it's pretty clear that Anthrophic was the main AI lab pushing code automation right from the start. Their blog posts, everything just targeted code generation. Even their headings for new models in articules would be "code". My view if they weren't around, even if it would of happened eventually, code would of been solved with cadence to other use cases (i.e. gradually as per general demand).

AI Engineers aren't actually SWE's per se; they use code but they see it as tedious non-main work IMO. They are happy to automate their compliment and raise in status vs SWE's who typically before all of this had more employment opportunities and more practical ways to show value.

throw310822 · 2026-01-18T12:21:11 1768738871

AI is already writing 90% of my code. 100% of Claude Code's code, too. So Amodei was right.

joshribakoff · 2026-01-17T20:33:16 1768681996

I dislike the idea of coupling my workflow to saas platforms like github or code rabbit. The fact that you still have to create local tools is a selling point for just doing it all “locally”.

joshribakoff · 2026-01-17T16:15:14 1768666514

I’ve been doing game development and it starts to hallucinate more rapidly when it doesn’t understand things like the direction it placing things or which way the camera is oriented

Gemini models are a little bit better about spatial reasoning, but we’re still not there yet because these models were not designed to do spatial reasoning they were designed to process text

In my development, I also use the ascii matrix technique.

kleene_op · 2026-01-17T16:35:21 1768667721

Spatial awareness was also a huge limitation to Claude playing pokemon.

It really seems to me that the first AI company getting to implement "spatial awareness" vector tokens and integrating them neatly with the other conventional text, image and sound tokens will be reaping huge rewards. Some are already partnering with robot companies, it's only a matter of time before one of those gets there.

nszceta · 2026-01-17T17:02:27 1768669347

This is also my experience with attempting to use Claude and GLM-4.7 with OpenSCAD. Horrible spatial reasoning abilities.

hypercube33 · 2026-01-17T17:01:36 1768669296

I disagree. With opus I'll screenshot an app and draw all over it like a child with me paint and paste it into the chat - it seems to reasonably understand what I'm asking with my chicken scratch and dimensions.

As far as 3d I don't have experience however it could be quite awful at that

vunderba · 2026-01-18T00:11:28 1768695088

Yeah at least for 2D, Opus 4.5 seems decent. It can struggle with finer details, so sometimes I’ll grab a highlighter tool in Photoshop and mark the points of interest.

miohtama · 2026-01-17T16:34:17 1768667657

They would need a spatial reason or layout specific tool, to translate to English and back

falcor84 · 2026-01-17T17:09:21 1768669761

I wonder if they could integrate a secondary "world model" trained/fine-tuned on Rollercoaster Tycoon to just do the layout reasoning, and have the main agent offload tasks to it.

joshribakoff · 2026-01-16T21:49:28 1768600168

Banning paying users with no warning doesn’t seem super ethical. Probably not unethical, either, but I would not frame them as “the most ethical”

phist_mcgee · 2026-01-16T21:59:45 1768600785

I'd say they're about as good as the average billion dollar American tech company when it comes to ethics.

joshribakoff · 2026-01-16T21:46:59 1768600019

Yep, i have long felt like i randomly get sonnet results despite opus billing. I try to work odd hours and notice better results.

joshribakoff · 2026-01-15T19:58:12 1768507092

I expect that adding instructions that attempt to undo training produces worse results than not including the overbroad generalization in the training in the first place. I think the author isn’t making a complaint they’re documenting a tradeoff.

joshribakoff · 2026-01-11T05:12:29 1768108349

For me, I’m simply trying to read the article and there are random full screen pop-ups nagging me to sign up for newsletters and stuff

joshribakoff · 2026-01-10T15:07:44 1768057664

I have been using an open source program “handy”, it is a cross platform rust tauri app that does speech recognition and handles inputting text into programs. It works by piggybacking off the OS’s text input or copy and paste features.

You could fork this, and shell out to an LLM before finally pasting the response.

joshribakoff · 2026-01-09T20:20:25 1767990025

I clicked out of the article since it starts out with a contradiction.

Experienced engineers can successfully vibe code? By definition it means not reading the output.

If you’re not reading your output, then why does skill level even matter?

rjh29 · 2026-01-10T14:46:55 1768056415

The definition of 'vibe code' is somewhat nebulous at the moment. For many it means "only look at the end product (website) and use prompts to fix it" but for others it means "mostly don't hand-code anything, but check the diffs".

NitpickLawyer · 2026-01-10T09:04:46 1768035886

> If you’re not reading your output, then why does skill level even matter?

Few thoughts here.

Experience helps you "check" faster that what you asked for is actually what was delivered. You "know" what to check for. You know what a happy path is, and where it might fail. You're more likely to test outside the happy path. You've seen dozens of failure modes already, you know where to look for.

Experience also allows you to better define stuff. If you see that the output is mangled, you can make an educated guess that it's from css. And you can tell the model to check the css integration.

Experience gives you faster/better error parsing. You've seen thousands of them already. You probably know what the error means. You can c/p the error but you can also "guide" the model with something like "check that x is done before y". And so on.

Last, but not least, the "experience" in actually using the tools gives you a better understanding of their capabilities and failure modes. You learn where you can let it vibe away, or where you need to specify more stuff. You get a feeling for what it did from a quick glance. You learn when to prompt more and where to go with generic stuff like "fix this".

blargey · 2026-01-10T03:15:33 1768014933

Don't you think "having a concrete idea of what sort of code change / end behavior you're looking for" affects the prompts and LLM output?

wiseowise · 2026-01-10T08:39:22 1768034362

Do apply the same logic to conductors too?

joshribakoff · 2026-01-09T15:03:13 1767970993

Now roast the horrendous ugly colors on this horrendous ugly website that is frying my eyeballs. Did you prompt your LLM to create the biggest pile of garbage possible? Or is that just how you talk about other peoples work?