Vibe Coding in the Physical World: Robotics, Circuits, and Dangerous Permissions
Greg Sadetsky on driving laptop-refurbishing robots with AI vision, the near-100x speedups he sees in well-tested code — and what AI still can’t do, including vibe-coding physics.
Show notes
Dan and Greg discuss Revise Robotics, where Greg serves as founding engineer building robotic systems that refurbish discarded corporate laptops for donation. The episode opens with a description of how AI vision models allow robots to navigate unfamiliar BIOS screens and unpredictable laptop states dynamically — a capability that wasn’t feasible a few years ago. Greg reflects on how LLM-powered vision surprised even him as a “second gift,” enabling a kind of general adaptability that previously would have required exhaustively pre-coded state machines.
The conversation digs into Greg’s hands-on experience using Claude for hardware projects, most vividly illustrated by an Arduino RPC library he built on a Raspberry Pi in under five minutes — a task he estimates would have taken a full day by hand. Greg draws a sharp distinction between projects where AI delivers near-100x speedup (well-defined problems with existing patterns and a testable harness) versus cases where it gets confidently stuck in loops. His Minivac 601 circuit simulator project becomes the central cautionary example: months of fruitless AI-assisted attempts to simulate relay circuits collapsed once he realized he needed a real physics engine rather than asking the AI to re-derive Kirchhoff’s laws from scratch.
A recurring theme is the tension between speed and trust. Greg describes his journey from clicking “yes” to every Claude permission prompt, to briefly trying sandboxing tools like Nono, to ultimately running Claude with dangerously-skip-permissions locally — partly out of pragmatism, partly because he concluded the permission theater wasn’t actually catching anything. He shares his “committee of elders” technique, routing important decisions through Claude, Gemini, and ChatGPT simultaneously and only proceeding when all three agree. Dan shares his MMI hook tool, which intercepts Claude’s bash calls to enforce conventions like always using uv instead of raw Python.
The episode closes with a candid discussion of the emotional and societal costs of this pace. Greg describes a new kind of frustration — distinct from normal debugging — when an AI tool fails after drawing you deep into a rabbit hole. He and Dan also address broader concerns: the acceleration of security vulnerabilities, the environmental cost of GPU compute, and AI-driven job displacement. Both acknowledge they can’t stop using these tools even as they see the harms compounding, and end on a cautiously hopeful note about open-source and local models eventually offering more control.
Chapters
From this episode — Greg Sadetsky
When I bring it to the three elders — Claude, Gemini, and ChatGPT — and they all go, yeah, this is fine, there's no hole, I'm kind of like: there's probably no hole.
How can you not call that a drug? It's literally like you've walked all your life and someone was like, You want to try a bicycle? A bicycle going downhill.
The people writing malware with LLMs right now must have so much fun. It must be an incredible time for them. We are unprepared.
I turn around, we maybe talk about one thing — it's five minutes, it's done. The library is perfect, it's tested, it self-tested itself.
Mentioned
- Revise Robotics
- Arduino
- Raspberry Pi
- uv
- MMI (Mother May I)
- nono
- MikroTik
- Recurse Center
- Fly.io
- sprites.dev
- 10X Management
- Claude Shannon
- Minivac 601
- Verilog
- The Field Guide to Understanding 'Human Error'
Transcript
I’m Dan Gerlanc and welcome to Agents and Engineers, the podcast for engineers building and building with agents. Today I have Greg Sedetsky with us. He is currently the founding engineer at Revise Robotics. Prior to that, he was a fractional CTO with 10X Management for 11 years, where I was also a client.
He before that was founder, CTO, and CEO of at least three other startups, including a web mapping startup that was acquired by Apple. During Hurricane Harvey, he created a map that helped rescue 1700 people. And he was awarded by both the US Coast Guard and the Canadian government for that work.
Well, welcome Greg. Great to have you on today.
Welcome.
Thank you so much.
Well, you said
impressive background, so
No, no, no. You
you’re very kind. yeah. you have to tell everybody about the bourbon company started. Okay. I haven’t started this one. That’s a dream. yeah, so yeah, Revise is super, super fun. startup that I joined quite recently here in New York City. The short of it is that there’s a ton of laptops that get kind of discarded after they’re used in corporate, you know, the corporate world, right? So huge company, thousands of employees.
yeah.
Yep. Yep.
Mm-hmm.
So wow.
Yeah. Yep.
Yes, it is.
Yeah.
Yeah, no, a hundred percent. Yeah, a hundred percent. Yeah.
Right.
Right. It’s not like can this run Crysis it’s like can I use Chrome to get my email.
Yeah, can I go on a Google form? You know, I I’m even like, you know, telling my girlfriend it’s like you know, she was like, we embed a Google form like on a website we’re doing. And I was like, you can’t like the mobile like like a mobile Google form form is broken enough. Anyway, laptops are good. and yeah, so it’s very, very nice. But one of the sort like one of the many twists that are kind of interesting about what we’re doing is that there’s a big robotic component because this kind of work right now is done by humans who basically stare at a Windows installation screen.
and they just stare at that on like multiple laps, which is a crazy insane thing that we are forcing humans to do.
Wa
like watching the installation process happen? Okay. Yeah. Yes.
Watching yeah, as interesting as watching grass grow, really. And that’s
on one side. And then the other side, which is really interesting and kind of like novel and wouldn’t have been possible a few years ago, is that we do use AI. But the AI, one of the things that the AI does is that it makes decisions as to what to do next. So it is basically watching the screen with a webcam. And then it’s like, okay, like the BIOS
Right.
Yeah.
Yeah.
Yeah.
Right. Like how many
different models would you need to at each step to get to the next step there? Yeah. Yeah.
Yeah, and exactly, and how many states a laptop can be in. And
you and we’re really talking about, you know, laptops that maybe, you know, the battery is like for just faulty enough that it will boot, but then kind of like mid operation shut down, but the AI can still like figure that out or try another strategies, or sometimes it has to press F four, but like more quickly, and it like will direct the robot to like we have like our video on our website is very like you kind of see the robot like just jam F four and that’s what it does. so it’s it’s it’s kind of yeah, it’s kind of wild.
Yeah.
Yeah.
Rip.
Right. Yeah.
So how did had you worked in robotics before this? How did you get into doing this? Okay. Uh-huh. Okay.
No, but they knew that. Yeah. They knew that before hiring me. I did not hide my non robotic
background. I did have a very old GitHub repo where I listed all the robotic arms that one day I wish to have. And this was my claim to like I’ve been thinking about it for a while, which is completely true. But yeah, all all my like little smart like I have like little art experiments and things that I release on the web.
Yeah.
The robotic arms. Okay.
Right.
Right.
Right. You don’t you gotta be careful. Yeah.
Right.
Right. You have you have to make
sure it actually doesn’t go back to the old way and smash the laptop. Yeah.
Yes, yeah, yeah. I I can i e ev even hurting the laptop in the sense I’m like, you know,
it’s still much better than hurting one of my colleagues. so we try to get away for that, but knock on wood it hasn’t happened, but it there’s close calls and then we’re like, all right, how do we We try to learn we’re try learn from the close calls to our yeah, our our exact our post mortems are hopefully not mortems. Haha. So it’s but but that’s the spirit, yeah, that’s the spirit.
Yeah, so yes.
So how mu how much of this has changed in the last year or two years with LLMs, vision models, or how much can be done now that couldn’t be done before? What like what does that transition kind of look like?
Mm-hmm.
Yeah. I’d say yeah, I mean there’s two aspects of it. I I think on one hand there’s like you know, there’s like the company and I junced semi recently, so I I’m also not gonna like opine necessarily what was going on years ago. I think, yeah, from the company’s point of view, as I understand kind of its history, specifically vision models, which I have to say, like in a sense that this is a very dumb thing to say, but like I think it’s kind of intrigu intriguing that when we got LLMs and we’re like, all right, we have this like rock from that
Yeah. Yeah.
Yeah.
Yeah. Yeah.
Ramp.
Mm-hmm.
Right. Yeah, yeah.
Yeah.
Right.
Yeah. Sure. Yeah.
Sure. Yeah.
so we w we do still like validate them one by one, we
read them. But who, how, where? Like when does that so yeah, so the vision models, again, back to the company, that made a huge, huge impact because the robot can see the laptop it’s working on and make take decisions. the AI LM component that I think we’ll talk about a lot in terms of the coding, I mean, in the sense it’s just a multiplicator of speed, right? It’s like you can have a smaller team, you can go faster, you can make more projects.
Yeah.
Yeah.
Yeah.
In c in Claude or in GitHub?
okay. Hm. Yeah, yeah, the s where it has the session history for Claude. Yeah.
Yeah.
Yeah.
Yeah. Yeah.
No,
I I know lots of folks who’ve gone there and
It’s
it’s it’s fantastic. It it’s just like yeah, the the best people and the best but when you’re there you feel like it’s a such sp such a special moment and it kinda gets the best out of you and there I maybe worked on twenty projects at one time. Now I work on twenty projects like all the time. which yeah, it’s like that’s novel. I don’t know what the old number is, but now it’s too late, it’s out of my mind. anyway, I don’t know if that’s
Yeah.
And do you think I mean, I like to think that everyone is now using these, but I mean empirically I can’t anecdotally I see it right on social media, but also there’s still some people who say, Hey, I use this and it’s not really doing that much for me. Do you think there’s something that separates developers who get a lot out of working with
Yeah. Hmm.
Yeah.
Yeah.
Yeah.
Uh-huh.
Mm-hmm.
Yeah.
Yeah.
Yeah.
Yeah, and
I think sometimes almost better and sometimes because you’re coming at it from what’s my problem versus how do I engineer this?
Right, yes.
Yes, exactly. Yeah, they did not care. They they cared for the thing to like exist and then hopefully work and then hopefully kind of like be okay. But they would I think were even very open to the thing like half working because they thought, well if I at least if I get it I can fix it. So the big thing that I saw there that I found really interesting was that they were it really felt like seeing maybe like whatever ourselves or myself like two, three years ago, they were going to cloud.ai with cloud like cloud.com
Yeah.
Yeah.
Yeah.
Right.
Yeah.
Mm-hmm.
Yeah.
Yeah.
Right.
Yeah. Right.
Yeah.
Mm-hmm.
I wrote a tool that actually you use it as a Claude hook and it lets you have rewrites where it will block things like Python three and rewrite it as uv run.
Wait, okay, what’s
a hook and what’s a rerun? I’ve never heard of those things.
that’s interesting.
I saw that. Yes, yes, yes. Okay.
Mm-hmm.
Mm-hmm. Mm-hmm. Mm-hmm.
Mm. Mm-hmm.
Yep, yep, yep, yep, yep.
Mm-hmm. Mm-hmm.
Mm.
tell
it what you what commands you don’t want it to run. And there’s a mode in the hooks where it it communicates it back to Claude or other harnesses have this ability where you send it a message and say, hey, don’t use Python 3, always use UBRun.
Right.
Mm-hmm.
Hmm.
But can it so so so this is like I don’t know if this is what part of my opinions. I sort of like, in a sense, because you see, you know, you fill up the context window, it remembers one thing and then it forgets it, it writes a markdown, found and it reads it back. In a sense, I like almost think it’s almost like a waste of time to write any markdown instructions. So to me, I’m interested, I mean, I’m interested and generally I I think people should check it out, and I know you do really good work. I’m not doubting that, but
Yeah.
Mm-hmm. Yeah.
It’s not more markdown. Yeah. Yeah.
Yeah.
Yeah.
So tell tell me more about the dark side.
I run cloud dangerous
skip permissions all the time.
Nope. Nope.
Yep.
I have been enabled so the the thing is I got a taste for it in a specific environment. So right. I think the history for me of like my pr my relationship with permissions was that you get the tool and you’re like, my god, yes, Cloud, to ask me every single question. Yes, you and I were doing this thing. And then you become, of course, you know, the whatever the Homer Simpson scene years ago where he like had a nuclear plant, he gets to like Woodpecker that just presses the button all the time.
Mm-hmm.
Yeah.
Yeah.
Yeah. Yeah.
Yeah.
Yeah.
Yeah.
Yeah.
Yeah.
Okay, yeah.
Yeah.
What is what is this tool? I haven’t used it. No no, no. Okay. Okay.
Yeah.
Right.
Yeah.
Red, yeah.
Right.
Yeah.
Yeah.
Right.
Yeah. Like deleting
writing all zeros to your hard drive. That kind of stuff. Yeah. All all sorts of fun, yeah.
Yeah, guess yeah, writing zeros, I didn’t think of that. Yeah but but like Yeah,
but it’s it’s like will it happen? Anyway, so so then no no promises.
I mean
some it would probably be unlikely to happen unless you were doing some task that yeah, right. I mean if you had something injected into your context that like sure, like you downloaded a website that then tells it to do this, sure. That’s I think the risk. But working on your own project probably not.
But but unless you ride the first LM virus.
Then no problem. Then no pro
Yeah, and it’s like and then Yeah, or like obvious No,
like in my right exact. I I I mean it makes mistakes, it does sometimes go like, let me delete this file ‘cause it takes up space and then I guess if I had clicked yes, yes, yes, yes, and then asked me to do and I would well know, I’d be like, Whew it it’s good that we’re anyway. So so I install Nono basically it stops working. Because of course Cloud Code is like releasing probably like fifty thousand updates a day
Yeah, yeah.
Yeah.
Yeah.
Yeah.
Yeah.
Mm-hmm.
tell me more.
Tell me more about that.
Well, right.
So so I ha I have a few hardware things that I did. this one, perfect example. This is like this is school textbook, school book, whatever. So my colleagues w have like a lot of Arduinos, they a lot of things. Basically, there’s a lot of like copy pasting, and we have like homegrown libraries. That’s all very normal. And I kind of go, Hey, I don’t have an Arduino library in the you know, like there’s like a the official libraries, whatever, or just like libraries that you’re like kind of like NPM install.
Okay.
Mm-hmm.
Yeah.
Yep. The G the G R P C of Arduino.
Yeah.
Well the
So when when was when was this? Okay. Very recent.
Ha ha.
He what
one of the one of the founders of the company or okay. Okay, cool.
Yes, he’s a CTO. Yeah, one of the founders, exactly.
And so we so basically I put the right resistors after like an hour. Again, Simpsons and the Simpsons at one point Bart is explaining something to Homer, and then he at the end he’s like using puppets and Homer still doesn’t get it. That’s how it felt. But I get it, I get it. And so I put the right resistors, and basically what I what it does is that the Arduino forget it LMs, forget anything. If you just ask it what is the voltage at this specific pin, it will always, because of Kershoff, because of physics, because of everything.
Mm-hmm.
Yeah.
Yeah.
Yeah.
Yeah.
Yeah.
Well, right, I mean
you instru you gave it the parameters and made sure it did what you want it to do, right?
Yes.
We use it. It
it’s it’s short, it’s very readable, and it’s beautiful C plus plus, I guess. I admittedly don’t have the C plus plus skills to like tell you whether it like use the right C out star star. It works. So that got me hooked. And then I again I have like more example of hardware things. Some gone
Here.
Yeah. And how long do you
think it would have taken you to build this by hand?
Yeah.
Yeah.
Yeah.
Yeah.
Yeah.
Right.
Yeah.
Yeah.
Yeah.
Yeah, s
so is what is that like a hundred X speed up relative to doing it by hand?
Right.
Right.
I mean that’s that’s like like how can you not call that a drug? Like it’s literally like you’ve walked all your life and someone was like, You want to try a bicycle? And it’s like, Yeah, that’s especially like a a bicycle going downhill. Which is also maybe that that that could be like a funny thing of like sometimes it’s still a bike. It’s just the battery’s out, it’s a s it’s a heavy city bike. You’re going uphill and there’s you know, and there’s like potholes. You still like the bike. You still like the idea of pushing this bike, but maybe like walking would have been less effort.
Yeah.
Yeah.
Right.
Yeah.
So well
so that well that that sounds like another scenario. Is like is that like where does that case come up that you found where yeah, that’s still easier than doing it by hand, but there’s some potholes or it’s a city bike that maybe fell in the East River. Yes. Yes.
But I for somehow I’m like so attached to it, or I’m paying $100 a month for it, so I’m not gonna
Sunko’s fallacy. The one example where it still did things right, but it was more like when it got things wrong, it like kind of like put us into trouble and stuff, is I’ve vibe set up the networking at our workplace. And we basically went from like having like a home router that like could have like a few ports.
Yeah.
Yeah.
So is this
like for everyone’s laptops at the office or is this also the hardware? Okay. It’s like also all the machine all all the robots, everything too. Okay.
Yes. For everything. Yeah, for everything.
Everything, right? And it’s
like when we got the space, there were like Ethernet jacks, but basically the previous tenant had like just cut all the cables. So one thing was hardware, which was really fun, and I’ve done it before. Some of it was a little bit new, but I got like a punch down tool if anybody has ever done that. Basically you get a bunch of cables. You know it’s like those panels with like all the Ethernet jacks. So I punched them one by one. I got the right tools. I tested that was fun. That was like the very Yes.
Okay.
Yeah. Yeah.
So you’ve got a ma so you’ve got things that are hardwired, you’ve got
Wi Fi and you’ve got things that need a good network connection. hard wire connection.
Well, right. It’s like yeah, and yeah, and it’s like exactly, yes,
that’s that’s correct. And we have FIOS and but but it’s more like, you know, at the end of the day, you really to speak in networking terms, like you just want like that basic firewall and you kind of want those wireless access points to kind of work. And there’s like some kind of magic when access points like use the same network name, computers and phones will mostly magically kind of like transfer blah blah. But I also
Yeah.
Right.
Right, so you so you
can switch between different access points based on which one is stronger or things like that.
It is it right
closer, but then like also Macs and other devices will like make their own decisions as to when to swap ‘cause there’s like whatever. So I’ve used and kinda like microtik equipment and maybe I don’t think it’s like masochistic, but it is a little bit like more complicated than it should be. And it but it’s kind of cheaper and it’s it’s actually like quite powerful. It’s been very stable. I’ve I’ve kind of used it for like many, many years now, like in my home apartments and stuff.
Right.
Mm-hmm.
Yeah.
Yeah.
Mm-hmm.
Ha ha.
Yeah.
Right.
Yeah.
Yeah.
Yeah.
Yeah.
Makes sense. Yeah.
Yeah.
Right. But in ish yeah. Like if you like your printer, right, you probably
always want at the same IP address. Yeah. Things like that. Yeah. Yeah.
The exactly. And especially like for us, like our robots. Like we run our robot
so you know, and other devices because we want to address them with a stable IP. So and then of course I can hear even from this record
Yeah.
Right. I mean you can
do fancy things like give it a name and do DNS resolution, r right, but that’s more work. Yeah, yeah.
Yes, you can do tail scale. Yeah, you can do telescale. There there’s like it’s
not it’s not like this is the only way, but it’s got it’s almost like also a like an easy, quick solution to just like bake those IPs. So you know, I was also like migrating that Yes, yes, yes, exactly. Which and and then it could Yes, yes, yes, exactly. And then you everybody has to use a DNS, but yeah. So I was kind of bringing all of those configs, you know, into the micro tick, but I let Claude like talk to the router directly. And
Yeah. Yeah. But first adding a lot of extra complexity of running separate DNS, you know.
Yeah.
Mm-hmm.
Yeah.
Mm-hmm. Yeah.
Yeah. Yeah.
Mm-hmm. Right.
Right.
Yeah.
So so you’re using
basically like a different agent adversarial verification.
Yes. Yeah, at the
material stuff is like I think I think there’s a th it’s like I almost want to say maybe this is one of the like almost most crucial thing. It’s this like and you know, I can see it in myself. It’s like it’s something about, you know, the difference between just having these thoughts and then when you write down and then you start kind of reading yourself or like even the rubber duck phenomenon where like even hearing yourself say say a thing, you can like react to it, which is sometimes different than if you just have those thoughts bouncing around. And in a certain sense, like
Yeah.
Yeah.
Yeah.
Mm.
Yeah.
Yeah.
all like
just like arguing against each other until there is sort of a consensus. And of course, that to me that still feels like well, if you got a bunch of six year olds arguing, the consensus will still be the consensus of a six year old mind. No one will be like, you know what, we should be d reasonable and bury the hatchet and you know, go have a coffee with the spirit. It will still be like we should put glue on their seat and call them a bad
Yeah.
Sure. Yes. Yeah.
you know, a bad
whatever, and then we should still like pull their hair or something. So I think that’s still that still is the case, right? Like these agents still they’re kind of like some limit it it’s it’s kind of like a dice, like it doesn’t matter if you throw it a hundred times or but the probabilities are the same of them getting to any number. And in the same sense, you know, if it’s wrong twenty percent of time, I’m not a stats person, don’t come at me, but there’s something that feels like more eighty percent just gives you eighty percent or less than eighty percent. It doesn’t
increase the probability of it not make Right, or worse. Yeah, the Right. I know it should, but it I also don’t want to be like yeah, I it’s like but yes, yes. It’s like yeah, get a room of, you know, reasonable people is there gonna be a breakthrough? I I don’t know. It’s it’s hard.
Probably much worse than eighty percent. So it’s as a compound because those compound at each step, I guess, right?
But
how’d you feel where it is now? Is it 80% or like 95%? Because I felt like pre-Opus 4.5 and into November that we were kind of at 80% a lot of times. And it was just very hard to I didn’t really trust Claude Code to be editing my code because it was too 80%.
Uhhuh, uhhuh. Uhhuh.
Yeah, yeah, yeah, yeah, yeah.
Yes, yes.
Mm-hmm. Mm-hmm.
Mm-hmm. Mm-hmm.
Mm-hmm.
Mm-hmm.
Mm-hmm. I I think I think that the number you know, I like I’m I I’m able to make averages, but I almost kind of like almost, you know, like like like air bars or whatever. Like I want to look at the you know, the shape of these errors. I think that I have a lot of one hundred percent. A ton, right? I look at my list of projects again, these like twenty projects, like many of them were like entire full on
Yeah.
Yeah.
Right.
Uh-huh.
Yeah, you guessed.
Sure, you’re right.
Yeah, but that’s
a that’s the worst way to look at it. I mean, that’s the the book argues that that’s a bad way to look at at a thing because y you have to put yourself in the the head and the circumstances of the person at every step before that, leading them that path. Why did they take all those decisions? And also why did the system let them make those decisions? And in a sense, that’s exactly maybe like what we’re talking about, right? It’s like now if you told me if you told me, hey, I have a more reliable claw, it’s a better percentage, it’s a new model.
Mm-hmm.
Yeah.
Yes.
Yeah.
Yeah.
Yeah.
Yeah.
Ha ha
Yeah.
Right.
Yeah.
Yeah.
Yeah.
Yeah.
Yeah. Mm-hmm. Right. Yeah.
Yeah. Right. Yeah.
Yeah.
No, or right. Or it might
run but not do the thing you want it to do. Yeah.
What exactly, of course. And it’s like we were never like,
my god, like I feel so bad, I’m not No, y it’s not like Linus Torvalds when he wrote Linus’s code, run no. It’s like we already have this like I mean the iteration is one thing, the testing is one thing. And of course there’s still an ethereal human whatever this is, still kind of like an excited eleven year old who just learned to code, kinda, and reaches for Python three even though we beg it not to.
Yeah.
Yeah.
Yeah.
Right. Yeah, no, for sure.
Yes.
That’s right. Yeah, yeah,
it’s really the that kind of generalization. I’ll I’ll give two very specific examples where it totally failed me. never got out of trouble, was never helpful, and brought and this is truly the like AI psychosis feeling of like you’re going crazy with it, it keeps convincing you it’s gonna get it right, or it almost got it right, or next time, and it never does. And it can also, even more fundamentally, never actually
Mm-hmm.
Yeah.
Yeah.
Mm-hmm.
Yeah.
Yeah. Yeah.
Mm-hmm.
Mm-hmm.
Like and what is what is a relay for
Yeah.
Yeah.
Yeah.
Yeah.
Right. It’s like But wait. Yeah. Mm-hmm.
But turns out he builds like a tic-tac-toe machine. Yeah, he builds
like a thing that actually does recognizes characters like OCR and can like like pretend to be an elevator. So it’s really surprising. Anyway, so I don’t have one. It’s expensive. There’s not a lot of them left. I want to build a JavaScript version of that. And it’s kind of like a long project. I work on it sort of like on and off. Also very like pre-AI, post-AI, kind of like over over two years in the last two years. And the thing that I don’t get, which is really my fault, is that
Mm-hmm.
Yeah.
Okay. Mm-hmm.
Right.
Yeah.
This like what
languages like Verilog and things like that have
Yeah, yeah, the very
exactly. Or y or or basically like if I gave you if I just like if you know somebody hands you like a circuit diagram, like yes, you can read it, you can be like, this is this component. But if if you were to go how does this circuit work and you were like stepping it step by step, you would have to do the math that again, nineteenth century Kirchhoff, you would have to go, Well, this voltage will drop by this much with this amp, and then you multiply some numbers, this resistance, and you have to do that for every single and it that’s called like
Yeah.
Right.
Yeah.
Right.
Something you’d need a sim essentially you need a simulator that does this.
Yes. Yes. And a valid
one that’s actually mathematically the valid enough. So I for a long time reject that idea and kind of write my own really bad version of that with like graphs and nodes. And you know, it works until it like it’s kinda like, you know, Newton laws work until you go really fast and then it they don’t work at all. So not that I was reinventing and I I just compared myself to Newton, I really didn’t, but like my stuff would work and then immediately if you put like two components in a different way, it would break and I’d be like,
Yeah.
Yeah.
Yeah. Yeah.
Yeah.
Mm-hmm.
Right.
Mm-hmm.
Yeah.
So you use
that as the the kind of kind of the core engine to run this and then built on top of that.
Yes. That’s exactly right. Yes. And yeah, and on top
of that, no problem. It can figure out components, it can place them, it can call the engine. But it still felt like I don’t know, maybe it’s a silly gap and everyone is laughing at me right now and pointing at fingers and taking screenshots and putting it on TikTok. This guy thought Kershop’s laws could be vibe coded. And I didn’t think that, but I anyway, there was like a disconnect between what I thought was possible and of course what the AI was convincing me was almost about to get right. So that’s like one example.
The other example that I won’t go in as much, but it was yeah. Right.
Well I guess it didn’t have the specification. Like if you had the
once you had that, you essentially had a specification of the system.
Yeah.
Right.
Yeah.
Yeah.
Right.
Yeah, right.
Yeah.
Yeah. Yeah, I agree
that if you’re not in at the correct level of abstraction, like if you knew you needed a database, say, right? Like similar to if you need a package manager or virtual environment, like maybe Claude will start trying to build the database, but
okay, sorry.
Yes, yes, yes, yes, yes. Correct.
Yep. Yep.
Yeah.
Like
historically it would take yeah team like ten years to write a database and that’s with knowing here’s the proper like, what are the properties you need in a database? Like it’s probably not gonna get to that.
That’s right. That’s right. That’s right.
Right. Yeah, that’s a great ex yeah. Yeah yes, exactly. Yeah, it wouldn’t
event sequel. Yeah, and and that’s like I think I think we are in that state right now. It’s like imag that’s that’s perfect. I I was I thought at first I was thinking you were kind of saying like you you didn’t specify like no, but if it didn’t know what a database is, which is exactly the case for a lot of things that it doesn’t know, it’s not a good like it’s just not there yet. That the six the percentage three. Like one.
Yeah.
Yeah, you don’t want it to build the database as part of the project. As part of
building like your circuit designer, it’s like, we need a database. Let’s build a database. No, that’s not gonna work. Right, yeah. Yeah.
Yes. Yeah, no. No, it should just stop itself. It should be like, Well,
I hey you’re you know, it’s like this is of course like you you see to happen. It’s like it’s kind of if you’re like, Hey, look let’s invent let’s invent teleportation and it goes, No problem, let’s get started and it’s like and you know, because it knows that teleportation is hard, it will not do it. But the things that it thinks is like, Okay, the other example, I’ll just mention it just in case it can scare somebody or like i i you know, it get somebody excited about solving it.
Yeah.
Yeah. Yeah.
Yeah.
Yeah.
Mm-hmm. That’s cool. That’s cool
because that it’s not something I have any experience working with. So cool to hear about. Mm-hmm.
Yeah. Right. And and again, I of course am also coming at it very, very
new. I’m like new to cat, I’m new to this. We’re 3D printing. We’re doing a lot of robotic, like optic like recognition. It is really fascinatingly like also convinced, writing a lot of code, running it in simulators, not seeing what it does. And you know, as someone very smart, like actually my opposing right now, very good friend, he was like, Well, it doesn’t it doesn’t have like a cube in its head when it thinks about a cube, right? And so it’s like
Yeah.
Right.
Rare.
Yeah, I mean that’s interesting in the
sense of do you need to how do you present that world to it? Like do you need to show it s images of the rotation, things like that, right? ‘Cause otherwise I mean it makes sense, it can’t iterate. Like I th yeah.
Screenshot Yeah. But is that enough?
No, it can’t iterate. Exactly.
And and it’s mm-hmm.
I mean I think that’s part of
why it’s successful in code and a lot of things is because it can iterate and t like the tests tell it when it’s wrong. So then it can iterate. yeah.
Yes. Yep. Yep.
Yep. Yeah, it’s a it’s a very closed loop. Loops are good. Loops are good, you
know, dangerously skip dangerously do more loops is good.
Yeah.
Which I still I I still don’t run dangerously skip permissions locally. I know there is a new met there is a new kind of somewhere in between method where Claude actually will evaluate. You can have Claude they added it as an option and it will actually run each command through a prompt to see if it’s safe.
Mm.
Mm.
Interesting.
that’s funny. I mean that just you know
I feel even more validated by that. I’m like everybody every the kids Dan Jules’s kit permissions, you know, and I’m I think I don’t know if it came from React. React has dangerously set inner HTML, like has one permission, not one per one attribute. It was kind of the joke, they invented this really funny name where they were like, Look, you’re just breaking React, you’re not even using it properly, things are gonna be terrible. Sure, if you want to.
Yeah.
No.
Right.
Yeah.
Yeah.
Yeah.
Yeah, I mean I’ve been
testing with doing things like using fly.io has a service, sprites.dev where you can run VMs that have this. And like you could say here’s only right, yeah, here’s only like a git token for this repo, right? So limit what can go wrong, but it takes more time. Yeah.
Yes, that’s right. That’s right.
Yeah, exe.dev. Yep.
A VM. Yep. Yes. I I yeah. That’s right. Takes more time. And I fully
this is the thing. If I’m responsible or somebody was like, looked me in the eye and said, What do you really want? You just want to run a piece of code on a piece of compute. You’re not desperate to give it your SSH keys. I’m I don’t want it to delete my like family pictures. It’s just, yeah, convenience wins and blah blah blah. But a hundred percent. I I think of course if we can only do that.
Yeah.
Rip.
Yeah.
and run
these autonomously. I think we really get like we get our lunch and to eat it too or a cake or whatever. I th I think it’s that. But clock code is a thing you install on your development computer. So are they gonna give us compute in the cloud for free? Is this part of the next plan?
Yeah.
Right.
Well no, yeah, it
is I mean, it adds another cost to it because you have another computer running and I mean you need enough power on these to be able to actually run the code and the tests. So it’s not like a small computer, yeah, too. So it is I think that adds to the challenge of it is you need a real machine that can do this. Yeah.
That’s right. The thing. That’s right. Yeah. And my code talks to my robot. Yep. That’s right. So I’m
Yeah, maybe I should
just, you know, move off my pictures and my SSH keys and just kinda consider my comp like I should have another computer. It’s a whole computer. I mean again this is so absurd.
but I think when you asked me how long ago that happened, that’s a scary question. It’s like, yeah, this just just happened. Just just just just just happened. So the the like living whatever living through this day by day has a very, very special feeling where we’re all discovering this kind of thing that we’re
Yeah.
Yeah.
Yeah.
Yeah, for sure.
Yeah, one if you have to go. But yeah, one th so this one part that I actu have some talked to some people about, which I do think is I mean I know I’ve felt this as just there’s more context switching if I’m working on like twenty different projects you could work on at once, right? Before it just wasn’t feasible.
No, no, no, it’s all good. No, no, please, please, please. Yeah, yeah.
Mm-hmm.
Mm-hmm.
Yeah.
Yeah.
W it’s true. I was like asking my girlfriend, why can’t I sleep well in why why am I waking up at seven AM in fright? Yeah, it’s it’s totally
What is it?
It yeah, it feels so different and so happening right now that I don’t have the perspective. Is it more tiring? I think ri yeah, the the part that I really, really th so I I think when I would feel that my emotions are kinda like you know, like I bring work to home sort of deal, like I f I I remember it as well, if I was still trying to solve a thing.
Yeah.
Yeah.
Yeah.
Yeah.
Yeah.
Yeah.
like those anch those anchors. Yeah, yeah, yeah, yeah.
Red.
Yeah.
Right.
Yeah. Yeah.
Yeah.
Yeah. Right.
Yeah.
Do you think it’s more
frustrating when Claude gets something wrong than if you got it wrong personally? Yeah.
Yeah, because you trusted it and then I I
I I do think, yeah, I do think it’s like it’s less pure. It’s more like you ate a bunch of nachos, your tummy hurts, and then you’re like, Why am I doing this to myself? Like I know that I should be eating veggies and I I had veggies in the fridge, but I I felt a little like nachos are kind of it’s like there’s a poisonous comfort and like laziness and a little bit of a fast thing that is like addictive.
Yeah.
Yeah.
Right now? Is this no. No. No. We’re
Yeah, yeah. No, we’re just waiting for the next model to solve all of our problems.
I’m just waiting for the next update. I’m just trying to out trick it. And I’m trying to also think that I’m getting better. And this is like almost like somebody who has like an addiction I’m not to compare, of course, but like there’s something about the lying to yourself, which is like, is this what I’m doing? I’m just being like, No. It’s like kind of yes, back to the people who are like they have a hard time with it, which I I’m sure is true.
And then other people are like, I you’re not using it right. I have those thoughts sometimes, but those are crazy thoughts. Like not using it right. Like, what is it? What is right? And again, it did I go from seventy percent to seventy three percent, and I feel superior to the people who feel that it is seventy percent, and they’re right anyway. So so it’s like not not not great, not the best. I think when Tel Scale came out, it was a net positive for the entire world.
Yeah.
What
does that mean for us going forward though? Like what if we’re doing this, I g I d I don’t know. But
But how could we know? How could we How could we Hindsight is twenty twenty, but we we’re
we’re in the middle of a crash and it’s gonna be human error at the end, and it was like I have no idea. I th I th I think it’s obvious that it’s gonna lead to some really, really bad stuff. I think huge leaks, huge systems that go haywire in a way that we’re not like people have always there’s always been bad code around, but this is gonna like be like dramatically
Yeah.
Red.
Right. Yeah. Yeah. Yeah. Well, I guess that
was the the latest lovable breach where I think they were saying that well I don’t I just I think when I was reading the reports on it, the speed with which it happened, they were
was it?
Yeah, yeah, yeah.
Yeah.
huh. Mm-hmm.
Mm. Yep. Yep.
Right. Takes time.
Yeah, yeah. I have I have I have to read that. Yeah, it’s an accelerator. Exactly.
Yeah, it’s an accelerator. It’s like it’s an accelerator. Yeah. So so I don’t think we’re all gonna be like this was a net positive for the entire world. It was so f like this is this is this is definitely like the invention of gasoline or some some other like substance that we’re like we’re kinda stuck with and it does a lot for us, but
Yeah.
Yeah, I think pro automobiles is probably a good a analogue, right? Like there’s how many people are killed or injured in automobiles every year, right? Yeah.
Yeah, yeah. Yeah.
Yes. Yeah. All transit systems being destroyed because now
everyone will have a car. Of course we don’t need those like, you know, those rail cars, those to be rail cars. Let’s get rid of all of them. let’s build highways for everybody. Of course everybody then like just the other pollution. Yeah, it’s like I I suppose the world is like whatever, worlding better and I don’t know, you know, hearts can be transplanted faster because we can transport them not on on the back of a donkey. Like I suppose. Like there’s some argument for cars that is like
Yeah.
Yeah.
Right. Would we have any of like would we even have the internet, any of these other things without that? Probably not.
Yeah. Yeah, no, if we’re still
y you know, but but yeah, but being part of a technological revolution doesn’t feel like yeah, like e let’s go even faster. It just feels like, yeah, this this will have like terrible harms, intended and unintended. Because intended, of course, like people who are like really looking forward to like firing a bunch of people are like so excited about this, of course. and replacing them with AI. I think it’s also like obviously the like
Yeah.
Yeah.
Yeah. Yeah.
Well.
Right, it’s invested here versus other I mean, there’s
always choices in how we as a society invest our resources, right?
T then Yeah.
Yeah,
and Tom and and you know, it’s like all the nuclear plants coming back online and like, you know, e every company being like, we had a climate pledge, but that that was before we needed like all the GPUs to compute more tokens. It’s like I I don’t find it funny. I find it like it’s like a cynically disgusting kind of thing of like, yeah, the other words were words and now we we’ll have new words. It doesn’t matter, we know what we’re really after. and yeah, and it’s not the betterment of like it’s not about endangered species, not
Yeah.
Yeah.
Yeah.
Yeah.
Yeah. Well may maybe this
the GPUs will push us push us to full solar power ‘cause that’s I think is the cheapest installation to do these days now.
Hey, look
at that. Finally. If it’s yeah I d mm-hmm.
Yeah, I th I I’m pretty sure that
it’s if you’re adding a marginal unit of power today, the cheapest way to do it is solar. So
Hmm. Right, right, right, right, right.
Solar. Well
that’s it. Then, you know, I don’t know. Something something good to look to to to look to look up to to look forward to. but yeah, I I have a hard time saying that I’m an optimist and at the same time I’m such a complete devoted user of it that it’s like it’s it’s it’s a it’s a it’s a strange and I know that my guilt will not do anything to solve anything, so I guess I’m stuck.
Yes, yes, yes, yes. Yep, yep, yep. Yep.
Yeah, that’s huge. Of course. Yep. We’ll we’ll get there. We’ll get we’ll get somewhere. We’ll get somewhere.
Mm-hmm.
Yeah, yeah, yeah. We’ll see what the world is by then. By that time by that time. Yeah, once we have our little
By the by then when when that is, yeah. When I get my
I’m waiting to get my next I still have a twenty twenty Mac Mini, so I can’t really run any serious LLMs on that, but the next Mac I get, hoping can run a decently good local model.
Yeah.
What’s a good model do you recommend? Just curious, just to like
The Qwen, yeah. People like Qwen. That’s pretty impressive, yeah.
Yeah, yeah, yeah.
Yeah, that’s
interesting. Yeah. Yeah. Yeah, what I I’m I’m yeah, I’m just trying to like yeah, sort of follow along by reading like one thread a every few weeks on H N. But yeah, yeah, I I I’ve seen that that’s that that’s interesting. Something to consider for sure. Yeah, yeah, yeah.

