More

mpaepper · 2026-01-27T14:44:21 1769525061

Maybe, however, they might not care as the API is freely available anyways

chromehearts · 2026-01-27T14:46:10 1769525170

You're right.. well hope it doesn't come to that

mpaepper · 2026-01-27T14:43:57 1769525037

For example this post here becomes: https://news.gcombinator.com/item?id=46780583

mpaepper · 2026-01-25T12:33:13 1769344393

How much memory do you need locally? Is a rtx 3090 with 24gb enough?

divyaprakash · 2026-01-25T12:36:48 1769344608

Yes, more than enough. I have rtx4080 laptop gpu with 12gb vram.

mpaepper · 2026-01-23T21:50:23 1769205023

You should look into the new Nvidia model: https://research.nvidia.com/labs/adlr/personaplex/

It has dual channel input / output and a very permissible license

cbrews · 2026-01-23T23:49:06 1769212146

Thanks for sharing this! I'm going to put this on my list to play around with. I'm not really an expert in this tech, I come from the audio background, but recently was playing around with streaming Speech-to-Text (using Whisper) / Text-to-Speech (using Kokoro at the time) on a local machine.

The most challenging part in my build was tuning the inference batch sizing here. I was able to get it working well for Speech-to-Text down to batch sizes of 200ms. I even implement a basic local agreement algorithm and it was still very fast (inferencing time, I think, was around 10-20ms?). You're basically limited by the minimum batch size, NOT inference time. Maybe that's a missing "secret sauce" suggested in the original post?

In the use case listed above, the TTS probably isn't a bottleneck as long as OP can generate tokens quickly.

All this being said a wrapped model like this that is able to handle hand-offs between these parts of the process sounds really useful and I'll definitely be interested in seeing how it performs.

Let me know if you guys play with this and find success.

zaken · 2026-01-24T05:12:37 1769231557

Oh man that space emergency example had me rolling

albert_e · 2026-01-24T08:57:12 1769245032

Ha --

and the "Customer Service - Banking" scenario claims that it demos "accent control" and the prompt gives the agent a definitely non-indian name, yet the agents sounds 100% Indian - I found that hilarious but also isn't it a bad example given they are claiming accent control as a feature?

mikkupikku · 2026-01-24T12:21:48 1769257308

"Sanni Virtanen", I guess it was meant to be Finnish? Maybe the "bank customer support" part threw the AI off, lmao.

adabyron · 2026-01-24T14:04:46 1769263486

Changing my title to "Astronaut" right now... I'll be using that line as well anytime someone asks me to do something.

hnlmorg · 2026-01-24T10:58:51 1769252331

Oh wow. Thats definitely something…

dsrtslnd23 · 2026-01-23T21:58:56 1769205536

oh - very interesting indeed! thanks

mpaepper · 2026-01-23T21:47:52 1769204872

Free generator for e-invoices here: https://www.e-rechnung-online-erstellen.de/kostenlos/e-rechn...

mpaepper · 2026-01-22T19:47:45 1769111265

You mentioned needing 40k tiles and renting a H100 for 3$/hour at 200tiles/hour, so am I right to assume that you spend 200*3=600$ for running the inference? That also means letting it run 25 nights a 8 hours or so?

Cool project!

cannoneyed · 2026-01-22T21:43:24 1769118204

Yup back of the napkin is probably about there - also spent a fair bit on the oxen.ai fine-tuning service (worth every penny)... paint ain't free, so to speak

mpaepper · 2026-01-21T22:09:26 1769033366

Hi everyone, inspired by Alex' cool article about ASCII rendering (https://alexharri.com/blog/ascii-rendering), I revamped my hero section. It now consists of a matrix style rain of characters which transitions into a pseudo random ASCII rendering of an image of myself.

mpaepper · 2025-11-18T12:17:39 1763468259

Seems like the merging with Replit didn't work so well :p

mpaepper · 2025-11-08T10:33:45 1762598025

He might already have enough money, so doesn't care if he gets the absolute best resources.

mpaepper · 2025-10-18T19:44:45 1760816685

What about the security aspects, can it run anything?

d4rkp4ttern · 2025-10-19T11:04:16 1760871856

I assume by “it” you mean Claude code or codex-cli — that depends on how you launched them or how you modified the permissions within the CLI chat; that’s orthogonal to my CLI tools.