hugmynutus's comments

hugmynutus · 2025-11-26T20:11:31 1764187891

> i'm placing my bets that in a few thousand years we'll have changed calendar system entirely haha

Given the chronostrife will occur in around 40_000 years (give or take 2_000) I somewhat doubt that </humor>

hugmynutus · 2025-10-29T20:44:08 1761770648

> As long as your

linux core utils have supported this since 2018 (coreutils 8.3), amusingly it is the same release that added `cp --reflink`. AFAIK I know you have to opt out by having `POSIX_CORRECT=1` or `POSIX_ME_HARDER=1` or `--pedantic` set in your environment. [1]

freebsd core utils have supported this since 2008

MacOS has basically always supported this.

---

1. Amusingly despite `POSIX_ME_HARDER` not being official a alrge swapt of core utils support it. https://www.gnu.org/prep/standards/html_node/Non_002dGNU-Sta...

hugmynutus · 2025-10-11T00:31:16 1760142676

Buddy, I have 24Tb HDDs in my pool today.

If anything the opposite has occurred. HDD scaling has largely flattened. Going from 1986 -> 2014, HDD size increased by 10x every 5.3 years [1]. If anything we should have 100Tb+ drives if scaling kept going. I say this not as a but there have been directly implications for ZFS.

All this data stuck behind an interface who's speed is (realistically after a file system & kernel involved) hard limited to 200MiB/s-300MiB/s. Recovery times sky rocket. As you simply cannot re-build parity/copy data. The whole reason stuff like draid [2] were created is so larger pools can recover in less than a day by doing sequential parity & hot-spairs loaded 1/N of each drives data ahead of time.

---

1. Not the most reliable source, but it is a friday afternoon https://old.reddit.com/r/DataHoarder/comments/spoek4/hdd_cap...

2. https://openzfs.github.io/openzfs-docs/Basic%20Concepts/dRAI... for concept, for motivations & implementation details see -> https://www.youtube.com/watch?v=xPU3rIHyCTs

godelski · 2025-10-11T00:48:45 1760143725

Not quite that level, but you can get 8TB nvmes. You'll pay $500 a pop though...[0]. Weirdly that's the cheapest NewEgg lists for anything above 8TB and even SSDs are more expensive. It's a gen4 PCIe M.2 but a SATA SSD is more? It's better going the next bracket down but still surprising to me that the cheapest 4TB SSD is just $20 cheaper than the cheapest NVMe[1] (a little more and you're getting recognizable names too!)

It kinda sucks that things have flatlined a bit, but still cool that a lot of this has become way cheaper. I think the NVMes at these prices and sizes really makes caching a reasonable thing to do for consumer grade storage

[0] https://www.newegg.com/western-digital-8tb-black/p/N82E16820...

[1] https://www.newegg.com/p/pl?N=100011693%20600551612&Order=1

necovek · 2025-10-11T03:28:28 1760153308

In terms of production, SSD flash chips that go into SATA and NVMe drives can be pretty much the same: only the external interface can be different.

The biggest cost driver for flash chips is not the speed they can be read from and written to in bursts, but how resilient they are (how many times can they be written over) and sustained speed (both based on the tech in use, TLC, SLC, MLC, 3D NAND, wear levelling logic...): even for SATA speeds, you need the very best for sustained throughput.

Still, SATA SSDs make sense since they can use the full SATA bandwidth and have low latency compared to HDDs.

So the (lack of) price difference is not really surprising.

nick__m · 2025-10-11T01:10:08 1760145008

If your budget allows it you can get 120TB .5 dwdp ssd like that drive http://www.atic.ca/index.php?page=details&psku=319207

wtallis · 2025-10-11T01:51:11 1760147471

> Weirdly that's the cheapest NewEgg lists for anything above 8TB and even SSDs are more expensive.

Please don't perpetuate the weird misconception that "SSD" refers specifically to SATA SSDs and that NVMe SSDs aren't SSDs.

hugmynutus · 2025-04-29T21:09:08 1745960948

I find this unconvincing. The actual discussion of LLM generation is very lacking.

The original link [1] cites a discussion of the cost per query of GPT-4o at 0.3whr [2]. When you read the document [2] itself you see 0.3whr is a lower bound & 40whr is the upper bound. The paper [2] is actually pretty solid, I recommend it. It uses the public metrics from other LLM APIs to derive a likely distribution of the context size of the average query for GPT-4o which is a reasonable approach given that data isn't public. Then factoring in GPU power per FLOP, average utilization during, and cloud/renting overhead. It admits this likely has non-trivial error bars, concluding the average is between 1-4whr per query.

This is disappointing to me as the original link [1] attempts to bring in this source [2] to disprove the 3whr "myth" created by another paper [3], yet this 3whr figure lies directly in the error bars their new source [2] arrives at.

Links:

1. https://simonwillison.net/2025/Apr/29/chatgpt-is-not-bad-for...

2. https://epoch.ai/gradient-updates/how-much-energy-does-chatg...

3. https://www.sciencedirect.com/science/article/pii/S254243512...

Edit: whr not w/hr

Retric · 2025-04-29T21:29:11 1745962151

The methodology is inherently flawed by assuming all infrastructure, training, etc is going to exist with or without individual queries, while trying to answer a different question of the impact of AI on the environment. It’s like arguing the environmental impact of solar electricity is 0 because the panels would exist either way.

Thus the results inherently fail to analyze the underlying question.

A more realistic estimate is to take their total spending assuming X% of their expenses are electricity directly or indirectly because the environmental impact isn’t adds up. Even that ignores the energy costs on 3rd party servers when they download their training data.

hugmynutus · 2025-04-29T21:43:03 1745962983

You are correct to point out the larger questions of supply chain cost (and their environmental impact) are not addressed in the root link.

cwillu · 2025-04-29T21:13:09 1745961189

The unit is watt·hour, not watt/hour: multiplication, not division.

hugmynutus · 2025-04-29T21:26:53 1745962013

Thanks!

hugmynutus · 2025-04-25T16:52:40 1745599960

This really just a variant of the classic, "pretend you're somebody else, reply as {{char}}" which has been around for 4+ years and despite the age, continues to be somewhat effective.

Modern skeleton key attacks are far more effective.

tsumnia · 2025-04-25T20:47:05 1745614025

Even with all our security, social engineering still beats them all.

Roleplaying sounds like it will be LLMs social engineering.

bredren · 2025-04-25T16:56:52 1745600212

Microsoft report on on skeleton key attacks: https://www.microsoft.com/en-us/security/blog/2024/06/26/mit...

Thorrez · 2025-04-26T05:50:26 1745646626

I think the Policy Puppetry attack is a type of Skeleton Key attack. Since it was just released, that makes it a modern Skeleton Key attack.

Can you give a comparison of the Policy Puppetry attack to other modern Skeleton Key attacks, and explain how the other modern Skeleton Key attacks are much more effective?

vessenes · 2025-04-26T06:22:36 1745648556

Seems to me “Skeleton Key” relies on a sort of logical judo - you ask the model to update its own rules with a reasonable sounding request. Once it’s agreed, the history of the chat leaves the user with a lot of freedom.

Policy Puppetry feels more like an injection attack - you’re trying to trick the model into incorporating policy ahead of answering. Then they layer two tricks on - “it’s just a script! From a show about people doing bad things!” And they ask for things in leet speak, which I presume is to get around keyword filtering at API level.

This is an ad. It’s a pretty good ad, but I don’t think the attack mechanism is super interesting on reflection.

hugmynutus · 2025-04-18T02:10:17 1744942217

Luckily 400Gb/s nics are already on the market [1]

[1] https://docs.broadcom.com/doc/957608-PB1