More

whyever · 2025-11-13T23:11:30 1763075490

Note that N=1 for the memory safety vulnerabilities they had with Rust, so the error of the estimated average number of vulnerabilities per LOC is quite large.

vacuity · 2025-11-13T23:40:40 1763077240

Yes. I would make a guess of 10 or less memory-safety vulnerabilities per MLOC, which is still a hundredfold reduction.

Jweb_Guru · 2025-11-14T07:44:59 1763106299

Your best guess is that the true rate is 20x higher than the observed rate? This seems unlikely to me given the number of samples (outside of systematic biases towards certain types of memory safety bugs that probably apply to C++ code too). 10 per hundred MLOC is closer to what I would have guessed too, but that is because I've historically been very conservative with my assumptions about the memory unsafety rate per unsafe LOC being similar to that of C++. The evidence here suggests that the true rate is probably much lower than that.

vacuity · 2025-11-14T13:57:18 1763128638

I'm making a conservative guess, which is why I said 10 or less (10 or fewer??). So the improvement is at least a hundredfold. I might say 5 or less instead. I think the exact rate is not so important; either way, it's clear that Rust is a boon.

whyever · 2025-11-02T11:34:20 1762083260

It's missing which point?

uecker · 2025-11-02T12:28:20 1762086500

That you should be very careful about what you install. Cut&pasting some line from a website is the exact opposite of it. This is mostly about psychology and not technology. But there are also other issues with this, e.g. many independent failure points at different levels, no transparency, no audit chain, etc. The counter model we tried to teach people in the past is that people select a linux distribution, independently verify fingerprints of the installation media, and then only install packages from the curated a list of packages. A lot of effort went into making this safe and close the remaining issues.

IshKebab · 2025-11-02T12:43:07 1762087387

None of that has anything to do with curl|bash.

Be careful who you trust when installing software is a fine thing to teach. But that doesn't mean the only people you can trust are Linux distro packagers.

uecker · 2025-11-02T12:52:52 1762087972

I think it has a lot to do with "curl|bash". Cut&paste a curl|bash command-line disables all inherent mechanisms and stumbling blocks that would ensure properly ensuring trust. It was basically invented to make it easy to install software by circumventing all protection a Linux distribution would traditionally provide. It also eliminates all possibility for independent verification about what was installed or done on the machine.

IshKebab · 2025-11-02T13:21:17 1762089677

Downloading and installing a `.deb` or `.rpm` is going to be no more secure. They can run arbitrary scripts too.

uecker · 2025-11-02T13:28:01 1762090081

Downloading a deb via a package manager is more secure. Downloading a deb, comparing the hash (or at least noting down the hash) would also already be more secure.

But yes, that the run arbitrary scripts is also a known issue, but this is not the main point as most code you download will be run at some point (and ideally this needs sandboxing of applications to fix).

IshKebab · 2025-11-02T14:08:56 1762092536

> Downloading a deb via a package manager is more secure.

Not what I meant. Getting software into 5 different distros and waiting years for it to be available to users is not really viable for most software authors.

uecker · 2025-11-02T18:55:58 1762109758

I think it would be quite viable if there is any willingness to work with the distributions in the interest in security.

IshKebab · 2025-11-02T21:27:01 1762118821

Well, distros haven't really put any effort into making it viable as far as I know. They really should! Why isn't there a standard Linux package format that all distros support? Flatpak is fine for user GUI apps but I don't think it would be feasible to e.g. distribute Rust via a Flatpak.

(And when I say fine, I haven't actually used it successfully yet.)

I think distros don't want this though. They all want everyone to use their format, and spend time uploading software into their repo. Which just means that people don't.

whyever · 2025-10-29T19:55:07 1761767707

I agree, but https://www.pcg-random.org/ still advertizes PCG as "challenging" to predict, and critizises other RNGs as predictable and insecure.

tptacek · 2025-10-29T19:57:15 1761767835

Right, that's a problem, because nobody that cares about this should be using PCG.

avadodin · 2025-10-30T08:39:22 1761813562

> Predictable — after 624 outputs, we can completely predict its output.

> we recovers[sic] all the secret information using 512 consecutive output bytes

oof

whyever · 2025-09-19T14:11:11 1758291071

They are synonyms.

chermi · 2025-09-19T15:03:33 1758294213

No, they are not. Statistical mechanics is a theory, statistical physics is a field.

nakamoto_damacy · 2025-09-19T14:46:14 1758293174

Physics and Mechanics are not synonyms. The latter is a small subset of the former.

whyever · 2025-09-19T15:52:17 1758297137

Yes, but this relation does but apply to statistical mechanics and statistical physics, they mean the same: https://en.wikipedia.org/wiki/Statistical_mechanics

What is included in "statistical physics" that is not included in "statistical mechanics"?

chermi · 2025-09-19T18:08:24 1758305304

Kinetic theory stuff for one, like deposition, growth, sandpile type things. Complex networks and lots of dynamics stuff falls under statistical physics umbrella but not statistical mechanics. Stat mech's amazingly wide applicability makes it easy to think it's THE approach to approaching things statistically, but it's not. The broad encompassing approach has a name, statistical physics.

northlondoner · 2025-09-19T16:35:55 1758299755

There is a distinction. Usually statistical mechanics means the ensemble theory and partition functions that connects microscopic systems to macroscopic ones from material point of views. However, statistical physics is a bit more generic, for example complex networks may not use ensemble theory or partition functions and could use only statistics on the network, such as average neighbourhood or similar.

kgwgk · 2025-09-19T17:49:11 1758304151

People have also used “statistical physics” to refer to the former concept since forever. For example Landau.

“Statistical mechanics” is also used in a broad sense, just like “quantum mechanics” is often used for anything “quantum”.

nakamoto_damacy · 2025-09-19T18:42:07 1758307327

What I'm getting from this discussion is that we use Statistical Physics to refer to anything covered by Statistical Physics AND Statistical Mechanics, while we use Statistical Mechanics in a narrower context, but it is also possible that some use SM loosely.

kgwgk · 2025-09-19T21:17:19 1758316639

> it is also possible that some use SM loosely

I think it’s frequent. For example: https://teach-me-codes.github.io/computational-physics/the_p...

whyever · 2025-09-09T14:44:39 1757429079

Signal asks you to repeat the key immediately before even enabling backups. It cannot fail much later unless you modify the digit after the check.

upofadown · 2025-09-09T15:56:18 1757433378

A longer key makes typing a bunch of characters back into the phone much less usable.

whyever · 2025-09-04T10:35:09 1756982109

That's a good question! Especially after Frank McSherry's COST paper [1], it's hard to imagine where the sweet spot for Spark is. I guess for Databricks it makes sense to push Spark, since they are the ones who created it. In a way, it's their competitive advantage.

[1]: https://www.usenix.org/system/files/conference/hotos15/hotos...

whyever · 2025-08-22T07:38:08 1755848288

It's a quantitative problem. How big is the error introduced by the simplification?

whyever · 2025-08-21T07:48:52 1755762532

I know some people who do trunk-based development with pair programming: You write the code together, and once you are satisfied, you merge it to the main branch, from where it is deployed to production if the tests pass. It works well for them.

whyever · 2025-08-18T08:39:03 1755506343

It would require a lot more memory, because you have to remember every generated UUID. And how would you do the partial match? You are not going to observe any collisions.

whyever · 2025-08-18T08:33:43 1755506023

Doesn't the clustering make collisions strictly more likely?