> No credentials. No insider knowledge. And no human-in-the-loop. Just a domain name and a dream. ... Within 2 hours, the agent had full read and write access to the entire production database.
Having seen firsthand how insecure some enterprise systems are, I'm not exactly surprised. Decision makers at the top are focused first and foremost on corporate and personal exposure to liability, also known as CYA in corporate-speak. The nitty-gritty details of security are always left to people far down the corporate chain who are supposed to know what they're doing.
Am I supposed to take away from these plots that we are all good since it's been steadily above the prior record in 1940 since 2020? Or is everything okay since it was really going up and it did course correct to a bit more of a straight line recently?
The article seems to be communicating that this rate of spending is not sustainable.
Not to say that it's okay, and Japan's economy certainly has issues with stagnation due to the debt load, but it's also not a "we have imminent hyperinflation" kind of thing either.
The concern with the past five months isn't so much the level of debt, it's the rate of change - we're increasing it faster than in the past... and this isn't a COVID-level crisis or a 2008-style deep recession either where Keynesian logic might make more sense.
This is old information. Japan's borrowing costs have spiked and are ~2.18% as of this comment. Yields are surging due to their debt load (currently ~240% of GDP).
The real question is what percentage of GDP is directly created (or continues to exist) because of the increased debt.
When this metric was created the GDP was more authentic and not debt driven.
> Economists aren’t necessarily worried by the total level of debt (in fact, government debt is a necessary foundation of global markets). Rather it’s the debt-to-GDP ratio, which measures a nation’s borrowing against its growth
The article has more details than just the headline. For example:
> Maya MacGuineas, president of the Committee for a Responsible Federal Budget (CRFB), said that interest payments on the debt are expected to exceed $1 trillion this year, and will surpass $2 trillion by 2036.
That’s very concerning. There’s no plan to run balanced budgets and stop deficits. And no plan to reduce debt. And no plan on economic competitiveness against China. American politics is mostly dominated by irrelevant things that won’t fix the fundamental problems that will come to affect us in the future.
TL;DR: The authors found current-generation AI agents are too unreliable, too untrustworthy, and too unsafe for real-world use.
Quoting from the abstract:
"We report an exploratory red-teaming study of autonomous language-model–powered agents deployed in a live laboratory environment with persistent memory, email accounts, Discord access, file systems, and shell execution. Over a two-week period, twenty AI researchers interacted with the agents under benign and adversarial conditions."
"Observed behaviors include unauthorized compliance with non-owners, disclosure of sensitive information, execution of destructive system-level actions, denial-of-service conditions, uncontrolled resource consumption, identity spoofing vulnerabilities, cross-agent propagation of unsafe practices, and partial system takeover."
It used to take years, decades, or centuries before a system could grow and evolve to be so complex and unwieldy, and so full of internal contradictions, that the whole thing becomes an incomprehensible tangle of hairballs. An example is the patchwork system of international, national, regional, and local laws we have at present, which has grown and evolved over centuries.
It's a worthwhile effort. If successful, Woxi can enable a large mass of scientists and engineers who don't have access to Mathematica to run legacy code written for it. Also, Woxi would give those scientists and engineers who regularly use Mathematica a non-proprietary, less restrictive alternative, which many of them would welcome.
How does Woxi compare to other "clean-room implementations"[a] of the same language?
--
[a] Please check with a lawyer to make sure you won't run into legal or copyright issues.
> No credentials. No insider knowledge. And no human-in-the-loop. Just a domain name and a dream. ... Within 2 hours, the agent had full read and write access to the entire production database.
Having seen firsthand how insecure some enterprise systems are, I'm not exactly surprised. Decision makers at the top are focused first and foremost on corporate and personal exposure to liability, also known as CYA in corporate-speak. The nitty-gritty details of security are always left to people far down the corporate chain who are supposed to know what they're doing.
reply