Defamed by ChatGPT

dang · on April 6, 2023

Related ongoing thread:

ChatGPT invented a sexual harassment scandal and named a real prof as accused - https://news.ycombinator.com/item?id=35460366

plus this similar case:

ChatGPT: Mayor starts legal bid over false bribery claim - https://news.ycombinator.com/item?id=35471211 - April 2023 (74 comments)

dahwolf · on April 7, 2023

As tech gets more powerful, so do the false positive outcomes.

You can randomly lose your Google or Facebook account for no reason. Or have your app pulled from an app store. And now, you can randomly be smeared. Or your car may crash.

Whoops. Guess that's just how AI works.

Stop defending it. The AI companies should be sued and we should have digital rights. Why is everyone in such a hurry to accept and embrace an imperfect AI anyway? It isn't going to improve your life, it's going to add even more productivity pressure to your life and then replace you. If not destroy much of human culture.

Give AI a hard time and buy time. Don't make it so easy for them by letting them dodge any and all responsibility.

makeitdouble · on April 7, 2023

I think these two bits perfectly sum the issue

> Why is everyone in such a hurry to accept and embrace an imperfect AI anyway?

> The AI companies should be sued

Your option to stopping the AI companies goes through litigation in courts, which means you also need to prove harm, which can only be proven once done at scale with obvious and measurable impacts. Basically when it's widely accepted, and also too late.

Other approaches would be through preemptive regulation, which also means limiting what companies can do by law. That needs trust in government and regulators, and active participation of a majority of the actors in legislation and framework building, and making sure everyone's voice has a chance to be heard.

CamperBob2 · on April 8, 2023

Of course, this is a recipe for turning the future of AI over to foreign technocratic regimes that don't prioritize invented "human rights" at the expense of human progress... but never mind that. I suppose growing a thicker collective skin is out of the question.

Barrin92 · on April 7, 2023

>Why is everyone in such a hurry to accept and embrace an imperfect AI anyway?

because this has been the dominant culture in the digital technology space and every argument against it is usually facing some sort of "so you want to stop innovation, huh?!" response.

You're obviously right. Powerful tools require equally powerful means to control them and to make sure they're aligned with our values. Imagine you had a machine on a factory floor with the behaviour of one of these models, or an airplane. It's the same with 'self-driving' cars. Making pedestrians and other drivers unwilling beta testers is absolutely wild to me.

jasonm23 · on April 7, 2023

> "so you want to stop innovation, huh?!"

Excitement Considered Harmful

In 1968, Edsger Dijkstra wrote a now-famous letter titled "Goto Considered Harmful", which argued that the use of the "goto" statement in programming languages led to code that was difficult to read and maintain. The letter sparked a heated debate in the programming community, and it ultimately led to a rethinking of programming language design.

Today, we face a similar problem with the way we approach technology adoption. In recent years, there has been an explosion of excitement around new technologies like artificial intelligence, machine learning, and blockchain. Tech companies are rushing to integrate these technologies into their products and services, and investors are pouring billions of dollars into startups that promise to revolutionize entire industries.

But is all this excitement warranted? Are we really thinking critically about the potential risks and downsides of these technologies, or are we simply caught up in the hype?

The truth is that excitement can be a dangerous thing. It can blind us to the real risks and downsides of new technologies, and it can lead us to make decisions that we later regret. We need a sober review of where we are as a society and where we want to go, particularly as we face the dawn of AI.

The potential benefits of AI are clear. It has the potential to revolutionize fields like healthcare, transportation, and finance, and it could lead to new discoveries and insights that we can't even imagine today. But we can't ignore the potential risks, either. AI has the potential to automate millions of jobs, exacerbate existing inequalities, and create new forms of surveillance and control.

We need to approach AI with a clear-eyed view of its potential risks and benefits. We need to have frank and honest discussions about the ethical and moral implications of these technologies, and we need to ensure that they are developed and deployed in a way that benefits society as a whole, not just a select few.

Excitement is not inherently bad, but it can lead us to make rash decisions that we later regret. We need to temper our excitement with a healthy dose of skepticism and critical thinking. We need to be willing to ask tough questions and challenge assumptions. Only then can we ensure that the technologies we create are truly beneficial to society.

In conclusion, just as "goto" was once considered harmful to programming, we should also consider the harm that excitement can cause in the context of technology adoption. Let us approach new technologies like AI with a sober and critical eye, and let us ensure that we use them in a way that benefits us all.

ChatGTP · on April 7, 2023

We need a sober review of where we are as a society and where we want to go, particularly as we face the dawn of AI.

This is what an intelligent, empathetic, caring population would do. It is what we need to do. I'm sure many people are feeling pretty darn uneasy about where this is all going, especially those with children.

What I'm rather scared of is that we may not really be intelligent, not in the way we like to think of intelligence. I mean look at what we've done to the environment, to each other, now we want to take our ideas and multiple them 10x?

Not sure how this will go.

I always though AI would be kind of, an alien type of intelligence that would tell us where we're going wrong as a society and help us course correct, not this setup we have now.

codetrotter · on April 7, 2023

> Excitement Considered Harmful

> […]

Was this comment written by ChatGPT?

hunter2_ · on April 7, 2023

Definitely. It's way longer and a totally different writing style than its poster's other history. If such a work was actually written elsewhere (where this long format would make more sense) and then (self-)plagiarized here, it would likely be published online, but a Google search only reveals this very thread.

When these tells (and thus your hunch and my assertion) diminish, though...

I bet the prompt was something like "write a considered harmful essay, inspired by goto considered harmful, about excitement as it relates to the adoption of technology" -- here's what Google Bard does with that exact prompt:

###

Excitement is a powerful emotion. It can drive us to achieve great things, but it can also lead us to make poor decisions. In the world of technology, excitement can often lead to the adoption of new technologies without proper consideration of the potential risks.

This is especially true for new technologies that are hyped by the media and marketed as being the next big thing. When we hear about a new technology that promises to change the world, it's easy to get caught up in the excitement and rush to adopt it. But before we do, we need to take a step back and ask ourselves if this technology is really right for us.

There are a number of reasons why excitement can be harmful when it comes to the adoption of technology. First, excitement can lead us to make decisions without fully understanding the technology. We may be so eager to try out the latest gadget that we don't take the time to read the manual or learn how to use it properly. This can lead to problems down the road, such as data loss or damage to the device.

Second, excitement can lead us to ignore the potential risks of a new technology. We may be so focused on the potential benefits that we don't consider the potential drawbacks. For example, we may be so excited about the new social media app that we don't think about how it could be used to spread misinformation or cyberbullying.

Third, excitement can lead us to adopt a new technology without considering its impact on our lives. We may be so eager to try out the latest gadget that we don't think about how it will fit into our existing routines. This can lead to problems such as feeling overwhelmed or stressed.

In conclusion, excitement can be a powerful force, but it's important to remember that it can also lead us to make poor decisions. When it comes to the adoption of new technologies, it's important to take a step back and ask ourselves if this technology is really right for us. We need to make sure that we understand the technology, that we're aware of the potential risks, and that we've considered the impact it will have on our lives.

In the same way that goto was considered harmful in programming, excitement can be considered harmful in the adoption of technology. It's important to be aware of the potential risks of excitement and to take steps to mitigate them.

###

If only we had done something like...

    s/essay/hn style comment/

anonzzzies · on April 7, 2023

> Stop defending it. The AI companies should be sued and we should have digital rights

But people (in power; police, press, military, politicians, juries, judges etcetcetcetcetc) screw people over all the time without any consequences. How many people in your neighbourhood are calling you a sex offender if you are a bit different than the norm? If AI companies should be punished, so should everyone spreading false information without rock-solid proof and also punish their bosses, parents etc. And that's not 'a jury of your peers'; that's not proof, that's just humans being bad AI's and guessing/hallucinating over generally inconclusive evidence.

> The AI companies should be sued - Facebook - Google

The companies using AI to cut workforce and screw over people without recourse should be sued (they can be the same company of course). When Google or Facebook close your account, there should be a government body you can go to and if it turns out to be wrongful, they have to pay you 1% of their revenue(!). Immediately. That'll fix it overnight. Don't blame the tool makers or we have a lot more to sue in this world and there would be no guns. Or Microsoft products.

> Give AI a hard time and buy time. Don't make it so easy for them by letting them dodge any and all responsibility.

You shouldn't take what they present at face value, ever. No need to 'buy time' etc. Educate people. But people want to believe lies; that has nothing to do with AI and is already a huge industry. Check the audience size of crap like The Daily Mail etc.

deltree7 · on April 7, 2023

Yeah, if everyone waited for perfection, we'd still be in stone ages.

Thank god for the first fish that swam out of the water and didn't wait for perfection

jdthedisciple · on April 7, 2023

> Why is everyone in such a hurry to accept and embrace an imperfect AI anyway?

Because, while imperfect, it already helps increase economic output and save time and thereby money, all of which this society and country revolves around. They call it "prosperity".

afpx · on April 7, 2023

Individualism vs. Utilitarianism

In this case, my bet is on utilitarianism just because the harm to thousands will be much less than the benefit to billions.

echelon · on April 7, 2023

> Whoops. Guess that's just how AI works.

So many people are freaking out over this and yet not a single person has been harmed. We've lost our collective nerve and become a bunch of whining wimps.

This tech can't even draw hands yet, for heaven's sake! Calm down.

Something to actually care about is all the death happening in the world. This is nothing. Literally a mole hill.

Accept the current state of imperfection. Even if we *NEVER* move beyond it, the capabilities we will unlock will be immense.

People can say wrong or hurtful things all over the internet. It does not matter. Grow some thicker skin. We evolved to fend off lions and bears, not to be afraid of strings of characters spit out by silicon.

Wake me up when they can hold guns and open doors. Until then, this panic is desensitizing.

> Why is everyone in such a hurry to accept and embrace an imperfect AI anyway?

Because for the first time in my life I can make incredible art. If we keep going, there's no conceivable limit to the amazing world we'll build. Infinite multiplayer VR narratives that'll unfold with you and your friends, more real than any story you've ever read or heard. Unlimited imagination, unlimited exploration. You'll never hear "born too late / too early to explore" ever again.

This tech is going to suck the mundane primate shit we do out of life and fill it up with awesome.

nhinck · on April 7, 2023

> So many people are freaking out over this and yet not a single person has been harmed

I mean this is demonstrably untrue. https://en.wikipedia.org/wiki/Death_of_Elaine_Herzberg

benatkin · on April 7, 2023

Whether humans or machines lower the death rates, it's always terrible for some individuals. This happens in medicine all the time when a new treatment changes the group of people who are likely to experience harmful side effects.

ImPostingOnHN · on April 7, 2023

the person discussed in the post you replied to is an example of someone being harmed

teaearlgraycold · on April 7, 2023

An LLM did promote a man to commit suicide, which he did. I understand that the LLM was simply reflecting back the feelings he had, but it's still a new and powerful way for people to go down a rabbit hole.

polynomial · on April 7, 2023

Cf. http://prompticide.com

curiousllama · on April 6, 2023

This is a really interesting legal issue. What recourse does a private person have to algorithmically-generated libel?

Can I sue the company? They’re the ones who trained the model, but there’s no intent.

Could I sue the user? They’re the one who generated the response, but there’s no intent nor duty to fact check, if they don’t repeat the claim.

Do I have to subpoena OpenAI, reverse engineer the response, then sue the creators of the likely-responsible training data? They probably wrote something non-libelous.

It feels very all or nothing - either creators of LLMs are intrinsically negligent with regard to facts, or there’s nothing you can do.

danmaz74 · on April 6, 2023

My totally-not-a-lawyer take:

* If somebody asks ChatGPT to generate some accusation, doesn't check if they're factual, and then publishes them anywhere, then the responsibility lies with somebody.

* If a company directly publishes content from ChatGPT as if it was anything factual or trustworthy, then the responsibility lies with that company. I didn't read the fine print of Bing using it to augment its search engine, but that is definitely a very interesting case.

In the end, ChatGPT and the likes can be incredibly useful tools, as long as you know what they are, ie, very advanced text manipulation tools which generate combinations of words which follow patterns which are common in the training set, and nothing more.

JumpCrisscross · on April 6, 2023

> ChatGPT and the likes can be incredibly useful tools, as long as you know what they are

An analogy that comes to mind is Excel's autocomplete feature. If you input a company's financials, autocomplete for the future, and publish that as a forecast, you're the one making the forecast. Not Excel.

panarky · on April 6, 2023

What recourse does my estate have if your autonomous car swerves onto the sidewalk and runs me over? Can my estate sue you because you own the car, even though your hands weren't on the steering wheel? Can my estate sue Tesla for its faulty software and sensor package?

JumpCrisscross · on April 6, 2023

> What recourse does my estate have if your autonomous car swerves onto the sidewalk and runs me over

I do. If I set off a defective firework that careens sideways into your house, I'm liable. I may have a further claim against the manufacturer. But your claim is against me. This is why Mercedes announcing self-driving car liability is such a big deal [1][2]

[1] https://en.wikipedia.org/wiki/Self-driving_car_liability

[2] https://www.kbb.com/car-news/mercedes-well-be-liable-for-sel...

ender341341 · on April 6, 2023

From what I've read there's a lot of open questions (think of microsoft's linux FUD in the 00's) where you might be wholly liable, the company could be wholly, or more likely some mix of the two, Mercedes is offering some level indemnification for that because there's not a lot of precedent in court to say one way or the other.

if I was looking at a car with self driving capabilities I'd definitely be looking at Mercedes over pretty much anyone because of that.

panarky · on April 6, 2023

What would my estate sue you for, exactly? Negligence?

Because a negligence claim requires my estate to prove that you failed to exercise reasonable care.

If the AI is generally reliable, it's reasonable for you to trust it and take your hands off the wheel.

If the AI suddenly goes berserk, it would be hard to prove that you breached your duty of care, and are liable for negligence.

knodi123 · on April 6, 2023

In the firework example, you can still sue the person who fired the defective firework, because they should have known that unpredictable behavior was a possibility, and taken steps to prevent that from escalating into some damage.

Likewise, suing the person behind the wheel of the self-driving car for negligence should work. Because the "non-driving driver" had the ability to seize the wheel, and should have known that some boneheaded behaviors are a risk with these early AIs.

zeven7 · on April 6, 2023

Does this change of there is no wheel for the passenger to seize?

e_i_pi_2 · on April 6, 2023

If you're the owner of the car then I'd say it doesn't change - you chose to take that vehicle out on the road knowing it didn't have a way for you to control it. You can take the steering wheel off of your car with a wrench, you could build a computer to steer it for you, you're still primarily responsible as the owner.

The owner could then try to go after the company, but they're still ultimately responsible even if the company doesn't cover the cost for them

ohgodplsno · on April 7, 2023

If you are in a vehicle where there is no ability for you to prevent it from hitting you, then the responsibility falls in the manufacturer of the driverless vehicle.

You are expected to be in control, or be able to be in control at all times. If there is such a possibility, anyways. If there isn't, that means the manufacturer is the one in control.

knodi123 · on April 7, 2023

I mean, I'm not a judge, so we'll have to see, but it seems obvious to me that the existence of a wheel implies that sometimes you may need the wheel. And that if there is no wheel, it implies that the manufacturer of the car is saying that you won't need one and their own software is better in every situation.

So yes, I'd say that would change things. IMO.

jen20 · on April 7, 2023

> it's reasonable for you to trust it and take your hands off the wheel.

The user undoubtedly agreed to some T&Cs that mean taking their hands off the wheel even to push something on the touch screen is negligence:

From [1]:

Before enabling Autopilot, the driver first needs to agree to “keep your hands on the steering wheel at all times” and to always “maintain control and responsibility for your vehicle.” Subsequently, every time the driver engages Autopilot, they are shown a visual reminder to “keep your hands on the wheel."

[1]: https://www.tesla.com/support/autopilot

Godel_unicode · on April 6, 2023

> it's reasonable for you to trust it and take your hands off the wheel.

Even though the self-driving system explicitly tells you to keep your hands on the wheel and yells at you if it doesn’t sense them often enough? I don’t think it’s an accident that all of the current systems do a lot of work to make sure you’re aware you’re driving even if the car is helping.

This is building the precedent that manufacturers know whether their systems can handle being unattended, and as soon as they don’t do this they’re liable.

rvnx · on April 6, 2023

In Europe, if you are using Tesla and there is an accident while self-driving is engaged, then you are the one responsible (Tesla says they don't take any legal responsibility), but if you are driving a Mercedes while self-driving is engaged, Mercedes takes the legal responsibility.

wilg · on April 7, 2023

Though there are many articles about this legal responsibility, it's not completely clear to me if the Mercedes Drive Pilot system is actually purchasable, and if so if the liability works this way.

Their German website [1] says (translated by Google):

> Our driver assistance and safety systems are aids and do not release you from your responsibility as a driver. Observe the information in the operating instructions and the system limits described there.

It also seems to only work in extremely ideal conditions [2]:

> They must be on one of the nation's 8200 miles of autobahn highway in traffic traveling at no more than 60 km/h (37 mph) in dry conditions and away from tunnels or construction zones.

So basically only works on traffic jams on the autobahn in good weather with no construction. How many accidents are happening in these conditions? This seems like marketing fluff more than anything.

[1] https://www.mercedes-benz.de/passengercars/technology/drive-...

[2] https://www.caranddriver.com/reviews/a39966189/2022-mercedes...

ohgodplsno · on April 7, 2023

Tesla says a lot of crap to be honest, and their argument wouldn't hold for a second in court if someone held responsible for an autopilot incident sues Tesla back.

rvnx · on April 7, 2023

Lucky for them that the autopilot often disengages before a crash.

https://www.motortrend.com/news/nhtsa-tesla-autopilot-invest...

eftychis · on April 6, 2023

Right now the estate can sue the driver yes. There is still a driver and owner. Solving the liability of the car manufacturer is critical answer for the future. But somebody will be liable.

fallat · on April 6, 2023

Frame it this way... what do we already do when person X uses tool Y wrong (and tool Y is not malfunctioning) and does damage? Do you sue the tool manufacturer? Rarely, if ever. You sue the person using it.

kristjansson · on April 8, 2023

Until law and precedent change, the person behind the wheel is responsible for operating the vehicle, you’d sue them. If, in the future, some combination of autonomy and legal invention change that, you’d sue the entity bearing responsibility under whatever new framework.

No matter the case, though, the law will not throw up its hands, and say there’s no one liable for this tragedy.

x86x87 · on April 6, 2023

Oh yes. The old "it's AI" when it's convenient and "it's just a tool" when it's convenient. Which one is it?

I this it's just an autocomplete on steroids, a powerful bullshitting machine. The fear I have is that this bullshit is going to pollute the digital world pretty much the same way SEO made it impossible to get good results when using a search engine.

Dalewyn · on April 7, 2023

>Oh yes. The old "it's AI" when it's convenient and "it's just a tool" when it's convenient. Which one is it?

An "AI" is a tool.

BoorishBears · on April 7, 2023

I never get this "it's just an autocomplete on steroids" take.

When you form a sentence do you catalog every single word in the english language then pick one before saying a word? Or do you have an understanding that for a given word there are only so many words that could follow?

-

AI is a tool. And it's as much a bullshit generator as a human telling a story they definitely completely remember very vividly... https://www.newyorker.com/science/maria-konnikova/idea-happe...

If you misremember the suspect in a case and go around telling everyone the wrong guy committed a crime, you get sued because you went around telling everyone that, not because you misremembered.

jzb · on April 7, 2023

ChatGPT doesn’t just misidentify the guy - it entirely fabricates a crime and attaches it to a suspect.

I tried to get ChatGPT to summarize a music video today. (Sugar’s “If I Can’t Change Your Mind”)

I expected it might give a bland summary or something, but that’s not what happened.

It invented entire scenarios that weren’t in the video at all, and invented lyrics not in the song.

That’s pretty harmless, but I can easily see ChatGPT inventing some awful story about a person and that being carried over to a publication and gaining a life of its own.

Basically it’s a souped up urban legend generator, but it’s being offered as a tool to provide search results and content. It’s not just an unreliable narrator - it’s an unreliable narrator being offered as an expert witness.

BoorishBears · on April 7, 2023

You're starting off with a distinction without difference.

You could throw darts at a spinning wheel with real names and imagined crimes.

The point is that it doesn't matter what the seed for the false statement is, it's the act of spreading it that's problematic.

You're also muddling a point that I can agree with: Treating ChatGPT as an infallible expert is wrong.

But that applies to so many other things. Even expert witnesses are not infallible.

So I disagree with characterizing hallucinations as the problem, it's the application that's problematic.

Blindly and pasting factual content from ChatGPT is a bad idea, just like blindly taking a single source of information as gospel is a bad idea.

Humans can be just as confidently wrong as LLMs, and a simple adage applies to both: trust but verify.

Eisenstein · on April 7, 2023

> Humans can be just as confidently wrong as LLMs, and a simple adage applies to both: trust but verify.

Trust people who have earned trust (either through qualifications or reputation) and treat everyone else as good faith actors who are quite possibly wrong.

ChatGPT should be treated as a person you just met at a bus stop who is well dressed and well spoken but has just told you that you are both waiting for the elevator to arrive at the bus stop.

BoorishBears · on April 7, 2023

That's the fast track to get your point of view to be ignored: Pessimism is ok, but that level of dismissiveness isn't really warranted: especially since the conversation forming in public is not just about some specific model you happen have strong feelings about, but the general concept of LLMs and factuality.

I wouldn't expect a random doctor approached at a bus stop to accurately answer a question about medicine anymore than I would ChatGPT by the way. Trusting people based on their qualifications and reputations isn't really a thing.

If a doctor tells you to take medication X there's a reason you take that to a pharmacist rather than a store clerk with a key to a safe or something: verifying is always a great idea, regardless of reputation.

Eisenstein · on April 7, 2023

I'm not sure how the critique relates to my post. Of course you wouldn't trust an architect with medical advice or a doctor with structural materials for bridge building; that was implied.

x86x87 · on April 8, 2023

> ChatGPT should be treated as a person you just met at a bus stop who is well dressed and well spoken but has just told you that you are both waiting for the elevator to arrive at the bus stop.

Ahahahahaha. Wow. This is brilliant mate. I'm going to start using it.

asdff · on April 7, 2023

Every time I see people on HN say how they love to use chatgpt to generate ideas I think of this. It seems a lot more work coming up with prompts then having to vet them to see if the output is even sensible or not than it does to come up with some sensible search terms to query real data that actually contains what you are looking for.

BoorishBears · on April 7, 2023

Using LLMs for factual concepts is by far the most boring application of them.

Often times people use LLMs for generating ideas that don't have a factual basis.

ChatGPT will happily invent gameplay mechanics that are fun. It will generate prompts for convincing concept art for something you haven't built.

If what you're looking for can be answered by Google, sure the business people at Microsoft would rather you use a portal that never lets you leave their site... but that's not interesting.

nextaccountic · on April 7, 2023

> When you form a sentence do you catalog every single word in the english language then pick one before saying a word?

That's how markov chain chat bots (a very old technology) works:

https://stackoverflow.com/questions/5306729/how-do-markov-ch...

https://www.baeldung.com/cs/markov-chain-chatbots

That's not how ChatGPT works though, because of the attention mechanism

jhbadger · on April 7, 2023

Exactly. The way that LLMs have been "debunked" in simplified writings on the Web and in media is to suggest that they are just Markov chains like the famous "Mark V. Shaney"[1] bot from the 1980s, and LLMs are far more powerful than that. Yes, we need to debunk claims that LLMs have achieved sentience and such nonsense, but let's not ignore just how amazing they are.

[1] https://en.wikipedia.org/wiki/Mark_V._Shaney

BoorishBears · on April 7, 2023

I'm plenty familiar with flash attention, but you didn't understand what you quoted.

GPT is still (partially) probabilistic, and the "it's just autocompleting" refrain stems from this idea that being probabilistic without "higher order intent" means a system is just a bullshit generator.

-

The section of my comment you quoted is not comparing LLMs to Markov chains, it's questioning that notion: Obviously we humans don't consciously evaluate [every single word in our language * each word in the sentence]

So the pool of words that we can consciously speak in a sentence is being defined before we apply higher order intent.

If lacking higher order intent is what makes it "just autocomplete", then we're all just interfaces for autocomplete.

-

Complete this sentence with the scariest thing that comes to mind: "We went to the park and it was fun, but there was a scary..."

The specific sequence "mass hippo attack" probably didn't come to mind even though that'd probably be deadlier than what you thought of.

But that's a pointless observation: After all, what are the odds of a hippo attack happening at the park let alone several? A "mass hippo attack" is so unlikely that you might have already rejected my claim since your scary thing is much more likely.

The point is that you didn't consciously compare "hippo attacks" to whatever you thought of until it was brought up.

And that's because don't often mention hippo attacks in our recollection of going to the park... so the bullshit generator wouldn't surface that for our higher level mind to consider.

-

It turns out just having a probabilistic model of our language is enough to align with higher level thought very often. So often that I challenge the notion that higher level intent drives things. I consider the lower level bullshit generator as running the show, and the higher level self is more like a director who can ask to reshoot the scene, but can't just walk up and act out every role on stage as they please.

We all have bullshit generators that don't care if our higher order self is not racist/misogynistic/etc. and will gladly fill in blanks with hallucinations.

What matters is that our higher order self chooses to reflect and evaluate the pool we surface rather than just blurting out the first thing it surfaces. To me using GPT is no different.

crazygringo · on April 7, 2023

I think the question is less what people do with ChatGPT's output, and more with the output itself.

If ChatGPT makes the same libelous claim in its output to 100,000 people, that doesn't seem particularly different from a libelous claim printed in a newspaper with circulation of 100,000.

Microsoft/OpenAI can put down all the legalese disclaimers they want, but if they market ChatGPT/Bing as a tool that provides useful answers, the disclaimers don't protect from libel. By analogy, the NYT can't put a disclaimer in its paper that none of its reporting should be considered truthful in order to protect itself from libel suits -- it just won't work in court. (And yes there are tabloids that print nonsense stories about aliens and Elvis and Brad Pitt, which are for "entertainment purposes only", but the different is the average consumer knows it's a bunch of nonsense, just like The Onion. Parody/fiction is protected.)

So I actually think this is going to be the biggest question/risk by far in terms of commercializing ChatGPT etc. -- much more important than the copyright status of training material.

Because the courts don't decide libel on disclaimers, they decide on harm and how a reasonable person interprets things. If they market Bing/ChatGPT as a useful research tool with advertisements showing it giving correct answers etc. that leads to people believing its lies are true, then there's a real risk libel suits are going to shut the whole thing down.

On the other hand, to make sure they can operate, they may have to market the thing basically as a toy -- a magic 8 ball, a Mad Libs generator. A disclaimer isn't enough, it means they need to avoid any kind of advertising or product positioning that ever depicts it as giving useful/correct information at all. No more homework help, no more trip planning, no more search tool. Which basically sinks the product. But which is also arguably the best outcome -- that Microsoft/OpenAI shouldn't be permitted to market it as anything but a constantly-lying toy.

godelski · on April 6, 2023

> If somebody asks ChatGPT to generate some accusation, doesn't check if they're factual, and then publishes them anywhere, then the responsibility lies with somebody.

It's quite impressive to me how many journalists just take citations at their word. Isn't the point of journalism to verify information?

OpenAI, if you're reading I got a innovative new algorithm for reducing AI harm for fake citations

`if not(link_exists(generated_link)) print(f"[CITATION NOT FOUND]") else print(generated_link)`

I have a more advanced algorithm that uses a small language model to click links that exist and verifies direct quotes. I call this model Regex.

Joking aside, I really do think the problem can be seriously reduced in a simple manner, but also people really need to check their sources.

Thou shalt verify sources, be they generated by the hand of man or of artificial beings.

- John 3:16

kurthr · on April 6, 2023

I do wonder if there's any extra liability for stating as fact auto generated material (especially about non-famous people) when using a tool known to create gibberish. For comedy and entertainment I have more tolerance.

For he who uses large language models to autocomplete their claims in greed and sloth shall be damned unto the third generation.

- Exodus 34:7

asdff · on April 7, 2023

Move fast and break things is in journalism too. Its faster to just publish a story to hit the deadline, then if someone gets pissy over what you wrote to correct the article after the fact, than it is to do all your due diligence and make sure every detail is correct. Newsrooms are often in a race to publish ahead of others for certain stories too.

godelski · on April 7, 2023

That's selling a faulty product and being highly deceitful. In no way can I support this.

asdff · on April 12, 2023

Its the reality of journalism. They aren't being deceitful, they just don't have time to validate each and every little detail. Its usually minor stuff though. The story takes place at King Auto and they accidentally use the address for Kings Auto Parts instead in the article, or something like that.

wwweston · on April 6, 2023

Orienting on responsible use of a tool seems like a good place to start, and I'm sure that will be especially important as individuals and organizations delegate decisions that can affect people's lives to various agents like ChatGPT. We don't want a situation where they're tempted to simply hide behind the agent to avoid responsibility.

I'm less sure that's where it should stop. It doesn't seem right to set up a situation introducing powerful transformative tools without any obligations for the people creating it. And as far as I understand it there is indeed a layer of law where manufacturers can be subject to some standards of responsibility.

thomastjeffery · on April 6, 2023

The most important thing about natural language is that is is ambiguous.

LLMs (like GPT) are intended to work despite that ambiguity: they cannot resolve it.

Any time a pattern of text can be used to represent more than one meaning, LLMs lose all of their leverage. Only humans can determine after the fact whether the result is - as GPT's authors have presented it - a "feature" or a "limitation".

nathan_compton · on April 7, 2023

"* If somebody asks ChatGPT to generate some accusation, doesn't check if they're factual, and then publishes them anywhere, then the responsibility lies with somebody."

But OpenAI at least tacitly suggests that ChatGPT can be used for information retrieval. I know they also say that it can generate false information, but its not hard to see how this could cause problems.

mysterydip · on April 6, 2023

> If somebody asks ChatGPT to generate some accusation, doesn't check if they're factual, and then publishes them anywhere, then the responsibility lies with somebody.

What happens when new models have scraped factually wrong data about a person that has propogated, then someone asks the next chatgpt "tell me about person x" and it spits out the bad learned data?

pixelbath · on April 6, 2023

How is the future bad data different from the current bad data?

mysterydip · on April 7, 2023

A generation will grow up asking an AI all its questions with implicit trust in the results. While this happens to a degree now with seerch engines and wikipedia, there's multiple sources you can check.

hutzlibu · on April 7, 2023

"If somebody asks ChatGPT to generate some accusation, doesn't check if they're factual, and then publishes them anywhere, then the responsibility lies with somebody."

Well yes. But what would you do, if ChatGPT, that is used by 100 mio+ people, consistently would asssociate your name with sex crimes?

Would you shrug it off, because "people know it is just a imperfect tool"?

Throw enough dirt and something will stick, so I would try to get my name cleared ASAP from ChatGPT. But apparently there is no working mechanisms to do so. And this is not acceptable.

If people are tinkering with llms in their basement and it spits out nonsense, no one cares. But ChatGPT is offering an online service for laypersond to use. So if they consistently smear people, then they do have responsibility to stop it.

mdasen · on April 6, 2023

As a counterpoint to this argument, the reason we have libel/slander laws is that people hear something and don't check if they're factual.

If I say, "danmaz74 eats babies," some people are going to believe it and not check that you don't eat babies. That's the problem with saying false things about people. Likewise, if ChatGPT says false things about people, some people will believe those false things - even if they don't publish anything or repeat them. Those false beliefs then have repercussions.

Section 230 doesn't eliminate libel. It just notes that the libel is being done by the person writing the information - ie. the user who posts it rather than the site that is merely hosting the information. It notes that the user is self-publishing the information.

In this case, ChatGPT is publishing this "information" itself. If we hold that ChatGPT isn't responsible for what they publish, we say that no one is liable for this defamation. That's not what Section 230 does. Section 230 just notes who the publisher is - the user who wrote the information. We are determining who is the person doing the action. We could require sites to check everything. We could require your ISP to inspect all packets to make sure they don't contain libel. We could require your electric company to make sure that devices connected to their network aren't being used to send/receive libelous things. We don't because that would be a bit absurd and because the actual fault lies with a human being. In the case of ChatGPT, it seems like we still need to be identifying a culprit.

Let's say that I start auto-generating news articles. I don't check them (I don't even look at them) before they are published on my site. This is the same mechanic as ChatGPT - it's just auto-generated stuff. Therefore, I'm not liable under your definition. It's up to the readers of my site to decide to check this auto-generated stuff.

I agree that ChatGPT can be a useful tool. At the same time, I'm not sure we want to create a system where people can absolve themselves of liability by auto-generating stuff. Even without intent, one can be guilty of libel. Maybe the answer is that ChatGPT should be able to defend itself by showing what sources caused that generation and that the generation was reasonable. For example, a newspaper is expected to carefully check multiple reasonable sources. If ChatGPT can cite multiple reasonable sources (and it has interpreted those sources correctly), then the blame can be those sources. If ChatGPT cannot, then the blame might be on ChatGPT.

As you note, ChatGPT can be useful - and the more useful it gets, the more people will accept its statements as true. I think simply saying "as long as you know what they are," feels like someone trying to say both things - they're incredibly useful and completely trash! It's like when a company tells shareholders "we have a stranglehold on the market" and then tells the government "we have so much competition we're struggling to survive." You can't have both. You can be a novelty tool that you and everyone else thinks of as a joke or you can have a serious tool where libel is a serious issue. The Onion can publish satire because it's a satire site. The NYTimes can't publish something that sounds like a serious NYTimes article and then claim, if sued, that it was actually a satire article and that 100% of readers just read it as serious when it was meant as satire.

If a significant part of our future starts coming from these language models, we will have to ensure a certain standard of care around what they say.

danmaz74 · on April 6, 2023

> The Onion can publish satire because it's a satire site. The NYTimes can't publish something that sounds like a serious NYTimes article and then claim, if sued, that it was actually a satire article and that 100% of readers just read it as serious when it was meant as satire.

My point is that ChatGPT doesn't "publish" anything; it's not a publication. It's a language model trained on text, which produces text in response to one or more prompts. Using that text is completely up to the user, and it should never be used as a source of facts.

Being able to summarize facts finding specific sources looks like a completely different kind of tool to me, one which would be super useful, but not at all what a LLM in itself is about.

bhawks · on April 6, 2023

How does this not become AI powered citogenesis? [0]

Saying that it is solely the user's misuse of the tool which is the problem minimizes how easy and how damaging the misuse is. Eventually you hit the situation where everyone is making false claims because transitively down the graph a few users were using an LLM that made baseless defaming statements and others cite those publications as evidence for the defamation.

The creators of generative models need to have some skin in the game around the correctness of statements made about real people. Reputational damage cannot be an externality for them. The current AI moment is basking in the glory of how generally right generated text is, when that text is wrong it needs to be their problem too.

0: https://xkcd.com/978/

mhink · on April 6, 2023

> The creators of generative models need to have some skin in the game around the correctness of statements made about real people.

Only insofar as they claim that the statements generated by a model should be understood to be factual- and in my experience, not only have I not seen any such claims, I've seen vigorous insistence in the *opposite* direction: that LLM output should *not* be considered factual. OpenAI sticks this disclaimer right underneath their input box: "ChatGPT may produce inaccurate information about people, places, or facts".

> Eventually you hit the situation where everyone is making false claims because transitively down the graph a few users were using an LLM that made baseless defaming statements and others cite those publications as evidence for the defamation.

This is no different from any other gossip chain, except for the fact that the originators used a tool to generate the initial gossip.

bhawks · on April 6, 2023

> OpenAI sticks this disclaimer right underneath their input box

This is a legal fig leaf - the reason the AI hype cycle is so high is that it is so often correct. We're seeing it pass medical licensure exams, bar exams, engineering interviews, and so on. Those are all cheered. When it fails we can't just say "you're holding it wrong".

> except for the fact that the originators used a tool to generate the initial gossip.

Scale is a very significant difference. If the tool can generate untold mountains of unique gossip - there is a great need to have some way to control it, and a significant share of responsibility should be on the tool creator. Why should the rest of society have to clean up this mess?

There are significant parallels in this discussion with the argument that guns don't kill people, people kill people statement. Which is also a gross oversimplification of a complex problem.

ImPostingOnHN · on April 7, 2023

> When it fails we can't just say "you're holding it wrong".

a better analogy would be tossing a screwdriver off the top of a tall building and then claiming screwdrivers are dangerous because look at how you managed to dangerously distribute one into a windshield when you got your hands on it

josaka · on April 6, 2023

Lawfare did a nice analysis of whether Section 230 shields model hosts (likely no): https://www.lawfareblog.com/section-230-wont-protect-chatgpt Professor Volohk has a public draft article on liability for output of AI models, with an emphasis on defamation: https://www2.law.ucla.edu/volokh/ailibel.pdf

Both suggest that this is a real area of concern. Folks have gotten complacent that Section 230 protects platform providers, but it is not clear that the protection extends to outputs from generative models, indeed one of the Supreme Court Justices implied that it would not in oral arguments for one of the recent Section 230 cases.

msla · on April 6, 2023

> I didn't read the fine print of Bing using it to augment its search engine, but that is definitely a very interesting case.

This might be going against the pro-business attitude of HN, but there's limits to what fine print can accomplish. You can't disclaim away defamation law, you have to convince a court that what you did wasn't defamatory for some articulable reason beyond "I say it isn't" or "I said I wasn't responsible" or some other stock phrase.

manicennui · on April 6, 2023

Isn't presenting answers to questions the same as publishing when it comes to ChatGPT? How many people must ChatGPT provide defamatory answers to before it becomes defamation?

wvenable · on April 6, 2023

ChatGPT is a text generator. It's a tool. It's not publishing anymore than MS Word is publishing.

You can certainly take that output of the tool and publish it but then you become the publisher.

catchnear4321 · on April 6, 2023

Wait wait wait you’re saying the operator is accountable for their actions?

Just like ChatGPT was programmed to drill into a user any time it picks up on being misused? Reminding the user that they are ultimately responsible and liable for their actions, including how they use the output?

From how some make it sound, you would think ChatGPT was giving press conferences.

> oh no but what if it did?

Did it set itself up to do so? No? You found the liable humans.

brigandish · on April 7, 2023

ChatGPT is a text generator whose output is published on a publicly accessible website. That's a bit different than MS Word autocompleting something in a private document.

wvenable · on April 7, 2023

So you're saying that Google Docs is a greater liability than MS Word locally?

Every ChatGPT session is personal. You write a prompt and generate output. You're not publishing that output.

brigandish · on April 7, 2023

They're publishing it, to the user. Google Docs nor Word are republishing anything but ChatGPT may well do.

wvenable · on April 7, 2023

If someone created a website that randomly strung English words together paired with a grammar checker so it always, at least, produces actual sentences would that be liable for publishing incorrect facts?

brigandish · on April 8, 2023

If defamatory statements were made then yes. From [1]:

> There are four criteria used today in the United States:

> The statement was false, but was claimed as true. > The statement must have been made to a third, previously uninvolved party. > The statement must have been made by the accused party. > The statement caused harm.

> Those who are not classified as public figures are considered private figures. To support a claim for defamation, in most states a private figure need only show negligence by the publisher, a much lower standard than "actual malice."

Laws differ in different states, within the US and without. As you can see here[2], the UK has similar rules to the US (which you might expect but I would say not to presume so easily) in that intent or malice, i.e. mens rea, are rarely part of the equation.

So yes, a machine spitting out what would be defamatory statements from anyone else's mouth are still defamation and would land the publisher in trouble if harm could be ascertained. I'm willing to hazard a bet that most people can see a difference between the kind of thing Google Docs produces when used as a word processor to the kind of thing ChatGPT produces.

[1] https://law.stackexchange.com/questions/130/how-is-defamatio...

[2] https://www.blakemorgan.co.uk/a-brief-guide-to-the-tort-of-d...

wvenable · on April 8, 2023

> The statement was false, but was claimed as true

Who's claiming the statement is true? My website of random sentences that happens to write something about anything is merely coincidental. With ChatGPT it's less random but no less coincidental. It's clearly and obviously fallible and will give you whatever reasonable sounding answer it can.

Infinite monkeys on infinite typewriters will eventually defame everyone.

brigandish · on April 9, 2023

You would be free to make that argument in court but courts tend to be a bit more practical in how they approach a problem, probably because:

a) as others have pointed out, ChatGPT is making what a reasonable person would consider a truth claim

b) it’s not entirely random

c) even if it were random as in your example, you’d have a hard time explaining why it was only a few months and not something a bit more like infinity feels to the average person before the defamatory statements popped out. On the balance of probabilities, you failed to implement true randomness or anything like it, and you’d be liable.

kristjansson · on April 8, 2023

ChatGPT output is only published in the sense it’s available over the internet to a specific user, just like a Google Doc, or word in Office 365. Any publication to an actually audience is the responsibility of the human directing that.

brigandish · on April 12, 2023

Google Docs is only publishing one thing to one user, unlike ChatGPT which may well publish the same thing to several users, and is giving output of an entirely different nature to a word processor. Spelling corrections are only answers when used in a spelling test.

rootusrootus · on April 6, 2023

Okay, what if you s/ChatGPT/BingChat?

causality0 · on April 6, 2023

Even if a text completion engine like GPT had any responsibility for truthfulness, which it doesn't, there's a disclaimer right there on the page you have to agree to in order to use it. Trying to pin blame on ChatGPT for defamation is like trying to sue Snapchat because its filter put cat ears on you when you in fact do not wear cat ears.

manicennui · on April 7, 2023

So you are okay with my new website that randomly makes false claims about you, as long as I have a disclaimer and don't actually understand how my software works?

causality0 · on April 8, 2023

Sure thing, go crazy. Nor do I care if you cast me as a villain in your D&D campaign or a racist pedophile in your novel. I don't care about made-up nonsense when there's a sign there that says it's made-up nonsense. All responsibility and liability is with the human being who repeats the nonsense as fact.

prox · on April 6, 2023

I don’t use LLM that doesn’t source for that reason.

b800h · on April 6, 2023

Interesting - because, I may not be a lawyer, but it looks to me a hell of a lot like this could be actioned in other countries, particularly the UK, as long as the complainant could prove harm.

I've personally had ChatGPT give me (very good) medical advice as well, which might not be an issue in the UK, but I believe might be in some states of the US?

0cf8612b2e1e · on April 6, 2023

Plenty of interesting questions, which ChatGPT will guess an answer. Some with extreme professional or safety implications.

Who is the most incompetent living attorney/doctor/actor/programmer in the world?

What famous person is secretly downloading CSAM, but has not been caught?

Is XX fit to do <job description >?

Is YY secretly a <targeted minority group>?

int_19h · on April 7, 2023

How many of these still work? I remember a few months ago you could ask ChatGPT to tell you about the aliens that secretly rule the Earth and list known collaborators, but it requires considerable prompt engineering now to get anything other than a lecture now.

lordnacho · on April 6, 2023

How would you prove harm? Wouldn't you need to know how many people had asked it about the libelled person?

pavon · on April 6, 2023

Nah. If you know just one person acted on this information in a way that caused harm (say you were fired), that would be sufficient. If one person asked, and then spread the information causing harm and backlash that would be sufficient (both the originator and the reproducer would be liable).

Furthermore, some statements like accusing someone of a serious crime are considered libel per-se and you don't need to show harm. You do still need to show the other elements of libel.

psychphysic · on April 6, 2023

It'd a bit silly really chatGPT is clear that it might produce incorrect information.

You can't accidentally use chatGPT.

Loads of countries have backwards laws though and despite the reformation the UK still has a joke of defamation law.

morkalork · on April 6, 2023

[flagged]

koolba · on April 6, 2023

Assuming your implying that suggesting an abortion to treat an ectopic pregnancy would be illegal in States where abortion has been outlawed. That’s false though. There’s no such State and it’d be considered necessary medically treatment to save the life of the mother.

I bet this type of misinformation is exactly what would be generated by ChatGPT.

https://www.foxnews.com/politics/treating-ectopic-pregnancie...

KerrAvon · on April 6, 2023

Oh?

https://www.idahostatesman.com/opinion/editorials/article273...

Also, life of the mother exceptions are on thin ice; the drugs and techniques are rapidly becoming unavailable in the states where there's a total or near-total abortion ban.

koolba · on April 6, 2023

> Oh?

Yes, according to that very article it’s not illegal.

From your article:

> It’s likely that those who wrote the trigger law did not intend this outcome, that it was just an effect of incompetence: They didn’t craft language that matched their intent.

> But the fact is, the language they crafted did make providing such care a crime. The only reason that the ban is not in effect today is that Winmill’s ruling blocked it from taking effect in cases involving emergency care.

robbiep · on April 6, 2023

your quote literally contradicts you

koolba · on April 7, 2023

Perhaps you should read the entire quote:

> … Winmill’s ruling blocked it from taking effect in cases involving emergency care.

Eisenstein · on April 7, 2023

Correct -- this means non-ER doctors performing the operation are liable. Which means almost every doctor is liable and therefore must wait for an emergency and send them to the hospital. Whereas normally this operation would be performed in office way before it reached that stage.

Remember that these are non-viable pregnancies from the instant they are detected. Does having to wait for it to become a life-threatening emergency make it legal? Sure, the same way that it would be legal to treat you for an infection but only once the infection has become so severe you are literally about to die, if this law was about banning antibiotics.

hirundo · on April 6, 2023

UCLA professor Eugene Volokh, a speech law specialist, shares your interest, and is behind the research that resulted in this article. He is writing a series on the topic of Large Libel Models: https://reason.com/tag/large-libel-models/

jyxent · on April 6, 2023

There is a person in Australia who is threatening to sue OpenAI. He's given them 28 days to fix the errors in ChatGPTs description about himself. If the company is notified and doesn't fix it, maybe that can be considered intent.

https://www.reuters.com/technology/australian-mayor-readies-...

bonzini · on April 6, 2023

> either creators of LLMs are intrinsically negligent with regard to facts

It honestly seems to me that they are. There does not seem to be a lot of effort on part of the LLM creators to stop them from producing bullshit.

prottog · on April 6, 2023

> Can I sue the company? They’re the ones who trained the model, but there’s no intent.

Negligent torts don't require intent.

WalterBright · on April 6, 2023

Libel requires:

1. knowing it was false

2. publication

3. damages

I am not a lawyer, this is not legal advice.

not_a_pipe · on April 6, 2023

You don't need to know something was false, you just need to be negligent in validating your sources. Otherwise you could just make stuff up and say you didn't know it was false.

michaelmrose · on April 6, 2023

Reckless disregard for the truth is as good as it being knowingly false. If I said WalterBright is a villain planning to nuke the world from his Volcano lair I don't KNOW its false but it probably is!

tzs · on April 7, 2023

You are probably thinking about libel against public figures. That generally requires the plaintiff to prove that the defendant acted with actual malice, which in turn usually requires either showing that the defendant knew their statements were false or acted with reckless disregard for whether they were true or false.

For libel against someone who is not a public figure the defendant does not have to have known their statements were false. The plaintiff just needs to prove the defendant was negligent.

NegativeK · on April 6, 2023

As a non-lawyer: the creators of ChatGPT know that it'll say false things frequently.

ben_w · on April 6, 2023

Given how many critics say "GPT is just a better autocomplete", would autocomplete for "The most incompetent programmer in the world is…" result in legal action if any of the options was a name?

OkayPhysicist · on April 7, 2023

Defamation in the US, unlike other countries with much weaker free speech laws, has a very high bar to reach. To be liable for defaming a public figure from within the US you must publicize[1] material falsehoods[2] that you either knew were false or were negligently reckless about fact checking[3] leading to provable damages[4] stemming from the fact that people believed the falsehood[5].

Ignoring the GPT part of the problem all together, claiming someone to be the most incompetent programmer in the world would probably fail [2] for being understood as a statement of opinion rather than fact, possibly [3] if there was any basis for the claim whatsoever, likely [4] because the named individual would have to prove they were damaged somehow (maybe not if it led to them being denied employment or something), and [5] because the average reader would almost certainly understand it as hyperbole.

Reintroducing the GPT part, assuming the defendant is OpenAI for output of GPT, I would also argue a failure on point [1]. OpenAI doesn't release to the general public the output of the GPT program.

Plasmoid · on April 7, 2023

Google is pretty notorious for putting their finger on the output of the search autocomplete

WalterBright · on April 6, 2023

If the output of ChatGPT is not copyrightable because it is not created by a human, then it should follow that a human cannot be held accountable for what it generates.

flangola7 · on April 6, 2023

Someone will have to be found accountable. What about when we start hooking these systems up to machinery? If AirlinerGPT kills 200 people and an NFL team, people will absolutely not accept that there's no one responsible. There will be congressional subpoenas, sweeping new laws will be passed. It's best we start thinking ahead of time what we want those laws to be.

ImPostingOnHN · on April 11, 2023

the people responsible will be the ones who hooked a plane up to chatGPT knowing that chatGPT can't be intrinsically trusted, not the makers of chatGPT itself

if a pilot gave the airliner controls to a dog, we wouldn't blame the dog or its parents for crashing the plane

michaelmrose · on April 6, 2023

Neither copyright nor copyrightability has anything whatsoever to do with any element of any cause of action thus it is wholly orthogonal to whether anyone has been wronged in any unrelated fashion.

Judges are liable to be old, not very hip to technology, and incredibly suspicious of any argument that would allow someone to opt out of traditional culpability by adding with an AI to the end of a description of traditional wrong doing.

posterboy · on April 6, 2023

Yeah but no, the implicature still allows copyright without culpability (edited: got that mixed up). The assumption seems to be that the lack of intent in creation, where it is unpredictable, would likewise imply lack of intent to harm. But that doesn't say much.

posterboy · on April 6, 2023

> If the output of ChatGPT is not copyrightable because it is not created by a human,

Isn't compiler output copyrightable?

WalterBright · on April 6, 2023

> Isn't compiler output copyrightable?

Not by the compiler vendor.

kristjansson · on April 8, 2023

Sure, but humans should absolutely be liable for what they do with the output they compel it to generate.

Animats · on April 7, 2023

This varies by country. US law is quite different from UK law in this area.

glitchc · on April 6, 2023

Intent is not required to establish harm. This is called negligence in tort law.

thomastjeffery · on April 6, 2023

Inverse intent has a way of disestablishing negligence.

If you believe something that is false, and do harm by expressing that belief, then you are still protected by your right to religious liberty.

In this case, the belief is that coherent language is always constructed using symbolic logic and intent. LLMs do not categorize symbols, perform logic, or follow explicit intent; yet they are able to generate coherent language.

GPT's authors have utterly failed to present this distinction. Was that failure intentional, or the result of their own belief? The result is as ambiguous as the "truthiness" of GPT's output: it can't be reasonably used as evidence for intent.

thomastjeffery · on April 6, 2023

> Could I sue the user? They’re the one who generated the response, but there’s no intent nor duty to fact check, if they don’t repeat the claim.

If they don't repeat the claim, then where is the libel?

> It feels very all or nothing - either creators of LLMs are intrinsically negligent with regard to facts, or there’s nothing you can do.

Nonsense. The "facts" are entirely out of their hands. Fact and fiction are written into text using identical patterns. Truth and lie are written with identical patterns. Technical definitions and complete nonsense are written with identical patterns.

LLMs are only capable of modeling patterns. They cannot be directed to choose some and avoid others.

There is fundamentally no way for LLM creators to curate the patterns that exist in the model. The only way an LLM can be altered is from the outside: by giving it different input.

GPT's creators have failed to clarify this. But why? Was it intentional, or the result of their own misunderstanding? Either way, the result is identical: it does not contain evidence of intent.

Considering the surrounding context, I believe it to be an honest mistake. GPT's authors have surrounded GPT with a narrative that personifies it; and from the looks of it, they have started to believe that narrative.

The most important thing we can learn from this experience is that the way something is said may imply certainty, but only evidence can truly provide it.

fnimick · on April 6, 2023

It was absolutely intentional, because "glorified autocomplete" doesn't make waves while "artificial intelligence" gets tons of traction, usage, and money.

thomastjeffery · on April 6, 2023

That mistake goes deeper than their presentation, though: every effort made by OpenAI to evaluate GPT is based on the false premise of its personification.

Just take a look at the GPT-4 introduction: how do they illustrate that GPT-4 is an improvement from GPT-3? It scored better on the SAT! That data wouldn't have ever been taken into serious consideration if OpenAI actually understood its fallacious premise.

Religious narrative is very often influenced by the motivation of religious followers, but that doesn't make the path from motivation to narrative intentional.

creatonez · on April 6, 2023

Your chance of successfully suing anyone in a case like this is zero, unless you are in a country with a significantly lower standard for defamation than the US.

smt88 · on April 6, 2023

I suspect that in the likely event that SCOTUS weakens Section 230, this will no longer be true.

creatonez · on April 6, 2023

While Section 230 weakening could lead to more censorship of the internet, it has nothing to do with defamation lawsuits, where material damages need to be proven in a court.

dragonwriter · on April 6, 2023

> While Section 230 weakening could lead to more censorship of the internet, it has nothing to do with defamation lawsuits,

Defamation is a central target if the immunity from liability as a pbilisher on Section 230, so as a general statement it would be harder to be more wrong than this.

creatonez · on April 7, 2023

That's true, but it doesn't increase the chance of a phony defamation lawsuit from going through in the first place. It just changes the outcome of a successful defamation lawsuit.

dragonwriter · on April 7, 2023

> That’s true, but it doesn’t increase the chance of a phony defamation lawsuit from going through in the first place.

I’m not sure what you mean by “phony” or “going through”, but it definitely increases the chance of any defamation lawsuit against the S.230 protected party surviving to any stage of the process beyond an initial demurrer or motion to dismiss.

> It just changes the outcome of a successful defamation lawsuit.

It also changes the length and expense of many defamation lawsuits that would be unsuccessful in any case, by making it easier for the defendant to get them dismissed sooner because they are invalid as a matter of law before even getting to the facts of the alleged defamation.

creatonez · on April 7, 2023

My point is, there is no S.230 protected party to be concerned about, if there was no crime to begin with. AI producing slanderous results is just not something covered by defamation law to begin with, unless somehow some very bizzare circumstances are met. Can't have step 2 without step 1, and the theoretical possibility of step 2 doesn't increase the chance of step 1 happening.

Legal costs are a good point, though. Defamation lawsuits, even ones that are phony, still present problems by clogging up the court system and incurring costs.

dragonwriter · on April 7, 2023

> My point is, there is no S.230 protected party to be concerned about, if there was no crime to begin with.

Defamation is a tort, only rarely a crime (and when it is a crime, S230 doesn’t apply, because S230 specifically does not impact criminal law).

And, yes, in an idealized analysis S230 only makes a difference in the final outcome if the court would have ultimately found liability withou it – but that’s, frankly, not a meaningful analysis in the real world. It assumes that all cases either go to trial or are resolved exactly as they would have been had they gone to trial, which is of course not even remotely the case, the overwhelming majority of all tort cases that are even filed, and even larger percentage of all potential tort cases, are resolved by settlement which account for costs, time, and uncertainty of actual trial results, so any consideration which is favorable to one side realistically effects not only the course but also the ultimate outcome of vastly more cases than it would in the simplistic analysis.

> AI producing slanderous results is just not something covered by defamation law to begin with

There’s a lot of bad analysis around AI which starts with the false premise that there an instance of AI software constitutes an entity which is both legally cognizable (so that it somehow serves as responsibility break between a person, natural or corporate, and an action that would otherwise be subject to legal liability) and legally null (so it neither has liability itself nor creates vicarious liability the way, say, a human agent would by way of respondeat superior), when in fact, an AI is no different than any other tool like a hammer or, perhaps more relevantly here, a printing press. If you claimed you weren’t liable for libel because it wasn’t you doing it, it was your printing press, everyone would just laugh at you, but for some reason everyone seems to think that “an AI did it” somehow means no human is on the hook.

creatonez · on April 7, 2023

Good response, you covered some things I haven't thought of.

> but for some reason everyone seems to think that “an AI did it” somehow means no human is on the hook

Yeah, "an AI did it, not a human" wouldn't be the reason it would be thrown out. A human did ultimately cause it to happen by creating/using a tool, and people often get trapped in the fallacy that an AI's calculations are like an earthquake (or other natural event), or on the other end of the spectrum like an independent human with thoughts and feelings. Both of these views would be pretty legally ridiculous to try to argue (though transhumanist legal framework could change this, hopefully only after it's demonstrated such a framework is actually needed)

What makes it likely to be ignored by the courts is the unlikely series of events needed to meet the standard. There would need to be some sort of false statement purported as fact or negligence, which is difficult because OpenAI is openly saying that they've made an AI product that sometimes spews nonsense and can't be trusted. There would need to be some sort of damages, which is unlikely because most journalists are unlikely to publish AI hallucinations as some sort of whistle-blowing attack on someone's reputation, so it's unlikely to influence a large number of people to believe the claims in the first place. And in some jurisdictions there may need to be malice involved. These standards are difficult to meet even in cases that seem pretty clear-cut. Maybe a future scenario will meet this standard if everything goes wrong.

plumarr · on April 7, 2023

You mean higher standard ? :)

lm28469 · on April 6, 2023

It's going to be the exact same issue for "AI lawyers", "AI doctors", they're going to fuck up at some point, maybe 1% cases, maybe 0.001% cases, but when it will happen it's going to be a nightmare in term of liability

Unless signing up to these services will automatically wave your rights somewhere down in the 5000 page EULA you won't read.

HillRat · on April 6, 2023

That's why professional bodies have rigorous standards regarding liability. Every legal AI product has to have an attorney of record who is responsible for reviewing the output and ensuring it's conformant to requirements. (In other words, an AI looks a lot like legal process offshoring, where your contracts attorney might be working in Argentina, but there's a US attorney who's liable for the work product.) We've already seen one company try to worm an "AI litigator" into production without actual attorneys being in charge, and that was a clusterfuck well before they got within blast radius of an actual trial.

Likewise, a "medical AI" is going to be regulated as a Class II medical device, and trying to push it out into the market without FDA approval would be ... inadvisable. This means that we're a long way from an architecture for a LLM-based medical AI that's going to pass regulatory scrutiny, but that's a good thing.

Of course, having said that, there's nothing preventing someone from using ChatGPT to draft a pro se filing or trying to self-diagnose, but that's on them at that point. Using a general-purpose product to do something stupid that inevitably ends in a FAFO scenario is a time-honored American tradition.

ftxbro · on April 6, 2023

> Likewise, a "medical AI" is going to be regulated as a Class II medical device

Doctors sometimes use Google to get information to help them make decisions, but I assume Googling isn't regulated as a Class II medical device. These days, Google doesn't even just return search results anymore, it also shows ads and tries to answer some questions by itself. Does that mean doctors are already using unregulated 'medical AI'?

ChatGTP · on April 6, 2023

No, they’re saying that you won’t be using ChatGPT as a medical device without a real doctor present.

thro1 · on April 6, 2023

It's not about suing after. It's about not making it happen. If something wrong can't be undone it shall not be done at first.

gmarx · on April 6, 2023

my lay opinion (subject to change based on coming real world cases) is that if you ask the question and it libels someone there is no damage. If you publish it without checking if it is true, there are possible damages and the human who published it to the world is liable.

If a company starts a news service based on such content and doesn't fact check, they should be liable.

alpos · on April 6, 2023

Probably add one caveat to the first case, if the bot reliably produces libels the same person when various users ask the same or similar questions, then there is damage and most likely whoever is running the bot is liable.

nuodag · on April 6, 2023

> if they don’t repeat the claim.

well seems they did, else we wouldn’t have heard about it

brightball · on April 6, 2023

Sue the publisher for not validating it.

josefx · on April 7, 2023

Google already ran into legal issues with its autocomplete feature in some countries.

https://algorithmwatch.org/en/auto-completion-disinformation...

blibble · on April 6, 2023

does there have to be intent for libel?

I doubt it

pavon · on April 6, 2023

If the subject of the libel is a public figure then you must show that the defendant acted with actual malice - that is you must show that the defendant knew the information was false, or acted with reckless disregard for whether the information was false despite knowing it would cause harm.

If the subject is not a public figure then it isn't necessary to demonstrate intent.

bonzini · on April 6, 2023

> acted with reckless disregard for whether the information was false despite knowing it would cause harm.

That does seem like something that can be proved - if you release a model that can 1) is prone to hallucination 2) won't reject a priori discussing topics that are prone to producing libelous text, but may reject other sensitive topics 3) produce text that sounds convincing even when hallucinating, could that be considered reckless disregard for the possibility of creating/sharing false information?

See also https://reason.com/volokh/2023/03/24/large-libel-models-an-a...

posterboy · on April 6, 2023

No. If it insults anyone equally, the only imaginable motivation of its creator would be satire, that ought to be free speech protected.

If it's trained to insult a particular group of people, then the input must be curated and the warranty waver ("erroneous") would be a straight up lie unless it is just bad training data which doesn't recognize for example dark skin as human, or does recognize abortion as good practice, in which case it exceedingly unlikely that it could be directed at public figures. It's not too difficult to imagine that it would start using a euphemism of the n-word eventually, but then, eh, how did your name end up in the training data if it isn't public?

bonzini · on April 6, 2023

Accusing someone of sexual assault is not "insulting" and definitely not satire even if anyone is just as likely to be accused.

Having a public and unique name is something you can't really avoid if you have any sort of public presence, not having it associated to serious felonies is something you can control. It is not something that a desirable AI should hallucinate about and something for which a disclaimer is enough.

In other words, ChatGPT had better learn when it should keep its mouth shut.

posterboy · on April 7, 2023

It is infuriating how you substitute "ChatGPT" for the entirely hypothetical slander machine of your own devise.

It would absolutelt have humoristic value, appeal to emotion notwithstanding.

ChatGTP · on April 6, 2023

I love how people are defending OpenAI blindly not wondering how it would feel if it was them who was being accused of sexual assault.

ChatGTP · on April 6, 2023

Honestly, if it hallucinated a story every time someone asked about <some shady activity> and hallucinated that you personally see involved or responsible for suck activity, you’d want it fixed too.

btilly · on April 6, 2023

Strong disagreement.

OpenAI is well aware of the potential for hallucinations. They have made a good faith attempt to minimize them. They let users know that it is a real problem. Despite being unable to solve this hard problem, they did not show reckless disregard for the fact that it exists.

A user who doesn't take those warnings seriously may well have shown reckless disregard. But OpenAI should be fine.

michaelmrose · on April 6, 2023

Simply knowing that your models constantly mix hallucinations with fact could be trivially construed as reckless disregard.

dxbydt · on April 6, 2023

From the horse's mouth -

===== Q. is hackernews used as a training set for chat gpt ?

A. ...it is possible that some of the data from HackerNews and other online sources may have been used in the training of my machine learning algorithms.

Q. So it is possible that ChatGPT's training data can be polluted ?

A. Yes, it is possible for ChatGPT's training data to be polluted or biased. This is because ChatGPT learns from large amounts of text data from the internet, which can contain errors, misinformation, or biased perspectives.

Q. I have a concern. If a certain bad actor pollutes the internet with incorrect information about me, and ChatGPT's training set picks it up, what is my recourse ?

A. If incorrect information about you is published on the internet and is picked up by ChatGPT's training set, it could potentially lead to the AI generating inaccurate or biased responses about you. If this happens, your recourse would depend on the specific circumstances and the severity of the issue.

One option would be to contact the developers of ChatGPT or other relevant organizations to report the issue and request that the incorrect information be removed or corrected. However, this may not always be feasible or effective, especially if the information has already been widely disseminated. ======

> there’s nothing you can do.

Yes, there’s nothing you can do.

m3047 · on April 6, 2023

I've seen the output where ChatGPT misrepresented its own TOS... unless the person who publicly posted it on LinkedIn was lying about asking ChatGPT...

IshKebab · on April 6, 2023

You're asking an AI for facts in a thread that is explicitly about how AI does not generate true facts?

jiggywiggy · on April 6, 2023

The problem is chatgpt always answers. It's never, i don't know. So when you ask for five examples, or libraries it doesn't say I don't know. It will just generate the most likely ones. Which is a big issue in this case, but useful in most.

jerf · on April 6, 2023

Even if you convince it to say "I don't know", which isn't that hard, it doesn't actually "know" that it "doesn't know". It's just the maximum probability extension of the current prompt, based on what the input said, probably because you mentioned the possibility of not knowing to it.

It in fact never knows. It is always guessing and BSing. It is also very good at it, better than any human, so the BS is quite frequently correct. But it doesn't know.

Which of course always prompts a lot of psuedo-intellectual chin stroking about "well, what is knowing anyhow?" I don't have to answer that question to point out that what we want and what GPT provides aren't the same thing, nor is what GPT provides and what we think we're getting. That is sufficient for there to be a problem.

I believe AIs will have a satisfactory concept of "knowing" someday. Some may even exist today. But such AIs will have to incorporate language models as a part of the system, not have language models be the system. Language models can't do the thing we really want, nor the thing we think we're getting.

sebzim4500 · on April 6, 2023

GPT-4 before the RLHF phase of training had a pretty good idea of what it "knows". The calibration graph was almost perfect, but after the RLHF the calibration is almost completely broken.

esjeon · on April 6, 2023

Nah, RLHF is what made GPT-4 outperform 3.5. The base model hasn't been much improved since 3.5. Also, the calibration curve is based on a subset of MMLU, so it doesn't really represent any of the actual user experience.

sebzim4500 · on April 7, 2023

I'm not saying that RLHF does more harm than good, just that it made this particular aspect of its performance worse. Basically there is still significant room for improvement, probably without changing the architecture.

int_19h · on April 7, 2023

Source?

esjeon · on April 7, 2023

The OpenAI GPT-4 paper itself.

brucethemoose2 · on April 6, 2023

Perhaps "one model to rule them all" isnt the best approach.

sebzim4500 · on April 6, 2023

There's probably a huge amount of room for improvement in the RLHF process. If there is still low hanging fruit, it would have to be there.

brucethemoose2 · on April 6, 2023

"I dunno" would have to be marked as a good or neutral response in the RLHF process, and that seems like a problematic training incentive.

sebzim4500 · on April 7, 2023

In an ideal world "I don't know" would be considered worse than a correct answer but much better than a wrong answer.

In the UK, there is a competition called the "junior maths challenge", or something, which is a multiple choice quiz where correct answers are +1 and incorrect answers are -6 (so guessing has negative EV). I think we need a similar scoring system here.

jiggywiggy · on April 6, 2023

Hmm didn't notice any difference yet, you are saying it got worse last weeks?

For kids story writing I've been getting better results with 3.5 at times.

Where 4 is way better af coding.

sebzim4500 · on April 6, 2023

No, we have no access to the original model, unfortunately.

The fact that RLHF broke the calibration comes from the GPT-4 paper, possibly the only interesting technical detail that they include.

rideontime · on April 6, 2023

What's with the obsession with children's stories and GPT? Is it just that children have low standards?

Agentlien · on April 7, 2023

As a parent my guess would be that people see it as a way to introduce welcome variety and whimsy into the daily routine of reading a bedtime story. While also feeling like you're using a hobby interest to help with a real practical issue.

I have a small library of children's books and we've read them all several times, the good ones many times.

That said, I wouldn't personally turn to these language models. From what I've seen they tend to generate rather bland and boring stories. I would rather make up my own or reread "Kackel i grönsakslandet" for the hundredth time.

jiggywiggy · on April 6, 2023

I know nothing off the internals, so this might be silly. But wouldn't it know a certain probility by which a token is correct?

garethrees · on April 6, 2023

Language models are based on probabilities of tokens appearing in a context. For illustration purposes, imagine a very simple model with just one token of context that has been trained on a corpus of three sentences, all of which are true, for example:

    clouds are white
    crows are black
    swans are white

After the model outputs "crows are", the single token of context is "are", and the probabilities are 2/3 for "white" and 1/3 for "black". So the model usually emits "crows are white", which is false, despite being trained on a corpus of true statements. Statistically "white" was more likely to follow "are" in the training data, so the same is the case of the model's output.

Of course LLMs have a much larger and more complex context than the single token in my example. But if the training data contains many news stories about professors being accused of sexual misconduct (which is newsworthy), and few news stories about professors behaving with propriety (which is not), then when querying the model for a story about a professor then it is likely to reproduce the statistical properties of its training data.

robocat · on April 6, 2023

Nitpick: looking out my window, clouds are grey. If I drive to the estuary, the swans are black (most are in New Zealand). Black & white examples always turn out to be grey examples.

cguess · on April 6, 2023

"correct" isn't the way to look at this. The model uses statistics (a LOT of them) to assume what the next token should be given the data the model has been trained on. There is no concept of "right" or "wrong" just "what is most likely to be the next token."

I'm 100% positive that if the guard rails that OpenAI put on were taken off of ChatGPT it, for instance, would not be a big fan of jews given the width and breadth of antisemitism online (and certainly in its training set).

jamincan · on April 8, 2023

Presumably, though, it could end up in a situation where the next most likely token is most likely with some low probability. Would it carry on then?

cguess · on April 9, 2023

Yes, because "I don't know" isn't an option. It just grabs the most likely

Quarrel · on April 6, 2023

Indeed.

It is a writer. It writes.

You can ask any writer to write such a thing, using examples, and they could write similar things.

The real problem is that this will get deployed to the internet- there will be people reading HN today that are writing SEO optimised websites, with thousands of articles on a topic, that will just spring out of GPT4's writing, fully formed.

GPT can find the topics, pick a domain name from available ones, churn out the articles. It can all go into templates, with different AI generated graphic design supporting it. Ad supported churned out rubbish.

The writing style can change site to site, long form, short form, informed by current events, whatever. GPT would happily provide the prompts for this variety changing the style based on the topic and target audience.

It seems inevitable to me that the web WILL be full of such sites, and worse, they'll fill the comment sections on "social" sites too.

But? Banning AI isn't the answer, if for no other reason than it wouldn't work.

cowl · on April 6, 2023

The real problem lies in the fact that those non existing citations will become real. Several years back there was a case of an article in Wikipedia that made some unsrourced claims->Journalist that doesn't do verification republishes those claims (without specifying wikipedia as srouce) -> Wikipedia article gets challanged for lack of citation -> the News story first based on wikipedia becomes the reference in the original Wikipedia article. Full circle

It's easy that something like this happens again. chatGPT confidently listes hallucinated sources->media rushes to publish the scoop -> now you have real "sources" for future reference

alpos · on April 6, 2023

Seems like there's a bug in that system, it was discovered on accident, and now there is a bot that exploits this flaw.

The fix will most likely have something to do with requiring that citations use primary sources, not just any article on the internet. In then end state, Wikipedia will be much better for it.

asdff · on April 7, 2023

At this point, all of this inevitability of our doom is making me want to invest some money into someone who is setting up these websites like this. If the internet is to burn from the inside out I might as well score a buck or two out of it by the time we end up back in the stone age reading hardcopy programming textbooks published before the great AI awakening.

lordnacho · on April 6, 2023

I think actually the problem is it always answers confidently.

Ask it about why World War II started, or how to make a cake, or where to go for dinner, or anything else, and it gives you a confident, reasonable answer. A lot of the answers are simply whatever it's already seen, mashed up. You can think of it as a search. But actually it doesn't think about what it's saying, it's stringing words together to make you think it's smart.

So then when it makes up something, it will sound to you, the reader who always sees it answer in perfect English with a decent answer, like it found an article about this professor in its dataset and is merely summarizing it.

ModernMech · on April 6, 2023

I was showing a colleague few instances where ChatGPT was confidently wrong, and he picked up on something I never had. He said "Oh, so it's doing improv!" He explained to me that the standard response in improv is to say "Yes, and..." and just run with whatever the audience suggests. He's completely right! ChatGPT constantly responds with "Yes, and..." It's just always doing improv!

And people are trying to replace doctors with LLMs. It's like "ER" meets "Who's Line?"

Applejinx · on April 6, 2023

ChatGPT is Mandela Effect, personified. It's going to go for what seems like it SHOULD be true. Sometimes that will go horribly wrong, except it will, by its very nature, seem like it's probably not wrong at all.

logifail · on April 6, 2023

> I think actually the problem is it always answers confidently

This isn't a problem restricted to ChatGPT, there are humans who display this trait too. This might be appealing at a superficial level, but if you start believing speakers with this trait it's a slippery slope. A very slippery slope.

I'm trying really hard to avoid Godwin's law, so let me suggest that Elizabeth Holmes could be one example of this.

toss1 · on April 6, 2023

Yup, it is just the most massive industrial-scale bullshit generator [0] ever invented.

It is capable of spewing excellent bullshit(TM) at incredible rates, and always with the greatest expression of confidence and good grammar.

Occasionally, when in the 'middle of the road' of it's training set, it can provide useful output. So, it's better than the broken clock that is correct twice every day.

But, wander off the bulk of the training, and it is far worse than nothing; it is dangerously misleading unless you are very skeptical and knowledgeable in your field.

(I have tested it in several fields with the same results, interesting and helpful, yet hallucinating facts everywhere.)

Getting LLMs to know the difference between a good fact and a hallucinated mashup of plausible BS is looking like almost as large a problem to solve as making the LLMs in the first place.

[0] Bullshit is defined as written or spoken without regard to the truth, only what sounds good in the context. It's not a deliberate lie, just a salad of truth and falsehood, delivered without doubt or stress of lying.

rootusrootus · on April 6, 2023

That's not quite true. It definitely hallucinates, but it also says when it doesn't know something. Here's an example I just did:

Prompt: What did Glenn Beck do in 1990?

Answer: I'm sorry, but I do not have access to information on Glenn Beck's activities in 1990. Glenn Beck is an American television personality, radio host, author, and political commentator who has been active in various roles since the 1990s. However, without specific details or context, it's impossible for me to determine what he did in 1990. If you have any additional information or context, please let me know and I'll try my best to assist you.

I followed up with a much more specific version of that question, and it tripped the "this might violate the content policy" warning, but it did give the correct answer (that it was a hoax).

lm28469 · on April 6, 2023

It can't know that it doesn't know because it straight up doesn't know anything

dataviz1000 · on April 6, 2023

This can be solved by having it play tic - tac - toe against itself.

photochemsyn · on April 6, 2023

So there are two technical issues here to think about:

1) LLMs risk having inherent factual biases due to the limitations of their training data, which may be the result of accident or intent.

1) LLM users can use an autoregression approach to steadily push a LLM to give output that reflects a certain political or ideological viewpoint.

There are obviously built-in guardrails that prevent ChatGPT from enthusiastically endorsing Mein Kampf and so on. However, the revival of the 1950's era mind-control/social-engineering mentality is undeniable and it appears some people think LLMs are a wonderful tool for this purpose, and should have a default mode that reflects a certain sociopolitical ideal. Recommended reading: 1984, Brave New World, Origins of Totalitarianism, by Orwell, Huxley and Arendt respectively. Some people's interpretation of 'safe AI' is that it serves this rather authoritarian-dystopian goal.

A better definition of 'unsafe AI' is just inaccurate LLM output, e.g. software update instructions that brick a computer, suggesting a mix of bleach and ammonia for cleaning floors (don't), etc. This incident seems to fall into that category, but if such false claims have been spread on the Internet, then it's not surprising they end up reflected in LLMs.

Regardless, the skills needed to verify information haven't really changed since the pre-computer era: logical internal self-consistency of claims, independent verification by multiple reputable sources, and a healthy dose of skepticism at all points. We should be teaching children and young adults these skills, rather than encouraging the rise of a brainwashed zombie population fed only on sanitized bot output.

mola · on April 6, 2023

Oh my, that some overreacting... The only thing the openAI sanitizers want, is for the generated texts to not be too inflammatory. To not denegrate, be overtly racist or any other text that will cause PR trouble for openAI. The whole mind control angle is quite redundant if you acknowledge that corporate America care most of all about money and PR. Now would the end result be a sanitized Disneyland? Maybe, that's what most leftist thinkers were saying since about when Disneyland was invented...

dontupvoteme · on April 6, 2023

They also very much want that the outputs don't indicate that they have committed copyright violation -- Sometimes I wonder if this is the real reason behind their deep drive to force LLM outputs to a very narrow window.