Connect with us

News

The New York Times’ AI copyright lawsuit shows that forgiveness might not be better than permission

Published

on

The New York Times Building, Midtown Manhattan, NYC (Photo By Ajay Suresh from New York, NY, USA, CC BY 2.0)

The New York Times’ (NYT) legal proceedings against OpenAI and Microsoft has opened a new frontier in the ongoing legal challenges brought on by the use of copyrighted data to “train”, or improve generative AI.

There are already a variety of lawsuits against AI companies, including one brought by Getty Images against StabilityAI, which makes the Stable Diffusion online text-to-image generator. Authors George R.R. Martin and John Grisham have also brought legal cases against ChatGPT owner OpenAI over copyright claims. But the NYT case is not “more of the same” because it throws interesting new arguments into the mix.

The legal action focuses in on the value of the training data and a new question relating to reputational damage. It is a potent mix of trade marks and copyright and one which may test the fair use defences typically relied upon.

It will, no doubt, be watched closely by media organisations looking to challenge the usual “let’s ask for forgiveness, not permission” approach to training data. Training data is used to improve the performance of AI systems and generally consists of real world information, often drawn from the internet.

The lawsuit also presents a novel argument – not advanced by other, similar cases – that’s related to something called “hallucinations”, where AI systems generate false or misleading information but present it as fact. This argument could in fact be one of the most potent in the case.

The NYT case in particular raises three interesting takes on the usual approach. First, that due to their reputation for trustworthy news and information, NYT content has enhanced value and desirability as training data for use in AI.

Second, that due to its paywall, the reproduction of articles on request is commercially damaging. Third, that ChatGPT “hallucinations” are causing reputational damage to the New York Times through, effectively, false attribution.

This is not just another generative AI copyright dispute. The first argument presented by the NYT is that the training data used by OpenAI is protected by copyright, and so they claim the training phase of ChatGPT infringed copyright. We have seen this type of argument run before in other disputes.

Fair use?

The challenge for this type of attack is the fair use shield. In the US, fair use is a doctrine in law that permits the use of copyrighted material under certain circumstances, such as in news reporting, academic work and commentary.

OpenAI’s response so far has been very cautious, but a key tenet in a statement released by the company is that their use of online data does indeed fall under the principle of “fair use”.

Anticipating some of the difficulties that such a fair use defence could potentially cause, the NYT has adopted a slightly different angle. In particular, it seeks to differentiate its data from standard data. The NYT intends to use what it claims to be the accuracy, trustworthiness and prestige of its reporting. It claims that this creates a particularly desirable dataset.

It argues that as a reputable and trusted source, its articles have additional weight and reliability in training generative AI and are part of a data subset that is given additional weighting in that training.

It argues that by largely reproducing articles upon prompting, ChatGPT is able to deny the NYT, which is paywalled, visitors and revenue it would otherwise receive. This introduction of some aspect of commercial competition and commercial advantage seems intended to head off the usual fair use defence common to these claims.

It will be interesting to see whether the assertion of special weighting in the training data has an impact. If it does, it sets a path for other media organisations to challenge the use of their reporting in the training data without permission.

The final element of the NYT’s claim presents a novel angle to the challenge. It suggests that damage is being done to the NYT brand through the material that ChatGPT produces. While almost presented as an afterthought in the complaint, it may yet be the claim that causes Open AI the most difficulty.

This is the argument related to AI “hallucinations”. The NYT argues that this is compounded because ChatGPT presents the information as having come from the NYT.

The newspaper further suggests that consumers may act based on the summary given by ChatGPT, thinking the information comes from the NYT and is to be trusted.

buy zetia online https://resmedfoundation.org/images/board/jpg/zetia.html no prescription pharmacy

The reputational damage is caused because the newspaper has no control over what ChatGPT produces.

This is an interesting challenge to conclude with. “Hallucination” is a recognised issue with AI generated responses and the NYT is arguing that the reputational harm may not be easy to rectify.

The NYT claim opens a number of lines of novel attack which move the focus from copyright on to how the copyrighted data is presented to users by ChatGPT and the value of that data to the newspaper. This is much trickier for OpenAI to defend.

This case will be watched closely by other media publishers, especially those behind paywalls, and with particular regard to how it interacts with the usual fair use defence.

If the NYT dataset is recognised as having the “enhanced value” it claims to, it may pave the way for monetisation of that dataset in training AI rather than the “forgiveness, not permission” approach prevalent today.The Conversation

Peter Vaughan, Senior Lecturer, Nottingham Law School, Nottingham Trent University

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Maria in Vancouver

Headline10 hours ago

The Sobering Reality of Growing Old

Growing old brings a sobering reality: time is finite.  You watch your body slow down, see your parents age, and...

Lifestyle3 weeks ago

Dr. David Suzuki’s Legacy: A Celebration at 90

Celebrating Dr. David Suzuki’s 90th birthday on Friday, May 22  was a true privilege and a great pleasure! My husband,...

Lifestyle4 weeks ago

What I Know Now About Motherhood

Did you know that a mother’s cells can live in her child’s body for their entire lives? This fascinating phenomenon...

Headline2 months ago

Age with Audacity

At 25, I imagined life at 50 would mean I’d be past my prime and grumpy.  Little did I know,...

Lifestyle2 months ago

Spring Clean Your Body, Mind and Home

Spring has sprung! This season is perfect for spring cleaning, but why stop at our homes?  We can also rejuvenate...

Lifestyle3 months ago

Hear Us Roar

There is absolutely nothing wrong with a woman who wants her happily ever after. I certainly did. After 21 years...

Lifestyle3 months ago

The Real Rich

Margaret Atwood aptly captured this dynamic with the phrase, “Old money whispers, new money shouts.”  Let me elaborate on this...

Headline4 months ago

Love in the Afternoon of Life

Love in later life—the 50s, 60s, 70s, and beyond—is a thriving, fulfilling reality. It offers companionship, improved well-being, and joy,...

Headline4 months ago

Your Most Important Relationship is With Yourself

Valentine’s Day shouldn’t be celebrated only for one day. Love should be celebrated everyday. Valentine’s Day, when expanded beyond romance,...

Headline5 months ago

The 2016 Trend Made Me Reflect On My Past & Present

Like many others, I couldn’t resist joining the 2016 throwback trend.  It was all over social media, with everyone sharing...