Tech

New York Times Blocks OpenAI’s GPTbot: Restricting Web Content Access And Opt-Out Solution

Published

9 months ago

August 23, 2023

(CTN NEWS) – Recently, there were revelations that the New York Times is contemplating the possibility of initiating legal measures against OpenAI, the creator of ChatGPT.

The allegation is centered around OpenAI’s AI models purportedly incorporating materials from the New York Times’ website, content that falls under the purview of the newspaper’s intellectual property.

Although no legal action has been taken thus far, the prominent news outlet has opted to proactively prohibit OpenAI’s web crawler from accessing its website’s content.

Consequently, this restriction effectively prevents the utilization of the website’s materials for the training of any of OpenAI’s fundamental AI models.

OpenAI

OpenAI Web Content Access and Opt-Out Initiatives

According to a recent article from The Verge, The New York Times (NYT) has taken steps to prevent OpenAI’s web crawler, known as GPTbot, from scanning and categorizing the content present on their website.

The report draws attention to a specific webpage on the publication’s site that explicitly confirms the bot’s restricted access.

Utilizing the Archive’s Wayback Machine, a tool enabling users to explore web content from previous dates, it has been revealed that the bot’s access was officially blocked on August 17th.

This action follows OpenAI’s implementation of an “opt-out” mechanism for website proprietors who do not wish their website content to be employed in training the company’s AI models.

On August 7th, OpenAI detailed that its GPTbot can be prevented from accessing content by making adjustments on the robot.txt page.

Concurrently, in its blog post, OpenAI underscored the utilization of such content, stating, “Web pages that are traversed by the GPTBot user agent might potentially contribute to enhancing future models.

These pages are carefully screened to eliminate sources demanding payment, collecting personally identifiable information (PII), or containing text that violates our guidelines.”

Web Crawlers in AI Training: Navigating Content Indexing and Social Media Responses

For those who might not be familiar, a web crawler, also referred to as a web spider, is essentially a computer program designed to autonomously explore and index the content present on websites.

It systematically navigates through all the URLs within a website, collecting data to compile its own informational database. In the current landscape, such web crawlers are extensively employed by AI enterprises to train their foundational models.

In recent times, several social media platforms have taken measures to address this trend.

Twitter, for instance, has implemented a temporary tweet rate limit aimed at preventing these web crawlers from extracting content from its platform. Similarly, Reddit has introduced a new API policy with the intent of discouraging the activities of web crawlers.

Nevertheless, OpenAI stands out as one of the rare AI companies providing a straightforward and uncomplicated means to exclude content from the reach of its GPTbot.

In a report by NPR published last week, it was disclosed that The New York Times (NYT) is considering the possibility of pursuing legal action against the creators of ChatGPT.

This development arises from the inability of both parties to come to terms on a licensing arrangement. The proposed agreement centered on OpenAI paying a predetermined sum for utilizing NYT’s articles to train their AI models.

RELATED CTN NEWS:

14-Hour Ordeal Ends: 8 Stranded Cable Car Passengers Rescued In Northwest Pakistan

BRICS Summit 2023: Developing Nations’ Leaders Address Expansion And Global Dynamics

Saudi Arabia Introduces New Family Visit Visa Process Enabling Umrah Pilgrimage For Foreign Residents

CTN News – Chiang Rai Times

New York Times Blocks OpenAI’s GPTbot: Restricting Web Content Access And Opt-Out Solution

Tech

New York Times Blocks OpenAI’s GPTbot: Restricting Web Content Access And Opt-Out Solution

OpenAI Web Content Access and Opt-Out Initiatives

Web Crawlers in AI Training: Navigating Content Indexing and Social Media Responses

CTN News App

Despite TikTok’s Political Turmoil, Tech Platforms Pitch For Ad Deals

Helldivers 2 Controversy Leads To Sony Making PSN FAQ Changes

Kentucky Derby Could Be Wet. Sierra Leone, One Of The Early Favorites, Won In Slop

Star Wars Day: What Is It? Why Is May 4 Celebrated? Here’s What You Need To Know

How many Exhale Wellness Delta 9 Gummies Should I Take?

Enhancing Urban Comfort: The Vital Role of Spray Foam Insulation in Chiang Rai

Interesting features of 1x ক্যাসিনো

Turkiye’s Inflation Reaches 70 Percent, Its Highest Level Since 2022

The Top Benefits of Knowing Live Scores of the Football Game

Russia in Favor of China’s 12-Point Peace Plan to End Ukraine War

“Watch Video” Woman Caught Vaping Aboard Airplane at Chiang Rai Airport

Watch Hareem Shah’s Latest video Leak Scandal

Police Take Down Illegal Gambling Sites With $1.3 Million in Circulation

“The Juice” OJ Simpson Dies After Long Battle with Cancer

Glasses as an Accessory: Expressing Yourself Through Frames

Police Arrest 272 People Sharing Child Abuse Materials on Social Media

WATCH: Iraqi TikTok Star “Om Fahad” Fatally Shot Outside Her Home

From Karachi to Chennai: 19-Year-Old Pakistani Receives Life-Saving Heart Transplant In India

Toyota Pilots EV Revo Pickup Baht Buses in Pattaya Thailand

Japan’s New F35 Aircraft Carrier Kaga Draws Ire from China

“Watch Video” Woman Caught Vaping Aboard Airplane at Chiang Rai Airport

LIVE VIDEO! Austrian Tourist Attacks Taxi Driver in Phuket Over Cigarette

Magnitude 7.2 Earthquake in Taiwan Claims 9 Lives, Injures Over1000

Wildlife Officials to End Wild Monkeys Overtaking Lopburi, Thailand

Recent News

BUY FC 24 COINS

Volunteering at Soi Dog

Find a Job

Free ibomma Movies

CTN News – Chiang Rai Times

New York Times Blocks OpenAI’s GPTbot: Restricting Web Content Access And Opt-Out Solution

OpenAI Web Content Access and Opt-Out Initiatives

Web Crawlers in AI Training: Navigating Content Indexing and Social Media Responses

You may like

CTN News App

Despite TikTok’s Political Turmoil, Tech Platforms Pitch For Ad Deals

Helldivers 2 Controversy Leads To Sony Making PSN FAQ Changes

Kentucky Derby Could Be Wet. Sierra Leone, One Of The Early Favorites, Won In Slop

Star Wars Day: What Is It? Why Is May 4 Celebrated? Here’s What You Need To Know

How many Exhale Wellness Delta 9 Gummies Should I Take?

Enhancing Urban Comfort: The Vital Role of Spray Foam Insulation in Chiang Rai

Interesting features of 1x ক্যাসিনো

Turkiye’s Inflation Reaches 70 Percent, Its Highest Level Since 2022

The Top Benefits of Knowing Live Scores of the Football Game

Russia in Favor of China’s 12-Point Peace Plan to End Ukraine War

“Watch Video” Woman Caught Vaping Aboard Airplane at Chiang Rai Airport

Watch Hareem Shah’s Latest video Leak Scandal

Police Take Down Illegal Gambling Sites With $1.3 Million in Circulation

“The Juice” OJ Simpson Dies After Long Battle with Cancer

Glasses as an Accessory: Expressing Yourself Through Frames

Police Arrest 272 People Sharing Child Abuse Materials on Social Media

WATCH: Iraqi TikTok Star “Om Fahad” Fatally Shot Outside Her Home

From Karachi to Chennai: 19-Year-Old Pakistani Receives Life-Saving Heart Transplant In India

Toyota Pilots EV Revo Pickup Baht Buses in Pattaya Thailand

Japan’s New F35 Aircraft Carrier Kaga Draws Ire from China

“Watch Video” Woman Caught Vaping Aboard Airplane at Chiang Rai Airport

LIVE VIDEO! Austrian Tourist Attacks Taxi Driver in Phuket Over Cigarette

Magnitude 7.2 Earthquake in Taiwan Claims 9 Lives, Injures Over1000

Wildlife Officials to End Wild Monkeys Overtaking Lopburi, Thailand

Recent News

BUY FC 24 COINS

Volunteering at Soi Dog

Find a Job

Free ibomma Movies