Search

Reddit to Update Web Standard to Block Automated Data Scraping From Its Website

Reddit said that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

Advertisement
Highlights
  • AI startups have reportedly been bypassing rules to gather content
  • Reddit said that it would update the Robots Exclusion Protocol
  • The platform will also block unknown bots and crawlers from data scraping
Reddit to Update Web Standard to Block Automated Data Scraping From Its Website

AI firms have been accused of plagiarizing content from publishers

Photo Credit: Reuters

Social media platform Reddit said on Tuesday it will update a Web standard used by the platform to block automated data scraping from its website, following reports that AI startups were bypassing the rule to gather content for their systems.

The move comes at a time when artificial intelligence firms have been accused of plagiarizing content from publishers to create AI-generated summaries without giving credit or asking for permission.

Reddit said that it would update the Robots Exclusion Protocol, or "robots.txt," a widely accepted standard meant to determine which parts of a site are allowed to be crawled.

The company also said it will maintain rate-limiting, a technique used to control the number of requests from one particular entity, and will block unknown bots and crawlers from data scraping - collecting and saving raw information - on its website.

More recently, robots.txt has become a key tool that publishers employ to prevent tech companies from using their content free-of-charge to train AI algorithms and create summaries in response to some search queries.

Last week, a letter to publishers by the content licensing startup TollBit said that several AI firms were circumventing the web standard to scrape publisher sites.

This follows a Wired investigation which found that AI search startup Perplexity likely bypassed efforts to block its Web crawler via robots.txt.

Earlier in June, business media publisher Forbes accused Perplexity of plagiarizing its investigative stories for use in generative AI systems without giving credit.

Reddit said on Tuesday that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

© Thomson Reuters 2024


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Further reading: Reddit, AI
 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. Latest OTT Releases: When and What to Watch this Weekend
  2. Vivo T4 5G Confirmed to Get 7,300mAh Battery, Bypass Charging Support
  3. Moto Pad 60 Pro With 12.7-Inch Display, Quad JBL Speakers Launched in India
  4. PS5 Slim Models Discounted in Sony's 'Summer Sale' Offer: See Price
  5. Samsung Allegedly Reveals Entire Release Schedule of One UI 7 Update
  6. Samsung Galaxy M56 5G With 50-Megapixel Main Camera Launched in India
  7. Moto Book 60 With 2.8K Display, Up to Intel Core 7 CPUs Launched in India
  1. Instagram Blend Feature With Personalised Content Suggestions Launched: How it Works
  2. Microsoft Is Reportedly Testing Copilot for Gaming Xbox Assistant
  3. Split Fiction on Switch 2 Can Be Shared With Nintendo Switch Players via GameShare, Says EA
  4. Honor GT Pro Set to Launch on April 23; Colour Options, Design Teased
  5. Itel A95 5G With MediaTek Dimensity 6300 SoC, 50-Megapixel Rear Camera Launched in India: Price, Features
  6. Tunisia Could Have Been the Birthplace of Today’s Domestic Cats, Reveals New Origin Study
  7. Google Expanding Gemini Live with Camera and Screen Share Features to All Android Devices
  8. Realme GT 8 Pro Tipped to Feature Snapdragon 8 Elite 2 Chipset, 200-Megapixel Periscope Camera, More
  9. CMF Phone 2 Pro Confirmed to Come With Transparent Protective Cover In-The-Box
  10. OpenAI o3 and o4-Mini AI Models With Visual Reasoning Capabilities Released
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »