Saturday, May 17, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Baidu restricts Google and Bing from scraping content for AI training

Simon Osuji by Simon Osuji
August 28, 2024
in Artificial Intelligence
0
Baidu restricts Google and Bing from scraping content for AI training
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Chinese internet search provider Baidu has updated its Wikipedia-like Baike service to prevent Google and Microsoft Bing from scraping its content.

This change was observed in the latest update to the Baidu Baike robots.txt file, which denies access to Googlebot and Bingbot crawlers.

According to the Wayback Machine, the change took place on August 8. Previously, Google and Bing search engines were allowed to index Baidu Baike’s central repository, which includes almost 30 million entries, although some target subdomains on the website were restricted.

This action by Baidu comes amid increasing demand for large datasets used in training artificial intelligence models and applications. It follows similar moves by other companies to protect their online content. In July, Reddit blocked various search engines, except Google, from indexing its posts and discussions. Google, like Reddit, has a financial agreement with Reddit for data access to train its AI services.

According to sources, in the past year, Microsoft considered restricting access to internet-search data for rival search engine operators; this was most relevant for those who used the data for chatbots and generative AI services.

Meanwhile, the Chinese Wikipedia, with its 1.43 million entries, remains available to search engine crawlers. A survey conducted by the South China Morning Post found that entries from Baidu Baike still appear on both Bing and Google searches. Perhaps the search engines continue to use older cached content.

Such a move is emerging against the background where developers of generative AI around the world are increasingly working with content publishers in a bid to access the highest-quality content for their projects. For instance, relatively recently, OpenAI signed an agreement with Time magazine to access the entire archive, dating back to the very first day of the magazine’s publication over a century ago. A similar partnership was inked with the Financial Times in April.

Baidu’s decision to restrict access to its Baidu Baike content for major search engines highlights the growing importance of data in the AI era. As companies invest heavily in AI development, the value of large, curated datasets has significantly increased. This has led to a shift in how online platforms manage access to their content, with many choosing to limit or monetise access to their data.

As the AI industry continues to evolve, it’s likely that more companies will reassess their data-sharing policies, potentially leading to further changes in how information is indexed and accessed across the internet.

(Photo by Kelli McClintock)

See also: Google advances mobile AI in Pixel 9 smartphones

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

Tags: ai, content moderation, Google, microsoft, search engine



Source link

Related posts

Home Depot Promo Codes & Coupons: 50% Off | May 2025

Home Depot Promo Codes & Coupons: 50% Off | May 2025

May 17, 2025
Microsoft Surface Pro 12 Review: Beautiful and Baffling

Microsoft Surface Pro 12 Review: Beautiful and Baffling

May 17, 2025
Previous Post

Martin Gitonga’s Lifelong Dream Realized

Next Post

The D Brief: Israel’s West Bank raids; Ukraine’s Kursk progress; USAF’s Ukrainian EW; Shipmaker, fined; And a bit more.

Next Post
The D Brief: Israel’s West Bank raids; Ukraine’s Kursk progress; USAF’s Ukrainian EW; Shipmaker, fined; And a bit more.

The D Brief: Israel’s West Bank raids; Ukraine’s Kursk progress; USAF’s Ukrainian EW; Shipmaker, fined; And a bit more.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

5 Best USB Hubs (2024): USB-C, USB-A, Portable

5 Best USB Hubs (2024): USB-C, USB-A, Portable

1 year ago
Nigerian Artist, Lola Mewu Breaks Guinness World Record in 82-Hour Painting Marathon

Nigerian Artist, Lola Mewu Breaks Guinness World Record in 82-Hour Painting Marathon

2 years ago
Restoring coral reefs in the Maldives

Restoring coral reefs in the Maldives

2 years ago
Mlati Gibson Mjomba dies in Dubai

Mlati Gibson Mjomba dies in Dubai

2 years ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.