<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:media="http://search.yahoo.com/mrss/" xmlns:content="http://purl.org/rss/1.0/modules/content/" version="2.0">
  <channel>
    <title>Aaj TV English News - Technology</title>
    <link>https://english.aaj.tv/</link>
    <description>Aaj TV English</description>
    <language>en-Us</language>
    <copyright>Copyright 2026</copyright>
    <pubDate>Wed, 22 Apr 2026 18:24:10 +0500</pubDate>
    <lastBuildDate>Wed, 22 Apr 2026 18:24:10 +0500</lastBuildDate>
    <ttl>60</ttl>
    <item xmlns:default="http://purl.org/rss/1.0/modules/content/">
      <title>Global news platform block GPTBot web crawler from accessing content</title>
      <link>https://english.aaj.tv/news/30331550/global-news-platform-block-gptbot-web-crawler-from-accessing-content</link>
      <description>&lt;p&gt;&lt;strong&gt;Global news outlets including &lt;em&gt;The New York Times&lt;/em&gt;, &lt;em&gt;CNN&lt;/em&gt;, &lt;em&gt;Reuters&lt;/em&gt; and the &lt;em&gt;Australian Broadcasting Corporation (ABC)&lt;/em&gt; have blocked the tool GPTBot from web crawlers to limit artificial intelligence accessing their original content, reported &lt;em&gt;The Guardian.&lt;/em&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;The New York Times&lt;/em&gt; first blocked the GPTBot on its website. Subsequently, other major news websites, including &lt;em&gt;CNN&lt;/em&gt;, &lt;em&gt;Reuters&lt;/em&gt;, the &lt;em&gt;Chicago Tribune&lt;/em&gt;, &lt;em&gt;ABC&lt;/em&gt; and Australian Community Media (ACM) brands such as the &lt;em&gt;Canberra Times&lt;/em&gt; and the &lt;em&gt;Newcastle Herald&lt;/em&gt;, also disallowed the web crawler.&lt;/p&gt;
&lt;p&gt;OpenAI said in a blog post: “Allowing GPTBot to access your site can help AI models become more accurate and improve their general capabilities and safety,” It has also given instructions on how to disallow the crawler.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;CNN&lt;/em&gt; confirmed to &lt;em&gt;The Guardian Australia&lt;/em&gt; that it has blocked GPTBot across its titles, but the company did not comment on whether the brand plans to take further action about the use of its content in AI systems.&lt;/p&gt;
&lt;p&gt;A &lt;em&gt;Reuters&lt;/em&gt; spokesperson said it regularly reviews its robots.txt and site terms and conditions. “Because intellectual property is the lifeblood of our business, it is imperative that we protect the copyright of our content,” she said.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;The New York Times&lt;/em&gt;’ terms of service were recently updated to make the prohibition against “the scraping of our content for AI training and development … even more clear,” according to a spokesperson.&lt;/p&gt;
&lt;p&gt;As of August 3, its website rules explicitly prohibit the publisher’s content from being used for “the development of any software program, including, but not limited to, training a machine learning or artificial intelligence (AI) system” without consent.&lt;/p&gt;
&lt;p&gt;International news platforms were faced with decisions about whether to use AI as part of news gathering and also how to deal with their content potentially being sucked into training pools by companies developing AI systems.&lt;/p&gt;
&lt;p&gt;Research from OriginalityAI, a company that checks for the presence of AI content, shared this week found that major websites including Amazon and Shutterstock had also blocked GPTBot.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;The Guardian&lt;/em&gt;’s robot.txt file does not disallow GPTBot.&lt;/p&gt;
</description>
      <content:encoded xmlns="http://purl.org/rss/1.0/modules/content/"><![CDATA[<p><strong>Global news outlets including <em>The New York Times</em>, <em>CNN</em>, <em>Reuters</em> and the <em>Australian Broadcasting Corporation (ABC)</em> have blocked the tool GPTBot from web crawlers to limit artificial intelligence accessing their original content, reported <em>The Guardian.</em></strong></p>
<p><em>The New York Times</em> first blocked the GPTBot on its website. Subsequently, other major news websites, including <em>CNN</em>, <em>Reuters</em>, the <em>Chicago Tribune</em>, <em>ABC</em> and Australian Community Media (ACM) brands such as the <em>Canberra Times</em> and the <em>Newcastle Herald</em>, also disallowed the web crawler.</p>
<p>OpenAI said in a blog post: “Allowing GPTBot to access your site can help AI models become more accurate and improve their general capabilities and safety,” It has also given instructions on how to disallow the crawler.</p>
<p><em>CNN</em> confirmed to <em>The Guardian Australia</em> that it has blocked GPTBot across its titles, but the company did not comment on whether the brand plans to take further action about the use of its content in AI systems.</p>
<p>A <em>Reuters</em> spokesperson said it regularly reviews its robots.txt and site terms and conditions. “Because intellectual property is the lifeblood of our business, it is imperative that we protect the copyright of our content,” she said.</p>
<p><em>The New York Times</em>’ terms of service were recently updated to make the prohibition against “the scraping of our content for AI training and development … even more clear,” according to a spokesperson.</p>
<p>As of August 3, its website rules explicitly prohibit the publisher’s content from being used for “the development of any software program, including, but not limited to, training a machine learning or artificial intelligence (AI) system” without consent.</p>
<p>International news platforms were faced with decisions about whether to use AI as part of news gathering and also how to deal with their content potentially being sucked into training pools by companies developing AI systems.</p>
<p>Research from OriginalityAI, a company that checks for the presence of AI content, shared this week found that major websites including Amazon and Shutterstock had also blocked GPTBot.</p>
<p><em>The Guardian</em>’s robot.txt file does not disallow GPTBot.</p>
]]></content:encoded>
      <category>Technology</category>
      <guid>https://english.aaj.tv/news/30331550</guid>
      <pubDate>Fri, 25 Aug 2023 15:13:57 +0500</pubDate>
      <author>none@none.com (Web Desk)</author>
      <media:content url="https://i.aaj.tv/large/2023/08/25104146777e90b.jpg?r=151357" type="image/jpeg" medium="image" height="480" width="800">
        <media:thumbnail url="https://i.aaj.tv/thumbnail/2023/08/25104146777e90b.jpg?r=151357"/>
        <media:title>A response by ChatGPT, an AI chatbot developed by OpenAI, is seen on its website in this illustration picture taken February 9, 2023. Reuters
</media:title>
      </media:content>
    </item>
  </channel>
</rss>
