2024-07-19 13:28:52
Technology
Artificial Intelligence

The Data That Powers A.I. Is Disappearing Fast

A study by the M.I.T.-led Data Provenance Initiative reveals that A.I. models heavily rely on web data, but a significant portion of it is now restricted due to websites using the Robots Exclusion Protocol and terms of service to prevent data harvesting.

This poses challenges for A.I. companies, researchers, and academics.

The study found that 5% of data and 25% of high-quality data in three major datasets are now restricted. The dwindling availability of web data highlights the need for alternative sources and approaches in A.I.

development. Additionally, the article discusses the environmental impact of artificial intelligence, with the energy and water consumption of data centers supporting A.I.

contributing to significant emissions. Efforts to decentralize A.I.

through blockchain technology are also highlighted as a potential solution to privacy concerns and regulatory barriers.

EL PAÍS
19. Juli 2024 um 03:30

The giants of AI advise our governments, but they are judge and party

Technology
Politics
Countries are asking the major Artificial Intelligence companies to enlighten us, which represents a conflict of interest. Your subscription is being used on another device. Your subscription is being used on another device and you can only access EL PAÍS from one device at a time. We are in an era where artificial intelligence is ubiquitous. Nothing else is talked about and there is not a day that goes by without some spectacular announcement. In the last 10 years, these increasingly sophisti..
EL PAÍS
19. Juli 2024 um 03:00

The Natural Footprint of Artificial Intelligence

Technology
Environment
Artificial intelligence (AI) has a considerable physical and environmental footprint; the data centers that support it absorb monumental energy resources, and the developing companies have doubled their energy and water consumption, as well as their carbon emissions, in recent years.
The Defiant
19. Juli 2024 um 07:53

AI Systems Are Headed for Major Roadblocks Unless Decentralization Is Adopted

Technology
Cryptocurrencies & blockchain
AI development dates back to the 1950s, when John McCarthy coined the term and invented the LISP programming language. Today, personalized AI systems like Apple's are ubiquitous, but privacy concerns, such as data leaks and regulatory bans on ChatGPT by Samsung and Italy, threaten this technological revolution. Decentralizing AI through blockchain could overcome these hurdles and ensure continued growth.
New York Times - Technology
19. Juli 2024 um 14:38

The Data That Powers A.I. Is Disappearing Fast

Technology
An M.I.T.-led Data Provenance Initiative study found A.I. models' reliance on web data is rapidly disappearing, with 5% of data and 25% of high-quality data in 3 major datasets (C4, RefinedWeb, Dolma) now restricted. Websites use the Robots Exclusion Protocol and terms of service to prevent data harvesting, with up to 45% of the C4 dataset restricted, posing challenges for A.I. companies, researchers, and academics, according to the study's lead author Shayne Longpre.
CW

Account

Waiting list for the personalized area


Welcome!

InfoBud.ai

infobud.ai is an AI-driven news aggregator that simplifies global news, offering customizable feeds in all languages for tailored insights into tech, finance, politics, and more. It provides precise, relevant news updates, overcoming conventional search tool limitations. Due to the diversity of news sources, it provides precise and relevant news updates, focusing entirely on the facts without influencing opinion. Read moreExpand

Your World, Tailored News: Navigate The News Jungle With AI-Powered Precision!