AI Tech Giants Actively Looking to Acquire AI Training Data

He shared insights into the high demand for Photobucket's content, with some potential buyers expressing interest in quantities surpassing the platform's available inventory. (Credits: Photobucket)

During its heyday in the early 2000s, Photobucket reigned as the premier image-hosting platform globally.

Serving as the media backbone for once-popular platforms like Myspace and Friendster, it amassed 70 million users and held nearly half of the U.S. online photo market share.

However, its user base has dwindled significantly over time, with only 2 million users remaining today, as reported by analytics tracker Similarweb.

The emergence of generative AI technology presents a potential revival opportunity for Photobucket. (Credits: Future Publishing)

CEO Ted Leonard, overseeing the 40-strong company from Edwards, Colorado, disclosed to Reuters that discussions are underway with various tech firms regarding licensing Photobucket’s vast repository of 13 billion photos and videos.

These assets would be utilized to train generative AI models capable of generating new content based on text prompts.

Leonard indicated that negotiations involve pricing structures ranging from 5 cents to $1 per photo and over $1 per video.

Prices are subject to significant variation depending on the buyer’s requirements and the types of imagery sought.

Biggest Tech Players (Credits: Digital Information World)

While Photobucket refrained from divulging the identities of potential buyers, citing commercial confidentiality, these ongoing negotiations shed light on the burgeoning data market that accompanies the race to dominate generative AI technology.

Nonetheless, they face legal challenges from copyright holders disputing the use of their content.

Major tech players such as Google, Meta, and Microsoft-backed OpenAI initially relied on freely available internet data to train generative AI models like ChatGPT, asserting that such practices are lawful and ethical. (Credits: Huzaifa Abedeen)

Simultaneously, these tech giants engage in discreet transactions to acquire content typically inaccessible behind paywalls or login screens.

This clandestine trade encompasses various data sources, including chat logs and forgotten personal photos from obsolete social media platforms, underscoring the complexity and secrecy surrounding the data economy fueling generative AI advancements.

Nate O'Hara
Nathan is a seasoned commerce writer with a passion for unraveling the intricacies of the business world and distilling them into engaging narratives. During his academic journey, he delved deep into subjects like economics, marketing, and entrepreneurship, honing his analytical skills and developing a keen understanding of market dynamics.