Warren Buffett's "Mystery Stock"

Billionaires are plowing millions of dollars into a new technology that early estimates say could generate more wealth than A.I. -- and determine the future of companies like Microsoft. The problem? Most investors have no idea it's happening, or how to profit on it. This expert just went public with all the details, including which stocks to jump on immediately.

AI 'gold rush' for chatbot training data could run out of human-written text

MATT O'BRIEN
June 06, 2024

Artificial intelligence systems like ChatGPT could soon run out of what keeps making them smarter -- the tens of trillions of words people have written and shared online.

A new study released Thursday by research group Epoch AI projects that tech companies will exhaust the supply of publicly available training data for AI language models by roughly the turn of the decade -- sometime between 2026 and 2032.

Comparing it to a "literal gold rush" that depletes finite natural resources, Tamay Besiroglu, an author of the study, said the AI field might face challenges in maintaining its current pace of progress once it drains the reserves of human-generated writing.

In the short term, tech companies like ChatGPT-maker OpenAI and Google are racing to secure and sometimes pay for high-quality data sources to train their AI large language models - for instance, by signing deals to tap into the steady flow of sentences coming out of Reddit forums and news media outlets.

In the longer term, there won't be enough new blogs, news articles and social media commentary to sustain the current trajectory of AI development, putting pressure on companies to tap into sensitive data now considered private -- such as emails or text messages -- or relying on less-reliable "synthetic data" spit out by the chatbots themselves.

"There is a serious bottleneck here," Besiroglu said. "If you start hitting those constraints about how much data you have, then you can't really scale up your models efficiently anymore. And scaling up models has been probably the most important way of expanding their capabilities and improving the quality of their output."

The researchers first made their projections two years ago -- shortly before ChatGPT's debut -- in a working paper that forecast a more imminent 2026 cutoff of high-quality text data. Much has changed since then, including new techniques that enabled AI researchers to make better use of the data they already have and sometimes "overtrain" on the same sources multiple times.

But there are limits, and after further research, Epoch now foresees running out of public text data sometime in the next two to eight years.

The team's latest study is peer-reviewed and due to be presented at this summer's International Conference on Machine Learning in Vienna, Austria. Epoch is a nonprofit institute hosted by San Francisco-based Rethink Priorities and funded by proponents of effective altruism -- a philanthropic movement that has poured money into mitigating AI's worst-case risks.

Besiroglu said AI researchers realized more than a decade ago that aggressively expanding two key ingredients -- computing power and vast stores of internet data -- could significantly improve the performance of AI systems.

The amount of text data fed into AI language models has been growing about 2.5 times per year, while computing has grown about 4 times per year, according to the Epoch study. Facebook parent company Meta Platforms recently claimed the largest version of their upcoming Llama 3 model -- which has not yet been released -- has been trained on up to 15 trillion tokens, each of which can represent a piece of a word.

But how much it's worth worrying about the data bottleneck is debatable.

"I think it's important to keep in mind that we don't necessarily need to train larger and larger models," said Nicolas Papernot, an assistant professor of computer engineering at the University of Toronto and researcher at the nonprofit Vector Institute for Artificial Intelligence.

Papernot, who was not involved in the Epoch study, said building more skilled AI systems can also come from training models that are more specialized for specific tasks. But he has concerns about training generative AI systems on the same outputs they're producing, leading to degraded performance known as "model collapse."

Training on AI-generated data is "like what happens when you photocopy a piece of paper and then you photocopy the photocopy. You lose some of the information," Papernot said. Not only that, but Papernot's research has also found it can further encode the mistakes, bias and unfairness that's already baked into the information ecosystem.

If real human-crafted sentences remain a critical AI data source, those who are stewards of the most sought-after troves -- websites like Reddit and Wikipedia, as well as news and book publishers -- have been forced to think hard about how they're being used.

"Maybe you don't lop off the tops of every mountain," jokes Selena Deckelmann, chief product and technology officer at the Wikimedia Foundation, which runs Wikipedia. "It's an interesting problem right now that we're having natural resource conversations about human-created data. I shouldn't laugh about it, but I do find it kind of amazing."

While some have sought to close off their data from AI training -- often after it's already been taken without compensation -- Wikipedia has placed few restrictions on how AI companies use its volunteer-written entries. Still, Deckelmann said she hopes there continue to be incentives for people to keep contributing, especially as a flood of cheap and automatically generated "garbage content" starts polluting the internet.

AI companies should be "concerned about how human-generated content continues to exist and continues to be accessible," she said.

From the perspective of AI developers, Epoch's study says paying millions of humans to generate the text that AI models will need "is unlikely to be an economical way" to drive better technical performance.

As OpenAI begins work on training the next generation of its GPT large language models, CEO Sam Altman told the audience at a United Nations event last month that the company has already experimented with "generating lots of synthetic data" for training.

"I think what you need is high-quality data. There is low-quality synthetic data. There's low-quality human data," Altman said. But he also expressed reservations about relying too heavily on synthetic data over other technical methods to improve AI models.

"There'd be something very strange if the best way to train a model was to just generate, like, a quadrillion tokens of synthetic data and feed that back in," Altman said. "Somehow that seems inefficient."

------------

The Associated Press and OpenAI have a licensing and technology agreement that allows OpenAI access to part of AP's text archives.

Continue Reading...

Popular
Top 3 Financial Stocks That May Crash This Month

Top 3 Financial Stocks That May Crash This Month

As of June 24, 2024, three stocks in the financial sector could be flashing a real warning to investors who value momentum as a key criteria in their trading decisions.

Ex-Colleagues Of Judge In Donald Trump's Hush-Money Case Say Former President Will Most Likely Face This Sentence

Ex-Colleagues Of Judge In Donald Trump's Hush-Money Case Say Former President Will Most Likely Face This Sentence

Former New York City judges weigh in on potential sentencing outcomes for former President Donald Trump.

Collect 10%+ Dividends From AI's Explosive Growth? - Ad

This one-of-a-kind fund is set to pay out a tsunami of profits to shareholders... Giving you the chance to collect 10%+ dividend yields, year after year! In his brand-new "AI Income Playbook," Marc Lichtenfeld gives you the full scoop on this breakthrough fund... Including its name, ticker symbol, and profit potential.

Putin's Daughters Enter The Limelight As Russian President Considers His Legacy: 'Young Daughters Represent Vitality For Him'

Putin's Daughters Enter The Limelight As Russian President Considers His Legacy: 'Young Daughters Represent Vitality For Him'

Vladimir Putin's daughters are making public appearances as the Russian president contemplates his legacy.

Trump Vs. Biden: Latest Poll Reveals Clear Impact Of Ex-President's Conviction On Voters In Tightly Contested Race

Trump Vs. Biden: Latest Poll Reveals Clear Impact Of Ex-President's Conviction On Voters In Tightly Contested Race

Ahead of the 2024 presidential election, former President Donald Trump has seen a decline in support following his recent conviction.

Where Buffett, Gates, and Bezos Are Investing Now - Ad

Billionaires are plowing millions of dollars into a new technology that early estimates say could generate more wealth than A.I. -- and determine the future of companies like Microsoft. The problem? Most investors have no idea it's happening, or how to profit on it. This expert just went public with all the details, including which stocks to jump on immediately.

Trump's Niece Says Insane To Pardon 'Convicted Criminal Who's Trying To Destroy America': 'Absolutely Terrible Idea'

Trump's Niece Says Insane To Pardon 'Convicted Criminal Who's Trying To Destroy America': 'Absolutely Terrible Idea'

Donald Trump's niece Mary Trump recently slammed the idea of pardoning the ex-president after he was found guilty in the first of his four criminal trials.

Donald Trump Isn't A Convicted Felon Just Yet And Can Still Pursue These Options To Fight Damning Verdict, Says Yale Law Professor

Donald Trump Isn't A Convicted Felon Just Yet And Can Still Pursue These Options To Fight Damning Verdict, Says Yale Law Professor

A Yale Law professor suggested an alternative strategy for former President Donald Trump's legal team following his recent guilty verdict.

"America's No. 1 Retirement Stock" (Name Inside) - Ad

According to the former Goldman Sachs VP -- who wrote a best-selling book on retirement -- one single stock stands head-and-shoulders above all others. And it should be the cornerstone of your portfolio. Have you heard of it?

Dogecoin Trader Who Made $250K Spends It All On Donations, Drugs, Concerts, Tattoos...And Takes Her Last $4K To Buy More

Dogecoin Trader Who Made $250K Spends It All On Donations, Drugs, Concerts, Tattoos...And Takes Her Last $4K To Buy More

A pseudonymous cryptocurrency trader shared a cautionary tale about the dangers of sudden wealth from cryptocurrency and meme coin trading.

States bet on boosting taxes for online sports betting companies like DraftKings, FanDuel

States bet on boosting taxes for online sports betting companies like DraftKings, FanDuel

NEW YORK (AP) — States are looking to increase their take from the $16 billion online sports gambling industry as it expands across the country with big partnerships.

AI Alert: Wealth Window Update - Ad

I'm James Altucher, AI expert at Paradigm Press. On June 25, tech giants like Nvidia, Dell, and Alibaba could reveal "AI 2.0." This limited wealth-building window could turn $10k into $1 million with strategies I've perfected over 40 years. Act fast before everyone catches on!

Singapore Airlines offers compensation to passengers on flight that hit extreme turbulence

Singapore Airlines offers compensation to passengers on flight that hit extreme turbulence

KUALA LUMPUR, Malaysia (AP) — Singapore Airlines said Tuesday it has offered compensation to passengers of a flight that hit extreme turbulence last month, in a rare case that killed one passenger and injured dozens.

Why Is AMD Stock Trading Lower On Monday?

Why Is AMD Stock Trading Lower On Monday?

Morgan Stanley downgrades AMD due to high valuations and competition from Nvidia. However, analyst believes AMD will continue to gain market share.

Ex-CIA Insider Exposes How Dems Could Rig the Election - Ad

In 2016, surveys were giving Hillary Clinton more than 99% chance of winning right up until election night. But right before the election... Former advisor to the CIA, Jim Rickards predicted Trump would win. You won't believe what he's predicting now. And it could have huge implications for the financial markets.

Future of Elon Musk and Tesla are on the line as shareholders vote on massive pay package

Future of Elon Musk and Tesla are on the line as shareholders vote on massive pay package

DETROIT (AP) — If Tesla shareholders approve an all-stock compensation package for CEO Elon Musk , it would almost guarantee he would remain at the company he grew to be the world leader in electric vehicles, shifting to AI and robotics including autonomous vehicles, which Musk says is Tesla's future.

Missile attacks by Yemen's Houthi rebels strike 2 ships in the Gulf of Aden, US military says

Missile attacks by Yemen's Houthi rebels strike 2 ships in the Gulf of Aden, US military says

MANAMA, Bahrain (AP) — Missile attacks by Yemen's Houthi rebels struck two ships in the Gulf of Aden, authorities said Sunday, the latest assaults on shipping in the region.

Seven Unknown AI Stocks That Could Dominate the Next Six Years - Ad

The original "Magnificent Seven" stocks generated 16,800% over the last 20 years. But now a new set of AI stocks is set to take over. Alex Green dubs them "The Next Magnificent Seven." And he's arguing that just $1,000 in each could turn into more than $1 million in less than six years.

Elon Musk Hails 'Diablo' As A 'Hall-Of-Fame' Game After Player Count Surpasses 100 Million

Elon Musk Hails 'Diablo' As A 'Hall-Of-Fame' Game After Player Count Surpasses 100 Million

Elon Musk praised Blizzard Entertainment's "Diablo" series for surpassing 100 million players over its 27-year history.

Congressional Budget Office raises this year’s federal budget deficit projection by $400 billion

Congressional Budget Office raises this year’s federal budget deficit projection by $400 billion

WASHINGTON (AP) — The Congressional Budget Office said Tuesday that it projects this year’s federal budget deficit to be $400 billion higher, a 27% increase compared to its .

The Mysterious Tale of "America's No. 1 Retirement Stock" - Ad

This factory of 53,000 employees, in Burbank, CA, was camouflaged to look like a sleepy suburb (with the help of artists, set designers, and painters from nearby Hollywood movie studios). The mysterious company behind this disappearing act is now being called "America's No. 1 Retirement Stock".

Shiba Inu, Ethereum, Chainlink Flash This 'Long-Term Bullish Signal' Despite Recent Drop

Shiba Inu, Ethereum, Chainlink Flash This 'Long-Term Bullish Signal' Despite Recent Drop

While the cryptocurrency market edged lower Thursday, several of the most popular coins showed a strong bullish indication. What Happened: Ethereum, Shiba Inu, and Chainlink endured significant losses during the day, as indicated below. 

Musk Urges Tesla Shareholders To Vote, Says Too Much Of The Stock Market Is Controlled by ISS And Glass Lewis: 'Zero Economic Alignment With Actual Shareholders'

Musk Urges Tesla Shareholders To Vote, Says Too Much Of The Stock Market Is Controlled by ISS And Glass Lewis: 'Zero Economic Alignment With Actual Shareholders'

Tesla Inc (NASDAQ: TSLA) CEO Elon Musk is urging shareholders to cast their votes on their shares by offering them an opportunity to win a personal tour of the company's Texas facility.

Nvidia Is Pivoting to Solve Big Tech's $1 Trillion Problem - Ad

Nvidia is the hottest company in the world thanks to its chip business. But here's the thing: Nvidia is making a massive $1 trillion pivot ... To solve AI's biggest problem. But it's not making this move by itself. A new set of companies are partnering with Nvidia in this trillion-dollar venture.

Tesla 'Not Going To Grow This Year,' Says Bernstein Analyst: Maintains 'Underweight' Rating While Noting That Elon Musk's Pay Package Approval Will Bring 'Relief Rally'

Tesla 'Not Going To Grow This Year,' Says Bernstein Analyst: Maintains 'Underweight' Rating While Noting That Elon Musk's Pay Package Approval Will Bring 'Relief Rally'

Bernstein analyst Toni Sacconaghi shared on CNBC's Squawk Box that while Elon Musk's new pay package approval led to a positive "relief rally" for Tesla, he maintains an underperform rating on the stock due to expected declines in unit growth and earnings.

What's Going On With BioRestorative Therapies Stock?

What's Going On With BioRestorative Therapies Stock?

BioRestorative Therapies shares are trading higher Monday after the company announced it has received notice from the Nasdaq Stock Market that it has regained compliance with listing requirements.

Don't Pay a Dime for Marc Lichtenfeld's Top AI Picks - Ad

Marc Lichtenfeld's has a brand-new "AI Income Playbook", absolutely FREE! Inside, you'll find Marc's favorite AI dividend stocks... Poised to profit from the fastest-growing technology in history... and pay you bigger and bigger cash dividends along the way! And it's all yours, free of charge.

Massachusetts on verge of becoming second-to-last state to outlaw "revenge porn"

Massachusetts on verge of becoming second-to-last state to outlaw "revenge porn"

BOSTON (AP) — A bill aimed at outlawing “revenge porn” has been approved by lawmakers in the Massachusetts House and Senate and shipped to Democratic Gov. Maura Healey, a move advocates say was long overdue.

General Motors Reveals $6B Stock Buyback - What's Going On?

General Motors Reveals $6B Stock Buyback - What's Going On?

General Motors announces a $6 billion share repurchase plan, aiming to boost shareholder value amidst positive financial performance.

Big Tech Is Spending Billions Each Month on This AI Superproject - Ad

Big Tech is committing billions each month to the construction of these AI mega data centers. This is where Nvidia, comes in. It's quietly leaning on a set of what we call Silent Partners to get the job done. These companies could benefit greatly from this next wave of the AI boom.

What's New In the Consumer Tech World Last Week? News That You Should Know (June 2-June 8, 2024)

What's New In the Consumer Tech World Last Week? News That You Should Know (June 2-June 8, 2024)

Earlier this week, former President Donald Trump made a surprising move by joining TikTok, the social media platform he once sought to ban.

Walmart offers new perks for workers, from a new bonus plan to opportunities in skilled trade jobs

Walmart offers new perks for workers, from a new bonus plan to opportunities in skilled trade jobs

NEW YORK (AP) — is offering new perks for its hourly U.S. workers, ranging from a new bonus plan to opportunities to move into skilled trade jobs within the company.

What Is Nvidia's New $1 Trillion Superproject? - Ad

While most are still focused on Nvidia's recent performance, they're missing this massive, $1 trillion pivot Nvidia's making right now. But it's what's happening behind the scenes that should be most exciting for investors. We've identified three companies that Nvidia needs to lean on to help get the job done.

Boeing Insists On Adherence To Legal Agreement Post-737 MAX Crashes: Report

Boeing Insists On Adherence To Legal Agreement Post-737 MAX Crashes: Report

Boeing reportedly maintains it adhered to a 2021 agreement aimed at avoiding prosecution after the 737 MAX incidents, countering U.S. claims of non-compliance. The company's commitment to transparent engagement highlights its efforts to navigate legal complexities ahead of the Justice Department's crucial July 7 decision on extending prosecution terms.

Does 'Sell In May And Go Away' Work For Bitcoin? This Researcher Crunched The Numbers

Does 'Sell In May And Go Away' Work For Bitcoin? This Researcher Crunched The Numbers

Crypto researcher TradeTheFlow challenged the prevailing notion that summer is a dull period for cryptocurrency markets, particularly Bitcoin (CRYPTO:

This Election Shocker Could Be Worse Than Trump's Conviction - Ad

Former advisor to the CIA, the Pentagon and the White House Jim Rickards just dropped this Trump election bombshell. For the sake of our country...I hope he's wrong. Because if he's right, you need to prepare now.

Edward Snowden Takes A Jab At Elizabeth Warren's Anti-Bitcoin Stance Through The Infamous 'We're All Going To Die' Remark Broadcast On China-Controlled TV

Edward Snowden Takes A Jab At Elizabeth Warren's Anti-Bitcoin Stance Through The Infamous 'We're All Going To Die' Remark Broadcast On China-Controlled TV

Renowned whistleblower and privacy advocate Edward Snowden poked fun at Senator Elizabeth Warren (D-Mass.) for her anti-Bitcoin (CRYPTO: BTC) stance by comparing it with the bizarre perspective of the Chinese Communist Party.

Elliott Management Reportedly Builds $2B Stake In Southwest Airlines, Plans To Push For Change

Elliott Management Reportedly Builds $2B Stake In Southwest Airlines, Plans To Push For Change

Elliott Investment Management, an activist investor, has built a nearly $2 billion stake in Southwest Airlines, and is set to advocate for changes to enhance the airline's underperforming shares.

Could This New Project Top $1T? - Ad

This billionaire has predicted that his new venture - something I'm calling "X-9840" - could become "a trillion dollar company." Today, only three companies in the US are worth more than $2 trillion. Microsoft... Apple...and Nvidia. This has nothing to do with electric vehicles... Self-driving cars, rockets, brain chips, or satellites.

Faking an honest woman: Why Russia, China and Big Tech all use faux females to get clicks

Faking an honest woman: Why Russia, China and Big Tech all use faux females to get clicks

WASHINGTON (AP) — When disinformation researcher Wen-Ping Liu looked into China's efforts to influence using fake social media accounts, something unusual stood out about the most successful profiles.

Trending Now

Information, charts or examples are for illustration and educational purposes only and not for individualized investment management This message contains commercial elements, such as advertising. We only send these offers to those who have opted in to our newsletter. Past performance is not indicative of future results. For these reasons we strongly suggest trading in a DEMO/Simulated account. The information provided by us is for educational and informational purposes only. We make no representations or warranties concerning the products, practices or procedures of any company or entity mentioned or recommended and have not determined if the statements and opinions of the advertiser are accurate, correct or truthful. If you use, act upon or make decisions in reliance on information contained or any external source linked within it, you do so at your own peril and agree to hold us, our officers, directors, shareholders, affiliates and agents without fault.

Copyright finstrategist.com
Privacy Policy | Terms of Service