The End Of The Internet As We Know It?
The End of the Open and Free Internet
In this section, the speaker discusses how Reddit's decision to charge for their API is just the beginning of a trend that will lead to the end of an open and free internet. This is due to large language models requiring vast amounts of high-quality data, which companies are becoming increasingly protective of.
The Cost of APIs
- Reddit's decision to increase the cost of their API from free to paid has caused third-party developers like Apollo to shut down.
- Twitter also implemented a similar pricing model, and more companies are expected to follow suit.
- Companies with proprietary data are becoming highly siloed as they realize the value of their unique data sets.
Large Language Models
- Large language models require vast amounts of high-quality and structured data for training.
- Unique data sets give companies like OpenAI a competitive advantage in building better models.
- Artificial intelligence companies are scrambling to get their hands on better and more unique data.
Shutting Down APIs
- Websites with vast amounts of unique data like Reddit, Stack Overflow, and Twitter are shutting down their APIs and becoming highly isolated.
- This is different from 15 years ago when companies realized that opening up their APIs would create incredible ecosystems built by third-party developers.
Negative Impact on Users
- As apps shut down their APIs, functionality built by third-party engineers will decrease rapidly, hurting consumers.
- Users contribute valuable content but do not benefit from it as companies own the data completely.
A Better Model?
- There is an opportunity for a better model where end-users get paid for their contributions to data sets.
- Companies are not incentivized to legally allow users to own the data completely.
- Elon Musk plans on sharing revenue with content creators on Twitter, and this could be a great first step.
Open AI and Reddit's Data Set
In this section, the speaker discusses the theory that Open AI may be using Reddit's data set as a competitive advantage against other large language models. The speaker also talks about the potential consequences of companies becoming less incentivized to share their valuable data sets with third-party developers.
Open AI and Reddit's Data Set
- Open AI is founded and run by Sam Altman who is deep in the Y combinator Community.
- Dimitri theorizes that Reddit might be already sharing their data with open Ai and Sam Altman has convinced reddit's Founders to close down their API so that only open AI has access to that incredibly valuable data set.
- The open and free internet that we've all enjoyed over the last 10 15 maybe 20 years is starting to close down as data sets become more valuable.
- Companies are less incentivized to share those data sets with third-party developers to build incredible functionality on top of it.
Future of the Internet
In this section, the speaker talks about what needs to be done in order for entrepreneurs to come along and build business models that reward users for their content contributions. The speaker also discusses how having a thriving open source ecosystem will be critical in competing against closed-source artificial intelligence companies.
Future of the Internet
- There is a ripe opportunity for entrepreneurs to come along and build business models that reward users for their content contributions.
- The Reddit blackout is great but it may not do much.
- Having a thriving open source ecosystem to compete against closed-source artificial intelligence companies is going to be critical.