Perplexity CEO Aravind Srinivas On Comet, Search, & The Future Of AI | Semafor Tech

Perplexity CEO Aravind Srinivas On Comet, Search, & The Future Of AI | Semafor Tech

Introduction to the Interview

Overview of Reed Alberg and the Purpose of the Video

  • Reed Alberg introduces himself as the technology editor at Semaphore, with over 10 years of experience in tech journalism.
  • He explains his initiative to record interviews for YouTube, sharing insights that often get left out of print articles.
  • The video features Arvin Sinovas, co-founder and CEO of Perplexity, discussing their AI search engine and its ambitious goals.

The Comet Browser: A New Operating System for AI?

Features and Capabilities

  • Alberg shares his positive experiences using the Comet Browser during vacation, highlighting its ability to manage tasks like booking flights through AI assistance.
  • Sinovas elaborates on how Comet aims to function as an operating system by handling asynchronous processes without real-time updates.

Technical Insights

  • Sinovas describes essential characteristics of an operating system: state management, background processes, memory management, and user visibility into operations.
  • He emphasizes that a browser can evolve into a mini-computer by connecting seamlessly with other applications on a user's device.

Context Engineering in AI

Integration Across Platforms

  • Sinovas discusses how Comet allows users to interact across various platforms (e.g., iMessage, email), ensuring comprehensive context retrieval for tasks.
  • He highlights the importance of context engineering—AI's ability to orchestrate information from different sources without user blame for platform choices.

Personalization and Intelligence

  • The conversation touches on two critical axes: intelligence (AI capabilities like drafting emails or automating tasks) and personalization (tailoring responses based on user context).

Future Directions in AI Browsers

Evolution Towards Advanced Contextual Understanding

  • Sinovas predicts future iterations (like GPT5 or GPT6), where browsers will serve as ultimate contexts due to their integration into daily life.

Discussion on AI Model Limitations and Future Potential

The Role of the Model and Surrounding Infrastructure

  • The speaker emphasizes that both the model itself and the infrastructure built around it are crucial, noting limitations in current models regarding task capabilities.
  • A comparison is made to earlier versions of products like Perplexity, which utilized GPT-3.5, highlighting how far AI technology has progressed since then.

Evolution of AI Capabilities

  • The current capabilities of Comet are likened to early iterations of Perplexity before the advent of GPT-4, indicating a gradual improvement in model performance.
  • Anticipation for more capable models with better long-context understanding and instruction-following abilities is discussed, suggesting that these advancements will lead to more scalable and affordable solutions.

Strategic Positioning in a Competitive Market

  • The speaker argues against waiting for perfect models before launching products, advocating for proactive development based on expected future improvements in AI technology.
  • They acknowledge competition from major players like Google and OpenAI but express confidence in their ability to release a product that may not be fully refined yet.

User Adoption and Market Dynamics

  • There is an increase in users adopting Comet as their default browser since launch, indicating its readiness to capture market share from established browsers like Chrome and Edge.
  • The focus is on creating a new category of "agentic browsers" rather than competing directly with legacy systems, reflecting a shift away from traditional market dynamics.

Future Vision for Browsers

  • As user interest grows (approaching one million), there’s optimism about the potential for broader acceptance despite initial imperfections.
  • With larger companies entering this space, awareness about agentic queries will likely rise among users, leading them to automate tasks through browser platforms.

Reimagining Browser Functionality

  • The speaker envisions a future where browsers serve as essential operating systems orchestrating various life processes rather than merely facilitating internet consumption.
  • This transformation reflects an aspiration for Comet to become integral to users' daily workflows by streamlining operations traditionally handled manually.

Historical Context and User Habits

  • A parallel is drawn between past internet usage habits (like typing full URLs versus using search engines), suggesting that similar shifts are needed in how people interact with AI today.

The Future of Skills in the Age of AI

The Decline of Traditional Skills

  • Albergardi reflects on the diminishing importance of certain skills, such as mental arithmetic, which are now easily performed by calculators and AI.
  • He notes that recalling sports statistics or trivia used to be impressive but has lost its value due to easy access to information online.
  • The speaker compares current generations' reliance on technology to how previous generations viewed knowledge retention, suggesting a shift in what is considered "cool."

Adapting to New Technologies

  • Albergardi discusses his struggles with outdated methods for finding specific content in videos, emphasizing the need for more efficient tools like AI assistants.
  • He illustrates how AI can streamline tasks such as recruiting by automatically gathering relevant information from LinkedIn profiles.

The Role of Universal AI Assistants

  • The conversation highlights the potential for universal AI agents that can operate across various platforms without needing separate interfaces or custom solutions.
  • Albergardi envisions an integrated experience where users can interact with their work environments seamlessly while receiving assistance from AI.

Proactive Assistance and User Experience

  • There’s a discussion about creating AIs that observe user behavior and suggest improvements without feeling intrusive or spammy.
  • Albergardi emphasizes the challenge of making proactive suggestions feel magical rather than just promotional, which could lead users to ignore future alerts.

Trusting AI with Personal Information

  • Concerns arise regarding trust in AIs handling sensitive information; Albergardi mentions Comet's approach to not requiring passwords directly.

Browser Architecture vs. Traditional AI Agents

Advantages of Browser Architecture

  • The browser architecture allows the AI to interact with the logged-in version of a site without taking over user accounts, enhancing security compared to traditional agents like ChatGPT.
  • Users can utilize password management extensions like 1Password, although there are still some limitations in functionality.

Mobile Compatibility Challenges

  • The discussion highlights that iOS requires WebKit for browsers, posing challenges for implementing certain features on mobile devices.
  • A multiplatform approach using Kotlin could allow code reuse across iOS and Android, but may not be optimal for performance.

Background Process Limitations on iOS

  • On iOS, background processes face restrictions; if an app is in the background, it cannot perform tasks that require constant server-client communication.
  • If tasks exceed a 30-second window, users must keep the app open, which presents usability challenges.

Architectural Solutions and Apple Policies

  • Potential solutions include running a virtual browser in the cloud or advocating for more flexible Apple policies regarding background processes.
  • Apple's concerns about security risks from background apps complicate these discussions; they worry about potential hacking vulnerabilities.

Hardware Considerations and Future Directions

  • The conversation shifts to hardware development as a response to software limitations imposed by platforms like Apple.

Exploring the Limitations of Hardware in AI Contextualization

The Role of Hardware vs. Software

  • The discussion highlights that while various hardware form factors (like earbuds) can provide context about users' daily activities, they do not inherently solve the problem of controlling third-party applications.
  • It is emphasized that hardware merely addresses one aspect—gathering contextual information—while a browser or software solution would be more effective for comprehensive control and interaction with apps.

Privacy Concerns with Contextual Listening

  • There are significant privacy concerns regarding devices that listen to private conversations, which feels intrusive compared to browsing in incognito mode where user controls exist.
  • Browsers offer options for zero retention and query deletion, making them a safer alternative than audio recordings sent to servers, which raises fears about data security.

Future Applications of AI Devices

  • The speaker expresses openness to future AI devices that could assist in professional settings, such as transcribing meetings and centralizing medical records without relying on third-party applications.
  • A concern is raised about why these functionalities cannot simply be integrated into existing smartphone apps rather than requiring separate hardware solutions.

Limitations of Current Mobile Technology

  • The conversation touches on how traditional recording devices have become obsolete due to smartphones, yet some still use them out of nostalgia rather than utility.
  • The speaker clarifies they are not pursuing hardware projects but believes software innovations will enhance the functionality of existing devices.

Challenges in Accessing Third-party Apps

  • Criticism is directed at both Apple and Android ecosystems; while Apple provides SDK access to its native apps, Android's restrictions complicate interactions with essential services like Uber or DoorDash.
  • There’s an acknowledgment that many developers focus solely on app development rather than maintaining mobile web interfaces, which poses challenges for creating effective agentic AI systems.

Vision for Future Development

  • Emphasis is placed on improving mobile browsers and developing hybrid architectures that prioritize privacy and security over new hardware distractions.

Cloud-Based Solutions and Future Directions

Exploring Cloud Integration and Flexibility

  • The discussion begins with the idea of utilizing cloud services, where tasks may be executed in virtual environments or containers, accessed via web interfaces.
  • There is a vision for third-party applications to abstract themselves as backend APIs or MCP servers, indicating a flexible approach rather than adhering to a single operational model.
  • The speaker expresses skepticism towards rigid models like MCP, emphasizing compatibility with various alternatives and the importance of supporting diverse protocols.

User Permissions and Third-Party Interactions

  • Emphasis is placed on user permissions; actions taken by the service are based on explicit user consent, ensuring that third-party servers perceive interactions as if performed directly by users.
  • The speaker suggests that issues faced by third-party servers often stem from their business models rather than limitations in serving users effectively.

Infrastructure Challenges and Cost Considerations

  • Acknowledgment of significant infrastructure rewrites needed to support agents natively due to complex client-server communications required for task execution.
  • Detailed explanation of the intricate processes involved in executing tasks such as placing orders online, highlighting the need for stable infrastructure to manage numerous connection requests.

Evaluation and Progress Tracking

  • Discussion on creating rigorous evaluation suites specifically for agents, which differ from traditional answer evaluations due to the non-deterministic nature of browsing tasks.
  • Importance of automated evaluations is stressed; tracking progress requires adapting infrastructure to new models while forecasting costs per user based on agent queries.

Gradual Rollout Strategy

  • The strategy involves slowly increasing access to new features while monitoring usage patterns among power users to avoid pitfalls experienced by other companies.

Understanding the Role of AI Research in Product Development

The Importance of a Strong AI Research Background

  • The discussion begins with the consideration of whether cheaper models can perform similar tasks, emphasizing ongoing efforts to enhance reliability and scalability.
  • Acknowledgment of the speaker's strong academic background in AI, highlighting that they could have pursued a career as an AI professor, which adds credibility to their insights on model progression and product offerings.
  • The co-founders' backgrounds are noted, particularly one who has experience at Microsoft Bing, contributing valuable knowledge about scalable backend systems and recommendation algorithms.

Problem-Solving Skills Over Technical Expertise

  • The speaker emphasizes that while technical skills in AI research are beneficial, problem-solving and reasoning skills developed during their PhD are more crucial for understanding complex issues.
  • They admit to not being up-to-date with every detail of recent model training techniques but maintain a solid grasp on general principles and user experience challenges.

Triage Process for Bug Resolution

  • The ability to quickly identify the source of issues—whether related to search indexing or query formulation—is highlighted as essential for maintaining product quality.
  • Engaging directly with users helps inform the development process; personal experiences with bugs lead to a deeper understanding of user frustrations.

User-Centric Approach in Product Development

  • Emphasizes the importance of using their own product regularly to identify bugs and improve user satisfaction. Feedback from family members is also considered valuable.
  • Acknowledges that while encountering bugs is acceptable initially, persistent issues after two years indicate a need for improvement.

Challenges Faced by Established Companies like Google

  • Discusses how established companies face unique challenges due to legacy systems filled with numerous unresolved bugs compared to newer competitors focused on innovation.

Sundar's Leadership and Google's Business Strategy

Sundar Pichai's Performance

  • The speaker believes Sundar is performing well in his role, acknowledging the challenges he faces.
  • Credit is given to Thomas Kurian for leading Google Cloud with exceptional efficiency, likening his management style to that of top CEOs.

Financial Overview of Google’s Revenue Streams

  • Google Cloud generates approximately $50 billion annually, while YouTube contributes around $40 billion. Subscriptions across platforms add another $5 to $10 billion.
  • Despite significant revenue, margins are lower compared to search advertising, presenting a risk for the company.

Diversification and Market Strategy

  • The speaker compares Google’s future potential to Microsoft’s diversified business model, which includes various services beyond Windows.
  • Acknowledges that AI-driven search will prioritize user loyalty over advertiser interests, impacting revenue models.

Addressing Margin Losses

  • To counteract declining margins from search advertising, Google must develop additional businesses with reasonable profit margins.
  • Emphasizes that Google's strategy focuses on diversification rather than solely high-margin products.

Insights on Business Management

  • The speaker reflects on the complexities of managing people and organizations as akin to distributed programming in natural language.
  • Effective business management requires authenticity and problem-solving skills rather than just charisma or marketing prowess.

Constraints in Startup Environments

  • Discusses the constraints faced by startups compared to established companies like Google regarding brand recognition and distribution capabilities.
  • Highlights how scaling operations introduces both advantages (credibility with users) and disadvantages (risk of bugs affecting reputation).

Passion for Business Development

  • The speaker expresses a passion for business development acquired through experiences at Google and studying its operational strategies deeply.

User Experience Optimization in Technology

Enhancing User Interaction

  • The discussion highlights the importance of backend optimization, allowing users to access emails while typing passwords, showcasing a seamless experience.
  • Emphasis is placed on anticipating user needs, such as initiating server connection requests when hovering over the assistant button, reflecting a proactive approach to user engagement.

Brand Advertising Inspiration

  • The effectiveness of brand advertising is discussed, particularly referencing a successful ad featuring Lee Jung J from "Squid Games," which received widespread acclaim and millions of views on Instagram.
  • Despite low production costs for the ad, its impact was significant; it resonated well with audiences outside Google’s sphere.

Creative Process Involvement

  • The speaker describes their active involvement in the creative process alongside other executives and team members to ensure the ad felt engaging and game-like.
  • Collaboration led to unique settings inspired by popular shows like "Severance" and "Squid Games," enhancing relatability through familiar cultural references.

Audience Engagement Strategies

  • A humorous reference was included in the ad regarding AI mistakes (e.g., using glue), aiming for relatability while providing correct information.
  • Insights from influencer Mr. Beast emphasized simplifying content for mass appeal; understanding audience IQ levels is crucial for effective communication.

Lessons Learned from Previous Campaigns

  • The speaker reflects on past advertising efforts that were too subtle and failed to engage viewers effectively, learning that clarity and simplicity are key.
  • Simple questions resonate more with audiences; examples include everyday issues rather than complex queries that may alienate potential users.

Social Media Dynamics

  • The speaker discusses their natural engagement style on Twitter without forcing confrontational or meme-driven content, highlighting authenticity as essential.
  • Insights into social media algorithms suggest that tweets with mixed accuracy (50% right/wrong) tend to generate higher engagement compared to purely factual statements.

Journalism vs. Engagement Tactics

M&A Activity and Company Strategy

Clarification on M&A Interests

  • The speaker emphasizes that there has been no leak regarding M&A activity to Mark, stating clearly that they are not aware of any such activities. This is reinforced by an official statement in a related article.
  • The speaker reiterates their disinterest in M&A, highlighting their commitment to ensuring smaller tech companies succeed and continue progressing.

Product Development Insights

  • Following the launch of Comet, the speaker notes a significant increase in user engagement with Perplexity, indicating a multiplier effect where logged-in users perform more queries.
  • The distribution strategy for Comet is described as simpler compared to Perplexity, allowing for various commerce-related functionalities which excite the speaker about this new chapter for their company.

Distribution Strategies Compared

  • A comparison is made between Google's early distribution methods (like Google Toolbar) and current strategies. Google paid desktop software companies to install their toolbar, leading to increased query volumes.
  • The importance of effective distribution is emphasized; traditional methods are deemed outdated against competitors like Google who have established ad networks.

Competitive Landscape Analysis

  • The speaker discusses the competitive risks posed by larger tech companies like Meta and Google. They express concern over how these giants overshadow smaller players in the browser market.
  • Despite acknowledging Meta's potential interest in building a browser, the speaker maintains that they are not currently interested in any M&A opportunities due to perceived risks involved.

Future Directions and AI Integration

  • The conversation shifts towards AI's role within product development. The speaker suggests that competition isn't just about individual companies but also involves navigating powerful incumbents like Google.
  • A discussion on AI applications reveals two main areas: self-driving cars and search engines. A new category emerges involving agents operating within browsers.

User Experience Considerations

  • As technology evolves, there's speculation about user preferences remaining tied to traditional browsing experiences despite advancements in AI-driven automation.
  • Users may still desire manual interaction with digital platforms even when agents can handle tasks efficiently, reflecting a need for human connection alongside technological convenience.

AI and the Future of Work: Augmentation or Replacement?

The Role of Technology in Labor

  • Discussion on how technology is evolving from merely facilitating movement (point A to point B) to performing complex physical labor and digital tasks.
  • Emphasis on AI as an augmentation tool rather than a replacement for human workers, aiming to enhance productivity and free up time for personal connections.

Work-Life Balance with AI

  • The potential for a reduced workweek (3-4 days) due to increased efficiency from AI assistance, allowing more time for family and social interactions.

Business Models and Monetization Strategies

  • Exploration of various monetization strategies for AI products, including subscription models versus ad-supported services.
  • Recognition of the underestimated potential of subscription markets, citing OpenAI's success with millions of paying subscribers generating significant revenue.

Revenue Potential in AI Services

  • Insights into the scalability of AI services, predicting that millions could pay varying subscription fees leading to substantial annual revenues without relying on advertisements.
  • Comparison with existing high-revenue companies like Bloomberg, suggesting that similar models could apply to AI-driven businesses.

Future Trends in Monetization

  • Speculation about new forms of monetization based on task completion by AI agents, potentially rationalizing higher payments compared to traditional human labor costs.
  • Discussion on alternative revenue streams where businesses might pay agents directly for driving commerce, akin to existing platforms like DoorDash or Expedia.

Integration Challenges and Opportunities

  • Mention of ongoing integration efforts with payment providers like PayPal to facilitate commerce through AI platforms such as Perplexity.

Broader Implications for Commerce

Discussion on AI Tools and Market Dynamics

Driving Commerce through AI Tools

  • The conversation begins with the potential of AI tools, like United, to drive additional commerce by offering incentives such as points for flight bookings.
  • There is speculation about users possibly subscribing to multiple AI services (e.g., OpenAI and Perplexity), indicating a trend towards microtransactions in this space.

Subscription Models and Differentiation

  • The speaker notes that while some users may subscribe to various tools, it’s likely that they will eventually choose specific tools based on their unique offerings.
  • Claude is highlighted as a coding tool, while Perplexity aims to be a daily workflow assistant. This differentiation suggests a strategic focus on niche markets within the AI landscape.

Value Proposition of Major Players

  • ChatGPT is described as the default AI tool known widely, similar to how Google operates. The discussion emphasizes the need for value propositions beyond just basic functionality.
  • Over time, subscription models may consolidate around core elements where certain tools excel compared to others.

Future Opportunities in AI

  • The speaker expresses skepticism about any single tool being able to do everything exceptionally well but acknowledges significant revenue opportunities in search and web-based actions.
  • A multi-hundred billion dollar business model is anticipated from subscriptions alone, suggesting immense growth potential in the sector.

Trillion Dollar Aspirations

  • There's an assertion that building a trillion-dollar company is feasible if one can capture substantial market share; historical context indicates it's been years since a new company reached this milestone.
  • OpenAI's founding year (2015) is mentioned, clarifying misconceptions about its age relative to other tech giants.

Nonprofit vs. For-Profit Dynamics

  • Discussion touches upon OpenAI's transition from nonprofit origins and its implications for future operations alongside Microsoft.
  • The importance of maintaining brand association with "AI" is emphasized as crucial for ongoing success in the field.

Impressive Developments by Competitors

  • Grock's rapid development trajectory under Elon Musk's leadership showcases efficiency and urgency in creating competitive models without excessive spending.
  • Incremental improvements across Grock versions are noted as key factors contributing to their growing reputation in the industry.

Who Will Dominate the AI Landscape?

Key Players in AI Development

  • The discussion begins with speculation on which companies will lead in AI, mentioning Google XAI and Meta as significant players trying to catch up.
  • Main contenders identified include OpenAI, Anthropic, Google XAI, and others. The speaker emphasizes that these are the primary players shaping the future of AI.
  • Notably, models like GPT and Gemini are highlighted for their developer mind share; however, XAI is noted for its growing capabilities despite lower API usage compared to other platforms.

Emerging Competitors

  • Alibaba's models (referred to as "Queen") and Deep Seek are acknowledged as underrated competitors in the AI space due to their rapid development and multiple model offerings.
  • The speaker points out that Chinese models are strong contenders because they are open source and can be utilized effectively by developers.

Market Dynamics and User Adoption

  • Insights into user behavior reveal that market dynamics dictate which models gain traction; if a model isn't effective, users will switch to alternatives.
  • The company has visibility into model usage trends among users, allowing them to focus on supporting popular models rather than those with low adoption rates.

Challenges of Competing with Established Models

  • Users often become accustomed to existing applications like ChatGPT, making it challenging for new entrants to attract them unless they offer exceptional value or unique features.
  • Differentiation is crucial; new apps must provide compelling use cases beyond novelty to compete against established leaders like OpenAI.

Future Directions in AI Strategy

  • The conversation touches on the need for innovation beyond chatbots—emphasizing broader applications such as browsers—to enhance user experience significantly.
Video description

Perplexity co-founder and CEO Aravind Srinivas sat down with Semafor technology editor Reed Albergotti for a wide-ranging conversation on the company's new Comet browser, the state of the AI race, and more. Read more: https://www.semafor.com/article/08/06/2025/why-perplexitys-ceo-couldnt-wait-for-perfection-to-launch-an-operating-system-for-the-ai-ecouldnt-wait-for-perfection-to-launch-an-operating-system-for-the-ai-era Sign up to get Semafor Tech in your inbox for free, twice every week: https://www.semafor.com/newsletters/tech Sample the latest edition: https://semafor.com/newsletters/technology/latest 0:00 Welcome to Semafor Tech 1:05 Comet browser first impressions 11:20 Old habits in the AI age 22:26 Will Perplexity get into hardware? 36:35 Benefits of a background in AI research 49:43 How Perplexity approaches marketing 1:02:45 Subscription vs. ad-supported service 1:10:34 "Why would we sell the company now?" #Technology #AI #Perplexity