Claude Cowork: a small taste of AGI

Claude Cowork: a small taste of AGI

Cloud Code: A Personal Confession

Personal Use Cases of Cloud Code

  • The speaker confesses to using Cloud Code daily, primarily for personal tasks rather than traditional coding.
  • They utilize it for various activities such as analyzing iMessage history and organizing files on their computer.
  • The speaker appreciates how Cloud Code enhances productivity beyond just codebase modifications.

Anthropic's New Product Release

  • The introduction of a new product by Anthropic is highlighted, featuring a question UI and to-do list functionalities.
  • Notably, the product can take over Chrome and generate presentations, despite Anthropic not having released an image model yet.
  • The speaker expresses optimism about the product but shares frustrations regarding installation issues.

Hiring Solutions with G2I

Efficient Hiring Process

  • G2I is introduced as a sponsor that simplifies hiring processes for tech teams.
  • Their network consists of 8,000 experienced engineers familiar with modern AI tools, ensuring quick onboarding.
  • G2I promises rapid results, aiming to have candidates ready to file pull requests within seven days.

Installation Challenges with Anthropic's App

User Experience Issues

  • The speaker recounts difficulties encountered while installing the app, including browser compatibility problems.
  • Frustration arises from multiple web views opening during the sign-in process without proper redirects or usability features.

Critique of Anthropic's Engineering Capabilities

  • Despite acknowledging the effectiveness of Claude and Opus, the speaker criticizes Anthropic’s engineering skills based on their app performance.
  • They highlight ongoing issues with signing in and general app functionality that detracts from user experience.

Final Thoughts on App Performance

Overall Impressions

  • The speaker expresses disappointment in the app's design and functionality despite wanting to appreciate its potential value.
  • They note technical glitches like clipping icons which further diminish their enthusiasm for the product.

Cloud Code and Co-Work: A Critical Review

Initial Impressions of Co-Work

  • The speaker expresses frustration with the user interface (UI) of Co-Work, noting that it reloads the entire page when accessed.
  • Concerns are raised about the quality assurance (QA) process for Cloud Code, questioning whether it was adequately tested before release.
  • The speaker criticizes the engineering team for lacking a proper understanding of user experience, suggesting a need for better leadership and ownership in app development.

Use Cases and Functionality

  • The speaker discusses potential use cases for Co-Work, expressing relief that it does not focus on travel planning—a common but ineffective AI application.
  • A specific demo is highlighted where Co-Work organizes desktop files automatically, which could be beneficial for many users.

Security Concerns

  • The speaker raises concerns about granting desktop apps access to terminal commands without users fully understanding the risks involved.
  • An interesting find is mentioned regarding using Cloud Code to reverse engineer an Electron app, revealing insights into its sandboxing mechanisms.

Technical Insights

  • It’s noted that Cloud Code utilizes Apple's virtualization framework similar to Docker, indicating that installing the cloud app also downloads a full Ubuntu virtual machine (VM).
  • The discussion includes details about network isolation features in this setup, which prevent unauthorized access to devices on the user's network.

Vulnerabilities and Future Directions

  • Despite good security practices like isolation, vulnerabilities remain in Cloud Code's execution environment related to file exfiltration attacks due to unresolved issues.
  • The introduction of Cloud Co-Work as a research preview is discussed; it's currently available only to higher-tier subscribers but has gained significant attention online.

General Observations on Cloud Code

  • Simon's perspective is shared that Cloud Code functions more as a general agent than merely a developer tool; its capabilities extend beyond coding tasks.
  • There’s an emphasis on needing a more user-friendly interface and terminology that appeals to non-developers while maintaining functionality.

Co-Work Tab in Cloud Desktop App

Overview of Co-Work Interface

  • The co-work tab is integrated into the cloud desktop app, positioned alongside chat and code tabs, resembling the regular cloud code interface.
  • Users can start with a prompt and attach a folder of files; it processes these to assist with tasks like checking unpublished blog drafts against an external website.

Use Case Example

  • A user tested the feature by querying drafts from the last three months to identify which were not published on their site, showcasing practical application.
  • The system's ability to reference external sources through its VM for file access highlights its functionality but also reveals limitations when accessing certain data types.

Anthropic vs. OpenAI: Product Development

Comparison of User-Facing Products

  • Anthropic has historically focused less on end-user products compared to OpenAI, which has released numerous consumer-facing applications and features.
  • OpenAI's extensive product offerings contribute significantly to its higher subscription revenue compared to Anthropic.

Introduction of Labs at Anthropic

  • Anthropic launched "Labs" as an initiative to foster internal product development similar to successful projects like cloud code.
  • This shift indicates a strategic move towards enhancing user experiences by creating more accessible tools.

Limitations and User Experience Challenges

Functionality Testing

  • During testing, co-work struggled with specific queries (e.g., counting iMessage messages), revealing gaps in expected capabilities despite having file access.

User Understanding of File Systems

  • Many users may not understand how to utilize file systems effectively, particularly younger individuals accustomed to devices without traditional file management (e.g., Chromebooks).

Cultural Implications of Technology Use

Generational Differences in Tech Literacy

  • There is a growing concern that new users lack familiarity with conventional computing concepts due to reliance on simplified interfaces found in mobile devices.

Misconceptions About AI Capabilities

  • Users may overestimate AI's abilities based on demonstrations or use cases they observe without understanding underlying limitations related to data accessibility and processing complexity.

Exploring AI Tools for File Management

Personal Use Case of AI in Organizing Files

  • The speaker shares a personal anecdote about a friend who successfully organized her messy desktop using an AI tool, highlighting its practical utility.
  • Currently, the tool is only compatible with Mac systems, and its future development remains uncertain. It integrates with various platforms like Google Drive and Chrome.
  • The speaker demonstrates the tool's ability to interact with their browser by requesting feedback on their Twitter profile through the AI assistant.
  • A permissions issue arises when trying to navigate to a specific website, illustrating some usability challenges faced during the demonstration.
  • The speaker realizes they accessed the wrong Twitter profile, prompting a discussion about assumptions made regarding user identities.

Challenges and Opportunities in File Organization

  • The speaker expresses frustration over navigating multiple screens while using the AI tool but acknowledges its potential benefits for file management tasks.
  • They reflect on their cluttered downloads folder post-system wipe, emphasizing that even after cleanup, it still contains numerous files needing organization.
  • A genuine use case emerges as they consider how this tool could help summarize investment documents stored as PDFs in their downloads folder.
  • The speaker highlights that leveraging AI tools can significantly reduce time spent on repetitive tasks like organizing files or executing scripts.
  • They encourage users to think creatively about how these tools can automate mundane computer tasks beyond traditional coding applications.

Potential Impact on Various Professions

  • The summary of files reveals a diverse collection including images, PDFs, videos, and legal documents; showcasing the need for effective organization strategies.
  • While acknowledging that not everyone has extensive files to manage, the speaker notes that those in roles requiring file orchestration will find this tool particularly beneficial.
  • Executive assistants are identified as key beneficiaries of such tools due to their responsibilities involving extensive file management and organization tasks.

Technical Insights into Tool Functionality

  • Discussion shifts towards technical aspects of how the AI operates within virtual environments and manages access permissions securely.
  • There’s mention of session management indicating each thread operates independently within its own secure environment.

Plug-in Upgrade Guide and Artifacts

Overview of Artifacts

  • The speaker discusses a plug-in upgrade guide that provided the necessary information, although it was not intended for their blog.
  • Artifacts are described as a feature in Claude that allows users to create mini web apps within the sidebar, which can be utilized for various tasks.
  • Users have built games, apps, platforms, and tools using artifacts; the speaker prefers using CLI folders for personal projects.

Functionality and Potential of Artifacts

  • The speaker notes difficulties with closing the right sidebar in artifacts but acknowledges its functionality despite display issues.
  • Artifacts are compared to cloud code but with added UI tools; there is potential for further development in this area.
  • Open-source alternatives to Cloud Co-work were released simultaneously by two different groups, highlighting community engagement.

Security Concerns

  • The speaker raises concerns about prompt injection risks associated with artifacts and how they could potentially compromise user data or system integrity.
  • A quote from Anthropic emphasizes ongoing efforts to defend against prompt injections while acknowledging that security remains an active area of development.

Industry Responsibility and User Precautions

  • The speaker critiques Anthropic's approach to communicating security risks, suggesting they should take more ownership of potential vulnerabilities.
  • Recommendations include avoiding access to sensitive files when using cloud services and monitoring AI actions for suspicious behavior.

User Experience and Future Implications

  • Simon expresses skepticism about non-programmer users being able to recognize signs of prompt injection threats effectively.
  • A summarization process is discussed where context from the web is filtered through another model before reaching the main model, reducing risk exposure.
  • This technology represents a significant shift towards AI performing tasks beyond simple text generation, indicating broader implications for everyday users.

Exploring the Limitations and Potential of AI Tools

Current Capabilities of AI Tools

  • Users can draft emails and attach files with AI tools, but they lack the ability to send emails or find files autonomously. This highlights a gap in functionality that limits their practical application for real work.
  • The speaker expresses excitement about using Cloud Code, emphasizing its potential to enhance user experience (UX) despite some limitations, such as issues with accessing iMessage.

Access Challenges and User Experience

  • The speaker encounters difficulties with Mac OS blocking access to certain folders, which complicates the use of AI tools for tasks like retrieving photos or messages.
  • There is concern that many users will struggle to understand various technical components like connectors and MCPs (Managed Connector Protocol), which could lead to confusion.

Diverse Functionalities in AI Tools

  • The discussion introduces multiple functionalities available within these tools: skills, connectors, MCPs, plugins, and desktop extensions. Each has different expectations and controls that may overwhelm users.
  • The speaker compares current offerings with Claudebot—a tool that runs similar CLI commands remotely—indicating a preference for its capabilities over those currently available in other platforms.

Practical Applications of Cloudbot

  • Claudebot allows remote interaction with a computer via messaging apps like Telegram or WhatsApp. This setup enables users to perform complex tasks without being physically present at their machines.
  • The speaker describes how they utilize a Mac Mini running Cloudbot to manage applications and troubleshoot services effectively through remote commands.

Open Source Benefits and Recommendations

  • Claudebot is highlighted as an open-source project that has improved significantly since its initial release. However, it requires subscriptions to other services for optimal performance.
  • The speaker emphasizes the innovative potential of controlling computers remotely through tools like Claudebot while recommending it due to its advanced capabilities compared to existing options.

Future Considerations for Development

  • There is speculation that if Claudebot hadn't gained popularity quickly, alternative projects like co-work might not have been developed by teams inspired by its success.
  • A call for open-sourcing Claude code is made as the speaker believes it's time for broader accessibility now that integrated features are becoming commonplace in various applications.
Video description

They made another Claude Code, except this one's not for code... Thank you G2i for sponsoring! Check them out at: https://soydev.link/g2i SOURCES https://x.com/claudeai/status/2010805682434666759 https://simonwillison.net/2026/Jan/12/claude-cowork/ https://clawd.bot/ Want to sponsor a video? Learn more here: https://soydev.link/sponsor-me Check out my Twitch, Twitter, Discord more at https://t3.gg S/O @Ph4seon3 for the awesome edit 🙏