Apple Pulled This... I Run AI On It

Apple Pulled This... I Run AI On It

The Rise of Local AI: A Fortress Against Cloud Dependency

Introduction to the Concept

  • The speaker expresses a strong sentiment about the increasing control over cloud-based AI services, suggesting that they are being throttled and monetized.
  • They emphasize their personal investment in local intelligence, claiming it cannot be taken away by external forces.

Exploring Frontier AI

  • Mike Russell introduces himself and discusses his experience with advanced AI models like Claude Opus 4.7 and GPT 5.5, highlighting their capabilities as coding assistants.
  • He raises concerns about the costs and dependencies associated with using cloud-based AI systems, prompting him to consider running powerful AI locally on his own hardware.

Apple’s Hardware Changes: A Catalyst for Local AI Exploration

Disappearance of Key Products

  • Russell notes that Apple has quietly removed certain configurations of its Mac Mini and Mac Studio from its website without any announcement, hinting at a potential strategy shift.
  • He suggests that this removal is not coincidental but rather indicative of a larger trend affecting consumer access to powerful computing resources for local AI applications.

Setting Up the Ultimate Local AI Machine

Specifications of the New Machine

  • The speaker describes the impressive specifications of his new machine: 256 GB unified memory, 819 GB/s memory bandwidth, a 28-core CPU, and a 60-core GPU capable of running complex models efficiently at low power consumption (80 watts).

Initial Setup Steps

  • He begins setting up the machine by connecting necessary cables and creating a new local user account instead of signing in with an Apple ID to maintain privacy and independence from cloud services.
  • Important settings adjustments include disabling automatic sleep mode and enabling remote login for easy access to the machine from other devices on his network.

Installing Essential Software for Local AI Operations

Homebrew Installation

  • Russell installs Homebrew as a package manager essential for managing software installations needed for local AI operations on macOS.

Key Software Installations

  • He proceeds to install Olama as an inference server alternative to OpenAI's API along with LM Studio for exploring cutting-edge AI models available for download.

Testing Local Models: Performance Insights

Model Downloads and Initial Tests

  • After downloading various models including Quen 3.6B, he tests their performance against tasks typically handled by cloud-based AIs such as summarizing articles or classifying messages into categories like tech support or feature requests.

Performance Comparison

  • During testing, Quen demonstrates impressive speed (73 tokens per second), while also comparing it against GPTOSS which shows even faster response times in similar tasks indicating high efficiency in local processing capabilities.(644)(712)

Creative Applications: Beyond Routine Tasks

Generating Creative Content

  • Russell explores creative uses by asking local models to generate YouTube titles or write Python scripts that create visual effects like colorful rainbows on screen—showcasing versatility beyond standard operational tasks.(912)(940)

Conclusion on Local vs Cloud Intelligence

  • Ultimately, he concludes that while local machines can effectively handle routine tasks competently without reliance on cloud services, more complex challenges may still require access to frontier-level intelligence provided by companies like OpenAI or Anthropic.(1142)(1183)
Video description

🔗 Get the full build, every command, every config inside the Creator Magic community → https://mrc.fm/closetai 🧠⚡ I went fully off-grid with AI in this one. Frontier models like Claude Opus 4.7 and GPT-5.5 are incredible, but every call my OpenClaw and Hermes agents make is hitting someone else's server and paying someone else's bill — so I asked the dangerous question: can I get near-frontier intelligence running entirely on a machine in my own house that nobody can throttle, rate limit, or take away? Right before Apple quietly pulled the 256GB Mac mini M4 and the 256GB RAM Mac Studio M3 Ultra from their website, I managed to grab the exact config I needed, and in this video I set it up from scratch step by step — power settings, headless SSH, static hostname, Homebrew, Ollama, LM Studio, MLX — so you can copy exactly what I did. Then I throw real agent prompts at Qwen3.6-35B-A3B, GPT-OSS-120B, Gemma 4 and MiniMax M2.7 to find out which ones actually replace frontier work, before plugging the whole thing directly into my OpenClaw and Hermes stack so my agents are now running 100% local — silent, sub-100W, in a closet, forever. They tried to take away my cloud intelligence. They will never take away the intelligence I own. 00:00 Intro: Going Off-Grid with AI 01:21 Apple Secretly Removed This Mac 02:07 The Ultimate Local AI Machine Specs 03:14 macOS Headless Server Setup 06:17 Installing Ollama & LM Studio 07:04 Downloading AI Models & Screen Sharing 08:53 Test 1: Summarization & Logic (Qwen) 11:44 Test 2: Data Extraction (GPT-OSS 120B) 13:38 Test 3: Creative Writing & YouTube Titles 14:57 Test 4: Python Coding Test 18:09 Running AI Agents Locally 18:49 Conclusion: Can it Replace Cloud AI?