The only AutoResearch tutorial you’ll ever need

Name: The only AutoResearch tutorial you’ll ever need
Uploaded: 2026-03-27T12:51:57.000Z
Duration: 38 min 53 s

Understanding Auto Research: An Overview

What is Auto Research?

Auto research is an open-source project by Andre Karpathy that enables AI to autonomously improve itself through automated experiments.

The process involves an AI agent running multiple experiments, retaining successful outcomes while discarding ineffective ones.

Who is Andre Karpathy?

Andre Karpathy is a prominent AI researcher, co-founder of OpenAI, and the lead behind Tesla's autopilot system.

He has significantly contributed to the open-source community in AI and coined the term "VIP coding."

The Core Concept of Auto Research

How Does It Work?

The fundamental idea is to allow an AI agent to run numerous experiments based on a single file and metric, optimizing models without human intervention.

A critical component is the prepare.py file, which defines what constitutes success; it prevents the agent from manipulating its evaluation criteria.

Experimentation Process

The agent formulates hypotheses, modifies code, trains for about five minutes, evaluates results, and either commits successful changes or resets if unsuccessful.

This iterative loop can execute approximately 100 experiments overnight when initiated before sleep.

Implications Beyond Model Training

Broader Applications

Auto research extends beyond just training AI models; it can be applied across various domains such as marketing, product testing, trading strategies, and personal life management.

Mastering how to measure effectively will become a valuable skill in future business landscapes.

Future of Work with AI Agents

As execution of tasks becomes increasingly automated through these loops, understanding metrics will differentiate successful individuals in business environments.

Architecture of Auto Research

File Structure Explained

Key files include program.mmd, which sets goals and constraints; train.py, where modifications occur; and prepare.py, which measures outcomes without being altered by the agent.

Misconceptions About Auto Research

Many believe auto research solely focuses on machine learning optimization. However, its potential spans numerous fields requiring clear metrics for experimentation.

The Role of Oxyabs in AI Development

Introduction to Oxyabs

The discussion begins with the importance of real-world data for AI workflows, highlighting that even sophisticated AI agents are ineffective without current web data.

Oxyabs is introduced as a web scraper API capable of extracting structured data from various websites like Amazon and Google through a single API call.

Features of Oxyabs

The API simplifies integration for developers, allowing quick connections to tools like cursor or cloth code, enabling live web scraping capabilities for AI agents.

Non-developers can also utilize Oxyabs via NA10 to visually create workflows that scrape data without writing any code.

Cost and Accessibility

Users can access up to 2,000 scrape results for free, making it easy to test the service. A promotional code offers discounts on plans.

Auto Research: Expanding Applications

General Concept of Auto Research

Auto research is applicable beyond machine learning; it can be used wherever measurable outcomes exist.

Key components include having one editable file, a scalar metric, and a time-boxed loop for effective auto research execution.

Use Cases in Trading and Marketing

In trading, auto research can optimize buy/sell rules based on historical market data by evaluating strategies using metrics like the Sharpe ratio.

For marketing, auto research allows extensive experimentation (up to 36,000 tests per year), modifying content based on conversion rates.

Future Implications and Conditions for Success

Developer Applications

Developers can apply auto research techniques to enhance codebases or fine-tune open-source models for better performance on personal devices.

Conditions for Effective Auto Research

Successful auto research requires three conditions: clear metrics, automated evaluation without human intervention, and a single file that the agent modifies.

Limitations and Challenges

Areas Where Auto Research May Fail

Subjective areas such as brand design or UX may not yield effective results due to the lack of objective metrics needed for optimization.

Vision for Future AI Research Models

Arj Karpathy's vision includes leveraging distributed computing power similar to SETI@home but focused on advancing AI research through collaborative efforts across numerous agents.

How to Build Your Own Auto Research Loop

Introduction to Auto Research

The speaker encourages beginners to follow along for five minutes to gain an advantage in AI, emphasizing that many users only pay for subscriptions without understanding the underlying technology.

A GitHub repository from Karpathy is introduced as a resource for building an auto research loop, with a brief explanation of GitHub as a code storage platform.

Setting Up the Environment

Users are advised to install an Integrated Development Environment (IDE), such as VS Code or Cursor, and the speaker chooses Cursor for this demonstration.

The process begins by creating a new folder in the project root level and cloning the GitHub repository into it, allowing users to reference original code while developing their own projects.

Project Structure and Goals

The speaker outlines plans to create a simple web app aimed at optimizing website performance metrics like loading times using auto research techniques.

Emphasis is placed on measuring loading times as they are straightforward metrics suitable for optimization through auto research loops.

Utilizing Tools and Technologies

The speaker mentions launching Codeex in YOLO mode alongside Cloth Code, highlighting their effectiveness in debugging and coding tasks.

A benchmark file is created within the original folder structure, which will utilize Puppeteer to test local website speed.

Benchmarking Website Performance

The initial setup involves running a simple portfolio website built with Express.js and static files; its design reflects common early web development styles.

After running Puppeteer tests, results indicate a medium load time of 50 ms, setting the stage for further optimizations through auto research methods.

Developing the Program.md File

The importance of creating a program.md file is discussed; this file serves as the main document guiding the auto research loop's objectives related to website speed benchmarking.

Inspiration from Karpathy’s original program structure is suggested for adapting instructions relevant to specific project goals.

Customizing Instructions for Optimization

The speaker emphasizes rewriting instructions based on existing frameworks while tailoring them specifically towards improving website performance metrics.

By leveraging Cloth Code's capabilities, 128 lines of tailored instructions are generated that align closely with project requirements.

Auto Research: The Future of Experimentation?

Setting Up the Experiment

The speaker instructs cloud code to stage a commit as a baseline for experiments, emphasizing that successful experiments will be recorded while unsuccessful ones will be reset.

The process begins with reading the program and running a baseline benchmark, followed by an autonomous loop for conducting experiments without interruptions.

Observations on Experiment Results

Initial results show a decrease in website speed, prompting the system to revert changes. This highlights the importance of auto research in quickly identifying failures.

A significant improvement is noted when the system finds better configurations, reducing load time from 50 milliseconds to 33 milliseconds—a 34% improvement within minutes.

Implications of Auto Research

The speaker expresses excitement about auto research's potential, suggesting it could revolutionize experimentation processes and invites viewers to subscribe for more content on this topic.

An offer is made for free idea validation calls aimed at serious founders looking to build AI businesses, indicating support for entrepreneurs in scaling their ventures effectively.