Agent Skills Deep Dive

Agent Skills Deep Dive

AI Code Generation and Productivity

Introduction to AI in Coding

  • The speaker initiates a discussion about the prevalence of code generation tools among developers, questioning how many have transitioned from manual coding to automated processes.
  • A participant shares their experience of primarily generating code rather than writing it manually, highlighting the significant volume of generated code they have produced.

Token Usage and Costs

  • The speaker reflects on their usage statistics, noting that they utilized 330 million tokens in a month, which is a substantial amount compared to others who may use even more.
  • They mention the financial implications of token usage, referencing a humorous anecdote about someone spending $12,000 on tokens for AI tools within their monthly budget.

Multi-Agent Systems

  • The conversation shifts to multi-agent systems where multiple agents can be launched simultaneously for tasks. This method increases efficiency but also raises costs due to higher token consumption.
  • The speaker describes using Git v3 for managing repositories with multiple agents working on tasks concurrently, allowing for quick comparisons between different solutions.

Methodologies in AI Development

  • Discussion introduces SDD (Spec Driven Development), emphasizing its effectiveness when utilizing teams composed of various AI agents like business analysts and developers.
  • Although effective, the approach is noted as being costly due to high token consumption when deploying numerous agents simultaneously.

Productivity Enhancement through AI

  • The speaker asserts that AI tools act as multipliers of productivity; skilled developers can significantly enhance their output while less competent individuals may not benefit similarly.
  • There’s an observation that advanced developers are no longer limited by personal productivity but can scale their efforts across multiple job opportunities using these tools.

Evolution of Agent-Based Approaches

  • Acknowledgment is made regarding the rapid advancements in AI tools over the past year, particularly focusing on agent-based approaches that emerged around late 2024.
  • Key differences between traditional chat interfaces and agent loops are discussed; agent loops allow autonomous decision-making based on task completion criteria.

Conclusion: Future Implications

  • The session concludes with reflections on how these developments could reshape job markets and productivity dynamics within software development environments.

Agent Features and Tools

Key Characteristics of Agents

  • The first notable feature of agents is the Agent Loop, which allows them to solve problems by restarting processes over time, although this can be costly.
  • The second key feature is their access to Tools, such as file system access and command execution, which enhances their functionality.
  • An example of a useful tool is MCP Contex 7, which finds relevant documentation and provides code snippets that significantly improve agent performance.
  • Recently, there has been an increase in the use of agent skills, marking a shift towards skill-based programming where focus is on specific capabilities rather than just tools.

Evolution of Agent Skills

  • The introduction of agent skills represents a significant change in development practices, emphasizing the importance of skills in programming tasks.
  • A standard for agent skills was developed by Antropic, leading to widespread adoption across various tools within months.
  • Initially, development focused on prompt engineering—creating effective prompts for tasks—but has evolved into using rules that dictate how tasks should be performed based on best practices.

Integration Challenges with MCP

Limitations and Considerations

  • While rules provide structured guidance on task execution, they also introduce complexity; many tools now share similar functionalities, making it less critical which tool is used.
  • MCP (Multi-context Processing) tools consume context heavily; visualizations show that connecting multiple MCP can deplete available context significantly.
  • There are concerns about MCP cluttering IDE environments due to numerous functions being integrated, potentially confusing users regarding which function to utilize.

Combining Approaches

  • Agent skills effectively merge elements from prompts, rules, and MCP functionalities into a cohesive framework that simplifies user interaction with these technologies.
  • This integration aims to streamline processes while maintaining flexibility in how agents operate within different contexts.

Technical Demonstration Challenges

Presentation Issues

  • During discussions about demonstrating agent skills live, technical difficulties arose regarding screen sharing and presentation formats.
  • The speaker expressed frustration over needing to switch between applications but remained focused on showcasing the practical aspects of agent capabilities.

Overview of AI Tools and Skills

Introduction to the Session

  • The speaker is preparing to demonstrate a feature related to AI tools, indicating readiness with "супер" (super).
  • Emphasizes the importance of showcasing this information in Telegram.

Technical Setup

  • Discussion about managing technical aspects, including stopping certain processes and using a clicker for presentations.
  • Highlights that while AI can assist, human interaction (clicking through slides) remains essential.

Understanding Instructions and Rules

  • Clarifies that different tools have unique terminologies; commands are referred to as instructions or rules depending on the context.
  • Introduces the concept of skills in AI, which allow for repetitive tasks involving prompts and tools.

Skills in AI Context

  • Explains how modern AI tools utilize similar models but may yield different results due to varying contexts and methodologies.
  • Discusses how different platforms like Cursor index code differently, impacting their effectiveness in solving problems.

Development of Skills

  • Describes how skills can range from simple to complex packages, with many new skills emerging rapidly across various platforms.
  • Introduces concepts of active context versus passive context in skill development and application.

Lifecycle of Skills

  • Outlines the lifecycle phases of a skill: discovery, initialization, and activation. Each phase plays a crucial role in determining relevance.
  • Details that skills are defined by metadata within Skill MD files, which include necessary attributes for functionality.

Execution and Resources within Skills

  • Mentions that once a skill is deemed relevant, it activates additional resources such as prompts or templates contained within its file structure.
  • Notes that skills can contain executable code along with detailed instructions for various applications.

Understanding Skill Development and Management

Risks of Downloading Code

  • The speaker emphasizes the dangers of downloading code from the internet, highlighting instances where harmful code has been distributed. Caution is advised when integrating external resources into projects.

Progressive Disclosure in Skills

  • The concept of progressive disclosure allows for a large number of skills to be managed without overwhelming the system. This contrasts with MCP (Minimum Contextual Parameters), which require fewer skills due to their heavy context load.

Structure of a Skill

  • A skill is organized in a folder named after it, containing a file called skill.md. Essential attributes include name (matching the folder name) and description, which outlines what the skill does and when to use it.

Importance of Description

  • The description field is crucial as it defines how and when to apply the skill. There are character limits: 64 for name and up to 1000 for description, though brevity is preferred.

Extended Skill Structure

  • An advanced structure may include additional folders like scripts or references. While naming conventions aren't strict, clarity in naming helps maintain organization within skills.

Automatic Application of Skills

  • Skills should ideally trigger automatically based on environmental cues. A well-defined description enhances the likelihood that a skill will be activated appropriately.

Reusability of Skills

  • Skills can be reused across projects; they can be stored in version control systems alongside code. Teams should share common skills while individuals may develop personal variations tailored to their needs.

Managing Conflicts Between Skills

  • When multiple skills are activated simultaneously, conflicts may arise if instructions overlap. It's essential to manage these interactions carefully to avoid ambiguity in execution.

Non-deterministic Nature of Programming with LMs

  • Programming with language models introduces non-determinism; outcomes can vary based on numerous factors such as model state or external conditions, necessitating careful monitoring during development.

Validation Requirements for Skills

  • Validating skills ensures they meet specific requirements before activation. A script exists within standards for this purpose, preventing issues arising from improperly defined skills that fail to activate correctly.

How to Effectively Create and Validate Skills

Understanding Skill Creation and Validation

  • The speaker discusses the process of creating a front meter for skills, expressing curiosity about its functionality. They mention discovering a defect that was fixed through cursor adjustments, which allowed the skill to activate properly.
  • A library is introduced that validates scripts, with additional functionality allowing skills to be exported in XML format. This feature enables multiple files to be bundled into a transportable file.
  • The importance of validation when editing skills is emphasized; if a skill is invalid, an error message will be displayed. This logical approach ensures proper functioning of the skills.

Exploring Repositories and Examples

  • The speaker recommends exploring Antropic Skills' repositories on ThHub for open skills, noting their diverse formulations and triggers such as "use" or "do not use."
  • A specific skill called "skill creator" is mentioned, which interacts with users to create new skills based on their tasks. Correct descriptions are crucial for triggering these skills effectively.

Importance of Descriptions in Skill Functionality

  • The necessity for accurate descriptions in UI component creation and design interfaces is highlighted. These descriptions guide agents in executing tasks while avoiding anti-patterns.
  • An example of a complex skill for generating documents (doкфайлів) is presented, showcasing its comprehensive instructions and templates that facilitate automatic document generation upon request.

Advanced Skill Generation Techniques

  • Two methods for generating documents are discussed: deterministic code generation using libraries or utilizing a skill-based programming approach where an agent determines user needs dynamically.
  • While there may not be complete determinism in results, advancements in models have improved their effectiveness significantly over time.

Key Takeaways on Skill Development

  • High-quality descriptions are essential as they route processes effectively; clear steps should be outlined within them.
  • For large skills, breaking them into smaller components can enhance manageability and usability during development.

Challenges with AI Code Generation

  • The speaker notes challenges faced by developers using less popular programming languages due to limited training data available for AI models compared to more common languages like JavaScript or Python.
  • A cycle emerges where new developers hesitate to learn less popular languages because AI struggles to assist them effectively with coding tasks.

Future Directions in Development Tools

  • Despite concerns about AI replacing coding jobs, the speaker believes understanding generated code remains vital for developers today.
  • Vercel's ongoing development of infrastructure services highlights Next.js becoming the go-to framework for web applications generated by AI tools. Their Agent Skills set includes best practices specifically tailored for React projects.

Insights on Project Skills and Model Training

Overview of Project Skills

  • A project has been developed that operates efficiently by integrating skills, categorizing them into areas such as synchronous programming and server work.
  • The project provides examples of both correct and incorrect implementations, emphasizing the importance of clear guidance in coding practices.

Challenges with Negative Prompts

  • Models struggle with negative prompts; stating "don't do this" can lead to misinterpretation. It's recommended to present a positive example after a negative one for clarity.
  • While some suggest avoiding negative prompts altogether, modern models have improved in understanding them when structured correctly.

Skill Implementation and Best Practices

  • By defining skills for projects, developers can emulate senior-level coding practices rather than junior-level approaches, enhancing code quality significantly.
  • There is a concern about the variety of frameworks leading to confusion among developers who may not be familiar with certain libraries or tools.

Installation and Security Considerations

  • The installation process involves using NPX Skills, which offers access to thousands of skills but poses risks if malicious code is included. Caution is advised during installation.
  • Users can add specific skills through commands while being aware that telemetry data collection can be disabled for privacy.

Creating Custom Skills

  • To create custom skills, users should identify routine tasks they perform regularly. This process includes providing context and examples to guide skill creation effectively.
  • The Skill Creator tool facilitates the development of new skills by asking relevant questions and allowing modifications as needed.

Importance of Documentation in Projects

  • Proper documentation within repositories is crucial for efficient project management. It helps agents understand tasks without extensive research, reducing errors.
  • Two types of documentation are recommended: technical documentation detailing architecture and best practices, and functional documentation outlining features. This dual approach enhances clarity for all team members involved.

Refactoring JavaScript: Best Practices and Methodologies

Refactoring Approach

  • The speaker discusses the intention to refactor code, emphasizing a shift from traditional JavaScript functions to arrow functions, while noting that regular functions should still be used for object methods.
  • It is suggested to implement changes gradually rather than overhauling an entire project at once, particularly in large projects.

Spec Driven Development

  • The speaker introduces Spec Driven Development (SDD), highlighting various methodologies such as GitHub Spec Kit, Open Spec, and BMAD.
  • Open Spec is favored for its short iteration cycle involving three steps: proposal of changes, application of those changes, and archiving updates to demonstrate feature completion.

Project Implementation Insights

  • An example is given where a project was entirely developed using SDD with numerous specifications based on requirements. This allows for code removal and regeneration of the project in different programming languages or frameworks.
  • Caution is advised against simply instructing an agent to recreate a large project without proper guidance; manual invocation may be necessary for certain agents.

Personal Experience with App Development

  • The speaker shares personal anecdotes about developing a top 50 app in the Windows Phone Market after bringing the first Windows Phone to Ukraine in 2009.
  • The app "All My Staff" focused on categorizing items but faced challenges due to the routine nature of data entry and organization.

AI Integration in App Development

  • A new idea involves creating an AI agent that can understand user commands via messaging platforms like Telegram, inspired by successful projects like Open Claw.
  • Mentioned is Peter Steinberger's journey into AI development after selling a previous project for $100 million; he has since created multiple AI tools including one initially named Clot Bot.

Evolution of AI Projects

  • The discussion includes how Steinberger rebranded his tool from Clot Bot to Molbot due to trademark issues before finally renaming it Open Claw.
  • Open Claw is described as a personal bot capable of self-modification and extensive control over its environment, allowing users to command it through simple instructions.

Skills and Project Development

Overview of Skills Integration

  • The speaker discusses the integration of skills into a project, emphasizing that users can contribute thousands of skills to the system.
  • The initial consideration was OpenClaw, a large project with over 6 million tokens and more than a million lines of code, but it was deemed too complex for current needs.

Transition to Nnocklo

  • The speaker shifted focus to Nnocklo, which is smaller in size yet serves similar functions. It operates within Docker, enhancing security by limiting access to the host computer.
  • A skill was developed using Nnocklo that manages home directories without relying on a database, allowing for easy editing and portability.

Markdown Structure Implementation

  • The idea is to structure data in Markdown format where files represent items and folders act as containers (e.g., rooms as containers within a house).
  • Compatibility with Obsidian is sought so that notes can be stored in Markdown while maintaining an organized catalog.

Skill Generation and Limitations

  • The generated skill creates descriptions for items based on user input; however, there are limitations regarding its operational boundaries.
  • Recommendations from the system suggested breaking down skills into smaller parts; however, the speaker prefers keeping the bot specialized rather than universal.

Demonstration Challenges

  • A live demonstration of the bot's functionality is attempted but faces technical difficulties with projector connectivity.
  • Despite setbacks during the demo setup, there’s optimism about showcasing how the bot interacts with physical items through image recognition.

Practical Application Examples

  • The speaker shares practical examples where they photograph items around their workspace and describe their locations for identification by the bot.
  • Specific item details are provided by the bot after images are uploaded (e.g., coffee machine specifications), illustrating its capability in managing household inventory.

Organizing My Workspace and Items

Initial Setup and Item Recognition

  • The speaker discusses updating a model and mentions specific items in their office, including a Samsung Odyssey monitor and an RGB mechanical keyboard. They request a photo of a coffee maker.
  • A mini-printer is identified, with the speaker providing its model number (B18). They also mention various items stored on shelves, such as books and figurines.

Cataloging Office Supplies

  • The speaker photographs various items in their storage area, including networking equipment and audio devices. Some items are recognized by the system, like Razer headphones.
  • The recognition process isn't perfect; however, the speaker finds it sufficient for their needs. They express willingness to ask for further assistance if needed.

Price Evaluation Feature

  • The speaker suggests adding functionality to evaluate item prices. They instruct the system to search online for current book prices using its browsing capabilities.
  • Prices found include 500 UAH for paper copies and discounts for veterans. The addresses provided were verified as valid.

Adding Clothing Items

  • The speaker takes photos of clothing items in their wardrobe, which are then added to the inventory list. Minor errors in item categorization are corrected by the speaker.
  • They plan to add their laptop as part of their workspace setup.

Technical Insights on System Functionality

  • Discussion about how the organizer's logic operates through skills rather than deterministic programming. It utilizes token authentication from CloudCode for session management.
  • Limitations include potential rate limiting when too many tasks are assigned simultaneously or subscription limits being reached.

Model Performance and Alternatives

  • Speaker mentions running local models on personal GPUs with varying performance levels; they note that while some models may not be very intelligent, they can still serve basic cataloging purposes effectively.
  • There’s mention of cheaper Chinese models that have emerged recently which could offer better performance at lower costs.

Future Improvements and User Interaction

  • Speaker reflects on how user interaction could improve item tracking without needing constant photographic updates; verbal commands could suffice for inventory changes.
  • Emphasizes that all data is stored within a codebase that allows easy updates based on user input or actions taken regarding physical items.

Repository Management for Bot Development

Creating Separate Repositories

  • The speaker discusses the creation of two separate repositories: one for the bot's code and another for data storage. This separation simplifies management and organization.
  • The use of Git Submodules was considered but deemed unnecessary, as maintaining two distinct repositories suffices for their needs.

Data Handling and Version Control

  • The speaker highlights that version control is maintained effectively, allowing them to store private repositories on GitHub.
  • An example is given where a photo of a MacBook is automatically uploaded to the data folder with a uniquely generated name, demonstrating automated data handling.

Automation in File Management

  • The system has access to the file system and can generate folder structures autonomously based on templates provided within its agent skills.
  • There’s an invitation to view slides related to this automation process, indicating additional resources are available for those interested.

Upcoming Workshops

  • The speaker announces plans for future workshops focused on modern practices in assistant development, which will be affordable with discounts available for employees.
  • Details about how announcements regarding these workshops will be communicated are briefly mentioned, emphasizing community engagement.

Personal Reflections on Programming Journey

  • A personal anecdote reveals the speaker's journey into programming at age 10 and reflects on their current setup involving multiple monitors and significant investment in technology.
  • They mention running agents in parallel, showcasing their advanced technical environment and ongoing projects.
Video description

У цьому відео розбираємо штуку, яка реально міняє підхід до AI-розробки: **Agent Skills**. На мою думку це настільки важлива інновація в Agentic Engineering, що можна говорити про перехід до нової парадигми **skill-based programming**. Тут ми не просто просимо модель щось зробити. Ми даємо їй структуру: **що робити, як робити і які інструменти для цього використовувати**. У відео розбираємо: – що таке Agent Skills – чим Skills відрізняються від prompts, rules і MCP – чому MCP часто не найкращий вибір, якщо можна без них обійтися – як працює lifecycle у skills – що таке `SKILL.md` і як він влаштований – чому `description` у skill – це критично – як валідовувати skills – де брати готові skills – як створити власний skill під свою задачу – як це використовувати в реальних проєктах – і чому майбутнє AI-розробки – це не “магічний промт”, а нормально спроєктована система Окремо показую практичні кейси: – генерація docx через skills – підходи Vercel до best practices – Skills SH і пакетний підхід – підготовка великих проєктів через документацію – Spec Driven Development – Open Claw / NanoClaw – реальний сценарій з home-каталогом, де агент працює через markdown і skills **Таймкоди:** 09:48 Skill-based programming – про що взагалі мова 10:50 Що таке Agent Skills і навіщо вони потрібні 11:23 Prompt engineering – з чого все починалось 11:48 Rules – наступний етап еволюції 13:06 MCP – що це таке і в чому проблеми 13:59 Як Skills об’єднують prompts, rules і MCP 19:17 Що буде у цьому розборі: стандарт, валідація, приклади 19:36 Що саме упаковує skill 20:08 Чому AI-інструменти дають різний результат 21:36 Як Skills допомагають агенту працювати краще 22:02 Окремі skills vs packages 23:22 Active context vs rules 23:47 Lifecycle skill: discovery, initialization, activation 24:33 Структура `Skill.md` 25:22 Додаткові ресурси всередині skill 25:43 Ризики: шкідливий код у skills 26:17 Progressive disclosure і економія контексту 26:39 Мінімальна структура skill 27:01 Обов’язкові поля: `name` і `description` 28:21 Чому `description` – найважливіше поле 28:47 Приклад description для генератора docx 29:32 Як перевикористовувати skills у проєктах 30:44 Що робити, якщо під задачу підходить кілька skills 31:45 Skills SH і пакетний підхід 32:02 Валідація skills 33:58 Репозиторії skills 34:38 Skill creator – skill, який створює skills 35:52 Великий приклад: генерація docx-файлів 37:28 Практичні поради щодо структури skills 37:48 Vercel і AI-орієнтований стек 40:15 Vercel React best practices skills 41:30 Correct / incorrect patterns 41:38 Чому з негативними промтами треба бути обережним 42:38 Як skills підтягують рівень генерації коду 44:05 Інсталяція через NPX Skills 45:25 Як зробити власний skill 46:24 Чи можна використовувати купу skills у проєкті 46:50 Як підготувати великий проєкт для AI 46:58 Документація прямо в репозиторії 48:36 Spec Driven Development 50:30 Історія про старий застосунок і нову ідею 53:14 Open Claw 56:28 NanoClaw 57:05 Ідея home-каталогу в markdown 58:01 Сумісність з Obsidian Якщо тобі цікаво не просто “погратися з AI”, а реально вибудувати **системний підхід до розробки**, тут якраз про це. #ai #agentskills #programming #cursor #claude #mcp #promptengineering #softwaredevelopment Слайди тут https://programmingmentor.github.io/2026-agent-skills/1