Skills for organizations, partners, the ecosystem (opens in new tab)

(claude.com)

289 pointsadocomplete4mo ago170 comments

170 comments

There's a pattern I keep seeing: LLMs used to replace things we already know how to do deterministically. Parsing a known HTML structure, transforming a table, running a financial simulation. It works, but it's like using a helicopter to cross the street: expensive, slow, and not guaranteed to land exactly where you intended.

The real opportunity with Agent Skills isn't just packaging prompts. It's providing a mechanism that enables a clean split: LLM as the control plane (planning, choosing tools, handling ambiguous steps) and code or sub-agents as the data/execution plane (fetching, parsing, transforming, simulating, or executing NL steps in a separate context).

This requires well-defined input/output contracts and a composition model. I opened a discussion on whether Agent Skills should support this kind of composability:

https://github.com/agentskills/agentskills/issues/11

basch4mo ago

The same applies to context vs a database. If a reasoning model makes a decision about something, it should be put off to the side and stored as a value/variable/entry somewhere. Instead of using pages and pages of context, it makes sense for some tasks to "press" decisions that become more permanent to the conversation. You can somewhat accomplish that with notebooklm, by turning results into notes into sources, but notebooklm is insular and doesnt have the research and imaging features of gemini.

And also, in writing, writing from top to bottom has its disadvantages. It makes sense to emulate human writing process and have passes, as you flesh out, and conversely summarize writing.

Current LLMs can brute force these things through emulation/observation/mimicry but they arent as good as doing it the right way. Not only would I like to see "skills" but also "processes" where you create a well defined order that tasks are accomplished in sequence. Repeatable templates. This would essentially include variables in the templates, set for replacement.

rlupi4mo ago

> Not only would I like to see "skills" but also "processes" where you create a well defined order that tasks are accomplished in sequence. Repeatable templates. This would essentially include variables in the templates, set for replacement.

You can do this with Gemini commands and extensions.

https://cloud.google.com/blog/topics/developers-practitioner...

basch4mo ago

Maybe I'm not explaining it well.

The template would more define the output, and I imagine it more recursively.

Say we are building a piece of journalism. First pass, do these things, second pass build more coherent topic sentences, third pass build an introduction.

Right now, the way that models write from top to bottom, the introduction paragraph seems to inform the body, and then the body is just a stretched out version of the intro. Whereas how it should work is the body is written and then condensed into topic sentences and introductions.

I find myself having to baby models, "we are going to do this, lets do the first one. ok now lets do the second one, ok now the third one. you forgot the instructions, lets revise with the parameters you were given initially. now lets put it all together."

I'm babbling, I just think these interfaces need a better way to define "lets write paragraph 4 first, followed by blah blah" to better structure the order in which they tackle tasks.

gradus_ad4mo ago

I've recently been doing some work with Autodesk. It would be great for an LLM to be as comfortable with the "vocabulary" of these applications as they are with code. Maybe part of this involves creating a language for CAD design in the first place. But the principle that we need to build out vocabularies and subsequently generate and expose "sentences" (workflows) for LLM's to train on seems like a promising direction.

Of course this requires substantial buy in from application owners - create the vocabulary - and users - agree to expose and share the sentences they generate - but the results would be worth it.

baq4mo ago

Mildly amusing since i remember AutoCAD having a lisp interpreter ~30 years ago…?

officialchicken4mo ago

AutoCAD had LISP from the beginning.

https://www.fourmilab.ch/autofile/

ugh1234mo ago

100%

Additionally, I can't even get claude or codex to reliable use the prompt and simple rules (use this command to compile) in an agents.md or whatever required markdown file is needed. Why would I assume they will reliably handle skills prompts spread about a codebase?

I've even seen tool usage deteriorate while it's thinking and self commanding through its output to say.. read code from a file. Sometimes it uses tail while other times it gets confused on the output and then writes a basic python program to parse lines and strings from the same file to effectively get what was the same output as before. How bizarre!

esafak4mo ago

Skills are about empowering LLMs with tools, so the heavy lifting can still be deterministic. Furthermore, pipelines written in LLMs are simpler and less brittle, since handling variation is the essence of machine learning.

rk064mo ago

pipelines written in LLMs may be simpler. but they are definitely more brittle and even more non-deterministic.

if AI were deterministic, what difference would different AI model make?

itissid4mo ago

Isn't atleast part of that GH issue something that this https://docs.boundaryml.com/guide/introduction/what-is-baml is also trying to solve? LLM inputs and outputs must be functions with defined functions. That was their starting point.

IIUC their most recent arc focuses on prompt optimization[0] where you can optimize — using DSPy and an optimization algo GEPA [1] — using relative weights on different things like errors, token usage, complexity.

[0] https://docs.boundaryml.com/guide/baml-advanced/prompt-optim... [1] https://github.com/gepa-ai/gepa?tab=readme-ov-file

deaux4mo ago

Where in this post's article are you seeing this pattern?

> Parsing a known HTML structure

In most cases, HTML structures that are being parsed aren't known. If they're known, you control them, and you don't need to parse them in the first place. If they're someone else's, who knows when they'll change, or under what condition they're different.

But really, I don't see the stuff you're talking about happening in prod for non-one-off usecases. I see LLMs used in prod usecases exactly for data where you don't know exactly what its shape will be, and there's an enormous amount of such cases. If the same logic is needed every time, of course you don't have an LLM execute that logic, you have the LLM write a deterministic script.

_the_inflator4mo ago

I agree partly.

Skills are essentially boiling down to distributed parts of a Main Prompt. If you consider a state model you can see this pattern: Task is the state and combining the task's specifics skills defines the current prompt augmentation. When the task changes, another prompt emerges.

In the end, it is the clear guidance of the Agent that is the deciding factor.

hintymad4mo ago

> Parsing a known HTML structure, transforming a table, running a financial simulation.

Transforming an arbitrary table is still hard, especially a table on a webpage or in a document. Sometimes I even struggle to find the right library. The effort does not seem worth it for one-off need of such transformation too. LLM can be a great tool for doing the tasks.

reedf14mo ago

How likely are we to look back on Agent/MCP/Skills as some early Netscape peculiarity? I would dive into adoption if I didn't think some new thing would beat the paradigm in a fortnight.

vessenes4mo ago

I've built a number of MCP servers, including an MCP wrapper. I'd generally recommend you skip it unless you know you need it. Conversely, I'd generally recommend you write up a couple skills ASAP to get a feel for them. It will take you 20 minutes to write and test some.

MCP does three things conceptually: it lets you build a bridge between an agent and <something else>, it specifies a UI+API layer between the bridge and the LLM, and it formalizes the description of that bridge in a tool-calling format.

It's that UI+API layer that's the biggest pain in the ass, in my opinion. Sometimes you need it; for instance, if you wanted an agent to access your emails, a high quality MCP server that can't destroy your life through enthusiastic tool calling makes sense.

If, however, you have, say a CLI tool or simple API that's reasonably self documenting and you're willing to have it run, and/or if you need specific behavior with a different context setting, then a skill can just be a markdown file that explains what, how, why.

throwup2384mo ago

Agreed. I use only one MCP server regularly and it’s a custom one integrated into my QT desktop app. It has tools for inspecting the widget tree, using selectors to click/type/etc, and take screenshots. Functionality that would otherwise be hard or impossible to reliably implement using CLI calls but gives Claude a closed feedback loop.

All public MCP server I’ve seen have been a disaster with too many tools and tokens polluting the context. It’s really most useful when you need tight integration with some other environment and can write a little custom wrapper to provide it.

deaux4mo ago

> All public MCP server I’ve seen have been a disaster with too many tools and tokens polluting the context.

People like to shit on Copilot's UX but something it does well is making it incredibly easy to switch off individual tools you don't need per MCP server. In general I've found its MCP story the best out of all of them (Codex/CC/Gemini), it utilizes VSCode extensions integration very well.

deaux4mo ago

If you know you need them though, do use them. There are four MCP servers I use regularly and they're enormously useful. They're all around the same topic though - pulling in context/data from sources. One is dual-use, in that I occasionally also use it for things like dashboard generation.

I will say, when using MCP be selective about which tools you enable. A lot of the time they come with say 30 tools and you only personally care about 5 of them. The other 25 are just rotting your context.

irrationalfab4mo ago

Agent/MCP/Skills might be "Netscape-y" in the sense that today's formats will evolve fast. But Netscape still mattered: it lost the market, not the ideas. The patterns survived (JavaScript, cookies, SSL/TLS, progressive rendering) and became best practices we take for granted.

The durable pattern here isn't a specific file format. It's on-demand capability discovery: a small index with concise metadata so the model can find what's available, then pull details only when needed. That's a real improvement over tool calling and MCP's "preload all tools up front" approach, and it mirrors how humans work. Even as models bake more know-how into their weights, novel capabilities will always be created faster than retraining cycles. And even if context becomes unlimited, preloading everything up front remains wasteful when most of it is irrelevant to the task at hand.

So even if "Skills" gets replaced, discoverability and progressive disclosure likely survive.

verelo4mo ago

Yes this 100%. Every person i speak with who is excited about MCP is some LinkedIn Guru or product expert. I'm yet to encounter a seriously technical person excited by any of this.

hnlmorg4mo ago

MCP, as a concept, is a great idea.

The problem isn’t having a standard way for agents to branch out. The problem is that AI is the new Javascript web framework: there’s nothing wrong with frameworks, but when everyone and their son are writing a new framework and half those frameworks barely work, you end up with a buggy, fragmented ecosystem.

I get why this happens. Startups want VC money, established companies then want to appear relevant, and then software engineers and students feel pressured to prove they’re hireable. And you end up with one giant pissing contest where half the players likely see the ridiculousness of the situation but have little choice other than to join party.

anthuswilliams4mo ago

I have found MCPs to be very useful (albeit with some severe and problematic limitations in the protocol's design). You can bundle them and configure them with a desktop LLM client and distribute them to an organization via something like Jamf. In the context I work in (biotech) I've found it a pretty high-ROI way to give lots of different types of researchers access to a variety of tools and data very cheaply.

verelo4mo ago

I believe you, but can you elaborate? What exactly does MCP give you in this context? How do you use it? I always get high level answers and I'm yet to be convinced, but i would love this to be one of those experiences where i walk away being wrong and learning something new.

1 more reply

danmaz744mo ago

I use only one MCP, but I use it a lot: it's chrome devtools. I get Claude Code to test in the browser, which makes a huge difference when I want it to fix a bug I found in the browser - or if I just want it to do a real world test on something it just built.

verelo4mo ago

OK this is super practical, thanks for sharing! I'm going to try this out!

james2doyle4mo ago

I have found MCPs helpful. Recently, I used one to migrate a site from WordPress to Sanity. I pasted in the markdown from the original site and told it to create documents that matched my schemas. This was much quicker and more flexible than whipping up a singular migration tool. The Sanity MCP uses oAuth so I also didn’t need to do anything in order to connect to my protected dataset. Just log in. I’ll definitely be using this method in the future for different migrations.

xnx4mo ago

Don't forget A2A: https://developers.googleblog.com/en/a2a-a-new-era-of-agent-...

We'll see how many of these are around in a few years.

SamDc734mo ago

I'm yet to come across applications implementing A2A in real life.

dcreater4mo ago

Ive yet to come across applications implementing ANY AI framework in real life/production grade projects...

eddybenchek4mo ago

I've found the "too many tools polluting context" problem to be real. The challenge is that MCP servers often expose everything upfront, even when you only need a subset. The on-demand capability discovery pattern 'irrationalfab' mentioned makes sense having a lightweight index that the model can query, then pulling full tool details only when needed. This mirrors how we actually work with APIs we don't load all endpoints into memory, we discover them as needed.

isodev4mo ago

How likely is it to even remember “the AI stuff” 2-3 years from now? What we’re trying to do with LLMs today is extremely unsustainable. NVidia/openai will run out of silly investors eventually…

wuliwong4mo ago

So like any early phase, there's risk in picking a technology to use.

veunes4mo ago

The space is moving fast enough that everything feels provisional

smrtinsert4mo ago

Extremely likely but that doesn't mean it lacks value today

adw4mo ago

Skills are just prompt conventions; the exact form may change but the substance is reasonable. MCP, eh, it’s pretty bad, I can see it vanishing.

The agent loop architectural pattern (and that’s the relevant bit) is going to continue to matter. There will be new patterns for sure, but tool calling plus while loop (which is all an “agent” is) is powerful and highly general.

DenisM4mo ago

Why do you think they will fade out?

observationist4mo ago

Frontier models will eventually eat all the tedious tailored add-ons as just part of something they can do.

Right now models have roughly all of the written knowledge available to mankind, minus some obscure held out private archives and so on. They have excellent skills and general abilities to construct plausible sequences of actions to accomplish work, but we need to hold their hands to really get decent performance across a wide range of activities. Skills and agent frameworks and MCP carve out different domains of that problem, with successful solutions providing training data for future models that might be able to be either generalized, or they'll be able to create a vast mountain of synthetic data following successful patterns, and make the next generation of models incredibly useful for a huge number of tasks, by default.

It might also be possible that by studying the problem, identifying where mode collapses and issues with training prevent the right sort of generalization, they might tweak the architecture and be able to solve the deficiency through normal training runs, and thereby discard the need for all the bespoke artisanal agent specifications.

jonahbenton4mo ago

To my eyes skills disappear, MCP and agent definitions do not.

You can have the most capable human available to you, a supreme executive assistant. You still have to convey your intent and needs to them, your preferences, etc, with as high a degree of specificity as necessary.

And you need to provide them with access and mechanisms to do things on your behalf.

Agentic definitions are the former, and they will evolve and grow. I like the metaphor of deal terms in financial contracts- benchmarkers document billions of these now. The "deal terms" governing the work any given entity does for you will be rich and bespoke and specific, like any valuable relationship. Even if the agent is learning about you, your governance is still needed.

MCP is the latter. It is the protocol by which a thing does things for you. It will get extensions. Skill-like directives and instructions will get delivered over it.

Skills themselves are near term scaffold that will soon disappear.

1 more reply

DenisM4mo ago

I hear you - model development might overcome the shortcomings one day.

However the "waiting out" strategy needs a timeout. It might happen that agentic crutches around LLMs will bear fruit much sooner than high-quality LLMs arrive. If you don't have a timeout or a decent exit criteria you may end up waiting indefinitely, or at least until reality of things becomes too painful to ignore.

The "ski rental problem" comes to mind here, but maybe there is another "wait it out" exit strategy?

airstrike4mo ago

> Frontier models will eventually eat all the tedious tailored add-ons as just part of something they can do.

I don't this makes any sense as MCP is a part of something they can do already

mbesto4mo ago

> Right now models have roughly all of the written knowledge available to mankind, minus some obscure held out private archives and so on.

Sorry for the nit, but this is a gross oversimplification. Most private archives are not obscure but obfuscated and largely are way more valuable training data then the publicly available ones.

Want to know how the DOD may technically tracks your phone? Private.

Want to know how to make Coca Cola at scale? Private.

Want to know what the schematic is for a Google TPU? Private.

etc etc.

amitport4mo ago

His point, I believe, was that it is early in the innovation cycle and they very well be replaced quickely with different solutions/paradigms.

DenisM4mo ago

Well, some things fade out and some do not. How do we decide which one it is?

The reason I ask is that the pace of new things arriving is overwhelming, hence I was tempted to just ignore it. Not because things had signs of transience, but because I was drowning and didn't know where to start. That is not the same thing as actually observing signs of things being too foamy.

wuliwong4mo ago

Agreed. I think if this is overly concerning, developing early in the innovation cycle just might not be the ideal place to be. :)

orliesaurus4mo ago

Adoption on most of these has been weak, except MCP (and whatever flavor of markdown file you like to add to your agent context)

zingababba4mo ago

Microsoft seems to be pushing MCP pretty hard in the Azure ecosystem. My cynical take is they are very aware of the context bloat so see it as extra inference $$.

1 more reply

makestuff4mo ago

Is a skill essentially a reusable prompt that is inserted at the start of any query? The marketing of Agents/MCP/skills/etc is very confusing to me.

cshimmin4mo ago

It's basically just a way for the LLM to lazy-load curated information, tools, and scripts into context. The benefit of making it a "standard" is that future generations of LLMs will be trained on this pattern specifically, and will get quite good at it.

csomar4mo ago

> It's basically just a way for the LLM to lazy-load curated information, tools, and scripts into context.

So basically a reusable prompt like the previous has asked?

bnchrch4mo ago

Ah not exactly.

The way the OP phrased it

> Is a skill essentially a reusable prompt that is inserted at the start of any query?

Actually is a more apt description for a different Claude Code feature called Slash Commands

Where I can create a preset "prompt" and call it with /name-of-my-prompt $ARGS

and this feature is the one that essentially prefixes a Prompt.

The other description of lazy loading is more accurate for Skills.

Where I can tell my Claude Code system: Hey if you need to run our dev server see my-dev-server-skill

and the agent will determine when to pull in that skill if it needs it.

ActionHank4mo ago

Yes, but with more sales magic sprinkled on top.

prodigycorp4mo ago

Does it persist the loaded information for the remainder of the conversation or does it intelligently cull the context when it's not needed?

dcre4mo ago

This question doesn’t have anything to do with skills per se, this is just about how different agents handle context. I think right now the main way they cull context is by culling noisy tool call output. Skills are basically saved prompts and shouldn’t be that long, so they would probably not be near the top of the list of things to cull.

terminalkeys4mo ago

Claude Code subagents keep their context windows separate from the main agent, sending back only the most relevant context based on the main agent's request.

brabel4mo ago

Each agent will do that differently, but Gemini CLI, for example, lets you save any session with a name so you can continue it later.

stavros4mo ago

It's the description that gets inserted into the context, and then if that sounds useful, the agent can opt to use the skill. I believe (but I'm not sure) that the agent chooses what context to pass into the subagent, which gets that context along with the skill's context (the stuff in the Markdown file and the rest of the files in the FS).

This may all be very wrong, though, as it's mostly conjecture from the little I've worked with skills.

subpixel4mo ago

Claude also has custom slash-commands, so you can force skill usage as you see fit.

This lets you trigger a skill with '/foo' in a way that resembles the way you'd use the command line.

Claude Code is very good at using well-defined skills without a command though, but in a scenario where this is some nuance between similar skills they are useful.

theshrike794mo ago

Skills can be just instructions how to do things.

BUT what makes them powerful is that you can include code with the skill package.

Like I have a skill that uses a Go program to traverse the AST of a Go project to find different issues in it.

You COULD just prompt it but then the LLM would have to dig around using find and grep. Now it runs a single executable which outputs an LLM optimised clump of text for processing.

danielbln4mo ago

Its part of managing the context. It's a bit of prepared context that can be lazy-loaded in as the need arises.

Inversely, you can persist/summarize a larger bit of context into a skill, so a new agent session can easily pull it in.

So yes, it's just turtles, sorry, prompts all the way down.

dcre4mo ago

“inserted at the start of any query” feels like a bit of a misunderstanding to me. It plops the skill text into the context when it needs it or when you tell it to. It’s basically like pasting in text or telling it to read a file, except for the bit where it can decide on its own to do it. I’m not sure start, middle, or end of query is meaningful here.

langitbiru4mo ago

It also has (Python/Ruby/bash) scripts which Claude Code can execute.

gaigalas4mo ago

Finally I can share this beauty with a wider world:

https://github.com/alganet/skills/blob/main/skills/left-padd...

josteink4mo ago

Is that intentionally designed to completely occupy the full context window of the earlier GPT models?

Either way, that’s hilarious. Well done.

gaigalas4mo ago

I asked a model to write for me following the style and tone of other skills!

<conspiracy_mode> maybe all of them were designed to occupy the full context window of earlier GPT models </conspiracy_mode>

debugnik4mo ago

Amazing. It's just missing an order for the chatbot to say "I know left-pad" before actually doing any work.

xd19364mo ago

This is hilarious

mrbonner4mo ago

The agentic development scene has slowly turned into a full-blown JavaScript circus—bright lights, loud chatter, and endless acts that all look suspiciously familiar. We keep wrapping the same old problems in shiny new packages, parading them around as if they’re groundbreaking innovations. How long before the crowd grows tired of yet another round of “RFC” performances?

isoprophlex4mo ago

MCP: we're uber, but for stdout

Etheryte4mo ago

Good job, reading this sentence physically hurts my eyes.

rvz4mo ago

Well, these agentic / AI companies don't even know what an RFC is, let alone how to write one. The last time they attempted to create a "standard" (MCP) it was not only premature, but it was a complete security mess.

Apart from Google Inc., I have not seen a single "AI company" propose an RFC that was reviewed by the IETF and became a proper internet standard. [0]

"MCP" was one of the worst so-called "standards" ever built since the JWT was proposed. So I do not take Anthropic seriously when they create so-called "open standards" especially when the reference implementation is in Javascript or TypeScript.

[0] https://www.rfc-editor.org/standards

lxgr4mo ago

To be fair, security wasn’t even a consideration until RFCs were well into triple digits. We’re still very early, as they say.

> I have not seen a single "AI company" propose an RFC that was reviewed by the IETF and became a proper internet standard.

Why would the IETF have anything to do with LLM/agent standards? This seems like a category error. They also don’t ratify web standards, for example.

verdverm4mo ago

IETF maintains the HTTP standard

https://httpwg.org/

IETF is involved in protocol standards, MCP/A2A are certainly in this category, skills less so

hugs4mo ago

the tech industry is forever in denial that it is also actually a fashion industry.

recursive4mo ago

That's only true for companies that make most of their money from investment instead of customers. Those exist too.

falcor844mo ago

What do you mean? Are you saying that customers don't follow fashions?

pixl974mo ago

Beyond assembly everything is window dressing.

esafak4mo ago

The fractal nature of https://xkcd.com/435/ ...

beoberha4mo ago

It’s a fast moving field. People aren’t coming up with new ideas to be performative. They see issues with the state of the art and make something that may or may not advance things forward. MCP is huge for getting agents to do things in the “real world”. However, it’s costly! Skills is a cheap way to fill that gap for many cases. People are finding immediate value in both of these. Try not to be so pessimistic.

verdverm4mo ago

It's not pessimism, but actual compatibility issues

like deno vs npm package ecosystems that didn't work together for many years

There are multiple AGENTS vs CLAUDE vs .github/instructions; skills vs commands; ... intermixed and inconsistent concepts, all out in the wild

When I work on a project, do all the files align? If I work in an org, where developers have agent choice, how many of these instructions and skills "distros" do I need to put (pollute?) my repo with?

detkin4mo ago

Skills have been really helpful in my team as we've been encoding tribal knowledge into something that other developers can easily take advantage of. For example, our backend architecture has these hidden patterns, that once encoding in a skill, can be followed by full stack devs doing work there, saving a ton of time in coding and PR review.

We then hit the problem of how to best share these and keep them up to date, especially with multiple repositories. It led us to build sx - https://github.com/sleuth-io/sx, a package manager for AI tools.

ffsm84mo ago

Depending on your workflow, none.

While I do agentic development in personal projects a lot at this point, at work it's super rare beyond quick lookups to things I should already know but can't be arsed to remember exactly (like writing a one-off SQL scripts which does batching mutations and similar)

toomuchtodo4mo ago

When the AI investment dollars run out. "As long as the music is playing, you've got to get up and dance." (Chuck Prince, Citigroup)

veunes4mo ago

There's definitely a performative vibe to a lot of it right now

wiseowise4mo ago

> full-blown JavaScript circus

It is not healthy when you have an obsession this bad, seriously. Seek help.

quacky_batak4mo ago

i like how Anthropic has positioned themselves as the true AI research company and donating “standards” like that.

Although Skills are just md files but it’s good to see them “donate” it.

There goal seems to be simple: Focus on coding and improving it. They’ve found a great niche and hopefully revenue generating business there.

OpenAI on the other hand doesn’t give me same vibes, they don’t seem very oriented. They’re playing catchup with both Google models and Anthropic

plufz4mo ago

I have no idea why I’m about to defend OpenAI here. BUT OpenAI have released some open weight models like gpt-oss and whisper. But sure open weight not open source. And yeah I really don’t like OpenAI as a company to be clear.

dismantlethesun4mo ago

They have but it does feel like they are developing a closed platform aka Apple.

Apple has shortcuts, but they haven’t propped it up like a standard that other people can use.

To contrast this is something you can use even if you have nothing to do with Claude, and your tools created will be compatible with the wider ecosystem.

theshrike794mo ago

A skill can also contain runnable code.

Many many MCPs could and should just be a skill instead.

ada19814mo ago

Our lab has been experimenting with “meta skills” that allow creation of skills to use later after a particular workflow.

Paper & applications published here: https://earthpilot.ai/metaskills/

uhgrippa4mo ago

I noticed a similar optimization path with skills, where I now have subagents to analyze the performance of a previous skill/command/hook execution, triggered by a command. I've pushed this to my plugin marketplace https://github.com/athola/claude-night-market

babyshake4mo ago

I have been experimenting with these same type of factory pattern skills. Thanks for sharing.

danielbln4mo ago

After a session with Claude Code I just tell it "turn this into a skill, incorporate what we've learned in this session".

itissid4mo ago

One thing that is interesting to think about is given a skill which is just "pre-context", how can it be _evolved_ to create prompts given _my_ context? e.g. here is their web artifact skill builder from desktop app:

``` web-artifacts-builder

Suite of tools for creating elaborate, multi-component claude.ai HTML artifacts using modern frontend web technologies (React, Tailwind CSS, shadcn/ui). Use for complex artifacts requiring state management, routing, or shadcn/ui components - not for simple single-file HTML/JSX artifacts. ```

Say I want to build a landing page with some relatively static content — I don't know it yet but its just gonna be bootstrap CSS, no SPA/React(ish), it'll be fine with templated server side thing. But I don't know how to express this in words. Could the skill _evolve_ based on what my preferences are and what is possible for a relative novice to grok and construct?

This is a simple example, but it could extend to say using sqlite+litestream instead of postgres or using Gradient boosted trees instead of an expensive transformer based classifier.

an0malous4mo ago

I feel inspired and would like to donate my standard for Agent Personas to the community. A persona can be defined by a markdown file with the following frontmatter:

    ---
    persona: hacker
    description: logical, talks about computers a lot, enjoys coffee, somewhat snarky and arrogant
    ---
    
    <more details here>

lxgr4mo ago

This isn’t just a standard—this is a templating system that could offer us a straight shot to AGI!

1 more reply

allisdust4mo ago

Please consider donating this to the Linux Foundation so they can drive this inspiring innovation forward.

falcor844mo ago

I have a few qualms about this standard:

1. For an experienced Claude Code user, you can already build such an agent persona quite trivially by using the /agents settings.

2. It doesn't actually replace agents. Most people I know use pre-defined agents for some tasks, but they still want the ability to create ad-hoc agents for specific needs. Your standard, by requiring them to write markdown files does not solve this ad-hoc issue.

3. It does not seem very "viral" or income-generating. I know this is premature at this point, but without charging users for the standard, is it reasonable to expect to make money off of this?

acedTrex4mo ago

Have you considered publishing this with a few charts about vague levels of "correctness"?

rvz4mo ago

What is "correctness?"... wait hang on let me think...

"you're absolutely right!"

brap4mo ago

Give this man a Turing Award

InitialLastName4mo ago

Luckily you get the "extremely confident, even when wrong" attribute for free.

sshine4mo ago

But always willing to admit the opposite is true and go with that on a whim.

baobun4mo ago

As groundbreaking as this is, it will never get traction without a LICENSE.md.

weitendorf4mo ago

announcing md2ai spec

oblio4mo ago

> logical

Please tell us how REALLY feel about JavaScript.

zikani_034mo ago

absolutely revolutionary! ;)

apf64mo ago

It was just a few months ago that the MCP spec added a concept called "prompts" which are really similar to skills.

And of course Claude Code has custom slash commands which are also very similar.

Getting a lot of whiplash from all these specifications that are hastily put together and then quickly forgotten.

vedmakk4mo ago

My understanding is, that MCP Prompts and Slash Commands are "user triggered" whereas Skills (and MCP Tools) are "model triggered".

Other than that it appears MCP prompts end up as slash commands provided by an MCP Server (instead of client side command definitions).

But the actual knowledge that is encoded in skills/commands/mcp prompts is very similar.

seg_lol4mo ago

What I like is that there are 20 different names for things that are basically glorified checklists and paragraphs.

verdverm4mo ago

It's a "standard" though! /s

layer84mo ago

They published a specification, that doesn’t yet make it a standard.

vladsh4mo ago

Skills are a pretty awkward abstraction. They emerged to patch a real problem, generic models require fine-tuning via context, which quickly leads to bloated context files and context dilution (ie more hallucinations)

But skills dont really solve the problem. Turning that workaround into a standard feels strange. Standardizing a patch isn’t something I’d expect from Anthropic, it’s unclear what is their endgame here

ako4mo ago

Skills don’t solve the problem if you think an llm should know everything. But if you see LLMs mostly as a text plan-do-check-act machine that can process input text, generate output text, and can create plans how to create more knowledge and validate the output, without knowing everything upfront, skills are perfectly fine solution.

The value of standardizing skills is that the skills you define work with any agentic tool. Doesn't matter how simple they are, if they dont work easily, they have no use.

You need a practical and efficient way to give the llm your context. Just like every organization has its own standards, best practices, architectures that should be documented, as new developers do not know this upfront, LLMs also need your context.

An llm is not an all knowing brain, but it’s a plan-do-check-act text processing machine.

brabel4mo ago

How would you solve the same problem? Skills seem to be just a pattern (before this spec) that lets the LLMs choose what information they need to "load". It's not that different from a person looking up the literature before they do a certain job, rather than just reading every book every time in case it comes in handy one day. Whatever you do you will end up with the same kind of solution, there's no way to just add all useful context to the LLM beforehand.

root_axis4mo ago

> it’s unclear what is their endgame here

Marketing. That defines pretty much everything Anthropic does beyond frontier model training. They're the same people producing sensationalized research headlines about LLMs trying to blackmail folks in order to prevent being deleted.

verdverm4mo ago

> Standardizing a patch isn’t something I’d expect from Anthropic

This is not the first time, perhaps expectation adjustment is in order. This is also the same company that has an exec telling people in his Discord (15m of fame recently) Claude has emotions

wuliwong4mo ago

>But skills dont really solve the problem.

I think that they often do solve the problem, just maybe they have some other side effects/trade offs.

theshrike794mo ago

They’re not a perfect solution, but they are a good one.

The best one we have thought of so far.

terminalkeys4mo ago

All the talk about "open" standards from AI companies feels like VC-backed public LLM experiments. Even if these standards fade, they help researchers create and validate new tools. I see this especially with local models. The rise of CLI-based LLM coding tools lets me use models like GPT OSS 20B to build apps locally and offline.

exasperaited4mo ago

Argh word creep.

It has been published as an open specification.

Whether it is a standard isn't for them to declare.

htrp4mo ago

I wish agentic skills were something other than a system prompt or a series of step-by-step instructions. feels like anthropicide and opportunity here to do something truly groundbreaking but ended up with prompt engineering.

runtimepanic4mo ago

Interesting move. One thing I’m curious about is how opinionated the standard is supposed to be. In practice, agent “skills” tend to blur the line between capabilities, tools, and workflows, especially once statefulness and retries enter the picture. Is the goal here mostly interoperability between agent frameworks, or do you see this evolving into something closer to a contract for agent behavior over time? I can imagine standardization helping a lot, but only if it stays flexible enough to avoid freezing today’s agent design assumptions.

liampulles4mo ago

I'm curious about the `license` field in the specification: https://agentskills.io/specification.

Could one make a copyleft type license such that the generated code must be licensed free and open and under the same license? How enforceable are licenses on these skills anyway, if one can take in the whole skill with an agent and generate a legally distinct but functionally close variant?

kristo4mo ago

Still can’t symlink skills from Claude code to codex tho :/

mkagenius4mo ago

If anyone wants to use Skills in Gemini CLI or any other llm tool - check out something I have created, open-skills https://github.com/BandarLabs/open-skills

It does code execution in an apple container if your Skill requires any code execution.

It also proves the point that Skills are basically repackaged MCPs (if you look into my code).

theturtletalks4mo ago

Will Skills and Code Execution replace MCPs eventually?

mkagenius4mo ago

I doubt that. MCPs are broader. You can serve a Skill via a MCP but the reverse may not be always true.

For example, you can't have a directory named "Stripe-Skills" which will give you a breakdown of last week's revenue (unless you write in the skills how to connect to stripe and get that information). So, most of the remote, existing services are better used as MCPs (essentially APIs).

good-idea4mo ago

I have been switching between OpenCode and Claude - one thing I like about OpenCode is the ability to define custom agents. These can be ones tailored to specific workflows like PR reviews or writing change logs. I haven't yet attempted the equivalent of this with skills in Claude.

These two solutions look feel and smell like the same thing. Are they the same thing?

Any OpenCode users out there have any hot or nuanced takes?

terminalkeys4mo ago

Claude Code has subagents as well. I created a workflow with multiple agents to build iOS apps, including agents for orchestration, design, build, and QA.

0x0084mo ago

The skills can be specific to a repository but the agents are global, right?

abatilo4mo ago

Claude code simply supports agents also

albingroen4mo ago

They really do love standards

fudged714mo ago

I'd love to see way more interest, rigor, tooling, etc in the industry regarding Skills, I really think they have solved the biggest problems that killed Expert Systems back in the day. I'd love to see the same enthusiasm as MCPs for these, I think in the long term they will be much more important than MCPs (still complementary).

robertheadley4mo ago

I had developed a tool for Roo Code, and have moved over to anti-gravity with no problem, that basically gives playwright the ability to develop and test user scripts in an automated fashion.

It is functionally a skill. I suppose once anti-gravity supports skills, I will make it one officially.

someguy1010104mo ago

Is it possible to provide a llm a skill through the mcp resource feature?

uhgrippa4mo ago

In a way yes, you can reduce context usage by a non-negligible amount approaching it this way. I'm investigate this on my skill validation/analysis/bidirectional MCP server project and hope to have it as a released feature soon: https://github.com/athola/skrills

theshrike794mo ago

It’s also possible to implement an MCP as a skill

user39393824mo ago

This is the right direction but this implementation is playdough and this needs to be a stone mansion. I’m working on a non-LLM AI model that will blow this out of the water.

Seattle35034mo ago

My company has a plugin marketplace in a git repo where we host our shared skills. It would be nice if we could plug that into the web interface.

verdverm4mo ago

Or if we wrote these things in a language with real imports and modules?

I'm authoring equivalent in CUE, and assimilating "standard" provider ones into CUE on the fly so my agent can work with all the shenanigans out there.

foobarqux4mo ago

What is the difference between 3rd party skills and connectors? How do you access/install 3rd party skills in claude code?

veunes4mo ago

If skills really can move across platforms, that's a meaningful push against lock-in, at least in theory

almosthere4mo ago

how are skills and mcp different?

langitbiru4mo ago

Skills are local. MCP can be remote and has authentication.

almosthere4mo ago

It still sounds the same to me.

esafak4mo ago

Skills are supposed to be lighter (on your context).

delayedrelease4mo ago

Tired of having to learn the Next New Thing (tm) that'll be replaced in a month.

skillcreator4mo ago

Love seeing this become an open standard. We just shipped the first universal skill installer built on it:

npx ai-agent-skills install frontend-design

20 of the most starred Claude skills ever, now open across Claude Code, Cursor, Amp, VS Cod : anywhere that supports the spec. Would love some feedback on it

github.com/skillcreatorai/Ai-Agent-Skills

officialchicken4mo ago

XML Config? Can someone explain that decision?

pplonski864mo ago

Is codex working well with python notebooks?

jameslk4mo ago

“Agent skills” seems more like a pattern than something that needs a standard. It’s like announcing you’ve created a standard for “singletons” or “monads”

unbelievably4mo ago

Why does this need to be a standard in the first place. This isn't DDR5 lol, it's literally just politely asking the model to remember some short descriptions and read a corresponding file when it thinks appropriate. I feel like these abstractions are supposed to make Claude sound more sophisticated because WOW now we can give the guy new skills! But really they're just obfuscating the "data as code" aspect of LLMs which is their true power (and vulnerability ofc).

asadm4mo ago

Good riddance MCP.

observationist4mo ago

That's not what this is. MCP is still around and useful- skills are tailored prompt frameworks for specific tasks or contexts. They're useful for specialization, and in conjunction with post-training after some good data is acquired, will allow the next generation of models to be a lot better at whatever jobs produce good data for training.

adam_arthur4mo ago

Local tools/skills/function definitions can already invoke any API.

There's no real benefit to the MCP protocol over a regular API with a published "client" a local LLM can invoke. The only downside is you'd have to pull this client prior.

I am using local "skill" as reference to an executable function, not specifically Claude Skills.

If the LLM/Agent executes tools via code in a sandbox (which is what things are moving towards), all LLM tools can be simply defined as regular functions that have the flexibility to do anything.

I seriously doubt MCP will exist in any form a few years from now

asadm4mo ago

I have seen ~10 IQ points drop with each MCP I added. I have replaced them all with either skill-like instructions or curl calls in AGENTS.md with much better "tool-calling" rate.

verdverm4mo ago

That's a context pollution problem, not an MCP problem.

https://www.anthropic.com/engineering/advanced-tool-use

1 more reply

AndyNemmity4mo ago

It isn't particularly useful. It uses a lot of context without a lot of value. Claude has written a blog post saying as much. Skills keep the context out unless it's needed.

It's a much better system in my experience.

verdverm4mo ago

Claude did not say don't use MCP because it pollutes the context

What they said was don't pollute your context with lots of tool defs, from MCP or not. You'll see this same problem if you have 100s of skills, with their names and descriptions chewing up tokens

Their solution is to let the agent search and discover as needed, it's a general concept around tools (mcp, func, code use, skills)

https://www.anthropic.com/engineering/advanced-tool-use

j / k navigate · click thread line to collapse

170 comments

irrationalfab4mo ago

This requires well-defined input/output contracts and a composition model. I opened a discussion on whether Agent Skills should support this kind of composability:

https://github.com/agentskills/agentskills/issues/11

basch4mo ago

And also, in writing, writing from top to bottom has its disadvantages. It makes sense to emulate human writing process and have passes, as you flesh out, and conversely summarize writing.

rlupi4mo ago

You can do this with Gemini commands and extensions.

https://cloud.google.com/blog/topics/developers-practitioner...

basch4mo ago

Maybe I'm not explaining it well.

The template would more define the output, and I imagine it more recursively.

Say we are building a piece of journalism. First pass, do these things, second pass build more coherent topic sentences, third pass build an introduction.

I'm babbling, I just think these interfaces need a better way to define "lets write paragraph 4 first, followed by blah blah" to better structure the order in which they tackle tasks.

gradus_ad4mo ago

Of course this requires substantial buy in from application owners - create the vocabulary - and users - agree to expose and share the sentences they generate - but the results would be worth it.

baq4mo ago

Mildly amusing since i remember AutoCAD having a lisp interpreter ~30 years ago…?

officialchicken4mo ago

AutoCAD had LISP from the beginning.

https://www.fourmilab.ch/autofile/

ugh1234mo ago

100%

esafak4mo ago

rk064mo ago

pipelines written in LLMs may be simpler. but they are definitely more brittle and even more non-deterministic.

if AI were deterministic, what difference would different AI model make?

itissid4mo ago

[0] https://docs.boundaryml.com/guide/baml-advanced/prompt-optim... [1] https://github.com/gepa-ai/gepa?tab=readme-ov-file

deaux4mo ago

Where in this post's article are you seeing this pattern?

> Parsing a known HTML structure

_the_inflator4mo ago

I agree partly.

In the end, it is the clear guidance of the Agent that is the deciding factor.

hintymad4mo ago

> Parsing a known HTML structure, transforming a table, running a financial simulation.

reedf14mo ago

How likely are we to look back on Agent/MCP/Skills as some early Netscape peculiarity? I would dive into adoption if I didn't think some new thing would beat the paradigm in a fortnight.

vessenes4mo ago

throwup2384mo ago

deaux4mo ago

> All public MCP server I’ve seen have been a disaster with too many tools and tokens polluting the context.

deaux4mo ago

irrationalfab4mo ago

So even if "Skills" gets replaced, discoverability and progressive disclosure likely survive.

verelo4mo ago

Yes this 100%. Every person i speak with who is excited about MCP is some LinkedIn Guru or product expert. I'm yet to encounter a seriously technical person excited by any of this.

hnlmorg4mo ago

MCP, as a concept, is a great idea.

anthuswilliams4mo ago

verelo4mo ago

1 more reply

danmaz744mo ago

verelo4mo ago

OK this is super practical, thanks for sharing! I'm going to try this out!

james2doyle4mo ago

xnx4mo ago

Don't forget A2A: https://developers.googleblog.com/en/a2a-a-new-era-of-agent-...

We'll see how many of these are around in a few years.

SamDc734mo ago

I'm yet to come across applications implementing A2A in real life.

dcreater4mo ago

Ive yet to come across applications implementing ANY AI framework in real life/production grade projects...

eddybenchek4mo ago

isodev4mo ago

wuliwong4mo ago

So like any early phase, there's risk in picking a technology to use.

veunes4mo ago

The space is moving fast enough that everything feels provisional

smrtinsert4mo ago

Extremely likely but that doesn't mean it lacks value today

adw4mo ago

Skills are just prompt conventions; the exact form may change but the substance is reasonable. MCP, eh, it’s pretty bad, I can see it vanishing.

DenisM4mo ago

Why do you think they will fade out?

observationist4mo ago

Frontier models will eventually eat all the tedious tailored add-ons as just part of something they can do.

jonahbenton4mo ago

To my eyes skills disappear, MCP and agent definitions do not.

And you need to provide them with access and mechanisms to do things on your behalf.

MCP is the latter. It is the protocol by which a thing does things for you. It will get extensions. Skill-like directives and instructions will get delivered over it.

Skills themselves are near term scaffold that will soon disappear.

1 more reply

DenisM4mo ago

I hear you - model development might overcome the shortcomings one day.

The "ski rental problem" comes to mind here, but maybe there is another "wait it out" exit strategy?

airstrike4mo ago

> Frontier models will eventually eat all the tedious tailored add-ons as just part of something they can do.

I don't this makes any sense as MCP is a part of something they can do already

mbesto4mo ago

> Right now models have roughly all of the written knowledge available to mankind, minus some obscure held out private archives and so on.

Sorry for the nit, but this is a gross oversimplification. Most private archives are not obscure but obfuscated and largely are way more valuable training data then the publicly available ones.

Want to know how the DOD may technically tracks your phone? Private.

Want to know how to make Coca Cola at scale? Private.

Want to know what the schematic is for a Google TPU? Private.

etc etc.

amitport4mo ago

His point, I believe, was that it is early in the innovation cycle and they very well be replaced quickely with different solutions/paradigms.

DenisM4mo ago

Well, some things fade out and some do not. How do we decide which one it is?

wuliwong4mo ago

Agreed. I think if this is overly concerning, developing early in the innovation cycle just might not be the ideal place to be. :)

orliesaurus4mo ago

Adoption on most of these has been weak, except MCP (and whatever flavor of markdown file you like to add to your agent context)

zingababba4mo ago

Microsoft seems to be pushing MCP pretty hard in the Azure ecosystem. My cynical take is they are very aware of the context bloat so see it as extra inference $$.

1 more reply

makestuff4mo ago

Is a skill essentially a reusable prompt that is inserted at the start of any query? The marketing of Agents/MCP/skills/etc is very confusing to me.

cshimmin4mo ago

csomar4mo ago

> It's basically just a way for the LLM to lazy-load curated information, tools, and scripts into context.

So basically a reusable prompt like the previous has asked?

bnchrch4mo ago

Ah not exactly.

The way the OP phrased it

> Is a skill essentially a reusable prompt that is inserted at the start of any query?

Actually is a more apt description for a different Claude Code feature called Slash Commands

Where I can create a preset "prompt" and call it with /name-of-my-prompt $ARGS

and this feature is the one that essentially prefixes a Prompt.

The other description of lazy loading is more accurate for Skills.

Where I can tell my Claude Code system: Hey if you need to run our dev server see my-dev-server-skill

and the agent will determine when to pull in that skill if it needs it.

ActionHank4mo ago

Yes, but with more sales magic sprinkled on top.

prodigycorp4mo ago

Does it persist the loaded information for the remainder of the conversation or does it intelligently cull the context when it's not needed?

dcre4mo ago

terminalkeys4mo ago

Claude Code subagents keep their context windows separate from the main agent, sending back only the most relevant context based on the main agent's request.

brabel4mo ago

Each agent will do that differently, but Gemini CLI, for example, lets you save any session with a name so you can continue it later.

stavros4mo ago

This may all be very wrong, though, as it's mostly conjecture from the little I've worked with skills.

subpixel4mo ago

Claude also has custom slash-commands, so you can force skill usage as you see fit.

This lets you trigger a skill with '/foo' in a way that resembles the way you'd use the command line.

Claude Code is very good at using well-defined skills without a command though, but in a scenario where this is some nuance between similar skills they are useful.

theshrike794mo ago

Skills can be just instructions how to do things.

BUT what makes them powerful is that you can include code with the skill package.

Like I have a skill that uses a Go program to traverse the AST of a Go project to find different issues in it.

You COULD just prompt it but then the LLM would have to dig around using find and grep. Now it runs a single executable which outputs an LLM optimised clump of text for processing.

danielbln4mo ago

Its part of managing the context. It's a bit of prepared context that can be lazy-loaded in as the need arises.

Inversely, you can persist/summarize a larger bit of context into a skill, so a new agent session can easily pull it in.

So yes, it's just turtles, sorry, prompts all the way down.

dcre4mo ago

langitbiru4mo ago

It also has (Python/Ruby/bash) scripts which Claude Code can execute.

gaigalas4mo ago

Finally I can share this beauty with a wider world:

https://github.com/alganet/skills/blob/main/skills/left-padd...

josteink4mo ago

Is that intentionally designed to completely occupy the full context window of the earlier GPT models?

Either way, that’s hilarious. Well done.

gaigalas4mo ago

I asked a model to write for me following the style and tone of other skills!

<conspiracy_mode> maybe all of them were designed to occupy the full context window of earlier GPT models </conspiracy_mode>

debugnik4mo ago

Amazing. It's just missing an order for the chatbot to say "I know left-pad" before actually doing any work.

xd19364mo ago

This is hilarious

mrbonner4mo ago

isoprophlex4mo ago

MCP: we're uber, but for stdout

Etheryte4mo ago

Good job, reading this sentence physically hurts my eyes.

rvz4mo ago

Apart from Google Inc., I have not seen a single "AI company" propose an RFC that was reviewed by the IETF and became a proper internet standard. [0]

[0] https://www.rfc-editor.org/standards

lxgr4mo ago

To be fair, security wasn’t even a consideration until RFCs were well into triple digits. We’re still very early, as they say.

> I have not seen a single "AI company" propose an RFC that was reviewed by the IETF and became a proper internet standard.

Why would the IETF have anything to do with LLM/agent standards? This seems like a category error. They also don’t ratify web standards, for example.

verdverm4mo ago

IETF maintains the HTTP standard

https://httpwg.org/

IETF is involved in protocol standards, MCP/A2A are certainly in this category, skills less so

hugs4mo ago

the tech industry is forever in denial that it is also actually a fashion industry.

recursive4mo ago

That's only true for companies that make most of their money from investment instead of customers. Those exist too.

falcor844mo ago

What do you mean? Are you saying that customers don't follow fashions?

pixl974mo ago

Beyond assembly everything is window dressing.

esafak4mo ago

The fractal nature of https://xkcd.com/435/ ...

beoberha4mo ago

verdverm4mo ago

It's not pessimism, but actual compatibility issues

like deno vs npm package ecosystems that didn't work together for many years

There are multiple AGENTS vs CLAUDE vs .github/instructions; skills vs commands; ... intermixed and inconsistent concepts, all out in the wild

detkin4mo ago

ffsm84mo ago

Depending on your workflow, none.

toomuchtodo4mo ago

When the AI investment dollars run out. "As long as the music is playing, you've got to get up and dance." (Chuck Prince, Citigroup)

veunes4mo ago

There's definitely a performative vibe to a lot of it right now

wiseowise4mo ago

> full-blown JavaScript circus

It is not healthy when you have an obsession this bad, seriously. Seek help.

quacky_batak4mo ago

i like how Anthropic has positioned themselves as the true AI research company and donating “standards” like that.

Although Skills are just md files but it’s good to see them “donate” it.

There goal seems to be simple: Focus on coding and improving it. They’ve found a great niche and hopefully revenue generating business there.

OpenAI on the other hand doesn’t give me same vibes, they don’t seem very oriented. They’re playing catchup with both Google models and Anthropic

plufz4mo ago

dismantlethesun4mo ago

They have but it does feel like they are developing a closed platform aka Apple.

Apple has shortcuts, but they haven’t propped it up like a standard that other people can use.

To contrast this is something you can use even if you have nothing to do with Claude, and your tools created will be compatible with the wider ecosystem.

theshrike794mo ago

A skill can also contain runnable code.

Many many MCPs could and should just be a skill instead.

ada19814mo ago

Our lab has been experimenting with “meta skills” that allow creation of skills to use later after a particular workflow.

Paper & applications published here: https://earthpilot.ai/metaskills/

uhgrippa4mo ago

babyshake4mo ago

I have been experimenting with these same type of factory pattern skills. Thanks for sharing.

danielbln4mo ago

After a session with Claude Code I just tell it "turn this into a skill, incorporate what we've learned in this session".

itissid4mo ago

``` web-artifacts-builder

This is a simple example, but it could extend to say using sqlite+litestream instead of postgres or using Gradient boosted trees instead of an expensive transformer based classifier.

an0malous4mo ago

I feel inspired and would like to donate my standard for Agent Personas to the community. A persona can be defined by a markdown file with the following frontmatter:

    ---
    persona: hacker
    description: logical, talks about computers a lot, enjoys coffee, somewhat snarky and arrogant
    ---
    
    <more details here>

lxgr4mo ago

This isn’t just a standard—this is a templating system that could offer us a straight shot to AGI!

1 more reply

allisdust4mo ago

Please consider donating this to the Linux Foundation so they can drive this inspiring innovation forward.

falcor844mo ago

I have a few qualms about this standard:

1. For an experienced Claude Code user, you can already build such an agent persona quite trivially by using the /agents settings.

3. It does not seem very "viral" or income-generating. I know this is premature at this point, but without charging users for the standard, is it reasonable to expect to make money off of this?

acedTrex4mo ago

Have you considered publishing this with a few charts about vague levels of "correctness"?

rvz4mo ago

What is "correctness?"... wait hang on let me think...

"you're absolutely right!"

brap4mo ago

Give this man a Turing Award

InitialLastName4mo ago

Luckily you get the "extremely confident, even when wrong" attribute for free.

sshine4mo ago

But always willing to admit the opposite is true and go with that on a whim.

baobun4mo ago

As groundbreaking as this is, it will never get traction without a LICENSE.md.

weitendorf4mo ago

announcing md2ai spec

oblio4mo ago

> logical

Please tell us how REALLY feel about JavaScript.

zikani_034mo ago

absolutely revolutionary! ;)

apf64mo ago

It was just a few months ago that the MCP spec added a concept called "prompts" which are really similar to skills.

And of course Claude Code has custom slash commands which are also very similar.

Getting a lot of whiplash from all these specifications that are hastily put together and then quickly forgotten.

vedmakk4mo ago

My understanding is, that MCP Prompts and Slash Commands are "user triggered" whereas Skills (and MCP Tools) are "model triggered".

Other than that it appears MCP prompts end up as slash commands provided by an MCP Server (instead of client side command definitions).

But the actual knowledge that is encoded in skills/commands/mcp prompts is very similar.

seg_lol4mo ago

What I like is that there are 20 different names for things that are basically glorified checklists and paragraphs.

verdverm4mo ago

It's a "standard" though! /s

layer84mo ago

They published a specification, that doesn’t yet make it a standard.

vladsh4mo ago

ako4mo ago

The value of standardizing skills is that the skills you define work with any agentic tool. Doesn't matter how simple they are, if they dont work easily, they have no use.

An llm is not an all knowing brain, but it’s a plan-do-check-act text processing machine.

brabel4mo ago

root_axis4mo ago

> it’s unclear what is their endgame here

verdverm4mo ago

> Standardizing a patch isn’t something I’d expect from Anthropic

This is not the first time, perhaps expectation adjustment is in order. This is also the same company that has an exec telling people in his Discord (15m of fame recently) Claude has emotions

wuliwong4mo ago

>But skills dont really solve the problem.

I think that they often do solve the problem, just maybe they have some other side effects/trade offs.

theshrike794mo ago

They’re not a perfect solution, but they are a good one.

The best one we have thought of so far.

terminalkeys4mo ago

exasperaited4mo ago

Argh word creep.

It has been published as an open specification.

Whether it is a standard isn't for them to declare.

htrp4mo ago

runtimepanic4mo ago

liampulles4mo ago

I'm curious about the `license` field in the specification: https://agentskills.io/specification.

kristo4mo ago

Still can’t symlink skills from Claude code to codex tho :/

mkagenius4mo ago

If anyone wants to use Skills in Gemini CLI or any other llm tool - check out something I have created, open-skills https://github.com/BandarLabs/open-skills

It does code execution in an apple container if your Skill requires any code execution.

It also proves the point that Skills are basically repackaged MCPs (if you look into my code).

theturtletalks4mo ago

Will Skills and Code Execution replace MCPs eventually?

mkagenius4mo ago

I doubt that. MCPs are broader. You can serve a Skill via a MCP but the reverse may not be always true.

good-idea4mo ago

These two solutions look feel and smell like the same thing. Are they the same thing?

Any OpenCode users out there have any hot or nuanced takes?

terminalkeys4mo ago

Claude Code has subagents as well. I created a workflow with multiple agents to build iOS apps, including agents for orchestration, design, build, and QA.

0x0084mo ago

The skills can be specific to a repository but the agents are global, right?

abatilo4mo ago

Claude code simply supports agents also

albingroen4mo ago

They really do love standards

fudged714mo ago

robertheadley4mo ago

I had developed a tool for Roo Code, and have moved over to anti-gravity with no problem, that basically gives playwright the ability to develop and test user scripts in an automated fashion.

It is functionally a skill. I suppose once anti-gravity supports skills, I will make it one officially.

someguy1010104mo ago

Is it possible to provide a llm a skill through the mcp resource feature?

uhgrippa4mo ago

theshrike794mo ago

It’s also possible to implement an MCP as a skill

user39393824mo ago

This is the right direction but this implementation is playdough and this needs to be a stone mansion. I’m working on a non-LLM AI model that will blow this out of the water.

Seattle35034mo ago

My company has a plugin marketplace in a git repo where we host our shared skills. It would be nice if we could plug that into the web interface.

verdverm4mo ago

Or if we wrote these things in a language with real imports and modules?

I'm authoring equivalent in CUE, and assimilating "standard" provider ones into CUE on the fly so my agent can work with all the shenanigans out there.

foobarqux4mo ago

What is the difference between 3rd party skills and connectors? How do you access/install 3rd party skills in claude code?

veunes4mo ago

If skills really can move across platforms, that's a meaningful push against lock-in, at least in theory

almosthere4mo ago

how are skills and mcp different?

langitbiru4mo ago

Skills are local. MCP can be remote and has authentication.

almosthere4mo ago

It still sounds the same to me.

esafak4mo ago

Skills are supposed to be lighter (on your context).

delayedrelease4mo ago

Tired of having to learn the Next New Thing (tm) that'll be replaced in a month.

skillcreator4mo ago

Love seeing this become an open standard. We just shipped the first universal skill installer built on it:

npx ai-agent-skills install frontend-design

20 of the most starred Claude skills ever, now open across Claude Code, Cursor, Amp, VS Cod : anywhere that supports the spec. Would love some feedback on it

github.com/skillcreatorai/Ai-Agent-Skills

officialchicken4mo ago

XML Config? Can someone explain that decision?

pplonski864mo ago

Is codex working well with python notebooks?

jameslk4mo ago

“Agent skills” seems more like a pattern than something that needs a standard. It’s like announcing you’ve created a standard for “singletons” or “monads”

unbelievably4mo ago

asadm4mo ago

Good riddance MCP.

observationist4mo ago

adam_arthur4mo ago

Local tools/skills/function definitions can already invoke any API.

There's no real benefit to the MCP protocol over a regular API with a published "client" a local LLM can invoke. The only downside is you'd have to pull this client prior.

I am using local "skill" as reference to an executable function, not specifically Claude Skills.

If the LLM/Agent executes tools via code in a sandbox (which is what things are moving towards), all LLM tools can be simply defined as regular functions that have the flexibility to do anything.

I seriously doubt MCP will exist in any form a few years from now

asadm4mo ago

I have seen ~10 IQ points drop with each MCP I added. I have replaced them all with either skill-like instructions or curl calls in AGENTS.md with much better "tool-calling" rate.

verdverm4mo ago

That's a context pollution problem, not an MCP problem.

https://www.anthropic.com/engineering/advanced-tool-use

1 more reply

AndyNemmity4mo ago

It isn't particularly useful. It uses a lot of context without a lot of value. Claude has written a blog post saying as much. Skills keep the context out unless it's needed.

It's a much better system in my experience.

verdverm4mo ago

Claude did not say don't use MCP because it pollutes the context

What they said was don't pollute your context with lots of tool defs, from MCP or not. You'll see this same problem if you have 100s of skills, with their names and descriptions chewing up tokens

Their solution is to let the agent search and discover as needed, it's a general concept around tools (mcp, func, code use, skills)

https://www.anthropic.com/engineering/advanced-tool-use

j / k navigate · click thread line to collapse