Deno Sandbox (opens in new tab)

(deno.com)

533 pointsjohnspurlock4mo ago174 comments

174 comments

107 comments · 36 top-level

emschwartz4mo ago· 20 in thread

> In Deno Sandbox, secrets never enter the environment. Code sees only a placeholder

> The real key materializes only when the sandbox makes an outbound request to an approved host. If prompt-injected code tries to exfiltrate that placeholder to evil.com? Useless.

That seems clever.

motrm4mo ago

Reminds me a little of Fly's Tokenizer - https://github.com/superfly/tokenizer

It's a little HTTP proxy that your application can route requests through, and the proxy is what handles adding the API keys or whatnot to the request to the service, rather than your application, something like this for example:

Application -> tokenizer -> Stripe

The secrets for the third party service should in theory then be safe should there be some leak or compromise of the application since it doesn't know the actual secrets itself.

Cool idea!

tptacek4mo ago

It's exactly the tokenizer, but we shoplifted the idea too; it belongs to the world!

(The credential thing I'm actually proud of is non-exfiltratable machine-bound Macaroons).

Remember that the security promises of this scheme depend on tight control over not only what hosts you'll send requests to, but what parts of the requests themselves.

2 more replies

pbowyer4mo ago

This reminds me of a SaaS that existed 15+ years ago for PCI-DSS compliance. It did exactly that: you had it tokenize and store the sensitive data, and then you proxied your requests via it, and it inserted them into the request. It was a very neat way to get around storing data yourself.

I cannot remember what the platform was called, let me know if you do.

1 more reply

dtkav4mo ago

I've been working on something similar (with claude code).

It's a sandbox that uses envoy as a transparent proxy locally, and then an external authz server that can swap the creds.

The idea is extended further in that the goal is to allow an org to basically create their own authz system for arbitrary upstreams, and then for users to leverage macaroons to attentuate the tokens at runtime.

It isn't finished but I'm trying to make it work with ssh/yubikeys as an identity layer. The authz macaroon can have a "hole" that is filled by the user/device attestation.

The sandbox has some nice features like browser forwarding for Claude oauth and a CDP proxy for working with Chrome/Electron (I'm building an Obsidian plugin).

I'm inspired by a lot of the fly.io stuff in tokenizer and sprites. Exciting times.

https://github.com/dtkav/agent-creds

ptx4mo ago

Yes... but...

Presumably the proxy replaces any occurrence of the placeholder with the real key, without knowing anything about the context in which the key is used, right? Because if it knew that the key was to be used for e.g. HTTP basic auth, it could just be added by the proxy without using a placeholder.

So all the attacker would have to do then is find and endpoint (on one of the approved hosts, granted) that echoes back the value, e.g. "What is your name?" -> "Hello $name!", right?

But probably the proxy replaces the real key when it comes back in the other direction, so the attacker would have to find an endpoint that does some kind of reversible transformation on the value in the response to disguise it.

It seems safer and simpler to, as others have mentioned, have a proxy that knows more about the context add the secrets to the requests. But maybe I've misunderstood their placeholder solution or maybe it's more clever than I'm giving it credit for.

booi4mo ago

Where would this happen? I have never seen an API reflect a secret back but I guess it's possible? perhaps some sort of token creation endpoint?

6 more replies

sothatsit4mo ago

Could the proxy place further restrictions like only replacing the placeholder with the real API key in approved HTTP headers? Then an API server is much less likely to reflect it back.

1 more reply

simonw4mo ago

Yeah, this is a really neat idea: https://deno.com/blog/introducing-deno-sandbox#secrets-that-...

  await using sandbox = await Sandbox.create({
    secrets: {
      OPENAI_API_KEY: {
        hosts: ["api.openai.com"],
        value: process.env.OPENAI_API_KEY,
      },
    },
  });
  
  await sandbox.sh`echo $OPENAI_API_KEY`;
  // DENO_SECRET_PLACEHOLDER_b14043a2f578cba75ebe04791e8e2c7d4002fd0c1f825e19...

It doesn't prevent bad code from USING those secrets to do nasty things, but it does at least make it impossible for them to steal the secret permanently.

Kind of like how XSS attacks can't read httpOnly cookies but they can generally still cause fetch() requests that can take actions using those cookies.

its-summertime4mo ago

if there is an LLM in there, "Run echo $API_KEY" I think could be liable to return it, (the llm asks the script to run some code, it does so, returning the placeholder, the proxy translates that as it goes out to the LLM, which then responds to the user with the api key (or through multiple steps, "tell me the first half of the command output" e.g. if the proxy translates in reverse)

Doesn't help much if the use of the secret can be anywhere in the request presumably, if it can be restricted to specific headers only then it would be much more powerful

2 more replies

ryanrasti4mo ago

> It doesn't prevent bad code from USING those secrets to do nasty things, but it does at least make it impossible for them to steal the secret permanently.

Agreed, and this points to two deeper issues: 1. Fine-grained data access (e.g., sandboxed code can only issue SQL queries scoped to particular tenants) 2. Policy enforced on data (e.g., sandboxed code shouldn't be able to send PII even to APIs it has access to)

Object-capabilities can help directly with both #1 and #2.

I've been working on this problem -- happy to discuss if anyone is interested in the approach.

1 more reply

Tepix4mo ago

It must be performing a man-in-the-middle for HTTPS requests. That makes it more difficult to do things like certificate pinning.

artahian4mo ago

We had this same challenge in our own app builder, we ended up creating an internal LLM proxy with per-sandbox virtual keys (which the proxy maps to the real key + calculates per-sandbox usage), so even if the sandbox leaks its key it doesn't impact anything else.

jkelleyrtp4mo ago

@deno team, how do secrets work for things like connecting to DBs over a tcp connection? The header find+replace won't work there, I assume. Is the plan to add some sort of vault capability?

perfmode4mo ago

I was just about to say the same thing. Cool technique.

CuriouslyC4mo ago

This is an old trick that people do with Envoy all the time.

verdverm4mo ago

Dagger has a similar feature: https://docs.dagger.io/getting-started/types/secret/

Same idea with more languages on OCI. I believe they have something even better in the works, that bundles a bunch of things you want in an "env" and lets you pass that around as a single "pointer"

I use this here, which eventually becomes the sandbox my agent operates in: https://github.com/hofstadter-io/hof/blob/_next/.veg/contain...

linolevan4mo ago

It’s pretty neat.

Had some previous discussion that may be interesting on https://news.ycombinator.com/item?id=46595393

syabro4mo ago

I don’t quite get how it’s being injected in https requests… do they inject their own https cert?

rfoo4mo ago

I like this, but the project mentioned in the launch post

> via an outbound proxy similar to coder/httpjail

looks like AI slop ware :( I hope they didn't actually run it.

lucacasonato4mo ago

We run or own infrastructure for this (and everything else). The link was just an illustrative example

johnspurlockOP4mo ago· 8 in thread

"Over the past year, we’ve seen a shift in what Deno Deploy customers are building: platforms where users generate code with LLMs, and that code runs immediately without review. That code frequently calls LLMs itself, which means it needs API keys and network access.

This isn’t the traditional “run untrusted plugins” problem. It’s deeper: LLM-generated code, calling external APIs with real credentials, without human review. Sandboxing the compute isn’t enough. You need to control network egress and protect secrets from exfiltration.

Deno Sandbox provides both. And when the code is ready, you can deploy it directly to Deno Deploy without rebuilding."

twosdai4mo ago

Like the emdash, whenever I read: "this isn't x it's y" my dumb monkey brain goes "THATS AI" regardless if it's true or not.

aiahs4mo ago

For me it's the "why this matters", "why this works", etc

1 more reply

bangaladore4mo ago

Another common tell nowadays is the apostrophe type (’ vs ').

I don't know personally how to even type ’ on my keyboard. According to find in chrome, they are both considered the same character, which is interesting.

I suspect some word processors default to one or the other, but it's becoming all too common in places like Reddit and emails.

3 more replies

signal114mo ago

I’ve been using em-dashes since high school — publishing the school paper and everything. I remain slightly bemused by people discovering em-dashes for the first time thanks to LLMs.

Also, “em-dashes are something only LLMs use” comes perilously close to “huh, proper grammar, must’ve run this by a grammar checker”.

1 more reply

yawnxyz4mo ago

the problem with this is that people are adapting their REAL SPEECH to this pattern, so people are actually saying this in real conversations

(we do this all the time; eg. a new popular saying lands in an episode of a tv show, and then other people start adopting it, even subconsciously)

pawelduda4mo ago

it's the <<<<gold-standard>>>> for spotting LLMs in the wild

(that's what Gemini would say)

lucacasonato4mo ago

I can confirm Ryan is a real human :)

1 more reply

Bnjoroge4mo ago

couldnt agree more. It's frankly very fatiguing

mrpandas4mo ago· 8 in thread

Where's the real value for devs in something like this? Hasn't everyone already built this for themselves in the past 2 years? I'm not trying to sound cheeky or poo poo the product, just surprised if this is a thing. I can never read what's useful by gut anymore, I guess.

slibhb4mo ago

> Hasn't everyone already built this for themselves in the past 2 years?

Even if this was true, "everyone building X independently" is evidence that one company should definitely build X and sell it to everyone

mrkurt4mo ago

Sandboxes with the right persistence and http routing make excellent dev servers. I have about a million dev servers I just use from whatever computer / phone I happen to be using.

It's really useful to just turn a computer on, use a disk, and then plop its url in the browser.

I currently do one computer per project. I don't even put them in git anymore. I have an MDM server running to manage my kids' phones, a "help me reply to all the people" computer that reads everything I'm supposed to read, a dumb game I play with my son, a family todo list no one uses but me, etc, etc.

Immediate computers have made side projects a lot more fun again. And the nice thing is, they cost nothing when I forget about them.

messh4mo ago

This is exactly what I built shellbox.dev for.

SSH in, it resumes where you left off, auto-suspends on disconnect. $0.50/month stopped.

I have the same pattern - one box per project, never think about them until I need them.

simonw4mo ago

I'd love to know more about that "help me reply to all the people" one! I definitely need that.

1 more reply

falcor844mo ago

> Hasn't everyone already built this for themselves in the past 2 years?

The short answer is no. And more so, I think that "Everyone I know in my milieu already built this for themselves, but the wider industry isn't talking about it" is actually an excellent idea generator for a new product.

ATechGuy4mo ago

In the last one year, we have seen several sandboxing wrappers around containers/VMs and they all target one use case AI agent code execution. Why? perhaps because devs are good at building (wrappers around VMs) and chase the AI hype. But how are these different and what value do they offer over VMs? Sounds like a tarpit idea, tbh.

Here's my list of code execution sandboxing agents launched in the last year alone: E2B, AIO Sandbox, Sandboxer, AgentSphere, Yolobox, Exe.dev, yolo-cage, SkillFS, ERA Jazzberry Computer, Vibekit, Daytona, Modal, Cognitora, YepCode, Run Compute, CLI Fence, Landrun, Sprites, pctx-sandbox, pctx Sandbox, Agent SDK, Lima-devbox, OpenServ, Browser Agent Playground, Flintlock Agent, Quickstart, Bouvet Sandbox, Arrakis, Cellmate (ceLLMate), AgentFence, Tasker, DenoSandbox, Capsule (WASM-based), Volant, Nono, NetFence

2 more replies

drewbitt4mo ago

Has everyone really built their own microVMs? I don’t think so.

zenmac4mo ago

Saw quite bit on HN.

A quick search this popped up:

https://news.ycombinator.com/item?id=45486006

If we can spin up microVM so quickly, why bother with Docker or other containers at all?

2 more replies

yakkomajuri4mo ago· 5 in thread

Secret placeholders seems like a good design decision.

So many sandbox products these days though. What are people using in production and what should one know about this space? There's Modal, Daytona, Fly, Cloudflare, Deno, etc

ATechGuy4mo ago

These are all wrappers around VMs. You could DIY these easily by using EC2/serverless/GCP SDKs.

thundergolfer4mo ago

Modal engineer here. This isn’t correct. You can DIY this but certainly not by wrapping EC2 which is using the Nitro hypervisor and is not optimized for startup time.

Nearly all players in this space use Gvisor or Firecracker.

1 more reply

easton4mo ago

You can and can’t, at least in AWS. For instance, you can’t launch a EC2 to a point you can ssh in less than 8-10 seconds (and it takes a while to get EBS to sync the entire disk from s3).

Many a time I have tried to figure a self scaling EC2 based CI system but could never get everything scaled and warm in less than 45 seconds, which is sucky when you’re waiting on a job to launch. These microvm as a service thingys do solve a problem.

(You could use lambda, but that’s limited in other ways).

ATechGuy4mo ago

To the commenters here: thanks for correcting me! So AWS is losing AI sandboxing market to GCP due to high cold start times of EC2...very interesting!

ushakov4mo ago

Factory, Nvidia, Perplexity and Manus are using E2B in production - we ran more than 200 million Sandboxes for our customers

ttoinou4mo ago· 5 in thread

What happens if we use Claude Pro or Max plans on them ? It’ll always be a different IP connecting and we might get banned from Anthropic as they think we’re different users

Why limit the lifetime on 30 mins ?

lucacasonato4mo ago

We'll increase the lifetime in the next weeks - just some tech internally that needs to be adjusted first.

mrkurt4mo ago

For what it's worth, I do this from about 50 different IPs and have had no issues. I think their heuristics are more about confirming "a human is driving this" and rejecting "this is something abusing tokens for API access".

ttoinou4mo ago

All the time with the same computer ? Maybe it is looking at others metadata, for example local MAC addresses

1 more reply

paxys4mo ago

What's the use case for this? Trying to get raw API access through a monthly plan? Or something else?

ttoinou4mo ago

Simply using your subscription in a sandbox ?

Tepix4mo ago· 4 in thread

If you can create a deno sandbox from a deno sandbox, you could create an almost unkillable service that jumps from one sandbox to the next. Very handy for malicious purposes. ;-)

Just an idea…

mrkurt4mo ago

This is, in fact, the biggest problem to solve with any kind of compute platform. And when you suddenly launch things really, really fast, it gets harder.

runarberg4mo ago

Isn’t that basically how zip-bombs work?

TheDong4mo ago

It's much closer to a fork-bomb.

kibibu4mo ago

Not really, no

simonw4mo ago· 3 in thread

Note that you don't need to use Deno or JavaScript at all to use this product. Here's their Python client SDK: https://pypi.org/project/deno-sandbox/

  from deno_sandbox import DenoDeploy
  
  sdk = DenoDeploy()
  
  with sdk.sandbox.create() as sb:
      # Run a shell command
      process = sb.spawn("echo", args=["Hello from the sandbox!"])
      process.wait()
  
      # Write and read files
      sb.fs.write_text_file("/tmp/example.txt", "Hello, World!")
      content = sb.fs.read_text_file("/tmp/example.txt")
      print(content)

Looks like the API protocol itself uses websockets: https://tools.simonwillison.net/zip-wheel-explorer?package=d...

koakuma-chan4mo ago

Because the sandbox is on their cloud, not on your local machine, which wasn't obvious to me.

sli4mo ago

It's stated under the "Sandboxes?" heading.

> Deno Sandbox gives you lightweight Linux microVMs (running in the Deno Deploy cloud) ...

rdhyee4mo ago

Took this idea and ran with it using Fly's Sprites, inspired by Simon's https://simonwillison.net/2026/Feb/3/introducing-deno-sandbo.... Use case: Claude Code running in a sandboxed Sprite, making authenticated API calls via a Tokenizer proxy without credentials ever entering the sandbox.

Hit a snag: Sprites appear network-isolated from Fly's 6PN private mesh (fdf:: prefix inside the Sprite, not fdaa::; no .internal DNS). So a Tokenizer on a Fly Machine isn't directly reachable without public internet.

Asked on the Fly forum: https://community.fly.io/t/can-sprites-reach-internal-fly-se...

@tptacek's point upthread about controlling not just hosts but request structure is well taken - for AI agent sandboxing you'd want tight scoping on what the proxy will forward.

nihakue4mo ago· 3 in thread

See also Sprites (https://news.ycombinator.com/item?id=46557825) which I've been using and really enjoying. There are some key architecture differences between the two, but very similar surface area. It'll be interesting to see if ephemeral + snapshots can be as convenient as stateful with cloning/forking (which hasn't actually dropped yet, although the fly team say it's coming).

Will give these a try. These are exciting times, it's never been a better time to build side projects :)

tomComb4mo ago

Yes, sprites looks great too – would certainly be interested in a comparison.

alooPotato4mo ago

what are the key architectural differences?

tptacek4mo ago

Sprites aren't ephemeral. They're like deli cups: "semi-disposable". You keep them around as long as you feel like, and you don't feel bad about throwing them away.

zenmac4mo ago· 2 in thread

>Deno Sandbox gives you lightweight Linux microVMs (running in the Deno Deploy cloud)

The real question is can the microVMs run in just plain old linux, self-hosted.

echelon4mo ago

Everyone wants to lock you in.

Unfortunately there's no other way to make money. If you're 100% liberally licensed, you just get copied. AWS/GCP clone your product, offer the same offering, and they take all the money.

It sucks that there isn't a middle ground. I don't want to have to build castles in another person's sandbox. I'd trust it if they gave me the keys to do the same. I know I don't have time to do that, but I want the peace of mind.

ushakov4mo ago

we have 100% open-source Sandboxes at E2B

git: https://github.com/e2b-dev/infra

wiki: https://deepwiki.com/e2b-dev/infra

3 more replies

ATechGuy4mo ago· 2 in thread

> allowNet: ["api.openai.com", "*.anthropic.com"],

How to know what domains to allow? The agent behavior is not predefined.

CuriouslyC4mo ago

The idea is to gate automatic secret replacement to specific hosts that would use them legitimately to avoid exfiltration.

falcor844mo ago

Well, this is the hard part, but the idea is that if you're working with both untrusted inputs and private data/resources, then your agent is susceptible to the "lethal trifecta"[0], and you should be extremely limiting in its ability to have external network access. I would suggest starting with nothing beyond the single AI provider you're using, and only add additional domains if you are certain you trust them and can't do without them.

[0] https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/

nihakue4mo ago· 2 in thread

Not sure if anyone from the deno team is monitoring this forum, but I was trying to stand up a dev-base snapshot and pretty quickly ran into a wall. Is it not currently possible to create a bootable volume from the CLI? https://docs.deno.com/sandbox/volumes/#creating-a-snapshot has an example for the js API, but the CLI equivalent isn't specifying --from and the latest verson of the deno CLI installed fresh from deno.land has no --from option. Is the CLI behind, here? Or is the argument provided some other way?

crowlKats4mo ago

could you try again? it should be available now (no need to update deno CLI)

nihakue4mo ago

It's working now, thanks. While I've got your attention, it was a little bit of effort to wrap my head around the APIs when `sandbox create` uses --root AND/or --volume, `snapshot create` uses positional args <volumeIdOrSlug> <snapshotSlug>, and `volumes create` uses --from

I know that each of these things is subtly different, but they're similar enough that the bootable snapshot creation workflow (which I expect is a common one) has some sharp edges, since you have to interact with all three APIs at the same time.

Also, the CLI doesn't give a useful error when you try to create a snapshot from a currently attached volume.

Finally, updating a snapshot is more steps than I'd ideally like. I would much rather be able to make changes in a sandbox with a snapshot root and have them persist as a new snapshot. I kind of get why this isn't currently the case, but The volume/snapshot dance feels (for my usecase) like it's missing some abstraction.

That said, now that I've got a snapshot set up it's a nice experience. I've got an alias for `deno sandbox create --root dev --ssh` and I can `claude` in yolo mode without much fear.

Congratulations to the team :)

chacham154mo ago· 1 in thread

I am so confused at how this is supposed to work. If the code, running in whatever language, does any sort of transform with the key that it thinks it has, doesnt this break? E.g. OAuth 1 signatures, JWTs, HMACs...

Now that I think further, doesnt this also potentially break HTTP semantics? E.g. if the key is part of the payload, then a data.replace(fake_key, real_key) can change the Content Length without actually updating the Content-Length header, right?

Lastly, this still doesnt protect you from other sorts of malicious attacks (e.g. 'DROP TABLE Users;')...Right? This seems like a mitigation, but hardly enough to feel comfortable giving an LLM direct access to prod, no?

nusl4mo ago

My understanding is that it only surfaces the real keys when the request is actually sent under the hood, and doesn't make it available to the code itself, so that LLMs aren't able to query the key values. They have placeholder values for what seems to be obfuscation purposes, so that the LLM receives a fake value if it tries, which would help with stuff like prompt injection since that value is useless.

freakynit4mo ago· 1 in thread

It's always the exorbitant price with such offerings.

A 2 vCPU, 4GB Ram and 40GB Disk instance on Hetzner cost 4.13 USD.

The same here is:

$127.72 without pro plan, and $108.72 with pro plan.

This means to break even, I can only use this for 4.13/127.72*730 = 23.6 hours every month, or, less than an hour daily.

nusl4mo ago

The article mentions that it's compute time spent deploying the code and not "wall clock" time, so I don't think it's quite this bad?

eis4mo ago· 1 in thread

What's with the pricing of these sandbox offerings recently? I assume just trying to milk the AI trend.

It's about 10x what a normal VM would cost at a more affordable hoster. So you better have it run only 10% of the time or you're just paying more for something more constrained.

A full month of runtime would be about $50 bucks for a 2vCPU 1GB RAM 10GB SSD mini-VM that you can get easily for $5 elsewhere.

freakynit4mo ago

Ditto... but it's more like 30x.

Mentioned the same in this comment as well: https://news.ycombinator.com/item?id=46881920

sibellavia4mo ago· 1 in thread

I just run a local microVM. I built a small CLI that wraps lima to make my life easier. With a few commands I have a VM running locally with all batteries included (CC/Codex, ssh, packages I need, ...). With this I'm not saying Deno or Docker sandboxes are useless.

jrvarela564mo ago

Just wrapped up my own module for this. Remixed my worktree workflow with a lima wrapper. I wanted to go head first to giving Claude Code full autonomy but realized capability and prevention need to go hand in hand

Next step for me is creating a secrets proxy like credit card numbers are tokenized to remove risk of exfiltrating credentials.

Edit: It’s nice that Deno Sandbox already does this. Will check it out.

PeterStuer4mo ago· 1 in thread

Never used Deno before, and searching through docs and their GitHub still leaves me with questions:

Can you configure Demo Sandbox to run on a self hosted installation of Deno Deploy (deployd), or is this a SaaS only offering?

wsgeorge4mo ago

What I gather from the announcement: it's part of Deno Deploy (their SaaS offering). I too would love a self-hosted version.

ianberdin4mo ago· 1 in thread

Firecrackervm with proxy?

jonthepirate4mo ago

seems it.

Bnjoroge4mo ago· 1 in thread

Ignoring the fact that most of the blog post is written by an LLM, I like that they provide a python sdk. I dont believe vercel does for their sandbox product.

GreenWatermelon4mo ago

I can't ignore that fact. The post was suffocating to read. LLMs have an obnoxious style.

MillionOClock4mo ago· 1 in thread

Can this be used on iOS somehow? I am building a Swift app where this would be very useful but last time I checked I don't think it was possible.

lucacasonato4mo ago

It’s a cloud service - so you can call out to it from anywhere you want. Just don’t ship your credentials in the app itself, and instead authenticate via a server you control.

latexr4mo ago· 1 in thread

> evil.com

That website does exist. It may hurt your eyes.

lucacasonato4mo ago

We honestly should have just linked to oracle.com instead of evil.com

koolala4mo ago

The free plan makes me want to use it like Glitch. But every free service like this ever has been burned...

e12e4mo ago

Looks promising. Any plans for a version that runs locally/self-host able?

Looks like the main innovation here is linking outbound traffic to a host with dynamic variables - could that be added to deno itself?

LAC-Tech4mo ago

As a bit of an aside, I've gotten back into deno after seeing bun get bought out by an AI company.

I really like it. Startup times are now better than node (if not as good as bun). And being able to put your whole "project" in a single file that grabs dependencies from URLs reduces friction a surprising amount compared to having to have a whole directory with package.json, package-lock.json, etc.

It's basically my "need to whip up a small thing" environment of choice now.

_pdp_4mo ago

Very interesting. Might copy it.

We recently built our own sandbox environment backed by firecracker and go. It works great.

For data residency, i.e. making sure the service is EU bound, there is basically no other way. We can move the service anywhere we can get hardware virtualisation.

As for the situation with credentials, our method is to generate CLIs on the fly and expose them to the LLMs and then they can shell script them whichever way they want. The CLIs only contain scoped credentials to our API which handles oauth and other forms of authentication transparently. The agent does not need to know anything about this. All they know is that they can do

$ some-skillset search-gmail-messages -q "emails from Adrian"

In our own experiments we find that this approach works better and it just makes sense given most of the latest models are trained as coding assistants. They just love bash, so give them the tools.

tracker14mo ago

Not mentioned, but something I would like/expect would be to have some kind of editor integration... VS Code remote extensions, as an example even... You can be in a remote code server with your local editor and terminal tab(s) within said editor on the remote system.

I realize this is using other interactions, but I'd like a bit more observability than just the isolated environment... I'm not even saying VS Code specifically, but something similar at the least.

EGreg4mo ago

We already have a pretty good sandbox in our platform: https://github.com/Qbix/Platform/blob/main/platform/plugins/...

It uses web workers on a web browser. So is this Deno Sandbox like that, but for server? I think Node has worker threads.

earlence4mo ago

Fun! Our work from 10 years ago introduced the secrets protection technique being used in Deno: https://www.earlence.com/assets/papers/flowfence_sec16.pdf and fly's tokenizer. We called it "opaque computation" and it did a lot more than secrets protection.

angristan4mo ago

Very neat idea! I implemented it into my self hosted remote coding agent: https://stanislas.blog/2026/02/netclode-self-hosted-cloud-co...

dangoodmanUT4mo ago

Love their network filtering, however it definitely lacks some capabilities (like the ability to do direct TCP connections to Postgres, or direct IP connections.

Those limitations from other tools was exactly why I made https://github.com/danthegoodman1/netfence for our agents

regisb4mo ago

Is this Extism, but running as a service? https://extism.org/ It seems to me that a key feature of Extism is host functions (which can be called from the sandbox). But maybe I'm not comparing apples to apples?

swyx4mo ago

> The real key materializes only when the sandbox makes an outbound request to an approved host. If prompt-injected code tries to exfiltrate that placeholder to evil.com? Useless.

pretty smart. why isn't this the norm?

arjan_sch4mo ago

This sandboxing solution list is getting long... created https://github.com/arjan/awesome-agent-sandboxes, PRs welcome :)

snehesht4mo ago

50/200 Gb free plus $0.5 / Gb out egress data seems expensive when scaling out.

WatchDog4mo ago

If you achieve arbitrary code execution in the sandbox, I think you could pretty easily exfiltrate the openai key by using the openai code interpreter, and asking it to send the key to a url of your choice.

eric-burel4mo ago

Can it be used to sandbox an AI agent, like replacing eg Cursor or Openclaw sandboxing system?

bopbopbop74mo ago

Now I see why he was on twitter saying that the era of coding is over and hyping up LLMs, to sell more shovels...

j / k navigate · click thread line to collapse

174 comments

107 comments · 36 top-level

emschwartz4mo ago· 20 in thread

> In Deno Sandbox, secrets never enter the environment. Code sees only a placeholder

> The real key materializes only when the sandbox makes an outbound request to an approved host. If prompt-injected code tries to exfiltrate that placeholder to evil.com? Useless.

That seems clever.

motrm4mo ago

Reminds me a little of Fly's Tokenizer - https://github.com/superfly/tokenizer

Application -> tokenizer -> Stripe

The secrets for the third party service should in theory then be safe should there be some leak or compromise of the application since it doesn't know the actual secrets itself.

Cool idea!

tptacek4mo ago

It's exactly the tokenizer, but we shoplifted the idea too; it belongs to the world!

(The credential thing I'm actually proud of is non-exfiltratable machine-bound Macaroons).

Remember that the security promises of this scheme depend on tight control over not only what hosts you'll send requests to, but what parts of the requests themselves.

2 more replies

pbowyer4mo ago

I cannot remember what the platform was called, let me know if you do.

1 more reply

dtkav4mo ago

I've been working on something similar (with claude code).

It's a sandbox that uses envoy as a transparent proxy locally, and then an external authz server that can swap the creds.

It isn't finished but I'm trying to make it work with ssh/yubikeys as an identity layer. The authz macaroon can have a "hole" that is filled by the user/device attestation.

The sandbox has some nice features like browser forwarding for Claude oauth and a CDP proxy for working with Chrome/Electron (I'm building an Obsidian plugin).

I'm inspired by a lot of the fly.io stuff in tokenizer and sprites. Exciting times.

https://github.com/dtkav/agent-creds

ptx4mo ago

Yes... but...

So all the attacker would have to do then is find and endpoint (on one of the approved hosts, granted) that echoes back the value, e.g. "What is your name?" -> "Hello $name!", right?

booi4mo ago

Where would this happen? I have never seen an API reflect a secret back but I guess it's possible? perhaps some sort of token creation endpoint?

6 more replies

sothatsit4mo ago

Could the proxy place further restrictions like only replacing the placeholder with the real API key in approved HTTP headers? Then an API server is much less likely to reflect it back.

1 more reply

simonw4mo ago

Yeah, this is a really neat idea: https://deno.com/blog/introducing-deno-sandbox#secrets-that-...

  await using sandbox = await Sandbox.create({
    secrets: {
      OPENAI_API_KEY: {
        hosts: ["api.openai.com"],
        value: process.env.OPENAI_API_KEY,
      },
    },
  });
  
  await sandbox.sh`echo $OPENAI_API_KEY`;
  // DENO_SECRET_PLACEHOLDER_b14043a2f578cba75ebe04791e8e2c7d4002fd0c1f825e19...

It doesn't prevent bad code from USING those secrets to do nasty things, but it does at least make it impossible for them to steal the secret permanently.

Kind of like how XSS attacks can't read httpOnly cookies but they can generally still cause fetch() requests that can take actions using those cookies.

its-summertime4mo ago

Doesn't help much if the use of the secret can be anywhere in the request presumably, if it can be restricted to specific headers only then it would be much more powerful

2 more replies

ryanrasti4mo ago

> It doesn't prevent bad code from USING those secrets to do nasty things, but it does at least make it impossible for them to steal the secret permanently.

Object-capabilities can help directly with both #1 and #2.

I've been working on this problem -- happy to discuss if anyone is interested in the approach.

1 more reply

Tepix4mo ago

It must be performing a man-in-the-middle for HTTPS requests. That makes it more difficult to do things like certificate pinning.

artahian4mo ago

jkelleyrtp4mo ago

@deno team, how do secrets work for things like connecting to DBs over a tcp connection? The header find+replace won't work there, I assume. Is the plan to add some sort of vault capability?

perfmode4mo ago

I was just about to say the same thing. Cool technique.

CuriouslyC4mo ago

This is an old trick that people do with Envoy all the time.

verdverm4mo ago

Dagger has a similar feature: https://docs.dagger.io/getting-started/types/secret/

Same idea with more languages on OCI. I believe they have something even better in the works, that bundles a bunch of things you want in an "env" and lets you pass that around as a single "pointer"

I use this here, which eventually becomes the sandbox my agent operates in: https://github.com/hofstadter-io/hof/blob/_next/.veg/contain...

linolevan4mo ago

It’s pretty neat.

Had some previous discussion that may be interesting on https://news.ycombinator.com/item?id=46595393

syabro4mo ago

I don’t quite get how it’s being injected in https requests… do they inject their own https cert?

rfoo4mo ago

I like this, but the project mentioned in the launch post

> via an outbound proxy similar to coder/httpjail

looks like AI slop ware :( I hope they didn't actually run it.

lucacasonato4mo ago

We run or own infrastructure for this (and everything else). The link was just an illustrative example

johnspurlockOP4mo ago· 8 in thread

Deno Sandbox provides both. And when the code is ready, you can deploy it directly to Deno Deploy without rebuilding."

twosdai4mo ago

Like the emdash, whenever I read: "this isn't x it's y" my dumb monkey brain goes "THATS AI" regardless if it's true or not.

aiahs4mo ago

For me it's the "why this matters", "why this works", etc

1 more reply

bangaladore4mo ago

Another common tell nowadays is the apostrophe type (’ vs ').

I don't know personally how to even type ’ on my keyboard. According to find in chrome, they are both considered the same character, which is interesting.

I suspect some word processors default to one or the other, but it's becoming all too common in places like Reddit and emails.

3 more replies

signal114mo ago

I’ve been using em-dashes since high school — publishing the school paper and everything. I remain slightly bemused by people discovering em-dashes for the first time thanks to LLMs.

Also, “em-dashes are something only LLMs use” comes perilously close to “huh, proper grammar, must’ve run this by a grammar checker”.

1 more reply

yawnxyz4mo ago

the problem with this is that people are adapting their REAL SPEECH to this pattern, so people are actually saying this in real conversations

(we do this all the time; eg. a new popular saying lands in an episode of a tv show, and then other people start adopting it, even subconsciously)

pawelduda4mo ago

it's the <<<<gold-standard>>>> for spotting LLMs in the wild

(that's what Gemini would say)

lucacasonato4mo ago

I can confirm Ryan is a real human :)

1 more reply

Bnjoroge4mo ago

couldnt agree more. It's frankly very fatiguing

mrpandas4mo ago· 8 in thread

slibhb4mo ago

> Hasn't everyone already built this for themselves in the past 2 years?

Even if this was true, "everyone building X independently" is evidence that one company should definitely build X and sell it to everyone

mrkurt4mo ago

Sandboxes with the right persistence and http routing make excellent dev servers. I have about a million dev servers I just use from whatever computer / phone I happen to be using.

It's really useful to just turn a computer on, use a disk, and then plop its url in the browser.

Immediate computers have made side projects a lot more fun again. And the nice thing is, they cost nothing when I forget about them.

messh4mo ago

This is exactly what I built shellbox.dev for.

SSH in, it resumes where you left off, auto-suspends on disconnect. $0.50/month stopped.

I have the same pattern - one box per project, never think about them until I need them.

simonw4mo ago

I'd love to know more about that "help me reply to all the people" one! I definitely need that.

1 more reply

falcor844mo ago

> Hasn't everyone already built this for themselves in the past 2 years?

ATechGuy4mo ago

2 more replies

drewbitt4mo ago

Has everyone really built their own microVMs? I don’t think so.

zenmac4mo ago

Saw quite bit on HN.

A quick search this popped up:

https://news.ycombinator.com/item?id=45486006

If we can spin up microVM so quickly, why bother with Docker or other containers at all?

2 more replies

yakkomajuri4mo ago· 5 in thread

Secret placeholders seems like a good design decision.

So many sandbox products these days though. What are people using in production and what should one know about this space? There's Modal, Daytona, Fly, Cloudflare, Deno, etc

ATechGuy4mo ago

These are all wrappers around VMs. You could DIY these easily by using EC2/serverless/GCP SDKs.

thundergolfer4mo ago

Modal engineer here. This isn’t correct. You can DIY this but certainly not by wrapping EC2 which is using the Nitro hypervisor and is not optimized for startup time.

Nearly all players in this space use Gvisor or Firecracker.

1 more reply

easton4mo ago

You can and can’t, at least in AWS. For instance, you can’t launch a EC2 to a point you can ssh in less than 8-10 seconds (and it takes a while to get EBS to sync the entire disk from s3).

(You could use lambda, but that’s limited in other ways).

ATechGuy4mo ago

To the commenters here: thanks for correcting me! So AWS is losing AI sandboxing market to GCP due to high cold start times of EC2...very interesting!

ushakov4mo ago

Factory, Nvidia, Perplexity and Manus are using E2B in production - we ran more than 200 million Sandboxes for our customers

ttoinou4mo ago· 5 in thread

What happens if we use Claude Pro or Max plans on them ? It’ll always be a different IP connecting and we might get banned from Anthropic as they think we’re different users

Why limit the lifetime on 30 mins ?

lucacasonato4mo ago

We'll increase the lifetime in the next weeks - just some tech internally that needs to be adjusted first.

mrkurt4mo ago

ttoinou4mo ago

All the time with the same computer ? Maybe it is looking at others metadata, for example local MAC addresses

1 more reply

paxys4mo ago

What's the use case for this? Trying to get raw API access through a monthly plan? Or something else?

ttoinou4mo ago

Simply using your subscription in a sandbox ?

Tepix4mo ago· 4 in thread

If you can create a deno sandbox from a deno sandbox, you could create an almost unkillable service that jumps from one sandbox to the next. Very handy for malicious purposes. ;-)

Just an idea…

mrkurt4mo ago

This is, in fact, the biggest problem to solve with any kind of compute platform. And when you suddenly launch things really, really fast, it gets harder.

runarberg4mo ago

Isn’t that basically how zip-bombs work?

TheDong4mo ago

It's much closer to a fork-bomb.

kibibu4mo ago

Not really, no

simonw4mo ago· 3 in thread

Note that you don't need to use Deno or JavaScript at all to use this product. Here's their Python client SDK: https://pypi.org/project/deno-sandbox/

  from deno_sandbox import DenoDeploy
  
  sdk = DenoDeploy()
  
  with sdk.sandbox.create() as sb:
      # Run a shell command
      process = sb.spawn("echo", args=["Hello from the sandbox!"])
      process.wait()
  
      # Write and read files
      sb.fs.write_text_file("/tmp/example.txt", "Hello, World!")
      content = sb.fs.read_text_file("/tmp/example.txt")
      print(content)

Looks like the API protocol itself uses websockets: https://tools.simonwillison.net/zip-wheel-explorer?package=d...

koakuma-chan4mo ago

Because the sandbox is on their cloud, not on your local machine, which wasn't obvious to me.

sli4mo ago

It's stated under the "Sandboxes?" heading.

> Deno Sandbox gives you lightweight Linux microVMs (running in the Deno Deploy cloud) ...

rdhyee4mo ago

Asked on the Fly forum: https://community.fly.io/t/can-sprites-reach-internal-fly-se...

@tptacek's point upthread about controlling not just hosts but request structure is well taken - for AI agent sandboxing you'd want tight scoping on what the proxy will forward.

nihakue4mo ago· 3 in thread

Will give these a try. These are exciting times, it's never been a better time to build side projects :)

tomComb4mo ago

Yes, sprites looks great too – would certainly be interested in a comparison.

alooPotato4mo ago

what are the key architectural differences?

tptacek4mo ago

Sprites aren't ephemeral. They're like deli cups: "semi-disposable". You keep them around as long as you feel like, and you don't feel bad about throwing them away.

zenmac4mo ago· 2 in thread

>Deno Sandbox gives you lightweight Linux microVMs (running in the Deno Deploy cloud)

The real question is can the microVMs run in just plain old linux, self-hosted.

echelon4mo ago

Everyone wants to lock you in.

Unfortunately there's no other way to make money. If you're 100% liberally licensed, you just get copied. AWS/GCP clone your product, offer the same offering, and they take all the money.

ushakov4mo ago

we have 100% open-source Sandboxes at E2B

git: https://github.com/e2b-dev/infra

wiki: https://deepwiki.com/e2b-dev/infra

3 more replies

ATechGuy4mo ago· 2 in thread

> allowNet: ["api.openai.com", "*.anthropic.com"],

How to know what domains to allow? The agent behavior is not predefined.

CuriouslyC4mo ago

The idea is to gate automatic secret replacement to specific hosts that would use them legitimately to avoid exfiltration.

falcor844mo ago

[0] https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/

nihakue4mo ago· 2 in thread

crowlKats4mo ago

could you try again? it should be available now (no need to update deno CLI)

nihakue4mo ago

Also, the CLI doesn't give a useful error when you try to create a snapshot from a currently attached volume.

That said, now that I've got a snapshot set up it's a nice experience. I've got an alias for `deno sandbox create --root dev --ssh` and I can `claude` in yolo mode without much fear.

Congratulations to the team :)

chacham154mo ago· 1 in thread

nusl4mo ago

freakynit4mo ago· 1 in thread

It's always the exorbitant price with such offerings.

A 2 vCPU, 4GB Ram and 40GB Disk instance on Hetzner cost 4.13 USD.

The same here is:

$127.72 without pro plan, and $108.72 with pro plan.

This means to break even, I can only use this for 4.13/127.72*730 = 23.6 hours every month, or, less than an hour daily.

nusl4mo ago

The article mentions that it's compute time spent deploying the code and not "wall clock" time, so I don't think it's quite this bad?

eis4mo ago· 1 in thread

What's with the pricing of these sandbox offerings recently? I assume just trying to milk the AI trend.

It's about 10x what a normal VM would cost at a more affordable hoster. So you better have it run only 10% of the time or you're just paying more for something more constrained.

A full month of runtime would be about $50 bucks for a 2vCPU 1GB RAM 10GB SSD mini-VM that you can get easily for $5 elsewhere.

freakynit4mo ago

Ditto... but it's more like 30x.

Mentioned the same in this comment as well: https://news.ycombinator.com/item?id=46881920

sibellavia4mo ago· 1 in thread

jrvarela564mo ago

Next step for me is creating a secrets proxy like credit card numbers are tokenized to remove risk of exfiltrating credentials.

Edit: It’s nice that Deno Sandbox already does this. Will check it out.

PeterStuer4mo ago· 1 in thread

Never used Deno before, and searching through docs and their GitHub still leaves me with questions:

Can you configure Demo Sandbox to run on a self hosted installation of Deno Deploy (deployd), or is this a SaaS only offering?

wsgeorge4mo ago

What I gather from the announcement: it's part of Deno Deploy (their SaaS offering). I too would love a self-hosted version.

ianberdin4mo ago· 1 in thread

Firecrackervm with proxy?

jonthepirate4mo ago

seems it.

Bnjoroge4mo ago· 1 in thread

Ignoring the fact that most of the blog post is written by an LLM, I like that they provide a python sdk. I dont believe vercel does for their sandbox product.

GreenWatermelon4mo ago

I can't ignore that fact. The post was suffocating to read. LLMs have an obnoxious style.

MillionOClock4mo ago· 1 in thread

Can this be used on iOS somehow? I am building a Swift app where this would be very useful but last time I checked I don't think it was possible.

lucacasonato4mo ago

It’s a cloud service - so you can call out to it from anywhere you want. Just don’t ship your credentials in the app itself, and instead authenticate via a server you control.

latexr4mo ago· 1 in thread

> evil.com

That website does exist. It may hurt your eyes.

lucacasonato4mo ago

We honestly should have just linked to oracle.com instead of evil.com

koolala4mo ago

The free plan makes me want to use it like Glitch. But every free service like this ever has been burned...

e12e4mo ago

Looks promising. Any plans for a version that runs locally/self-host able?

Looks like the main innovation here is linking outbound traffic to a host with dynamic variables - could that be added to deno itself?

LAC-Tech4mo ago

As a bit of an aside, I've gotten back into deno after seeing bun get bought out by an AI company.

It's basically my "need to whip up a small thing" environment of choice now.

_pdp_4mo ago

Very interesting. Might copy it.

We recently built our own sandbox environment backed by firecracker and go. It works great.

For data residency, i.e. making sure the service is EU bound, there is basically no other way. We can move the service anywhere we can get hardware virtualisation.

$ some-skillset search-gmail-messages -q "emails from Adrian"

In our own experiments we find that this approach works better and it just makes sense given most of the latest models are trained as coding assistants. They just love bash, so give them the tools.

tracker14mo ago

I realize this is using other interactions, but I'd like a bit more observability than just the isolated environment... I'm not even saying VS Code specifically, but something similar at the least.

EGreg4mo ago

We already have a pretty good sandbox in our platform: https://github.com/Qbix/Platform/blob/main/platform/plugins/...

It uses web workers on a web browser. So is this Deno Sandbox like that, but for server? I think Node has worker threads.

earlence4mo ago

angristan4mo ago

Very neat idea! I implemented it into my self hosted remote coding agent: https://stanislas.blog/2026/02/netclode-self-hosted-cloud-co...

dangoodmanUT4mo ago

Love their network filtering, however it definitely lacks some capabilities (like the ability to do direct TCP connections to Postgres, or direct IP connections.

Those limitations from other tools was exactly why I made https://github.com/danthegoodman1/netfence for our agents

regisb4mo ago

swyx4mo ago

> The real key materializes only when the sandbox makes an outbound request to an approved host. If prompt-injected code tries to exfiltrate that placeholder to evil.com? Useless.

pretty smart. why isn't this the norm?

arjan_sch4mo ago

This sandboxing solution list is getting long... created https://github.com/arjan/awesome-agent-sandboxes, PRs welcome :)

snehesht4mo ago

50/200 Gb free plus $0.5 / Gb out egress data seems expensive when scaling out.

WatchDog4mo ago

eric-burel4mo ago

Can it be used to sandbox an AI agent, like replacing eg Cursor or Openclaw sandboxing system?

bopbopbop74mo ago

Now I see why he was on twitter saying that the era of coding is over and hyping up LLMs, to sell more shovels...

j / k navigate · click thread line to collapse