undefined | Better HN

0 pointsHarHarVeryFunny20d ago0 comments

Many of the Chinese models are open weights, so if you are concerned about them "phoning home", then anyone can just self-host and run them themself, or use via a US provider such as OpenRouter.

0 comments

19 comments · 4 top-level

falcor8420d ago· 13 in thread

There's a higher-order concern here that I'm paranoid enough to voice: that if used as a coding agent, an AI model affiliated with a country's government might try to make my software susceptible to attacks by that government's intelligence forces.

And note that I'm not singling out China here.

zozbot23420d ago

> that if used as a coding agent, an AI model affiliated with a country's government might try to make my software susceptible to attacks by that government's intelligence forces.

Note that if such a trigger were to exist, the behavior has to be completely reproducible by definition, e.g. when put into the right setting with the right input context, the model starts behaving maliciously with at least some well-defined probability. I don't think any such incident has ever been described, it's a purely theoretical concern.

Avicebron20d ago

I don't think it's a stretch that you can train/align a model to avoid "hatespeech" or other topics deemed $Unacceptable you can align a model to favor a certain ideological viewpoint and have that alignment subtly influence the output.

How do most Chinese models handle Tienanmen square or discussions on Han superiority?

3 more replies

SpicyLemonZest20d ago

Such incidents have been extensively described. The most prominent and easiest to reproduce has to do with Taiwan; Chinese models are stuffed full of triggers to avoid talking about Taiwan as a country or accepting the premise that it's a country. Try asking Deepseek about country code +886!

2 more replies

Humorist229020d ago

It's more comical than sinister, but I have an example in this vein.

I was using Claude to work on a pet project which itself has a "generate with AI" feature. The default model the project uses was Gemini (because it was cheaper and more reliably produces the correct output format). Claude kept changing the default model to Opus when working on entirely unrelated parts, and I kept noticing it because Opus would mangle the output and break the rendered page. It also did this to the .env file in addition to the default.

add-sub-mul-div20d ago

Giving up our agency to AI has the potential to turn us into NPCs, period. Economically, politically, socially. They've invented a vehicle for inserting any idea they want into our consumption and output.

moron4hire20d ago

Isn't this only a concern for yolocoding? All the AI-advocates tell me that "good" use of AI should include human review. Of course, they never seem able to explain why the boss that makes you use coding agents to go fast wouldn't be the same boss that pressures you to "just ship it, it's working" and skip review, so I absolutely believe your concern is valid.

HarHarVeryFunnyOP19d ago

If you're that paranoid, then you shouldn't be using any online services at all, and should not have an internet connection to your PC. Never use a compiler that you have not bootstrapped yourself without the use of any other compiler binary.

Even with these precautions you may still be hacked by state-level actors using a whole variety of sophisticated attack vectors. There may be Stuxnet-like software hidden on your hard drive where you cannot see it. If you do not have a TEMPEST hardened compute environment then anything you type on your keyboard or display on your screen may be getting stolen.

That said, it would be a fantastic achievement if someone could create a coding model that managed to hide a backdoor in the code it was generating. although surely simpler to hack you in 100 other ways.

imjonse20d ago

Since that is valid for every model from any country, it's a good idea to review the code the agent creates :)

beepbooptheory20d ago

Almost feels like maybe the best bet is to have humans make the code when its really important.

throw123456789120d ago

Because people cannot be manipulated.

sometimelurker20d ago

you can finetune the ccp propaganda out of them, then your mostly fine. if you want to be more safe you can finetune their public base models to not have ccp propagnada, and then proceed with the rest of the training (costs more tho)

stevehawk20d ago

so use the cheap model to do the work and the expensive domestic model to audit?

SpicyLemonZest20d ago

Or I can just use the domestic model, accepting that I'm paying some premium in order to reduce the complexity of my dependencies and the amount of time I have to spend thinking about supply chain risk. It's the same reason I don't buy things from Alibaba even though many things I buy from Amazon are surely available there for less.

1 more reply

kube-system20d ago· 2 in thread

Most American companies are using frontier or near frontier models.

And OpenRouter’s architecture makes it inherently a compliance nightmare.

It’s much easier for the typical company to go with a provider where they can pay as they go and have a single data processing agreement.

JumpCrisscross20d ago

> OpenRouter’s architecture makes it inherently a compliance nightmare

Why?

kube-system20d ago

Because the platform is designed to send data to numerous different backend data processors.

Using something like Bedrock is a lot easier for compliance because the only processor is Amazon.

1 more reply

chrsw20d ago

Very few American companies know how to properly set up and self-host their own models. Even fewer actually do it. It in the context of your typical large enterprise it's not as simple as buying a rack of servers and downloading a model off Hugging Face.

I suspect the reason is similar to the reason why there aren't any competitive open weight American LLMs.

xnx20d ago

Yes. Open weights are great and are a good option to hosted models under the right circumstances. I'm glad that China releases open weight models (which in some cases are sort-of be distilled versions of hosted US models).

j / k navigate · click thread line to collapse

0 comments

19 comments · 4 top-level

falcor8420d ago· 13 in thread

And note that I'm not singling out China here.

zozbot23420d ago

> that if used as a coding agent, an AI model affiliated with a country's government might try to make my software susceptible to attacks by that government's intelligence forces.

Avicebron20d ago

How do most Chinese models handle Tienanmen square or discussions on Han superiority?

3 more replies

SpicyLemonZest20d ago

2 more replies

Humorist229020d ago

It's more comical than sinister, but I have an example in this vein.

add-sub-mul-div20d ago

moron4hire20d ago

HarHarVeryFunnyOP19d ago

imjonse20d ago

Since that is valid for every model from any country, it's a good idea to review the code the agent creates :)

beepbooptheory20d ago

Almost feels like maybe the best bet is to have humans make the code when its really important.

throw123456789120d ago

Because people cannot be manipulated.

sometimelurker20d ago

stevehawk20d ago

so use the cheap model to do the work and the expensive domestic model to audit?

SpicyLemonZest20d ago

1 more reply

kube-system20d ago· 2 in thread

Most American companies are using frontier or near frontier models.

And OpenRouter’s architecture makes it inherently a compliance nightmare.

It’s much easier for the typical company to go with a provider where they can pay as they go and have a single data processing agreement.

JumpCrisscross20d ago

> OpenRouter’s architecture makes it inherently a compliance nightmare

Why?

kube-system20d ago

Because the platform is designed to send data to numerous different backend data processors.

Using something like Bedrock is a lot easier for compliance because the only processor is Amazon.

1 more reply

chrsw20d ago

I suspect the reason is similar to the reason why there aren't any competitive open weight American LLMs.

xnx20d ago

j / k navigate · click thread line to collapse