Enterprise procurement (and some mid-market) wants dedicated infrastructure. Full stop. Security questionnaires are brutal on shared tenancy Compliance (SOC2, HIPAA, FedRAMP) is infinitely easier with isolation
"You share infrastructure with our competitors?" kills deals
K8s namespaces give you 80% of single-tenant benefits Resource isolation, blast radius containment, independent scaling Customer-specific customization becomes trivial Noisy neighbor problems disappear
Sales-led growth with FDE (forward deployment engineers) commands premium pricing Implementation fees offset infrastructure costs Retention is higher when customers feel they have "their own thing"
IMHO, where multi-tenant still makes sense:
Product-led growth at <$10K ACV True horizontal SaaS with zero customization You're optimizing for scaling to millions of users, not hundreds of mid-market customers
Am I off base here? What am I missing? Would love to hear from folks running B2B SaaS at scale.
What killed forum culture? Was this inevitable, or did we lose something valuable? Are there examples of forums that are thriving today?
What's particularly impressive: - Speed: Near-instant responses even for complex queries - Quality: Accurate, well-sourced answers with citations - Integration: Seamlessly pulls from the knowledge graph and fresh web results
I'm wondering: - What model(s) are they running under the hood? - How are they achieving such low latency at scale? - Are they using some kind of speculative execution or caching strategy? - How does their infrastructure differ from standalone LLM APIs?
For those who've worked on similar systems or have insights into Google's approach, I'd love to hear your thoughts on what makes this possible.
Infrastructure (Compute): ~$140 (R$821 BRL) Cloud Logging: ~$1,300 (R$7,554 BRL)
Ratio: Logging cost 9.2x the actual servers.
https://imgur.com/jGrxnkh
I fixed the loop and paused the `_Default` sink immediately.
I opened a billing ticket requesting a "one-time courtesy adjustment" for a runaway resource—standard practice for first-time anomalies on AWS/Azure.
I have been rejected twice.
The latest response: "The team has declined the adjustment request due to our internal policies."
If you run GKE, the `_Default` sink in Log Router captures all container stdout/stderr.
There is NO DEFAULT CAP on ingestion volume which is an absurd!
A simple while(true); do echo "error"; done can bankrupt a small project.
Go to Logging -> Log Router. Edit _Default sink.
Add an exclusion filter: resource.type="k8s_container" severity=INFO (or exclude specific namespaces).
Has anyone successfully escalated a billing dispute past Tier 1 support recently?
It seems their policy is now to enforce full payment even on obvious runaway/accidental usage which is absurd since its LOGS! TEXT!
Cost Summary
Total Cost: $928.45 over 70 days
Average per Request: $0.06
Cost per Task (Request): Ranges from $0.00 to $2.78, with 65.7% costing under $0.05
Projected Monthly Cost: ~$416 (based on average daily spend of $13.86)
Request Patterns
Requests per 5 Hours: Average 70.7, ranging from 1 to 451
Average Time Between Requests: 6 minutes 33 seconds
Median Time Between Requests: Just 13 seconds (shows bursts of activity)
Peak Activity: 1-2 PM (10.4% of all requests at 1 PM)
Busiest Day: Saturday with 21.7% of requests
Token Efficiency
Average Tokens per Request: 83,371 tokens
Median Tokens per Request: 38,342 tokens
Average Output per Request: 876 tokens
Cache Hit Rate: 88.8% (excellent! saves money)
Cost per 1,000 Tokens: $0.0009 (very efficient due to caching)
Cost per 1,000 Output Tokens: $0.14
Notable Stats
Most Expensive Request: $2.78 using 6.8M tokens (mostly cached)
Total Hours of Active Usage: 1,692 hours (~9 requests/hour)
Most Used Models: claude-4.5-sonnet-thinking, claude-3.5-sonnet, and others
Related: https://news.ycombinator.com/item?id=45914307
Context: I'm an Founder & CTO building an AI-First CRM product.
Here's what happened:
October: Started the month thinking "I'll stay within the Pro limits, no problem." By mid-month, Cursor hit me with a $280 invoice. By month end? $348.56 total in on-demand charges. I literally maxed out the $400 limit.
November: It's only November 12 and I've already been invoiced $289.38:
Cost per request: Claude 4.5 Sonnet Thinking ranges from $0.02 to $0.06 depending on context size. Doesn't sound like much until you realize you're hitting it 200+ times per day.
I tried 7 different models (GPT-5, Gemini 2.5 Pro, Cheetah, etc.) thinking I'd save money. Claude still ate 85% of my budget because, honestly, it's the best.
Am I more productive? Absolutely. Is it worth $638 every 6 weeks? Idk. That's $5,500+ annually just for code assistance.
So I'm curious:
What are YOU spending? Am I an outlier or is this the new normal?
Have you changed behavior to cut costs? (Using faster models? Being more selective? Bringing your own API keys?)
At what price point would you stop? $100/month? $500? $1000?
Is anyone actually staying within the included limits? Or is that just marketing?
I feel like we're in this weird phase where the value is obvious but the pricing model hasn't stabilized.
Would love to hear how others are navigating this.
Let's experiment with the other side of the marketplace: a thread for people and companies who are actively looking to buy a solution for a problem they have.
This is for founders, managers, and engineers who have a budget and are tired of searching for the right tool, library, or service. It's also for builders and vendors to find qualified leads for problems they genuinely solve.
Guidelines: - For Buyers: Please post a top-level comment describing your need. Follow the template below to give solution providers the best context.
- For Sellers/Builders: Please reply directly to a buyer's comment if you have a relevant solution. Do not post a top-level comment to advertise your product.
Buyer Template (please use this in your top-level comment):
PROBLEM: (Be specific. Instead of "I need a CRM," try "I need a lightweight, self-hostable CRM for a 3-person dev tools company that primarily integrates with GitHub and Slack. Salesforce is too complex.") BUDGET: (e.g., "$50-$200/month", "Up to $5,000 one-time for a perpetual license", "Flexible for the right solution.")
WHAT I'VE TRIED: (e.g., "We've tried HubSpot but found it too marketing-focused. We looked at Close but the per-seat pricing doesn't work for us. We've considered building our own but would rather not.")
IDEAL SOLUTION: (Describe what the perfect solution would look like. e.g., "An open-source tool with optional paid support, or a SaaS with a transparent API and fair pricing. Must have a clean, fast UI.")
CONTACT: (e.g., "My contact info is in my profile," or "Reply here and I'll reach out," or "email me at hn-buying[at]mydomain.com")