technoabsurdist on Hacker News

^ title. I've been renting MI300Xs coz they are cheaper than H100s and my experience has been generally OK (smoother than i expected based on people shitting on AMD so much online). ROCm 6.x seems decent out of the box now, and I'll happily spend 30 more minutes setting up my GPU if it means 20% cheaper. that being said, it's still annoying to run inference for LLMs on AMD's hardware (e.g. You have to install vLLM from source). And there are some other small details which still suck. As a small example, nvidia-smi gives you a nice clear interface while rocm-smi dumps 3 pages of context that's hard to navigate.

would be curious to hear experiences from other folks experimenting with AI workloads.

6technoabsurdist1y ago2

12

Show HN: Chisel – GPU development through MCP (opens in new tab)

(github.com)GitHub

1technoabsurdist1y ago0

13

Show HN: Chisel – Profile AMD MI300X kernels locally (opens in new tab)

(github.com)GitHub

2technoabsurdist1y ago0

14

Show HN: Chisel – AMD GPU development that feels local but runs in the cloud (opens in new tab)

(pypi.org)

2technoabsurdist1y ago2

15

Crackd – unbiased technical skill comparison among students (opens in new tab)

(crackd.io)

2technoabsurdist1y ago1

technoabsurdist

Recent submissions

AI Could Democratize One of Techs Most Valuable Resources (opens in new tab)

Show HN: Wafer – Profile, inspect assembly, and iterate on CUDA within your IDE (opens in new tab)

Show HN: GPU Profiling That's Useful in 60 Seconds (opens in new tab)

Show HN: We made PyTorch profiling usable for ML engineers (opens in new tab)

Chip Benchmark: Hardware-Centric Performance Insights for AI Workloads (opens in new tab)

ChipBenchmark: Open-Source Benchmarking for LLM Performance Across Hardware (opens in new tab)

Using AMD MI300X for High-Throughput, Low-Cost LLM Inference (opens in new tab)

Profile CUDA kernels with one command, zero GPU setup (opens in new tab)

Show HN: Profile GPU Kernels with One Command, Zero GPU Setup (opens in new tab)

Show HN: Chisel – Profile GPU Kernels Without a GPU (Nvidia and AMD) (opens in new tab)

Ask HN: Is anyone using AMD GPUs for their AI workloads?

Show HN: Chisel – GPU development through MCP (opens in new tab)

Show HN: Chisel – Profile AMD MI300X kernels locally (opens in new tab)

Show HN: Chisel – AMD GPU development that feels local but runs in the cloud (opens in new tab)

Crackd – unbiased technical skill comparison among students (opens in new tab)

Recent submissions

AI Could Democratize One of Techs Most Valuable Resources (opens in new tab)

Show HN: Wafer – Profile, inspect assembly, and iterate on CUDA within your IDE (opens in new tab)

Show HN: GPU Profiling That's Useful in 60 Seconds (opens in new tab)

Show HN: We made PyTorch profiling usable for ML engineers (opens in new tab)

Chip Benchmark: Hardware-Centric Performance Insights for AI Workloads (opens in new tab)

ChipBenchmark: Open-Source Benchmarking for LLM Performance Across Hardware (opens in new tab)

Using AMD MI300X for High-Throughput, Low-Cost LLM Inference (opens in new tab)

Profile CUDA kernels with one command, zero GPU setup (opens in new tab)

Show HN: Profile GPU Kernels with One Command, Zero GPU Setup (opens in new tab)

Show HN: Chisel – Profile GPU Kernels Without a GPU (Nvidia and AMD) (opens in new tab)

Ask HN: Is anyone using AMD GPUs for their AI workloads?

Show HN: Chisel – GPU development through MCP (opens in new tab)

Show HN: Chisel – Profile AMD MI300X kernels locally (opens in new tab)

Show HN: Chisel – AMD GPU development that feels local but runs in the cloud (opens in new tab)

Crackd – unbiased technical skill comparison among students (opens in new tab)