Skip to content
Better HN
Top
New
Best
Ask
Show
Jobs
Search
⌘K
undefined | Better HN
0 points
ImJasonH
7mo ago
0 comments
Share
Is anybody working on making building specialized things easier and cheaper?
0 comments
default
newest
oldest
-_-
7mo ago
Yes! At
https://RunRL.com
we offer hosted RL fine-tuning, so all you need to provide is a dataset and reward function or environment.
selim-now
7mo ago
yes! check out
https://distillabs.ai/
– follows a similar approach except the evaluation set is held out before the synthetic data generation, which I would argue makes it more robust (I'm affiliated)
j
/
k
navigate · click thread line to collapse