It currently lets you:
- build evaluation pipelines as graphs
- run them against datasets
- track how output quality changes over time
https://www.pipevals.com