Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Dispatch differs from alternative solutions by allowing developers to write simple Python code: it has a minimal API footprint, which usually only requires using a function decorator (no complex ...