Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.m4trix.dev/llms.txt

Use this file to discover all available pages before exploring further.

Product updates for @m4trix/evals, generated from conventional commits in the monorepo. Scopes in commit messages route entries to the Agents, Evals, or Tracing changelogs. Last regenerated 2026-05-26 (UTC).
May 9, 2026
Fixes
v0.34.2

Bug fixes

April 18, 2026
FeaturesFixesImprovements
v0.34.1

New features

  • introduction of tag filters (evals) (6f71580)

Bug fixes

  • datasets now accept top level and filter (evals) (0c110fa)

Improvements

  • update import paths to use .js extensions for consistency (97e481c)
  • update import paths to use .js extensions for consistency (evals) (6dc73ac)
April 9, 2026
Features
v0.33.0

New features

  • introduced sampling (evals) (4fd81ac)
March 27, 2026
Features
v0.32.0

New features

  • introduced grouped test case export (evals) (0505039)
March 22, 2026
Features
v0.31.0

New features

  • introduced triggeredAt (evals) (6258460)
  • introduced trigger timestamp (evals) (e0cabeb)
March 20, 2026
Features
v0.29.0

New features

  • introduced test case name in meta (evals) (a7959dc)
  • introduced experiment name (evals) (c2bf0d7)
  • adjustment of dataset and test case naming (evals) (f941bd7)
  • introduction of ci mode (evals) (3aa1ec1)
  • introduction of tags (evals) (3b82060)
  • new naming convention (evals) (24f2dd9)
  • new run structure with run configs (evals) (65e46db)
March 7, 2026
Features
v0.25.0

New features

  • new features and project restructure (c5bdf34)
February 26, 2026
FeaturesFixes
v0.23.0

New features

  • removed cpu concurrency defintion (evals) (189299e)
  • improved concurrency for evals (evals) (50c3bcf)

Bug fixes

  • replaced the json diff package due to invalid package hashes (evals) (7a12092)
February 21, 2026
Features
v0.16.0

New features

  • integrate json-diff for enhanced diff logging and evaluation (evals) (0050228)
  • enhance RunDetailsView and RunView for better score and metric display (evals) (5bf9416)
  • update dependencies and enhance CLI functionality (evals) (2d29850)
February 20, 2026
Features
v0.13.0

New features

  • enhance test case handling with reruns and aggregation support (evals) (fb4cfb2)
  • improved cli (evals) (471394a)
  • improved cli (evals) (5f077da)
February 19, 2026
Features
v0.10.0

New features

February 18, 2026
Features
v0.8.0

New features

  • update version to 0.7.0, add output handling in evaluator, and enhance test case structure (#27) (evals) (eaf9d8f)
  • first version (#26) (evals) (9146b35)