Skip to content

LLM fixes; new benchmarks

Compare
Choose a tag to compare
@VikParuchuri VikParuchuri released this 11 Feb 16:41
· 154 commits to master since this release
a357967

New benchmarks

Overall

Benchmark against llamaparse, docling, mathpix (see README for how to run benchmarks). Marker performs favorably against alternatives in speed, llm as judge scoring, and heuristic scoring.

image

Table

Benchmark tables against gemini flash:

image

Update gemini model

  • Use the new genai library
  • Update to gemini flash 2.0

Misc bugfixes

  • Fix bug with OCR heuristics not being aggressive enough
  • Fix bug with empty tables
  • Ensure references get passed through in llm processors

What's Changed

Full Changelog: v1.3.5...v1.4.0