Skip to content

Releases: VikParuchuri/marker

Speedups, bug fixes

23 Oct 16:30
6b25f06
Compare
Choose a tag to compare
  • Fix some edge case OCR bugs
  • ~20% end to end speedup by improving layout and text detection

Fix OCR bugs

23 Oct 02:12
2f3f0d7
Compare
Choose a tag to compare
  • Fix bbox issue with OCR and resizing
  • Fix issue with layout bboxes missing after OCR

Fix misc bugs

22 Oct 22:27
189d660
Compare
Choose a tag to compare
  • Ensure we don't have 0 area table boxes
  • Ensure fullymergedblock gets a valid input

Fix layout bugs

22 Oct 20:52
6bee852
Compare
Choose a tag to compare
  • Improve layout, which improves output quality
  • Fix header level detection bugs

Fix OOM errors

21 Oct 13:52
93a3ca6
Compare
Choose a tag to compare
  • Add batch size for table rec model to avoid OOM
  • Enable configuring batch size
  • Fix error with debugging

Bugfixes, output quality improvement

18 Oct 19:48
361f9b5
Compare
Choose a tag to compare
  • Fix MPS bug with torch 2.5
  • Fix heading bug with zero line blocks
  • Improve output quality when visual boxes and text boxes are offset

Better tables, improved output quality, header levels

17 Oct 22:37
ea845fd
Compare
Choose a tag to compare

Tables!

  • Integrate custom table model for better table rendering - this uses a new state of the art open table model

Markdown output

  • Adjust block detection to improve markdown output globally
  • Assign layout labels to blocks in a better way - will improve quality globally
  • Better line spacing in markdown output
  • Push footnotes to end of page

Header levels

  • Add detection for header levels like #, ##, etc.
  • Add computed table of contents

Bugfixes/misc

  • Fix bug with pagination not working
  • Much better debugging with debug image output
  • Python 3.13 support

OCR and misc improvements; demo app

19 Aug 21:30
97ce051
Compare
Choose a tag to compare
  • Language no longer needs to be specified
  • Fix OCR memory leak
  • Add marker GUI demo app to test out conversion
  • Add progress for equation detection
  • Improve table recognition slightly
  • Add table benchmark

Significant speedup

12 Jul 18:04
054fac1
Compare
Choose a tag to compare

This release has a 15% GPU speedup, 3x CPU, 7x MPS. The speedup comes from new surya models for layout and text detection that are a lot more efficient.

This is a "best case" speedup, if you need to OCR or do equation recognition, the speedup will be lower. But it will still be a lot faster.

Fix transformers bugs

30 Jun 15:20
0f792b5
Compare
Choose a tag to compare
  • New transformers version introduces a new kwarg in donut models. Handle this case by ignoring it.
  • New transformers version breaks MPS compatibility by using torch .isin to do a comparison. Handle this by setting the pytorch mps fallback setting.