Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

VikParuchuri / marker Public

Notifications You must be signed in to change notification settings
Fork 1.3k
Star 21.5k

Code
Issues 181
Pull requests 22
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: VikParuchuri/marker

Releases · VikParuchuri/marker

Speedups, bug fixes

23 Oct 16:30

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Speedups, bug fixes

Fix some edge case OCR bugs
~20% end to end speedup by improving layout and text detection

Assets 2

Loading

tzengshinfu reacted with thumbs up emoji

All reactions

👍 1 reaction

1 person reacted

Fix OCR bugs

23 Oct 02:12

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix OCR bugs

Fix bbox issue with OCR and resizing
Fix issue with layout bboxes missing after OCR

Assets 2

Loading

tzengshinfu reacted with thumbs up emoji

All reactions

👍 1 reaction

1 person reacted

Fix misc bugs

22 Oct 22:27

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix misc bugs

Ensure we don't have 0 area table boxes
Ensure fullymergedblock gets a valid input

Assets 2

Loading

All reactions

Fix layout bugs

22 Oct 20:52

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix layout bugs

Improve layout, which improves output quality
Fix header level detection bugs

Assets 2

Loading

All reactions

Fix OOM errors

21 Oct 13:52

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix OOM errors

Add batch size for table rec model to avoid OOM
Enable configuring batch size
Fix error with debugging

Assets 2

Loading

All reactions

Bugfixes, output quality improvement

18 Oct 19:48

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Bugfixes, output quality improvement

Fix MPS bug with torch 2.5
Fix heading bug with zero line blocks
Improve output quality when visual boxes and text boxes are offset

Assets 2

Loading

hyericlee and futurewin reacted with thumbs up emoji

hyericlee and futurewin reacted with heart emoji

All reactions

👍 2 reactions
❤️ 2 reactions

2 people reacted

Better tables, improved output quality, header levels

17 Oct 22:37

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Better tables, improved output quality, header levels

Tables!

Integrate custom table model for better table rendering - this uses a new state of the art open table model

Markdown output

Adjust block detection to improve markdown output globally
Assign layout labels to blocks in a better way - will improve quality globally
Better line spacing in markdown output
Push footnotes to end of page

Header levels

Add detection for header levels like #, ##, etc.
Add computed table of contents

Bugfixes/misc

Fix bug with pagination not working
Much better debugging with debug image output
Python 3.13 support

Assets 2

Loading

jesseclin, PACHAKUTlQ, omega-lua, and mohit2152sharma reacted with thumbs up emoji

jrzkaminski reacted with laugh emoji

h-arnold and konradr reacted with heart emoji

All reactions

👍 4 reactions
😄 1 reaction
❤️ 2 reactions

7 people reacted

OCR and misc improvements; demo app

19 Aug 21:30

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

OCR and misc improvements; demo app

Language no longer needs to be specified
Fix OCR memory leak
Add marker GUI demo app to test out conversion
Add progress for equation detection
Improve table recognition slightly
Add table benchmark

Assets 2

Loading

yiyibooks, omega-lua, kksasa, JH6588, cxkjtd, caseylai, and Jfxhuangdao reacted with thumbs up emoji

gcgbarbosa, heldilira, cthulhu-tww, kuengroc, RalfNorthman, svmrw, Louis454545, MateoWartelle, dubsuar, nj-crossml, and 3 more reacted with hooray emoji

All reactions

👍 7 reactions
🎉 13 reactions

20 people reacted

Significant speedup

12 Jul 18:04

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Significant speedup

This release has a 15% GPU speedup, 3x CPU, 7x MPS. The speedup comes from new surya models for layout and text detection that are a lot more efficient.

This is a "best case" speedup, if you need to OCR or do equation recognition, the speedup will be lower. But it will still be a lot faster.

Assets 2

Loading

yiyibooks, ngirard, Kilowon, johnconnor-sec, Daerkle, h-arnold, jaelliot, Harvester62, lucasmelojs, omega-lua, and 2 more reacted with thumbs up emoji

mclevey, 651961, and ngirard reacted with laugh emoji

mattvr, 651961, FBruzzesi, ngirard, Blair-Johnson, yiyibooks, yasyf, dubsuar, cpursley, jrzkaminski, and Mutaz94 reacted with rocket emoji

All reactions

👍 12 reactions
😄 3 reactions
🚀 11 reactions

21 people reacted

Fix transformers bugs

30 Jun 15:20

VikParuchuri

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

Loading

Fix transformers bugs

New transformers version introduces a new kwarg in donut models. Handle this case by ignoring it.
New transformers version breaks MPS compatibility by using torch .isin to do a comparison. Handle this by setting the pytorch mps fallback setting.

Assets 2

Loading

All reactions

Previous 1 2 3 4 5 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.