Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

VikParuchuri / marker Public

Notifications You must be signed in to change notification settings
Fork 1.3k
Star 21.5k

Code
Issues 181
Pull requests 22
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Releases: VikParuchuri/marker

Releases Tags

Releases · VikParuchuri/marker

Pagination, bug fixes

17 Jun 17:04

VikParuchuri

v0.2.14

fe9343c

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Pagination, bug fixes

Add a setting to enable output pagination
Enable convert.py to use mps (but less memory efficient than cpu/cuda)
Fix bug with inference ram setting
Fix bug with pdf names with dots in them
Fix bug with images at the end of blocks

Assets 2

All reactions

Fix convert.py bug

30 May 01:55

VikParuchuri

v0.2.13

53125ac

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Fix convert.py bug

Fix model device check.

Assets 2

Harvester62, ggHydraLinn, and adarshmadrecha reacted with thumbs up emoji

All reactions

👍 3 reactions

3 people reacted

Specify page range

29 May 18:09

VikParuchuri

v0.2.12

aa8e7f0

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Specify page range

Make it more clear MPS can't be used with convert.py
Specify page range in convert with start_page and max_pages

Assets 2

All reactions

Python 3.12 compatibility

28 May 22:36

VikParuchuri

v0.2.11

a3334ce

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Python 3.12 compatibility

Remove ray to enable python 3.12 compatibility
Removing ray frees a lot of VRAM (since we can use torch shared tensors), so on average with convert.py each process takes 3GB VRAM. This enables much higher throughput (was between 4.5GB and 5GB before).

Assets 2

All reactions

OCR speedups

28 May 04:34

VikParuchuri

v0.2.10

7bf2e91

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

OCR speedups

Pull in new surya and pdftext versions for speedups in OCR and text extraction, respectively
Refine heuristics to reduce OCR false positives (and true positives, unfortunately)
Enable float batch multipliers

Assets 2

mrchengshunlong and 651961 reacted with laugh emoji

tuanbmstu, 651961, and Harvester62 reacted with heart emoji

All reactions

😄 2 reactions
❤️ 3 reactions

4 people reacted

Speed improvements

23 May 23:24

VikParuchuri

v0.2.9

0d9b0db

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Speed improvements

Enable parallel text extraction, with worker count settings
Bump surya version to pull in layout/line segmentation speed improvements, and OCR bug fix

Assets 2

pauloeli and SidneyRey reacted with thumbs up emoji

yiyibooks, mrchengshunlong, yasyf, heldilira, SidneyRey, and Jaitely-involead reacted with heart emoji

All reactions

👍 2 reactions
❤️ 6 reactions

7 people reacted

Faster OCR

18 May 04:28

VikParuchuri

v0.2.8

cc9d830

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Faster OCR

OCR is now ~2.5x faster, due to improvements in surya

Assets 2

tcluri, mrchengshunlong, SebastianBodza, nischalj10, yiyibooks, xtyrrell, abhirupghosh, and ggHydraLinn reacted with rocket emoji

FBruzzesi, omega-lua, 651961, and xtyrrell reacted with eyes emoji

All reactions

🚀 8 reactions
👀 4 reactions

11 people reacted

Speed up inference

17 May 22:57

VikParuchuri

v0.2.7

a056562

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Speed up inference

(from surya) faster ocr, line detection, layout inference
Unpin transformers version after testing

Should be significantly faster now, but haven't fully benchmarked, since I'm running low on time this week!

Assets 2

umerghafoor reacted with heart emoji

All reactions

❤️ 1 reaction

1 person reacted

Fix memory leak

16 May 22:46

VikParuchuri

v0.2.6

74adf35

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Fix memory leak

Fix a memory leak (fixed in surya, bumped the version). This caused high CPU memory usage on long docs.
Improve load_all_models to take device and dtype

Assets 2

651961, wch930, Watterry, seanzhang-zhichen, and mrchengshunlong reacted with thumbs up emoji

All reactions

👍 5 reactions

5 people reacted

Marker v2

10 May 16:02

VikParuchuri

v0.2.5

6f8b239

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

Marker v2

Basically a full rewrite!

Main features:

Extracts and saves images
Improved table formatting
Better markdown wrapping
Better reading order on complex docs
Improved OCR engine with more language options
Simple pip package install (no more required system dependencies), so can be used easily on Windows
Can be used commercially (pymupdf and layoutlmv3 dependencies removed)

It takes ~2x as long to run now, but seems like a decent tradeoff.

See the README for details.

Assets 2

Watterry reacted with thumbs up emoji

skrugly, tcluri, NickTaylor-, dubsuar, SimonB97, 651961, jstjoe, PostApoc, mrchengshunlong, asdfMaciej, and 5 more reacted with hooray emoji

651961, yiyibooks, FelisDwan, PostApoc, asdfMaciej, VAlexandersson, and Watterry reacted with heart emoji

651961, tcluri, mrdrozdov, PostApoc, and asdfMaciej reacted with rocket emoji

All reactions

👍 1 reaction
🎉 15 reactions
❤️ 7 reactions
🚀 5 reactions

19 people reacted

Previous 1 2 3 4 5 Next

Previous Next

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.