Releases · VikParuchuri/marker

Support xlsx, docx, pptx, html, epub

Marker now has support for additional document formats. You have to run pip install marker-pdf[full] to install all the dependencies.

Improved text detection

OCR should now work better due to an improved text detection model.

Inline math improvements

Better inline math detection with an improved model.
Inline math lines are now inference.
--redo-inline-math option to enable the highest quality math detection

Misc improvements

Support for the claude model
Improve benchmarking scripts
Merge lines better with new text detection model

What's Changed

Inline math by @VikParuchuri in #571
Add Support for DOCX, PPTX, XLSX, HTML and Epub by @iammosespaulr in #501
Fix character encoding issues when loading JSON configuration files by @vicenciomf2 in #574
Dev by @VikParuchuri in #573

New Contributors

@vicenciomf2 made their first contribution in #574

Full Changelog: v1.5.5...v1.6.0

Inline math

Marker will handle inline math if --use_llm is set. This makes reading scientific papers a lot nicer! The feature has been optimized for speed.

Local LLMs

We now support Ollama - when you're passing the --use_llm flag, you can select the Ollama inference service like this:

marker_single FILEPATH --use_llm --llm_service marker.services.ollama.OllamaService

You can set the options --ollama_base_url and --ollama_model. By default, it will use llama3.2-vision.

Batch LLM calls

LLM calls are now batched across processors for a significant speedup if you're passing --use_llm.

Misc fixes

Biology PDFs now work a lot better - leading line numbers are stripped
Improved OCR heuristics
Updated the examples

What's Changed

Batch together llm inference requests by @VikParuchuri in #536
Add another heuristic to clean up line numbers by @iammosespaulr in #538
Add Inline Math Support by @tarun-menta in #517
Factor out llm services, enable local models by @VikParuchuri in #544
Improve LLM speed; handle inline math; allow local models by @VikParuchuri in #537

Full Changelog: v1.4.0...v1.5.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support xlsx, docx, pptx, html, epub

Improved text detection

Inline math improvements

Misc improvements

What's Changed

New Contributors

Contributors

What's Changed

Contributors

Download models from R2

Inline math improvements

What's Changed

Contributors

Windows fixes

Memory leak fix

Convert.py enhancements

What's Changed

New Contributors

Contributors

Inline math

Local LLMs

Batch LLM calls

Misc fixes

What's Changed

Contributors

New benchmarks

Overall

Table

Update gemini model

Misc bugfixes

What's Changed

Contributors

Releases: VikParuchuri/marker

Support word, powerpoint, excel, html, epub + math improvements

Support xlsx, docx, pptx, html, epub

Improved text detection

Inline math improvements

Misc improvements

What's Changed

New Contributors

Contributors

Fix LLM layout relabeling bug

What's Changed

Contributors

Download models from r2; fix inline math

Download models from R2

Inline math improvements

What's Changed

Contributors

v1.5.3

Windows fixes

Memory leak fix

Convert.py enhancements

What's Changed

New Contributors

Contributors

Fix LLM service issue

Fix OCR issue

Inline math; speed up LLM calls; allow local models

Inline math

Local LLMs

Batch LLM calls

Misc fixes

What's Changed

Contributors

LLM fixes; new benchmarks

New benchmarks

Overall

Table

Update gemini model

Misc bugfixes

What's Changed

Contributors

Bump gemini version

Fix pytorch bug