OpenReader WebUI is a document reader with Text-to-Speech capabilities, offering a TTS read along experience with narration for both PDF and EPUB documents. It can use any OpenAI compatible TTS endpoint, including Kokoro-FastAPI.
- π― TTS API Integration: Compatible with OpenAI text to speech API, Kokoro FastAPI TTS, or any other compatible service; enabling high-quality voice narration
- πΎ Local-First Architecture: Uses IndexedDB browser storage - no server uploads required
- π Optional Server-side documents: Manually upload documents to the next backend for all users to download
- π Read Along Experience: Follow along with highlighted text as the TTS narrates
- π§ Audiobook Creation: Create and export audiobooks from PDF and ePub files with m4b format
- π EPUB Support: Read EPUB files with table of contents
- π PDF Support: Read PDF files with clean text extraction
- π¬ Follow along with highlighted text as the TTS narrates
- π² Mobile Support: Works on mobile devices, and can be added as a PWA web app
- π¨ Customizable Experience:
- π Set TTS API base URL (with optional API key)
- ποΈ Adjustable playback speed
- π Customize PDF text extraction margins
- π£οΈ Multiple voice options (checks
/v1/audio/voices
endpoint) - π¨ Multiple app theme options
β Audiobook creation and download (m4b format)
β Get PDFs on iOS working
β More documents formats: .txt, .docx, .md, etc.
β End-to-end Testing: Playwright tests (in progress)
β Accessibility Improvements: Ongoing
docker run --name openreader-webui \
-p 3003:3003 \
-v openreader_docstore:/app/docstore \
ghcr.io/richardr1126/openreader-webui:latest
(Optionally): Set the TTS API_BASE
URL and/or API_KEY
to be default for all devices
docker run --name openreader-webui \
-e API_BASE=http://host.docker.internal:8880/v1 \
-p 3003:3003 \
-v openreader_docstore:/app/docstore \
ghcr.io/richardr1126/openreader-webui:latest
Requesting audio from the TTS API happens on the Next.js server not the client. So the base URL for the TTS API should be accessible and relative to the Next.js server. If it is in a Docker you may need to use
host.docker.internal
to access the host machine, instead oflocalhost
.
Visit http://localhost:3003 to run the app and set your settings.
Note: The
openreader_docstore
volume is used to store server-side documents. You can mount a local directory instead. Or remove it if you don't need server-side documents.
docker stop openreader-webui && docker rm openreader-webui
docker pull ghcr.io/richardr1126/openreader-webui:latest
Create or add to a docker-compose.yml
:
volumes:
docstore:
services:
openreader-webui:
container_name: openreader-webui
image: ghcr.io/richardr1126/openreader-webui:latest
environment:
- API_BASE=http://host.docker.internal:8880/v1
ports:
- "3003:3003"
volumes:
- docstore:/app/docstore
restart: unless-stopped
Screen.Recording.2025-02-22.at.2.32.02.PM.mov
-
Clone the repository:
git clone /~https://github.com/richardr1126/OpenReader-WebUI.git cd OpenReader-WebUI
-
Install dependencies:
npm install
-
Configure the environment:
cp template.env .env # Edit .env with your configuration settings
Note: The base URL for the TTS API should be accessible and relative to the Next.js server
-
Start the development server:
npm run dev
or build and run the production server:
npm run build npm start
Visit http://localhost:3003 to run the app.
Dev server runs on port 3000 by default, while the production server runs on port 3003.
For feature requests or ideas you have for the project, please use the Discussions tab.
For general questions, you can reach out to me on Bluesky. If you encounter issues, please open an issue on GitHub following the template (which is very simple).
Contributions are welcome! Fork the repository and submit a pull request with your changes.
- Kokoro-FastAPI for the API wrapper
- react-pdf
- react-reader
- Kokoro-82M for text-to-speech
- linux/amd64 (x86_64)
- linux/arm64 (Apple Silicon)
- linux/arm/v7 (Raspberry Pi)
- Framework: Next.js (React)
- Containerization: Docker
- Storage: IndexedDB (in browser db store)
- PDF:
- EPUB:
- UI:
- TTS: (tested on)
- NLP: compromise NLP library for sentence splitting
This project is licensed under the MIT License.