Skip to content

Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language

License

Notifications You must be signed in to change notification settings

mozilla-ai/speech-to-text-finetune

Repository files navigation

Project logo

This blueprint guides you to ...

👉 📖 For more detailed guidance on using this project, please visit our Docs here

Built with

  • Python 3.10+
  • Open-Source Tool 1
  • Open-Source Tool 2
  • ...

Quick-start

  1. Select the language you want to finetune on
  2. Common Voice is using this https://en.wikipedia.org/wiki/List_of_ISO_639_language_codes to map languages to unique identifiers. Find your language and the correct language identifier (Set 1) and use it as input.

How it Works

Pre-requisites

  • System requirements:

    • OS: Windows, macOS, or Linux
    • Python 3.10 or higher
    • Minimum RAM:
    • Disk space:
  • Dependencies:

    • Dependencies listed in pyproject.toml

Troubleshooting

I am getting this error: ValueError: ffmpeg was not found but is required to load audio files from filename.

Follow the installation instructions for your OS from this link: https://ffmpeg.org/download.html, e.g. for Ubuntu: sudo apt install ffmpeg

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Contributing

Contributions are welcome! To get started, you can check out the CONTRIBUTING.md file.

About

Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published