This project is a web-based application that transcribes audio files into text using Azure Speech-to-Text SDK. It is built with Python, powered by Streamlit, and deployed on Heroku.
- Upload audio files for transcription.
- Utilizes Azure Speech-to-Text SDK for accurate and fast transcriptions.
- Simple and user-friendly web interface created with Streamlit.
- Secure and scalable deployment on Heroku.
Here's how the app looks:
To run this project locally, ensure you have the following installed:
- Python 3.8 or later
- Azure Subscription to access the Speech-to-Text service
- Streamlit
- Azure SDK for Python
-
Clone the repository:
git clone /~https://github.com/s-shahpouri/your-repo-name.git cd your-repo-name
-
Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install the required packages:
pip install -r requirements.txt
-
Set up your
.env
file:- Create a
.env
file in the root directory. - Add your Azure Speech-to-Text key and region:
AZURE_SPEECH_KEY=your_azure_key
- Create a
-
Run the Streamlit app:
streamlit run app.py
-
Open the provided URL in your browser (e.g.,
http://localhost:8501
). -
Upload an audio file and click the "Transcribe" button to see the transcription.
-
Install the Heroku CLI and log in:
heroku login
-
Create a Heroku app:
heroku create your-app-name
-
Set your environment variables on Heroku:
heroku config:set AZURE_SPEECH_KEY=your_azure_key AZURE_REGION=your_azure_region
-
Push your code to Heroku:
git push heroku main
- Python - Programming language
- Streamlit - Web framework
- Azure Speech-to-Text SDK - Speech recognition
- Heroku - Deployment platform
Contributions are welcome! Please fork the repository and create a pull request with your changes.
This project is licensed under the MIT License. See the LICENSE file for details.