Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Brlaney / words Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Breadcrumbs

words
md

/

Documentation.md

Copy path

Latest commit

History

124 lines (93 loc) · 4.03 KB

Breadcrumbs

words
md

/

Documentation.md

File metadata and controls

124 lines (93 loc) · 4.03 KB

Documentation 📰📃😃

Table of Contents

Documentation 📰📃😃
- Table of Contents

Create and start a python virtual env

# Starting a virtual environment
py -m venv ll_env

# Activate the virtual environment
ll_env/scripts/activate

The Procedure

Step 1. Run record_batch.py to create a new batch of phrases & words

You can look at a list of words or phrases and generate an audio recording of the word or phrase while holding down the space bar on your keyboard.

Each word or phrase recorded, gets a newline in the words.txt file, and the audio file is saved as a .wav file in the new-audios/ directory.

Example:

assurance (new-audios\assurance.wav)
deep-seated (new-audios\deep-seated.wav)
burden (new-audios\burden.wav)
load (new-audios\load.wav)
accomplishments (new-audios\accomplishments.wav)
...

Step 2. Cleanup your new batch

Trim any dead/silent audio in the .wav audio file.

Structure of the data

Note: all of the data is in the data/ directory.

data/
- audio/ (folder)
- backed-up/ (folder)
- dict/ (folder)
- sentences/ (folder)
- videos/ (folder)
- markdown_links.txt (text file)
- mistakes.json (json file)
- words.json (json file)

data/audio/ (audio) brendan's recordings of pronunciations in .wav audio files
- audio/phrases/ (phrases)
- audio/words/ (words)

data/dict/ (dict) stands for dictionary contains data saved from the webster dictionary API
- dict/audio/ (.wav audio) downloaded from dictionary API's media endpoint
  - dict/audio/phrases/ (phrases)
  - dict/audio/words/ (words)
- dict/json/ (json) json response from dictionary API for {searched word} + .json
  - dict/json/phrases/ (phrases) phrases are camel case, example: a_thrill_swept.json
  - dict/json/words/ (words)

To-Do

Completed

Only pages 5 and 6 are left to complete
Currently at: Harsh (id = 124) as of 10/07/2024
Fix recordings that are too long with blank/no noise. Write script to scan for and output all files with duration > 3 seconds.
pip install the package for env variables - then fix the api_req file to remove from .gitignore
If the md file will say List of words: then write a message like This word or phrase doesn't seem to exist in the English dictionary - maybe I misunderstood the text in your image
Mark the edited audio files w/ a flag to not have the volumex(2) used again during the video creation
Need to replace all json objects in the words.json file with their new durations from after.json
write script to seperate these into two categories: 1. Phrases and 2. Words.
Update the long-entries/before.json file

Pending

Currently at: Harsh (id = 198 (125-198)) as of 10/13/2024
Update the Requirements.txt
Finish the MoviePy script to prompt to record using each word in a sentence
Write script to separate into batches by date

Requirements (.txt)

certifi==2024.8.30
charset-normalizer==3.3.2
comtypes==1.4.7
idna==3.9
PyAudio==0.2.14
pydub==0.25.1
pypiwin32==223
pyttsx3==2.97
pywin32==306
requests==2.32.3
setuptools==74.1.2
SpeechRecognition==3.10.4
typing_extensions==4.12.2
urllib3==2.2.3

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.