Restructuring Spotdl #812

ghost · 2020-08-17T14:18:57Z

Spotdl is currently a sprawling codebase, this is an attempt to simplify it ground up and make it easier and more simple to contribute to.

If you see this message, I humbly ask that you spare an hour or two to look through the code.
Ofc not (even I wouldn't spend an hour or two...)

Please do look through the the 'Working Docs' and 'Temp/spotdl' folders and leave your thoughts on:

clarity of code - How easy is it to figure out what's going on?
completeness of documentation - Is everything documented and is the documentation clear?
Simpler ways to implement the same functionality using lesser code.

Flesihing out the ideas behing restructuring/recoding of spot-dl

The basic Ideas behind the restructuring attempt have been put down and will hopefully be updated as and when required. Now we can get straight to the code buisness.

Starting to work of the restructuring proper. You won't find any new code here but then a little design before coding can go a long way. I do my design via markdown, you should find a lot of that here. 😁

Some messups with the last commit and me trying to revert to a previous commit. Damn it should have been named 'Problems & Solutions III' well, here is the fix. The interface definitions have been finished. If I just figure out the code class dubbed soulOfSpotDl and the 'tools' proposed I can get down to code. Cheerio. 😁

Fixed up some inconspicious typos, ran some hacky tests, minor changes to interface definitons and some (in a way) working code.

Fixed some typo's, updated working docs, did some tests on logging in ./Hacks and got a working and configured heirarchal logger set up.

Nothing fancy here, just some typo fixes, guideline updates, design notes and recycled code.

Restructured Temp floder to resemble a python package. Minor name changes, a few edits, lots of questions and untested code. I might even have a hacky implementaion of the Metadata Search Interface.

Not much code this time around, was (re)figuring out the interfaces/object ideas, still have to change the docs to mirror the changes in code.

added the gener look up. did the doc strings...

Updated logging guidelines for more clarity of the resulting logs, updated the logging messages interspersed throughout the code to match the new guidelines, updated workingDocs. Chucked in a package diagram for good measure, it forces your to take stalk and stop roming around in circles. Put up the new stuff since 'Problems & Solutions I' under README updates.

Got in fresh embedding code that should work in theory, tests not yet done. Slight changes to the package diag. Some more minor folder renamings and stuff. One step closer to completion (4 more steps to go, assuming someone else will write the search provider which I'm hoping will be YTmusic) Thinking of ways to highlight the not so great function input variable 'v2_version=' used in mutagen, it tells you absolutely nothing about what the variable does.

Some typos and a lott of code... ༼ つ ◕_◕ ༽つ (Yayy!!!)

Was adding fresh functionality to spotifyHelpers.py and added a few more spotify-api responses to the REFS folder for reference

So, my ideas is to do the downloads and conversions in parrlell to speeden up things. Threading is not prrlell processing - multiprocess is. Fiddled around with multiprocessing. Tried calculating the SHA512 checksum for 33,326 files. single process ~ 25mins, 16 process (since I use a 8core Hyper-V processor) ~ just 3 mins. Far better than I expected.

Looking into various lobraries that can be used to download audio from YouTube and also into the speedy format conversion issue

ritiek · 2020-08-25T11:32:02Z

The structure looks good to me overall. I have mostly some style concerns at this point. I see you're using camelCase for naming your classes, functions/methods and variables. AFAIK this isn't the preferred way in python. The style guide indicates to use TitleCase for classes, lower_case for method/function and variable names.

ritiek · 2020-08-25T11:36:48Z

Also, let's not inherit from object in classes. This used to make sense in Python 2 but since spotdl now only targets Python 3, there is no reason to inherit from object.

ghost · 2020-08-25T13:46:47Z

Ok, cool, I didn't know that about the object inheritance thing, My primary source of reference for the std-lib is Bezley's Python reference (might be a bit outdated with that bit but it cover most of my needs). And about the style guide, I really don't see a reason to conform to a style just because the PEP guidelines say so. The intension behind standardized naming is ease of reading (at least according to Code Complete which is a very thorough reference to good code practices) if there are any good reasons to change the naming style (I haven't thought extensively, my bad), let me know.

One other thing of note about naming, the aim of a naming convention is to make the code more easy to read, and as a result easier to contribute to. Having potential contributors pause to think of naming styles (considering that PEP defines one of variables, function, classes, modules...) will just end up being a drag. Neither is spotdl so big (unlike the std-lib for which PEP was created) that it needs naming styles that indicate what is a function, what is a class and what is a variable.

I looked through the PEP naming conventions (at stackoverflow), they also note that 'internal consistency matters most'. If I think about it now, underscores do add readability, do you think we have to change it? If yes, are we taking about underscore based naming or of the whole PEP8 convention.

ritiek · 2020-08-25T14:52:08Z

I'd say let's just stick with the whole of PEP8 convention. I personally do not conform to a few things either - such as the character count per line, but overall I think it's a nice base to conform to and it does help with readability at least for me (or perhaps I've grown used to it). spotdl is indeed not a really big code base but it's still big enough to get lost (the reason we're doing this refactor). The greater community also mostly comes from same common ground that is PEP8, and this will also make it easier to make use of code check tools such as flake8 which could be helpful to make sure new contributors adhere to the same code style. So, I don't see a good reason to not follow PEP8, at least for the naming convention.

On a side note, let's also avoid spaces around "=" in kwargs.

Copied from the code I wrote for the original library. Includes "test" code.

Also updated objects.md with additional metadata suggestions.

Also includes (commented out) code to get metadata.

…potify-downloader into reStructure/reCode

YouTube Music's search response is a sprawling, over-nested JS Object. Code to filter out unnecessary data from those musked responses capable of handling all of the common response structures.

Wrote up a partial YouTube Music Based search Provider based off @roketinventors original code. I'm sure his version will be better

ghost · 2020-09-01T18:43:24Z

“The primary goal of any programmer is managing complexity”

ABC’s were actually implemented in this iteration as I thought ABC’s would help, they were subsequently removed as it didn’t help as much as expected.

I do accept that the original authors found a necessity for ABC’s, just because it was there doesn’t mean it should continue to be there. Also, ABC’s were never used in production code. Most of what I’ve done with help from @rocketinventor with @ritiek ’s permission is to strip back the unnecessary and improve spotDL’s core functionality and enhance its simplicity. As I understand, the interfaces.md document tells contributors all they need to stick to the interfaces and ABC’s don’t enforce the interfaces, ABC don’t cede that much control.

I believe that If spotDL had 100’s of interfaces that manually checking for interface enforcement would not be an option, ABC’s would be sensible. I’d rather spare future contributors the trouble of handling weird behavior’s ABC’s throw up.

If there are specific ways you believe ABC’s will help please do let us know.

Some users might even cringe at its simplicity

Got the CLI up and running Updated readme and stuff... I'm up for a longgg break.

Much tighter, tidier search interface

I filled up PURPOSES.md for all of the code written, had to delete some very big functions (lots of erroft wasted) and a simple cmd line utility to ensure line count doesn't excede 200 lines

Managing to get the progress bar to sync across multiple processes was a nightmare. the download should be more reliable and less breaky now. It shill could do with a little rewriting. I'll do that asap.

ghost · 2020-09-11T14:22:42Z

Hey @ritiek could you pleas go through the code again?

There are oddball cases with inexplicable errors and fluctation (pumping) of audio loudness. All those fixed

There were a few 1-line errors that could grind the download process to a halt. Fixed those. Refactored songObj -> SongObj across the codebase to keep in line with the naming convention. Started working on the docs.

EDM-like songs get their last 1-2 seconds clipped/cut/deleted by ffmpeg. we counter this here.

You can use this now.

and also added error handling for those rare cases where you don't get a youTube match at all. P.S. Its ironic that a 'patch' - something that fixes errors actually causes them but then, whatever...

Songs used to display the wrong song length, but no more, see comment at spotdl\download\downloader.py:145 for more details.

A fresh CONTRIBUTING.md and minor README changes to notify potensial contributors of the same.

…into reStructure/reCode

Mikhail Zex added 16 commits August 9, 2020 12:39

Restructuring Ideas I

dc5f6ca

Flesihing out the ideas behing restructuring/recoding of spot-dl

Restructuring Ideas II

31c2e99

The basic Ideas behind the restructuring attempt have been put down and will hopefully be updated as and when required. Now we can get straight to the code buisness.

Problems & Solutions I

bc597d1

Starting to work of the restructuring proper. You won't find any new code here but then a little design before coding can go a long way. I do my design via markdown, you should find a lot of that here. 😁

Problems & Solutions V (+some code)

e1d391c

Fixed up some inconspicious typos, ran some hacky tests, minor changes to interface definitons and some (in a way) working code.

Basic Logging Code

1f08101

Fixed some typo's, updated working docs, did some tests on logging in ./Hacks and got a working and configured heirarchal logger set up.

Basic Authorization Code

bb63ab3

Nothing fancy here, just some typo fixes, guideline updates, design notes and recycled code.

Basic 'tools' Code (Not sure if it works yet)

700a99c

Restructured Temp floder to resemble a python package. Minor name changes, a few edits, lots of questions and untested code. I might even have a hacky implementaion of the Metadata Search Interface.

Problems & Solutions VI (+ code edits)

ac112ac

Not much code this time around, was (re)figuring out the interfaces/object ideas, still have to change the docs to mirror the changes in code.

Problems & Solutions VII (+ hackey tests)

2bcd236

added the gener look up. did the doc strings...

'Utils' sub-package almost complete

ef3b8c6

Some typos and a lott of code... ༼ つ ◕_◕ ༽つ (Yayy!!!)

Updated Helpers and reference responses

9e902c9

Was adding fresh functionality to spotifyHelpers.py and added a few more spotify-api responses to the REFS folder for reference

Trials on Song download

1c4a685

Looking into various lobraries that can be used to download audio from YouTube and also into the speedy format conversion issue

rocketinventor and others added 10 commits August 25, 2020 23:34

Updated YTM song-search code

2a9c346

Added "yt_music" class

235108d

Copied from the code I wrote for the original library. Includes "test" code.

Update yt_music search to specs

684fcde

Also updated objects.md with additional metadata suggestions.

Update yt_music search to specs

669503d

Also updated objects.md with additional metadata suggestions.

Updated ytmusic to show all results

65ade1b

Also includes (commented out) code to get metadata.

Merge branch 'reStructure/reCode' of /~https://github.com/Mikhail-Zex/s…

a7aa37f

…potify-downloader into reStructure/reCode

Filtering YTM Responses with a focus on song matching

ed565ce

YouTube Music's search response is a sprawling, over-nested JS Object. Code to filter out unnecessary data from those musked responses capable of handling all of the common response structures.

More references for YTM, YTM data extraction fixed

83447a6

Looking around @rocketinventors YTM code

29b25a2

Wrote up a partial YouTube Music Based search Provider based off @roketinventors original code. I'm sure his version will be better

cleaned up repo tree

d5c2e71

Mikhail Zex added 11 commits September 2, 2020 22:39

Multiprocess download Complete

1e08da8

mutagen issues

406bfdb

A dead simple CLI

384d163

Some users might even cringe at its simplicity

Final Push (I feelDead already)

0cca8e9

Got the CLI up and running Updated readme and stuff... I'm up for a longgg break.

Final

5a72c40

spotDL v3 rework

a482d7c

reworked search interface Complete

dab7b09

Much tighter, tidier search interface

Finished Search Utils + Cleanup

b0f8d42

I filled up PURPOSES.md for all of the code written, had to delete some very big functions (lots of erroft wasted) and a simple cmd line utility to ensure line count doesn't excede 200 lines

downloader fixed (i guess)

5dcec82

Managing to get the progress bar to sync across multiple processes was a nightmare. the download should be more reliable and less breaky now. It shill could do with a little rewriting. I'll do that asap.

Downloaders rewritten

b0dd771

overview helper scripts cleaned up and fixed

3642064

Mikhail Zex added 7 commits September 12, 2020 22:01

Fixed some nitpicky error + audio normalization

c1579aa

There are oddball cases with inexplicable errors and fluctation (pumping) of audio loudness. All those fixed

Fixed lethal nitpicky errors + refactors + partial docs

6483427

There were a few 1-line errors that could grind the download process to a halt. Fixed those. Refactored songObj -> SongObj across the codebase to keep in line with the naming convention. Started working on the docs.

some more nitpicky errors + partial docs

fff92c5

EDM-like songs get their last 1-2 seconds clipped/cut/deleted by ffmpeg. we counter this here.

A ad-hock CLI

e7987e9

ad-hock CLI usage comments?

f0ba2a9

CLI error fix

c157e96

Working Packaged Stuff - Finally ! ༼ つ ◕_◕ ༽つ

696174c

You can use this now.

ghost marked this pull request as ready for review September 16, 2020 16:53

ghost requested a review from ritiek September 16, 2020 16:53

Mikhail Zex added 7 commits September 16, 2020 22:30

Create README.md

f46596a

fixed errors due to pyTube3 Patch

2d5cc57

and also added error handling for those rare cases where you don't get a youTube match at all. P.S. Its ironic that a 'patch' - something that fixes errors actually causes them but then, whatever...

fixed wrong song len error

d19f093

Songs used to display the wrong song length, but no more, see comment at spotdl\download\downloader.py:145 for more details.

Created CONTRIBUTING.md

c169911

A fresh CONTRIBUTING.md and minor README changes to notify potensial contributors of the same.

minor fix

174d29f

Update README.md

9cc6f75

Merge branch 'master' of /~https://github.com/spotDL/spotify-downloader …

34da1aa

…into reStructure/reCode

ghost merged commit f7e9d4b into spotDL:master Sep 30, 2020

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructuring Spotdl #812

Restructuring Spotdl #812

ghost commented Aug 17, 2020 •

edited by ghost

Loading

ritiek commented Aug 25, 2020

ritiek commented Aug 25, 2020

ghost commented Aug 25, 2020 •

edited by ghost

Loading

ritiek commented Aug 25, 2020

ghost commented Sep 1, 2020 •

edited by ghost

Loading

ghost commented Sep 11, 2020

Restructuring Spotdl #812

Restructuring Spotdl #812

Conversation

ghost commented Aug 17, 2020 • edited by ghost Loading

ritiek commented Aug 25, 2020

ritiek commented Aug 25, 2020

ghost commented Aug 25, 2020 • edited by ghost Loading

ritiek commented Aug 25, 2020

ghost commented Sep 1, 2020 • edited by ghost Loading

ghost commented Sep 11, 2020

ghost commented Aug 17, 2020 •

edited by ghost

Loading

ghost commented Aug 25, 2020 •

edited by ghost

Loading

ghost commented Sep 1, 2020 •

edited by ghost

Loading