Magic: The Gathering

This project contains code to scrape and parse various sources of MTG data.

Deckstats

TBC

Logic flow:

HTTP-triggered Cloud Function to scrape decklists.
- Uses Pyppeteer as JS-rendered.
- Scraped raw HTML stored in GCS
- Adds single Cloud Task that references the stored GCS file
Cloud Task triggers AppEngine to parse the HTML
- Parsed decklist CSV stored in GCS bucket
The GCS storage event triggers another Cloud Function
- Adds a Cloud Task for each individual deck in the list
Cloud Tasks trigger AppEngine to fetch-parse-write each individual deck
- Uses plain-ol' JSoup as standard HTML
- Raw HTML stored in GCS (at Fetch step)
- Parsed deck CSV stored in GCS bucket

Notes:

Some of the above bit clunky - no real reason to use AppEngine at all - but some code already existed, so just reused.
Currently only handles "containsCard" use case i.e. searching for decks with specific commander card

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
bq		bq
functions		functions
src		src
.gitignore		.gitignore
README.md		README.md
mtg-init.sh		mtg-init.sh
pom.xml		pom.xml