This set of tools will parse .dat
files from the KLRS Weather stations into a set of JSON files. Each sensor will become a separate datastream named datastream-<COLUMN NAME>.json
, with the records for that column in datastream-<COLUMN NAME>-records.json
.
These files can then be imported using node-dataservice
or another tool.
There are two tools in this repository, verify
and convert
. These cover the "Extract" and "Transform" steps of an ETL, and "Load" is handled by the node-dataservice tool.
Install Ruby, preferably Ruby 2.1.4 or newer. Then use the verify
tool to check the .dat
file:
$ bin/verify data/source/.../TOA5_5264.FiveMin.dat
File: data/2012/September/TOA5_5264.FiveMin.dat
Columns: 9
Column 1: TIMESTAMP
units: TS
stats:
readings: 0
Column 2: RECORD
units: RN
stats:
readings: 8286
Example: 2012-08-04 17:10 0
...
This will validate the input file conforms to the expected schema before running the convert
tool.
The convert
tool will generate intermediary JSON files with the data and metadata to upload to the RPI Data Service:
$ bin/convert data/source/.../TOA5_5264.FiveMin.dat
This will create a set of JSON files in the current directory. One JSON file for each column (except TIMESTAMP column), and one JSON file for the records in each column.
The datastream JSON files can be fed into node-dataservice
:
$ cd node-dataservice
$ bin/data-service create datastream <userUID> <sensorUID> < ~/.../datastream-COLUMN.json
And the command for loading observation records:
$ cd node-dataservice
$ bin/data-service create records <datastreamUID> < ~/.../datastream-COLUMN-records.json
MIT License
James Badger jpbadger@ucalgary.ca