With the signal group (sg) selector service, it is possible to match traffic lights to routes. This is done by matching linestring geometries (MAP topologies) of traffic lights to the route geometry. In our application we often use the term LSA synonymously for SG/Traffic Light. LSA means Lichtsignalanlage in German.
The service provides matching for OSM and DRN. For DRN see: /~https://github.com/priobike/priobike-graphhopper-drn
If you (re)use our work, please cite both papers:
@inproceedings{matthes2022matching,
title={Matching Traffic Lights to Routes for Real-World Deployments of Mobile GLOSA Apps},
author={Matthes, Philipp and Springer, Thomas},
booktitle={2022 IEEE International Smart Cities Conference (ISC2)},
pages={1--7},
year={2022},
doi={10.1109/ISC255366.2022.9922560},
organization={IEEE}
}
@inproceedings{matthes2023geo,
title={GeoAI-Powered Lane Matching for Bike Routes in GLOSA Apps},
author={Matthes, Philipp and Springer, Thomas and Jeschor, Daniel},
booktitle={The 31st ACM International Conference on Advances in Geographic Information Systems (SIGSPATIAL '23)},
pages={1--4},
year={2023},
doi={10.1145/3589132.3625583},
organization={ACM}
}
This matching is achieved using above filtering pipeline. The filtering pipeline consists of these steps:
- Proximity matching – Exclude all signal groups that are too far away from the route.
- Bearing matching - Exclude all signal groups that have too much angle difference with regards to the route.
- Length matching - Exclude all signal groups which can not be projected onto the route without a too big length difference.
- Overlap matching - Under the remaining signal groups, find overlaps and decide for the better matches.
- Adding the crossings that are not connected (have no MAP-Topologies) from this dataset: https://metaver.de/trefferanzeige?docuuid=C498DEED-985C-11D5-889E-000102B6A10E
This filtering pipeline is defined as a hypermodel and tuned by Optuna on a training dataset. The best configuration is available with a RESTful API.
This matching is achieved using a machine learning model, as shown above. This filtering includes the following steps:
- Proximity matching – Exclude all signal groups that are too far away from the route.
- ML matching – Extract features for each MAP topology with regards to the route, and make a binary classification ("match" or "no match").
- Overlap Cleanup – Detect overlaps and only select the topologies with the highest class probability for "match".
- Adding the crossings that are not connected (have no MAP-Topologies) from this dataset: https://metaver.de/trefferanzeige?docuuid=C498DEED-985C-11D5-889E-000102B6A10E
- Build and run the development setup
Run the development setup with
docker-compose up
On the very first startup, this will download the base Docker images and build the containers (can take some time). We provide example data so that you can understand our data format and try it out for yourself. This data is inherited from the Urban Data Platform Hamburg, which uses a FROST API. With the running docker-compose setup, this data is included in the PostGIS (running in the background) - see backend/run-preheating.sh.
- Send a request to the REST Endpoint at POST /routing/select
Select signal groups along a given route. The body of the POST request should contain a route as follows:
{
"route": [
{ "lon": <longitude>, "lat": <latitude>, "alt": <altitude> },
...
]
}
Perform an example request with the example preset route:
curl --data "@backend/data/priobike_route_ost_west.json" 'http://localhost:8000/routing/select'
To select the OSM-specific algorithmic matcher, use:
curl --data "@backend/data/priobike_route_ost_west.json" 'http://localhost:8000/routing/select?matcher=legacy&routing=osm'
To select the DRN-specific algorithmic matcher, use:
curl --data "@backend/data/priobike_route_ost_west.json" 'http://localhost:8000/routing/select?matcher=legacy&routing=drn'
To select the OSM-specific machine learning matcher, use:
curl --data "@backend/data/priobike_route_ost_west.json" 'http://localhost:8000/routing/select?matcher=ml&routing=osm'
To select the DRN-specific machine learning matcher, use:
curl --data "@backend/data/priobike_route_ost_west.json" 'http://localhost:8000/routing/select?matcher=ml&routing=drn'
Results are in the following structure:
{
"route": [
{
"lon": 9.990909,
"lat": 53.560863,
"alt": 9.99,
"signalGroupId": "hamburg/271_14",
"distanceOnRoute": 0,
"distanceToNextSignal": 80.0591490111176
},
[...],
{
"lon": 9.978001,
"lat": 53.564378,
"alt": 20.22,
"signalGroupId": null,
"distanceOnRoute": 1011.180699223152,
"distanceToNextSignal": null
}
],
"signalGroups": {
"hamburg/271_14": {
"label": "hamburg/271_14",
"position": {
"lon": 9.9902099,
"lat": 53.5614479
},
"bearing": 305.47283546104495,
"geometry": [
[
9.9902099,
53.5614479
],
[...]
[
9.9895951,
53.561724
]
],
"id": "hamburg/271_14",
"lsaId": "271_14",
"connectionId": "14",
"laneType": "Radfahrer",
"datastreamDetectorCar": null,
"datastreamDetectorCyclists": null,
"datastreamCycleSecond": "12864",
"datastreamPrimarySignal": "12399",
"datastreamSignalProgram": "9579"
},
[...]
},
"crossings": [
{
"name": "Rentzelstraße / An Der Verbindungsbahn",
"position": {
"lon": 9.978001,
"lat": 53.564378
},
"connected": true
},
]
}
Response Structure:
- route: The waypoints of the route, with the signal groups and the current distance to the next signal, if exists.
- signalGroups: A more detailled dictionary with information about all the signal groups, including datastream (FROST) ids.
- crossings: A list of intersections along the route, including intersections that are not connected (and have no MAP-Topologies).
In our app, we initially only supported routing based on OpenStreetMap (OSM) data. For this we also developed and studied our two matching approaches (algorithmic and ML). During the evaluation we came to the conclusion that routes based on OSM-data contain a lot of routing errors with respect to a cycling-specific-routing. Later we implemented routing based on an other data source than OSM. Specifically we used the so called Digitales Radverkehrsnetz (DRN) Hamburg providing us a much more detailed and correct representation of the cycle paths in Hamburg. With the new data source we were able to achieve much better results with respect to a cycling-specific-routing which manifests itself, for example, in the following points:
- Available cycle paths are used a lot more
- Fewer unnecessary and incorrect detours on or off the cycle path
As a result the routes based on DRN differ from the routes based on OSM. Since our two matching algorithms were tuned/trained on the characteristics of OSM-routes in comparison to the turn topologies, we also re-tuned/-trained them for the new DRN-based routes. To choose the routing specific matching approaches, append the query parameter routing=osm
/routing=drn
to the /select
-endpoint (as shown here: Response Format)
Note: the following benchmarks utilize the complete dataset. The ML model is trained on a part of this dataset.
Cleanup, build and run the container and enter the shell:
docker-compose down -v && \
docker-compose up -d --no-deps backend && \
docker exec -it sg-selector-backend /bin/bash
Stay in the shell and prepare the data we need for experiments:
cd backend && \
poetry run python manage.py load_constellations && \
poetry run python manage.py load_errors
Keep in mind that switching between benchmarks requires starting from above, to ensure that no data is mixed up.
After the run_analysis
command is executed, it will print the benchmark score to the command line as the profiler runs through the routes. After the run is finished, results w.r.t routing errors and route-lane constellations are stored in backend/backend/analytics/logs
.
Used for the benchmark in the paper.
poetry run python manage.py load_lsas_from_file_old_json_format --path ../data/lsadata_hamburg.json && \
poetry run python manage.py load_example_routes ../data/example_routes_osm.json && \
poetry run python manage.py load_bindings ../data/bindings_old/ && \
poetry run python manage.py run_analysis --route_data osm_old
This should result in:
Profiling algorithm topo-osm-2022-trained-on-osm-2022 ... (148 routes)
TP: 920, FP: 207, FN: 123
Precision: 0.82
Recall: 0.88
F1: 0.847926267281106
and
Profiling algorithm topo-osm-2022-trained-on-osm-2023 ... (148 routes)
TP: 908, FP: 221, FN: 135
Precision: 0.80
Recall: 0.87
F1: 0.8360957642725597
and
Profiling algorithm ml-osm-2022-trained-on-osm-2022 ... (148 routes)
TP: 936, FP: 57, FN: 107
Precision: 0.94
Recall: 0.90
F1: 0.919449901768173
poetry run python manage.py load_lsas_from_file --path ../data/sgs-2023-01-11T14_30_50.004510.json && \
poetry run python manage.py load_example_routes ../data/example_routes_osm.json && \
poetry run python manage.py load_bindings ../data/bindings_osm/ && \
poetry run python manage.py run_analysis --route_data osm
This should result in:
Profiling algorithm topo-osm-2023-trained-on-osm-2022 ... (52 routes)
TP: 615, FP: 159, FN: 143
Precision: 0.79
Recall: 0.81
F1: 0.8028720626631854
and
Profiling algorithm topo-osm-2023-trained-on-osm-2023 ... (52 routes)
TP: 637, FP: 136, FN: 121
Precision: 0.82
Recall: 0.84
F1: 0.8321358589157413
and
Profiling algorithm ml-osm-2023-trained-on-osm-2022 ... (52 routes)
TP: 614, FP: 52, FN: 144
Precision: 0.92
Recall: 0.81
F1: 0.8623595505617978
Note: The OSM benchmark data has no routing errors or constellations marked.
poetry run python manage.py load_lsas_from_file --path ../data/sgs-2023-01-11T14_30_50.004510.json && \
poetry run python manage.py load_example_routes ../data/example_routes_drn.json && \
poetry run python manage.py load_bindings ../data/bindings_drn/ && \
poetry run python manage.py run_analysis --route_data drn
This should result in:
Profiling algorithm topo-drn-2023-trained-on-drn-2023 ... (49 routes)
TP: 655, FP: 95, FN: 83
Precision: 0.87
Recall: 0.89
F1: 0.8803763440860215
and
Profiling algorithm ml-drn-2023-trained-on-drn-2023 ... (49 routes)
TP: 676, FP: 24, FN: 62
Precision: 0.97
Recall: 0.92
F1: 0.9401947148817803
This library is available under MIT License. Contributions are welcome. Here is our current progress:
- Publish usable library with our topologic feature matching pipeline
- Incorporate Python scripts for random route generation
- Polish and push management commands for hypermodel training and testing
- Add Route Composer web application which is used for training dataset generation
- Include experimental matching approaches (ML-Model, Dijkstra, Probabilistic)
- Advance experimental ML-based feature matching approach
Planned research on this topic is completed. Currently we focus on using the DRN dataset for routing and re-tuning our matching approaches.
After running docker-compose up (in debug-mode such that the additional subcomponents also are runnning) under the following URLs sites are accessible:
- Frontend for the composer-backend (to create the ground truth dataset): http://localhost:3000/composer?route_id=1
- Composer-Overview for statistics of all the samples as well as functions to perform a check whether all the samples in the database as well as in the .json-files (e.g. for persistent storate in repo) are the same: http://localhost:3000/composer/overview
- Frontend for the demo-backend: localhost:3000/demo?route_id=999&matcher=ml
- Frontend for the jiggle_vis-backend: http://localhost:3000/jiggle_vis?route_id=1
- Frontend for the projection_vis-backend: http://localhost:3000/projection_vis?route_id=1&lsa_id=2182&method=extended
In the route composer we used Carto (CARTO Basemaps Terms of Service: https://drive.google.com/file/d/1P7bhSE-N9iegI398QYDjKeVhnbS7-Ilk/view) in combination with OpenStreetMap (available under the Open Data Commons Open Database License, https://www.openstreetmap.org/copyright) for the map tiles.
This module is split into two components, the backend
service that selects signal groups and the frontend
service that is used to visualize the algorithms of the backend
service. Additionally, the frontend
service is used to manually create route-sg-mappings which can be used to validate the backend
service. To realize this, the backend
service provides additional subcomponents to communicate with the frontend
via REST endpoints. However, these components are stripped in the deployed backend
service since they are only used for development.
The routing
-subcomponent (backend/backend/routing
) is the only one being used in production and thus deployed.
analytics
(backend/backend/analytics
)- Used to analyze different matching approaches.
- With the management command (see)
run_analysis.py
(backend/backend/analytics/management/commands/run_analysis.py
) different approaches can be set as values in thestrategies
-dict and thus tested against the whole ground truth dataset.
composer
(backend/backend/composer
)- Used to create the ground truth dataset.
demo
(backend/backend/demo
)- Used to test the matching approaches visually with some example routes.
jiggle_vis
(backend/backend/jiggle_vis
)- Used to try out a jiggle-/data-augmentation-feature visually with different MAP topologies.
ml_evaluation
(backend/backend/ml_evaluation
)- Used to evaluate different ML-models, feature subsets, projection methods and feature transformation methods as well as to perform hyperparameter tuning.
- Thus it contains for example management commands for the following things:
- Train models.
- Tune hyperparameters.
- Generate dataset with features from ground truth with labeled route and MAP topology samples.
- Compute feature selection metrics such as transinformation, correlations and timings and perform RFE.
- Thus it contains for example management commands for the following things:
- Configs for different configurations of features, feature transformations and other things can be found here:
backend/backend/ml_evaluation/configs
- Those are also referenced in all kind of scripts and thus perform in dependence of the things set there.
- To use a config in production it needs to be copied to
backend/backend/routing/matching/ml/configs_production
. Additionally, the created and to the configs corresponding models (ML models and feature transformation models) need to get copied frombackend/backend/ml_evaluation/models
tobackend/backend/routing/matching/ml/models
. The seperation has been done, because during evaluation a lot of configs and models can get created which would take up a lot of space in production and also make it very convoluted.
- Used to evaluate different ML-models, feature subsets, projection methods and feature transformation methods as well as to perform hyperparameter tuning.
projection_vis
(backend/backend/projection_vis
)- Used to visualize different route-segment-selection strategies
- Currently there exist two projection strategies and one alternative strategy. The strategies are implemented here: (
backend/backend/routing/matching/projection.py
)
Help us improving this documentation. If you have any problems or unclarities, feel free to open an issue.