Map Matcher

Brief introduction

For training (note that this used to be done in Python in the mapfiles directory, but has now been moved here)

Generate image pairs. A pair can either be matching or disjoint.
For a matching pair
- Find a random small area of the training basemap with at least X features large enough to be distinctive for your kernel size.
- Shift the area by a small percentage in a random direction and random amount.
- If the shifted area and the original area have enough large features in common, then draw both images. Otherwise try again with a different area.
For a disjoint pair,
- Find a random small area of the training basemap with at least X features large enough to be distinctive for your kernel size.
- Find another random small area of the training basemap.
- If the shifted area and the original area have some features in common, don't draw. Otherwise draw!
At the end of training dataset generation, you should have a set of image pairs, and a CSV like this:

image1Path, image2Path, matches
image1.png, image2.png, True
image3.png, image4.png, False
...

Run python3 classifyPrecision.py

Then for searching:

Given a PDF map of an area, split it up into 128 * 128 pixel tiles to allow them to fit on the GPU with a decent batch size.
Offset the grid, then draw the paper map tile grid again.
For the search area, making sure to keep roughly the same scale, draw 128 * 128 tiles.
Offset the search area grid and do the same again. Now run multiGridSearch.py which:
For each PDF tile, matches it against all of the search tiles. Does the same with the offset grids.
In theory, you should identify clusters of matched areas. Given that you know the arrangement of the PDF tiles relative to each other, you should hopefully be able to spot the area that the PDF map represents of the search area.

Features of classifyPrecision

This has got two modes. The first is searching for the best set of parameters (after a low number of epochs) that result in the best precision while still fitting on your GPU. Once this is found, the other mode is training a model until it stops improving vs a validation dataset after a set epoch patience, and after lowering the learning rate.

Both of these steps are designed to run on a personal desktop computer, so there is functionality for stopping and picking up where it left off for both of these.

Parameters

The results of the parameter grid search favour this parameter combo:

combo = {
    'nlayers': 4,
    'downSample': None,
    'leaky_cnn': True,
    'leaky_classifier': True,
    'base_channels': 32, 
    'padding': 0,
    'classifier_layers': 4,
    'classifier_hidden': 32,
    'learning_rate': 1e-3
}

And if that doesn't fit on your GPU, this is also a good parameter combo:

combo = {
    'nlayers': 3,
    'downSample': None,
    'leaky_cnn': True,
    'leaky_classifier': False,
    'base_channels': 16, 
    'padding': 0,
    'classifier_layers': 4,
    'classifier_hidden': 128,
    'learning_rate': 1e-3
}

Future work

Training + grid generation

Training + testing images
When generating the training/ test dataset, we make sure that the sharedPixelArea matches the model kernel size.
Generate training data using Manchester, Leeds and Sheffield, then validate against Birmingham.
Calculate a "paint coverage" factor rather than using the number of features.
Test the current best model against its own validation dataset to see which ones it got wrong. Might show what the problem is!

Grid matching

When doing matching, after a few tiles have been matched up, when deciding where the next tile should start searching, it would be more efficient to check the tiles including and around where the initial clusters are.
If two tiles are blank, skip matching them? Or just skip classifying, and match.
Offset the main grid and search again using that - this will help at tile boundaries.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
MapServer		MapServer
mapfiles		mapfiles
ui		ui
.gitignore		.gitignore
Dockerfile		Dockerfile
GridSearchParameterSet.py		GridSearchParameterSet.py
MatchModelSlots.py		MatchModelSlots.py
README.md		README.md
classifyPrecision.py		classifyPrecision.py
grid_polygons.geojson		grid_polygons.geojson
merge_csvs.py		merge_csvs.py
merged_csv_to_geojson.py		merged_csv_to_geojson.py
multiGridSearch.py		multiGridSearch.py
output_to_csv.py		output_to_csv.py
sub_grid_to_geojson.py		sub_grid_to_geojson.py
vector_encoding.py		vector_encoding.py
which_validations_failed.py		which_validations_failed.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Map Matcher

Brief introduction

Features of classifyPrecision

Parameters

Future work

Training + grid generation

Grid matching

About

Uh oh!

Releases

Packages

Languages

oshawa-connection/map_matcher

Folders and files

Latest commit

History

Repository files navigation

Map Matcher

Brief introduction

Features of classifyPrecision

Parameters

Future work

Training + grid generation

Grid matching

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages