Skip to content
This repository was archived by the owner on May 27, 2024. It is now read-only.
This repository was archived by the owner on May 27, 2024. It is now read-only.

Remove duplicate stops from TNC-to-GTFS transform #10

@barbeau

Description

@barbeau

Is your feature request related to a problem? Please describe.
PR #9 is implementing a method to convert TNC trips into GTFS format to do analysis using existing GTFS-based tools.

Many of these TNC trips, especially those to a census tract centroid, have duplicate locations. Right now a new stop in stops.txt is used for each of these entries.

Describe the solution you'd like

It would be nice to de-duplicate these entries, so all visits to the same latitude and longitude use the same stop_id.

It looks like onebusaway-gtfs-modules supports this:
http://developer.onebusaway.org/modules/onebusaway-gtfs-modules/1.3.4-SNAPSHOT/onebusaway-gtfs-merge-cli.html

...as part of the "fuzzy" match solution. It would be nice to run this code via the Java objects in our code, or trigger this post conversion automatically. Perhaps using a command-line parameter.

Describe alternatives you've considered

Alternately, we could do our own deduplication in our code (e.g., using HashSets), but then we also need to efficiently handle holding references to all stops in memory.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions