Skip to content

Latest commit

 

History

History
47 lines (33 loc) · 2.57 KB

File metadata and controls

47 lines (33 loc) · 2.57 KB

Data notes for analysis

Caveats and gotchas to keep in mind when analyzing Project Sidewalk data — historical conditions in specific releases that affect how certain columns should be interpreted. If you're working with a database dump or the public data API, check whether any of these apply to the time range you're analyzing.

This is an append-only log: when a release changes data semantics or fixes a data bug with lasting analytical impact, add a dated, version-tagged entry here (newest first) so future analysts aren't caught out.

Data caveats by release

v7.8.5 — audit_task.task_start corrected

Before v7.8.5, the task_start column in audit_task was incorrectly set to the session start time rather than that individual task's start time. It's correct going forward. Existing rows were back-filled to a close approximation using the timestamp of the TaskStart event in audit_task_interaction — but only ~90% of audit_task rows have a corresponding TaskStart event, so the remaining ~10% are still approximate. The same fix was applied to the DC database (which is no longer collecting new data).

v7.5.0 (May 2022) — gallery interaction tables gained user_id

Before v7.5.0, the gallery_task_interaction and gallery_task_environment tables had no user_id column, so rows from before then can't be tied to a specific user. Added in #2895.

Separately, a small number of labels have a temporary_label_id that doesn't match any such ID in the audit_task_interaction logs — most do match. Background in #2154 (comment).

v2 — anonymous users and audit_task.completed

Up to the v2 release, records for anonymous users weren't marked as complete, so the completed column of audit_task is unreliable for anonymous contributions from that era. Discussion in #403.

Gallery screenshot tips

Console snippets for staging clean label screenshots in the Gallery's expanded card view:

// Hide the label icon/marker in the expanded view.
sg.cardContainer.getModal().pano.labelMarker.marker.setVisible(false);

// Inspect the marker's current heading/pitch, then reposition it.
sg.cardContainer.getModal().pano.labelMarker.marker.getPosition();           // current { heading, pitch }
sg.cardContainer.getModal().pano.labelMarker.marker.setPosition({ heading: newH, pitch: newP });