Good blog post. In general, there
is only an issue here because the Taxi Driver data was published without
inviting driver(s) to opt-in to being tracked for a day, week, month, year,
or eternity. Whether one thinks taxi drivers are public or private
individuals performing a service for a fee, it is better for everyone in
society if we apply common courtesy and ask them if they mind monitoring
before even using their not-so-well-anonymized data.
Tell them what we are doing. Show
them how. Ask for permission. Record the Opt-in and filter
out the Opt-outs.
Motto: "Do First, Think, Do it Again"
Nicklin <[address removed]>
[betaNYC] A Day in the Life of a New York City Taxi
The tl;dr version is that
we need to be looking beyond the immediate privacy implications at the
broader issues it raises for open data and freedom of information.
On Tue, Jul 15, 2014 at 3:08
PM, Jeremy Barth <[address removed]>
Chris, I'm curious what your
views are on the de-anonymization issue? Over in computer security
among many), the NYC taxi data dump is being treated as a privacy fiasco,
precisely because of the poorly-executed anonymization.
On Mon, Jul 14, 2014 at 10:01
AM, Chris Whong <[address removed]>
Thanks for the comments.
The data don't include the make/model of the vehicle. Since
the medallions are de-anonymizable, I wonder if you could determine the
make and model from the medallion somehow (I think this is a stretch).
I like the traffic density
idea, but I am not sure how I could get that kind of data on-demand for
a given taxi trip, especially since these are historic. Good idea
On Mon, Jul 14, 2014 at 9:54
AM, Steve C <[address removed]>
Great visualization! I like Steve's environmental
layer suggestion (my background is air pollution control / air quality)... a
- Using Google traffic data (is it easily
accessible?) to see the percentage of time a random taxi's estimated route
is in a given traffic density (green, yellow, red)
- Distinguish b/w type of vehicle (is it
possible to know whether it is hybrid/conventional and make/model?) to
then estimate tailpipe emissions over the course of the day and route (I
can help with this if needed)
- Air sensors mounted on taxis and vehicles
is inevitable in my opinion, and that's when things will get really interesting
On Mon, Jul 14, 2014 at 8:55
AM, Steven Hodas <[address removed]> wrote:
Ditto, it's fantastic.
On Mon, Jul 14, 2014 at 8:08
AM, Florent Peyre <[address removed]> wrote:
Pretty awesome visualization
On Jul 14, 2014, at 6:46 AM, Chris Whong <[address removed]>
On Mon, Jul 14, 2014 at 6:44
AM, Chris Whong <[address removed]> wrote:
I'm excited to share this
little side project that I've been working on with the FOIL'd NYC
taxi data. It
offers a glimpse into a single 24 hour period, for a single taxi, and tallies
up fares, taxes, and passengers.
I've already seen some interesting
things, such as unexpected gaps during prime service hours for some taxis
(why? Long lunch breaks? Shift Change?) as well as some
pretty big variance in daily earnings.
I hope you like it! Blog
post coming on the tech recipe, there's a lot going on under the hood.
BetaNYC envisions an informed and empowered public who can leverage technology, design, and data to address their problems and hold government accountable.
We are dedicated to civic technology, smart communities, & open government. We are a nonpartisan good government community using technology, design, and data to empower individuals and local communities, provides for an honest and efficient government, and build a civically‐engaged ecosystem.
BetaNYC is building a connected NYC, by the people, for the people, for the 21st Century.
Noel Hidalgo, Executive Director
Community Leaders & Co-Organizers - Benjamin Arancibia, Emily Goldman, Hayley Richardson, Kara Chesal, Lauren Rennee, Lucio Tolentino, Terrance Beckett, Volkan Unsal, and Yasi Razvan
## Important Community Resources
• TALK.beta is NYC's online home to civic technology, open data, and open government.