Thanks Andrew for posting about NYC 311 data.
I believe this is the correct link to start.
But when I sort by created_date on the socrata site it shows entries from 2003. The title says 2010 to present. More data is better so I'm only asking because when I download as CSV. I only receive 2 million records (actually, 2,117,547) but if you run a count(*) query using Socrata using this link:
You receive over 7 million records from Socrata via the API
Is the download limited?
For example, I'd like to select year(created_date) and group by that year but there seems to be no way to do this with the Socrata query language.
The YEAR function is supported in BigQuery so a query such as
SELECT count( Complaint_Type ), Complaint_Type, YEAR(Created_Date) as year1 FROM [nyc.311] group by Complaint_Type,year1 order by 1 desc
is possible and easy.
And it returns: Heating complaints are the top complaint and the download data from Socrata only goes back to
Query Results2:56pm, 7 May 2014
|2||100650||GENERAL CONSTRUCTION||2013|| |
|4||84084||Street Light Condition||2013|| |
|6||70594||PAINT - PLASTER||2013