October 4, 2013
Many types of real-world messy data. Mostly in tabular form, usable for machine learning
It would be good to see far PySpark has come, and how ready is is for production use. I'd love to hear from others that've successfully integrated PySpark into their workflow / data pipelines.
I like speakers and Q&A. I've already setup the system and will need more time (than a meetup) to implement something more advanced.
Machine Learning and data science engineer.