Skip to content

Enabling Python to Become a Better Big Data Citizen - Wes McKinney

Enabling Python to Become a Better Big Data Citizen - Wes McKinney

Details

http://photos1.meetupstatic.com/photos/event/1/4/0/1/600_442085121.jpeg

#ODSC meetup group is co-hosting this event with The New York Python Meetup Group (https://www.meetup.com/nycpython/).

Enabling Python a Become a Better Big Data Citizen - Wes McKinney

The Python ecosystem has long struggled with interoperability with the Apache Hadoop and Spark ecosystems due to architectural issues around JVM-Python interoperability and the high cost of moving data between processes. In spite of that, Python has been used extensively as a limited tool for processing streams of serialized data sent via UNIX pipes or other means. In this talk, Wes McKinney explains current efforts to enable pandas and other 3rd-party Python libraries to be used in a more native and performant way within big data computation frameworks like Apache Spark, Apache Impala (inc), and Apache Drill, as well as with storage projects like Apache Kudu (inc) and Apache Parquet.

Sponsors:

Open Data Science Conference (http://odsc.com/)

Cloudera (https://cloudera.com/)

Photo of New York Accelerate AI (AIx) group
New York Accelerate AI (AIx)
See more events
ODSC Office
394 Broadway, 6th floor,10013 · New York, NY