Enabling Python to Become a Better Big Data Citizen - Wes McKinney

Name: Enabling Python to Become a Better Big Data Citizen - Wes McKinney
Start: 2016-02-17T19:00:00-05:00
End: 2016-02-17T21:30:00-05:00
Location: ODSC Office

Hosted by New York Accelerate AI (AIx)

New York Accelerate AI (AIx)

Details

http://photos1.meetupstatic.com/photos/event/1/4/0/1/600_442085121.jpeg

#ODSC meetup group is co-hosting this event with The New York Python Meetup Group (https://www.meetup.com/nycpython/).

Enabling Python a Become a Better Big Data Citizen - Wes McKinney

The Python ecosystem has long struggled with interoperability with the Apache Hadoop and Spark ecosystems due to architectural issues around JVM-Python interoperability and the high cost of moving data between processes. In spite of that, Python has been used extensively as a limited tool for processing streams of serialized data sent via UNIX pipes or other means. In this talk, Wes McKinney explains current efforts to enable pandas and other 3rd-party Python libraries to be used in a more native and performant way within big data computation frameworks like Apache Spark, Apache Impala (inc), and Apache Drill, as well as with storage projects like Apache Kudu (inc) and Apache Parquet.

Sponsors:

Open Data Science Conference (http://odsc.com/)

Cloudera (https://cloudera.com/)

New York Accelerate AI (AIx)

Enabling Python to Become a Better Big Data Citizen - Wes McKinney

New York Accelerate AI (AIx)

Details

Related topics

You may also like