What we're about

DataPhilly is a community run group for anyone interested in gaining insights from data. Topics include (but are not limited to) predictive analytics, applied machine learning, big data, data warehousing and data science. We <3 data!

Join our google group to help plan future events! https://goo.gl/up9OlT
Send us a message: http://goo.gl/RvxB6J !
Found a space we can use for future meetups? https://goo.gl/Ru0eth
Found a speaker for an upcoming meetup? https://goo.gl/9DJxq0
Found a sponsor for our events? https://goo.gl/JLVfqh

Upcoming events (2)

March Speaker Series: Data Science at Scale

Seer Interactive

Thanks to this month's sponsors Seer Interactive (https://www.seerinteractive.com) and Iqvia (https://www.iqvia.com/) for their generous support of DataPhilly! We couldn't make DataPhilly happen without their help. If you're interested in sponsoring future events please fill out our form at https://goo.gl/JLVfqh This month we have two excellent talks on Data Science at Scale. Annmarie Stockinger will be giving a talk on "Forecasting at Scale" and Sharath Bennur will be giving a talk on "deploying ML services at scale". **Forecasting at Scale** Predictions and projections are hard. This talk reviews how I built out a scalable system for forecasting marketing data that is usable for business applications and audiences. You'll leave this talk with a deeper understanding of forecasting methods and tactics for implementing forecasting on your own. *About Annmarie* Annmarie is a Data Science Manager at Seer Interactive committed to bringing data science methods to clients in a usable and reproducible way. Annmarie wears many hats from client side communication to pipelining and loves finding new ways to make marketing more data-driven. **ML Services at Scale** A majority of data science projects fail to make it into production. Some common reasons include an inability to scale the models, lack of robust code and processes and insufficient infrastructure around the machine learning. A combination of newer technologies like Kubernetes and Airflow, along with better processes and software engineering best practices can make it significantly easier to deploy ML services at scale. An overview of our learnings around scaling machine learning at enterprise scale will be presented. *About Sharath* Sharath Bennur is ML lead at Iqvia, where he builds machine learning services for a number of applications. He’s passionate about how ML is created and consumed within organizations. He also wears a complementary ML architect hat.

Open Science and Bioinformatics? April R-Ladies & DataPhilly event

Please join us April 16th to learn about the open science movement and how it is impacting the traditionally closed world of biology research. We are excited to be hosting Olga Botvinnik – a bioinformatics scientist and advocate for open data and open science. She will be sharing some insights from her research and also her career experiences more generally. This event is a collaboration between R-Ladies Philly and DataPhilly - no background in bioinformatics is needed! ==Estimated Agenda== 6:00: Food and casual networking (refreshments generously provided by Vistar Media) 6:30: Intro to the Childhood Cancer Data Lab by Jaclyn Taroni, PhD 6:45: "If I wasn't such a mess" by Olga Botvinnik, PhD - followed by time for Q&A and additional conversations Olga Botvinnik is a bioinformatics scientist at Chan Zuckerberg Biohub, a non-profit biomedical research institute. She is a genomics expert interested in a “grand unified theory of cells,” by applying computer science and machine learning algorithms to biological data, especially interested in sequencing weird creatures such as ticks. She holds two S.B. degrees, one in Mathematics and one in Biological Engineering from the Massachusetts Institute of Technology. She also holds a M.S. in Bioinformatics and Biomolecular Engineering from UC Santa Cruz, and she completed her education with a PhD in Bioinformatics and Systems Biology from UC San Diego. She is a NumFOCUS John Hunter Technical Fellowship and NDSEG Fellowship recipient. She runs a weekly Twitch livestreaming channel called “Bioinformatics Beyonce” that showcases real bioinformatics work such as genome assembly, open-source software and cloud computing, and interviews with scientists about their research and day-to-day work. Open science, open data, open source. Jaclyn Taroni, PhD is a data scientist at the Childhood Cancer Data Lab, an Alex’s Lemonade Stand Foundation initiative located right here in Philadelphia (https://www.ccdatalab.org). The CCDL supports open science and specifically cancer research by creating tools to make biological big data easier to access, mine, and reuse. ==Getting there== The event is hosted by our friends at Azavea – 990 Spring Garden St. You can enter via the north side of the building where there is a security guard. Take the elevator to the 5th floor. For public transit and other information about the building: https://www.azavea.com/directions/ ==About our sponsors== Vistar Media: On the Vistar Media engineering team, we never stop looking for our next addition. We’re kinda hoping that addition is you. We are located in Old City, Philadelphia. We're a startup with an established, well-tested codebase, but we aren't afraid to shake things up. We want to solve our problems with the right tools, whether they’re cutting-edge, or tried-and-true. We're happy to explore new territory (who isn't?), but we won't jump on something shiny for the sake of it. Our engineers work with many different languages and platforms every day. If this gets you excited, we'd love to hear from you. https://www.vistarmedia.com/open-positions Azavea is a B Corporation that creates civic geospatial software for the web. We build custom application; develop data analytics; and manage several open source projects (GeoTrellis, Raster Vision, Raster Foundry, and others). All of our work is aimed at advancing the state of the art in geospatial technology and applying it for civic, social, and environmental impact. We work in many domains, including: elections, planning, water, transportation, climate change and land conservation. Azavea is currently recruiting for an Operations Engineer (more at https://careers.azavea.com/) as well as for two fellowship programs, Summer of Maps and our Open Source Fellowship.

Past events (79)

Global Diversity CFP Day!


Photos (72)

Find us also at