Sydney Data Eng meetup, April Edition


Details
We've got another exciting in-person edition this month at the MSFT Reactor.
The meetup is at 5pm on Wednesday 12th April.
Schedule:
- 5pm: doors open
- 5.30pm: announcements and welcome
- 5.40pm: 1st Talk
- 6.10pm: 2nd Talk
- 6.40pm: Networking
- 7.10pm: doors close
Speakers:
π€ Ann Clark, AI Ontologist, Nearmap
Talk Title: Unlocking the Power of Definition Management with Ontology and Graph Technology
Talk Summary: Discussion of using graph structures to document truths about the livable world in both human and machine readable ways, from a knowledge and vocabulary management lens.
Speaker Bio: Ann has about a decade of experience managing master data, semantic metadata, and enterprise technical knowledge in the energy and commercial data sectors. She earned a Master's in Library and Information Science from the University of Arizona in the US, and has lived in Sydney since 2022.
π€ Simon Aubury, Principal data engineer, Thoughtworks
Talk Title: That looks weird! Exploring Mastodon user behaviour with Kafka & DuckDB
Talk Summary: Mastodon is a decentralized social networking platform. Users are members of a specific Mastodon instance, and servers are capable of joining other servers to form a federated social network. To understand user behaviour in a distributed data system - you need a distributed data processing system!This talk describes the tools & techniques for data collection and data processing. How Apache Kafka, DuckDB and Seaborn are used to perform exploratory data analysis of user activity, server popularity and language usage. Exposing the surprising behaviour and trends from a distributed analysis of a federated social network.
Speaker Bio: Data Geek at ThoughtWorks
π Platform Host: DataEngBytes - https://www.youtube.com/dataengau
π Catering: Cloud Shuttle π
π¬ Join our Slack Group here: https://goo.gl/forms/DVNazDmNBg1FFm2X2
Remember to bring along some great questions!
COVID-19 safety measures

Sponsors
Sydney Data Eng meetup, April Edition