Skip to content

Clustering On Unstructured Data

Photo of Josh Peak
Hosted By
Josh P. and Pavan .
Clustering On Unstructured Data

Details

With more and more unstructured data (images, PDFs, etc.) being generated, it is becoming increasingly important to be able to capture insights and summaries of unstructured data via clustering. This talk discusses modern approaches such as vector databases, transformers and experimentation stacks that provide the foundation for Relevance AI's unstructured data platform.

About the Speaker
Jacky Wong is the founding data scientist at Relevance AI, the unstructured data experimentation platform that currently serves over millions of users across construction, gaming and education industries. Before Relevance AI, he worked across WooliesX, partnered with organisations like SalesForce and ranked highly (top 5%) across a number of data science competitions hosted by Google, Atlassian, EY ranging from natural language processing, tabular data, geospatial prediction to image processing.

***

Event Contingencies:

  1. Where possible we will try to host "on-site" events which includes some food and drinks and networking time. We will also try to host the presentation and question time as an "online" component too for these events.
  2. We are setup to host "online only" events when it isn't safe to host "on-site" events.

***

Online Events:

We will be using YouTube streaming to host this event. That way you can interact through live chat to ask questions. Depending upon the speaker's preference we may make the recording available shortly after the event has finished if you are unable to attend.

***

Connect with PyData Sydney Community:

Twitter:
https://twitter.com/pydatasydney

LinkedIn:
https://www.linkedin.com/company/pydata-sydney

Slack:
https://pydatasydney.slack.com/

Invite link:
https://join.slack.com/t/pydatasydney/shared_invite/zt-uk6n6vkk-wmJjMYZ1BFREZYM7MijXFA
(DM our socials if we forget to update an expired link)

***

About PyData:

PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.

NumFOCUS Code of Conduct : https://numfocus.org/code-of-conduct

  • PyData is dedicated to providing a harassment-free meeting experience for everyone, regardless of gender, sexual orientation, gender identity and expression, disability, physical appearance, body size, race, or religion.
  • We do not tolerate harassment of meeting participants in any form.
  • All communication should be appropriate for a professional audience including people of many different backgrounds.
  • Sexual language and imagery is not appropriate for any conference venue, including talks.
  • Be kind to others.
  • Do not insult or put down other attendees.
  • Behave professionally.
  • Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for PyData.
Photo of PyData Sydney group
PyData Sydney
See more events