Past Meetup

Talk 1: Housing Valuation | Talk 2: Distilling Media Personas

This Meetup is past

107 people went

The Metro @ thebridge

#01-07, Ascent, 2science Park drive · Singapore

How to find us

Metro Room (just by the taxi stand, and opposite the bar area)

Location image of event venue


For August, we are happy to invite 2 speakers:

1. Pei Ying Chua from LinkedIn, to speak about using data science in housing valuation

2. Chin Hui Han from DSO, to speak about understanding an organization through its media publications


Talk 1: Data science in the absence of complete information and tools—case study on house valuation

In the ideal world, we would receive data that is clean, harmonized, and complete. However this rarely happens; in fact, most of the data we work with requires intensive review and cleaning before we can use it accurately and effectively. This case study provides a real-life example of how publicly available datasets were cleaned and harmonized in order to conduct a house valuation exercise. Emphasis is placed on how one can use the core principles and methods of data science in the absence of comprehensive data sources and fancy analytic tools.


Pei Ying is an experienced data science lead with a demonstrated history of working in the tech space. She holds a Masters in Neuroscience from Cambridge University and a Masters in Management from Harvard. Starting out in computational neuroscience, she pivoted to lead behavioral psychology, data science, and economics departments in large internet companies such as Grab and LinkedIn. Outside of her professional work, she also believes in infusing data-driven approaches in all aspects of life, including cooking and shopping.


Talk 2: What is the SAF thinking? Understanding an Organization Through its Publications.

In the media age, both organizations and individuals curate online personas. The goal is to gain insights into the Singapore Armed Forces by performing data analytics on its official online publications. This talk will be in 3 sections: the data engineering process and hacks of building a corpus of SAF online publications, the data quirks and its impact on data analytics, and the insights gained such as the possibility of “group thinking” using topic modeling and trends analysis. There will be a Jupyter demo of the data pipelines used.


Hui Han is a senior system engineer with DSO National Laboratories. A computer scientist by training, Hui Han holds a Bachelor’s and Master’s Degree in Computer Science from Carnegie Mellon University, USA. For work, he leads a team to develop data engineering and machine learning systems for the Singapore Armed Forces. Outside of work, he tinkers with data science problems and ponders about the computational universe.



- 1830 - 1900: Networking [casual; small talk]
- 1900 - 2000: Housing Valuation with Data Science [core]
- 2000 - 2100: Understanding an Organization Through its Media Publications [core]


The Metro @ thebridge
(just by the taxi stand, and opposite the bar area)
2 Science Park Drive


- Opens: 18 July 2018 (Wed), 10:00 PM
- Closes: 31 July 2018 (Tue), 10:00 PM

Please note of our new attendance policy ( Attendees are REQUIRED to RSVP to the event to be allowed into the venue. People who are neither in the "going" nor "wait" lists will be turned away. Please be considerate and update your RSVP if you are not able to make it.