Skip to content

About us

Visit our new site at https://nyhackr.org/

Meet with other users of the open-source programming language R. Previously, this meetup focused only on the R language, but is now focused on all open-source data analysis tools; including but not limited to, Python, Julia, C++, Stan, etc.

Learn and share tricks and techniques from and with other users. Beginners to advanced users are all welcome.

Upcoming events

1

See all
  • $7.00
    Special k: The Science (or Art) of Determining the Optimal k in Cluster Analysis

    Special k: The Science (or Art) of Determining the Optimal k in Cluster Analysis

    Kimball Hall, 246 Greene St, New York, NY, US

    External registration required at nyhackr.

    This month we have Jason Bryer talking about clustering.

    Thank you to NYU for hosting us.

    Everybody attending must RSVP through the registration form at nyhackr. There is a charge for in-person and virtual tickets are free.

    Space is extremely limited and in-person registration closes at 3 PM the day of the talk.

    About the Talk:
    Cluster analysis is a statistical procedure for grouping observations using an observation-centered approach as compared to variable-centered approaches (e.g. PCA, factor analysis). As an unsupervised method true cluster membership is usually not known. Hence, determining the optimal number of clusters, or k, poses unique challenges. A review of six common metrics for determining k with several clustering methods using two data sets will be explored. An introduction to two bootstrapping fit statistics will be provided along with validation techniques for evaluating the validity and stability of the cluster results across bootstrap samples.

    About Jason:
    Dr. Jason Bryer is currently an Assistant Professor and Associate Director in the Data Science and Information Systems department at the City University of New York. He is currently the Principal Investigator of the FIPSE ($3 million #P116F150077) and IES funded ($3.8 million R305A210269) Diagnostic Assessment and Achievement of College Skills (DAACS), which is a suite of technological and social supports designed to optimize student learning. Dr. Bryer’s other research interests include quasi-experimental designs with an emphasis on propensity score analysis, data systems to support formative assessment, and the use of open source software for conducting reproducible research. He is the author of over a dozen R packages, including three related to conducting propensity score analyses.

    The venue doors open at 6:30 PM America/New_York where we will continue enjoying pizza together (we encourage the virtual audience to have pizza as well). The talk, and livestream, begins at 7:00 PM America/New_York.

    Remember, register at nyhackr.

    • Photo of the user
    1 attendee

Group links

Organizers

Members

13,864
See all