Skip to content

An evening with Dr. Leland Wilkinson (Visualization for large structured data)

Photo of Pramit Choudhary
Hosted By
Pramit C.
An evening with Dr. Leland Wilkinson (Visualization for large structured data)

Details

Hi Everyone,
Hoping everyone is keeping safe. Working on planning the next discussion with Dr. Leland Wilkinson. This will be a virtual event

Topic:
Computing a Distance-preserving Matrix Sketch Algorithm to Enable Visualizations of Large Rectangular Datasets

Description:
Suppose we wish to explore visually an n by p matrix of real numbers where n and p are quite large (say, n ~10^9 and p ~ 10^4). We present a new algorithm for subsetting data matrices that makes this exploration feasible. We select a subset of rows and columns of X_np : X_np -> X[a,b]_mk, where m < < n and k < < p and a is a row index array of length m and b is a column index array of length k. We restrict our selection of X_mk to be distance-preserving, where distances between the rows of X_mk are linearly related to the distances between the corresponding rows of X_np

BIO:
Leland Wilkinson is Chief Scientist at H2O and an Adjunct Professor of Computer Science at the University of Illinois Chicago. He received an A.B. degree from Harvard in 1966, an S.T.B. degree from Harvard Divinity School in 1969, and a Ph.D. from Yale in 1975. Wilkinson wrote the SYSTAT statistical package and founded SYSTAT Inc. in 1984. After the company grew to 50 employees, he sold SYSTAT to SPSS in 1994 and worked there for ten years on research and development of visualization systems. Wilkinson subsequently worked at Skytree and Tableau before joining H2O.
https://en.wikipedia.org/wiki/Leland_Wilkinson
https://www.cs.uic.edu/~wilkinson/

Wilkinson is a Fellow of the American Statistical Association, an elected member of the International Statistical Institute, and a Fellow of the American Association for the Advancement of Science. He has won the best speaker award at the National Computer Graphics Association and the Youden prize for the best expository paper in the statistics journal Technometrics. He has served on the Committee on Applied and Theoretical Statistics of the National Research Council and is a member of the Boards of the National Institute of Statistical Sciences (NISS) and the Institute for Pure and Applied Mathematics (IPAM). In addition to authoring journal articles, the original SYSTAT computer program and manuals, and patents in visualization and distributed analytic computing, Wilkinson is the author (with Grant Blank and Chris Gruber) of Desktop Data Analysis with SYSTAT. He is also the author of The Grammar of Graphics, the foundation for several commercial and opensource visualization systems (IBMRAVE, Tableau, Rggplot2, and Bokeh).
https://www.amazon.com/Grammar-Graphics-Statistics-Computing/dp/0387245448

In collaboration with NUMFOCUS, h2o.ai
https://numfocus.org/
https://www.h2o.ai/

Code of Conduct:
https://pydata.org/code-of-conduct/

SPONSORS:
NUMFOCUS
h2o.ai

Photo of PyData SoCal group
PyData SoCal
See more events