papers

I primarily study networks using tools from multivariate analysis.

My work with Karl Rohe has focused on fast approaches to spectral estimation via sparse linear algebra. Our primary project together has been developing a method for network co-factor analysis for settings with missing edge data, which we applied to a large network of citations between statistics papers. We have spent some time working to understand effective regularization strategies for spectral estimators, as well as developing diagnostic tools for PCA and varimax rotation. As part of an outgrowth of Karl’s murmuration project I developed extensive infrastructure to sample and analyze the Twitter following graph (see code), and Twitter data remains near and dear to my heart.

I am currently working with Keith Levin on causal interpretations of network regression, and with Jiwei Zhao on semi-parametric inference for experimental guardrails in a data fusion setting.

Previously at Facebook, I developed internal tooling to understand post content using neural hypergraph embeddings. At Facebook I also developed a diagnostic to assess the out-of-sample reliability of rolling classifiers based on differential calibration of the classifier over time.

I keep Google Scholar up to date, and post any research related code to Github (personal, lab).

publications

  1. Welcome to the Tidyverse. Hadley Wickham, Mara Averick, Jennifer Bryan, Winston Chang, Lucy D’Agostino McGowan, Romain François, Garrett Grolemund, Alex Hayes, Lionel Henry, Jim Hester, Max Kuhn, Thomas Lin Pedersen, Evan Miller, Kirill Müller, David Robinson, Dana Paige Seidel, Vitalie Spinu, Kohske Takahashi, Davis Vaughan, Claus Wilke, Kara Woo, Hiroaki Yutani. Journal of Open Source Software, 2019. pdf

Last updated on 2022-04-27.