# talks

## upcoming talks

None currently scheduled. Last updated on 2022-10-16.

## past talks

**Estimating network-mediated causal effects via spectral embeddings**

2022-10-14 @ 4:15 pm in MSC 1210, Statistics Graduate Student Association Seminar

The last several years have seen a renewed and concerted effort to incorporate network data into standard tools for regression analysis, and to make network-linked data legible to practicing scientists. Thus far, this literature has primarily developed tools to infer associative relationships between nodal covariates and network structure. In contrast, we augment a statistical model for network regression with counterfactual assumptions and show how causal effects on a network can be partitioned into a direct effect that is uninfluenced by the network, and an indirect effect that is induced by homophily. slides (for a general audience)

**Estimating indirect effects induced by homophily via spectral network regression** 2022-07-07, Tianxi Li and Can Le Joint Lab Meeting

The last several years have seen a renewed and concerted effort to incorporate network data into standard tools for regression analysis, and to make network-linked data legible to practicing scientists. Thus far, this literature has primarily developed tools to infer associative relationships between nodal covariates and network structure. In contrast, we augment a statistical model for network regression with counterfactual assumptions and show how causal effects on a network can be partitioned into a direct effect that is uninfluenced by the network, and an indirect effect that is induced by homophily. slides (for an audience familiar with network regression)

`distributions3`

: From basic probability to probabilistic regression

2022-06-23, UseR 2022

Achim Zeileis, Moritz Lang and Alex Hayes

The `distributions3`

package provides a beginner-friendly and lightweight interface to probability distributions. It allows to create distribution objects in the S3 paradigm that are essentially data frames of parameters, for which standard methods are available: e.g., evaluation of the probability density, cumulative distribution, and quantile functions, as well as random samples. It has been designed such that it can be employed in introductory statistics and probability courses. By not only providing objects for a single distribution but also for vectors of distributions, users can transition seamlessly to a representation of probabilistic forecasts from regression models such as GLM (generalized linear models), GAMLSS (generalized additive models for location, scale, and shape), etc. We show how the package can be used both in teaching and in applied statistical modeling, for interpreting fitted models, visualizing their goodness of fit (e.g., via the `topmodels`

package), and assessing their performance (e.g., via the `scoringRules`

package). video, slides

**The Low Hanging Fruit of the Twitter Following Graph**

2021-08-11, Joint Statistical Meetings

Alex Hayes and Karl Rohe

In recent applied work on the Twitter media ecosystem, we have found that Twitter metadata (such as follows, likes, quotes, retweets, mentions, etc) is often more informative than the actual content of tweets themselves. The metadata, in some sense, is the right data to use for many inference tasks. In particular, we find that embedding the Twitter following graph is highly informative. However, collecting the following graph is rather challenging due to API rate limits, and storing graphs can also be challenging. We present some computational infrastructure to make access and storage of this high signal data more straightforward, and suggest that research progress would be well served by an increased focus on instrumentation. slides

**Solving the model representation problem with broom**

2019-01-25, `rstudio::conf(2019)`

The R objects used to represent model fits are notoriously inconsistent, making data analysis inconvenient and frustrating. The broom package resolves this issue by defining a consistent way to represent model fits. By summarizing essential information about fits in tidy tibbles, broom makes it easy to programmatically work with model objects. Combining broom with list-columns results in an especially powerful way to work with many model fits at once. This talk will feature several case studies demonstrating how broom resolves common problems in data analysis. video, slides

**Solving the model representation problem with broom**

2018-09-19, Statistics Graduate Student Seminar

slides

**Convenient data analysis with broom**

2018-11-30, RStudio Webinar Series

Broom is a package that converts statistical objects into tibbles. This consistent structure makes it easier to accomplish many standard modelling tasks. In this webinar I’ll demonstrate how to use to broom to work with many models at once. We’ll see how broom makes it easier to visualize models, work with bootstrapped fits and assess model diagnostics. video, slides

**Solving the model representation problem with broom**

2018-09-19, Madison R User Group

slides

## slides from various informal presentations

**Identifiability of homophily and contagion in social networks**

2022-02-23, Madison Networks Reading Group

slides

**Triangles & networks**

2021-02-17, STAT 992 Seminar on Tensors

slides

**A new way to think about citations**

2020-11-17, Rohe Lab Group Meeting

slides

**The linear probability model**

2019-11-19, STAT 992 Seminar Course presentation

slides

**rstudio internship progress update**

2018-07-23, RStudio tidyverse team

slides

**Locally Interpretable Model-Agnostic Explanations**

2018-03-29, Rice DataSci club

slides

**Your First R Package**

2018-02-22, Rice DataSci club

slides