Carter Butts, Sociology @ UC Irvine
Daniel McFarland, Sociology @ Stanford
Padhraic Smyth, Computer Science @ UC Irvine
Example: Interactions among students
The problem of interest for today's talk involves a set of entities and events among them occurring over time. Each of these events can be called edges or relational events. One example we will return to throughout today's talk will be one where the nodes represent students in a particular classroom and the edges represent direction acts of communication. The individuals might be distinguished by some covariate, such as age, as represented by the different colors here. One of the most fundamental goals is to characterize the differential propensity to interact, and better understandhow these differences are associated with known covariates. For example, we might want to ask whether interactions between same-gender students are more likely and, furthermore, how this varies by context. To answer these types of questions, today we will discuss a statistical model for interaction in small groups of people. ----- The problem of interest for today's talk involves a set of entities and events among them occurring over time. Each event represents some interaction directed from one entity to another. I might refer to these entities as nodes, vertices, or individuals. Each of these events can be called edges or relational events. One example we will return to throughout today's talk will be one where the nodes represent students in a particular classroom and the edges represent direction acts of communication. The individuals might be distinguished by some covariate, such as age, as represented by the different colors here. This type of interaction data is of particular interest to Sociologists who study social networks and the dynamics of small groups. One of the most fundamental goals is to characterize the differential propensity to interact, and better understandhow these differences are associated with known covariates. For example, we might want to ask whether interactions between same-gender students are more likely and, furthermore, how this varies by context. To answer these types of questions, today we will discuss a statistical model for interaction in small groups of people. I will be talking about some of the challenges that arise and how we propose to solve them.
Aggregate over time and analyze the weighted network
(Holland, JASA 1981) (Feinberg, JASA 1985)
Time 1 $p(Y_1 | \theta_1)$ |
Time 2 $p(Y_2 | \theta_2)$ |
Time t $p(Y_t | \theta_t)$ |
rate of interaction $(i,j)$ at time $t$
vector of covariates about $(i,j)$ at time $t$
Model rate of each edge using previous history
Assume event rates change only when events occur.
Rate of kth observed event
Represents the fact that no event occurred between event k-1 and event k
For more info see (Butts, 2008).
$\lambda_{ij}(t) = \exp\{\beta'X_{ij}(t)\}$
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} + \beta_3 X_{ij3} + \beta_4 X_{ij4} + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
$\log \lambda_{ij}(t) = \beta_0 + $ $\beta_1 X_{ij1}$$ + \beta_2 X_{ij2} + \beta_3 X_{ij3} + \beta_4 X_{ij4} + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + $$\ \beta_2 X_{ij2}$$ + \beta_3 X_{ij3} + \beta_4 X_{ij4} + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} +$ $\ \beta_3 X_{ij3}$$ + \beta_4 X_{ij4} + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} + \beta_3 X_{ij3} + \beta_4 X_{ij4} + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
Additive effects that model turn-taking in conversation (Gibson, 2003)
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} + \beta_3 X_{ij3} + $ $\ \ \beta_4 X_{ij4}$$ + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
E.g. $X_{ij4}$ immediately after event (12,15)
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} + \beta_3 X_{ij3} + \beta_4 X_{ij4} + $ $\ \ \beta_5 X_{ij5}$$ + \beta_6 X_{ij6}$
E.g. $X_{ij5}$ immediately after event (12,15)
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} + \beta_3 X_{ij3} + \beta_4 X_{ij4} + \beta_5 X_{ij5} + $ $\ \ \beta_6 X_{ij6}$
E.g. $X_{ij6}$ immediately after event (12,15)
$\log \lambda_{ij}(t) = \beta_0 + \beta_1 X_{ij1} + \beta_2 X_{ij2} + \beta_3 X_{ij3} + \beta_4 X_{ij4} + \beta_5 X_{ij5} + \beta_6 X_{ij6}$
E.g. Entire matrix $\lambda$ immediately after event (12,15)
$\lambda_{ij}(t_k)$ |
$(i_k,j_k)$ |
Parameters are Gaussians drawn from upper level
For each classroom $j$ and effect $p$:
$\begin{align}Y_j \sim& \mbox{REM}(\theta_j,\mathbf{X}_j) \\ \theta_{jp} \sim& \mbox{Normal}(\mu_p,\sigma_p^2) \\ \mu_p, \sigma_p \propto & 1/\sigma_p \end{align}$
Simulated data:
$\begin{align} p(\theta,\mu,\sigma | \mathbf{Y}, \mathbf{X}) \propto & \prod_{j=1}^J \ p(Y_j|\theta_j,\mathbf{X}_j) \prod_{p=1}^P \ p(\theta_{jp}|\mu_p,\sigma_p) p(\mu_p,\sigma_p)\end{align}$
At each time $t$, rank $\lambda_{ij}(t)$
Precision @ k: Proportion of the next events that were ranked higher than k
Unnormalized distribution of interest: $g(\Theta)$
$$ \begin{align} \pi(\Theta_1,\ldots,\Theta_J) \propto& \prod_{j=1}^J h_j(\Theta_j) \\ h_j(\Theta_j) \propto & \exp \{-g(\Theta_j)/t_j \} \end{align} $$ where $t_0 < \cdots < t_J$ and $h_j(\Theta_j)$ is the target distribution for chain $j$.
Swap between chains $j$ and $k$ at iteration $t$ with acceptance probability: $$\begin{align} \min \left\{ 1, \frac{h_j(\Theta_{k}^{(t)})h_{k}(\Theta_{j}^{(t)})}{h_j(\Theta_j^{(t)})h_{k}(\Theta_{k}^{(t)})} \right\} \end{align} $$
(Geyer 1991), (Madras 2003)
Individual effects
Event effects
Participation shifts
Autocorrelation
Event context
Individual effects
Event effects
Participation shifts
Autocorrelation
Event context
Individual effects
Event effects
Participation shifts
Autocorrelation
Event context
Individual effects
Event effects
Participation shifts
Autocorrelation
Event context
Individual effects
Event effects
Participation shifts
Autocorrelation
Event context
$D(y,\theta) = -2\log p(y|\theta)$
$p_D = \frac{1}{L} \sum_{l=1}^L D(y,\theta^l) - D(y,\hat{\theta})$
$DIC = \frac{1}{L} \sum_{l=1}^L D(y,\theta^l) + p_D$
(Spiegelhalter,2002)
A | B | C | D | E | F | |
Covs. + Recency + Pshift | 413775 | 419375 | 463227 | 484837 | 418384 | 463451 |
Covs. + Recency | 489450 | 527203 | 546694 | 595583 | 526441 | 544513 |
Individual mixing | X | X | X | |||
Edgewise effects | X | X | X | |||
Broadcast effects | X | X | X | X | ||
Interaction w/ event context | X | X |
Special thanks to Carter Butts and Padhraic Smyth.
$\begin{align}\frac{\theta_{jp} - \mu_{p}}{\sigma_p} \sim& t_{\nu}\\ \nu \sim & \mbox{Exponential}(r)\end{align}$
$\mu_{p} = X_j \beta_p$
$\begin{align} \mu_p \sim & \mbox{Normal}(\rho,\tau)\\ \sigma_p \sim & \mbox{Gamma}(\alpha,\beta) \end{align}$
$\begin{align} \hat{\theta}_j =& \frac{y_{.j}/\sigma_j^2 + \mu/\tau^2}{1/(\sigma_j^2+\tau^2)} \\ \hat{v}_j =& \frac{1}{1/(\sigma_j^2+\tau^2)} \\ \hat{\mu} =& \frac{\sum_j \frac{1}{1/(\sigma_j^2+\tau^2)} y_{.j}}{\sum_j \frac{1}{1/(\sigma_j^2+\tau^2)}}\\ y_{.j} | \mu, \tau \sim & N(\mu, \sigma_j^2 + \tau^2) \\ p(\theta|\mu,\tau,y) \sim& N(\hat{\theta}_j, \hat{v}_j) \\ p(\mu,\tau|y)=&\frac{p(\mu,\tau,\theta|y)}{p(\theta|\mu,\tau,y)} \forall \theta \\ \end{align} $
$p(\tau|y) \propto p(t) v_{\mu}^{1/2} \prod_j (\sigma_j^2 + \tau^2)^{-1/2} \exp(-(y_{.j}-\mu)^2/(2(\sigma_j^2+\tau^2)))$
Only a big effect on AB-BA. Reference prior putting weight on smaller sigmas. Gamma(2,20) a strong prior, but AB-BA resisting.
Under strong prior, some of the lower level effects $\theta_{jp}$ become more pronounced.