Epilepsy

The Models

SSL Models

Relative Positioning (RP)
Temporal Shuffling (TS)
Contrastive Predictive Coding (CPC)
Variance-Invariance-Covariance-Reguralization (VICReg)

Relative Positioning

Temporal Shuffling

Implementation

Preliminary Results

The hyperparameters $\tau_+$ and $\tau_-$ are not properly configured for our task.
The task of detecting whether a seizure occurs is not complicated, as indicated by the 95.78% accuracy on the base model, and therefore the model may not benefit from much more information given in the pretraining.
The variance of the preictal, ictal, and nonictal windows may “confuse” temporal-based SSL methods, as these states are hypothesized to contain different signal characteristics, and simplifying this into two classes may actually harm the performance of our SSL methods by their construction.
The encoder or projector layers are not optimally configured, e.g., we could try different graph layers besides ECC or GAT, or we could try different layer sizes of the projector MLP.

    Seizure Detection (Binary)
    
            Model
            Test Accuracy
        
            Supervised
            95.78%
        
            Supervised + RP
            95.65%
        
            Supervised + TS
            96.13%

Seizure Detection (Binary)
Model	Test Accuracy
Supervised	95.78%
Supervised + RP	95.65%
Supervised + TS	96.13%

Epilepsy

affects over 50 million individuals worldwide, making it one of the most common neurological disorders worldwide.

Manual detection of seizures consumes key resources from experienced medical experts in the epilepsy unit.

How can we create a better system?

Our Solution

Graph neural networks (GNNs) and self-supervised learning (SSL), with the goal of fully automating seizure detection.

Why GNNs?

GNNs model graph structures, which represent a collection of objects (nodes) and their relationships (edges).

GNNs are perfect for modelling the brain, given its many complex interconnections, whether between individual neurons or entire brain regions. GNNs allow us to harness the interconnectedness between different sites where we collect our data (e.g., EEG electrodes).

The Data

Self-Supervised Learning (SSL)

GNN Encoder

We used the following encoder for each SSL method:

Here $x$ denotes our input graph representation, the graph and its features. We then transform $x$ into a single vector $z = \text{Enc}(x)$, called the encoding.

For more details see below, or skip to the next section to see the models!

$$\begin{aligned}\textbf{x}_i’ = \sum_{j \in \mathcal{N}(i)} F_{\theta}(\textbf{e}_{i,j})\textbf{x}_{j} \end{aligned}$$

$$\begin{aligned}\textbf{x}_i’ = g\bigg( \sum_{j \in \mathcal{N}(i)} \alpha_{ij} W \textbf{x}_j\bigg) \end{aligned}$$

where $\alpha_{ij}$ are the attention coefficients, which are learnable coefficients that essentially determine how important node $j$ is to node $i$, $W$ is a learnable weight matrix, and $g$ is some nonlinear activation function.

After applying ECC and GAT consecutively, we then aggregate the node features using a global mean pooling layer, and then flattening it to give our encoding.

The Models

SSL Models

We selected four SSL methods:

Relative Positioning (RP)

Temporal Shuffling (TS)

Contrastive Predictive Coding (CPC)

Variance-Invariance-Covariance-Reguralization (VICReg)

Each model used an the GNN decoder described above.

Relative Positioning

$$y = \begin{cases} 1 &\text{if } |t - t'| \leq \tau_{+} \\ 0 &\text{if } |t - t'| > \tau_{-}\end{cases}$$

Here’s a diagram of the RP model:

The model first encodes $x_t$ and $x_{t'}$ using our GNN encoder, before applying a projection function (in this case, a 3-layer MLP), before contrasting with the function:

$$\text{Contr}(z_t, z_{t'}) = |z_t - z_{t'}| = c$$where the absolute value is taken entrywise. Then $c$ is passed through plain old logistic regression to give our prediction $\hat{y}$, for which we compute our error with the Binary Cross Entropy (BCE) loss:

$$\mathcal{L}(y, \hat{y}) =- \Big[y \log \hat{y} + (1-y) \log(1 - \hat{y})\Big]$$

Temporal Shuffling

$$M = t + \frac{t''-t}{2}$$

Now dependent on the value $t'$ (the middle window), we generate the pseudolabel:

$$y(x_t, x_{t'}, x_{t''}) = \begin{cases} 1 &\text{if }t'\in (t,t'') \text{ or } t' \in (t'', t)\\0 &\text{if } |t'-M| > \tau_-\end{cases}$$

Hence, $y = 1$ if $t \leq t’ \leq t’’$, meaning that the triplet is temporally ordered and $y = 0$ if $t’$ is sufficiently far away from the midpoint between $t’$ and $t’’$, i.e. it is temporally shuffled. Here’s an overview of the model

Implementation

We mainly used the PyTorch Geometric (PyG) library for our implementation. For more details, please see the GitHub repository! We have several notebooks written for understanding PyG and our specific models.

Preliminary Results

The hyperparameters $\tau_+$ and $\tau_-$ are not properly configured for our task.

The task of detecting whether a seizure occurs is not complicated, as indicated by the 95.78% accuracy on the base model, and therefore the model may not benefit from much more information given in the pretraining.

The variance of the preictal, ictal, and nonictal windows may “confuse” temporal-based SSL methods, as these states are hypothesized to contain different signal characteristics, and simplifying this into two classes may actually harm the performance of our SSL methods by their construction.

The encoder or projector layers are not optimally configured, e.g., we could try different graph layers besides ECC or GAT, or we could try different layer sizes of the projector MLP.

Dementia Detection with the GDFT

Xavier Motooo

xmootoo@gmail.com

$$\text{Contr}(z_t, z_{t'}) = |z_t - z_{t'}| = c$$
where the absolute value is taken entrywise. Then $c$ is passed through plain old logistic regression to give our prediction $\hat{y}$, for which we compute our error with the Binary Cross Entropy (BCE) loss: