experimental comparison of connectivity measures with simulated eeg signals
TRANSCRIPT
![Page 1: Experimental comparison of connectivity measures with simulated EEG signals](https://reader035.vdocument.in/reader035/viewer/2022080108/575070241a28ab0f07d398e4/html5/thumbnails/1.jpg)
TECHNICAL NOTE
Experimental comparison of connectivity measureswith simulated EEG signals
Minna J. Silfverhuth • Heidi Hintsala •
Jukka Kortelainen • Tapio Seppanen
Received: 2 November 2011 / Accepted: 23 April 2012 / Published online: 22 May 2012
� International Federation for Medical and Biological Engineering 2012
Abstract Directional connectivity measures exist with
different theoretical backgrounds, i.e., information theoretic,
parametric-modeling based or phase related. In this paper, we
perform the first comparison in this extend of a set of con-
ventional and directed connectivity measures [cross-corre-
lation, coherence, phase slope index (PSI), directed transfer
function (DTF), partial-directed coherence (PDC) and
transfer entropy (TE)] with eight-node simulation data based
on real resting closed eye electroencephalogram (EEG)
source signal. The ability of the measures to differentiate the
direct causal connections from the non-causal connections
was evaluated with the simulated data. Also, the effects of
signal-to-noise ratio (SNR) and decimation were explored.
All the measures were able to distinguish the direct causal
interactions from the non-causal relations. PDC detected less
non-causal connections compared to the other measures.
Low SNR was tolerated better with DTF and PDC than with
the other measures. Decimation affected most the results of
TE, DTF and PDC. In conclusion, parametric-modeling-
based measures (DTF, PDC) had the highest sensitivity of
connections and tolerance to SNR in simulations based on
resting closed eye EEG. However, decimation of data has to
be carefully considered with these measures.
Keywords Biomedical signal processing �Computational biology � Electroencephalography
1 Introduction
Brain connectivity is a functional methodology to assess
connections between brain networks or areas. It is suitable
for detection and monitoring the subtle changes in brain
function. The early-stage signs of a serious neurological
illness leading to a disability may often emerge as a
functional abnormality, preceding detectable anatomical
damage in brain.
Conventionally used coherence and cross-correlation
measure strength of connection pair-wise between time
series measured from different locations, to confirm a
relationship (high value of coherence) in certain frequency
(band), between, e.g., EEG or fMRI time series (e.g., [11]).
However, by using these measures, direction cannot be
assigned. Therefore, there is no way to determine the flow
of information, to answer which brain area A is affecting
the processing of another brain area B.
Examples of directional connectivity measures [1, 8, 11,
12] are measures derived from information theory, like
transfer entropy (TE) [14], or parametric-modeling-based
Granger causality, directed transfer function (DTF) [6] and
partial-directed coherence (PDC) [2], or phase-related
measure phase slope index (PSI) [9]. TE has an advantage
of the model-free non-linear approach but requires com-
putational capacity. DTF and PDC are fast to compute but
model order estimation needs consideration [12]. PSI is fast
but a long data segment is necessary for reliable estimation.
Comparison of directional measures with different the-
oretical backgrounds (information theoretic, parametric-
modeling based or phase related) has only been performed
extensively in theory [11]. However, parametric-modeling-
based measures have been widely compared [e.g., 1, 4, 8,
12, 16], but in general with quite a simple connectiv-
ity patterns. In this paper, we explore the robustness of
M. J. Silfverhuth (&) � J. Kortelainen � T. Seppanen
Department of Computer Science and Engineering,
University of Oulu, Box 4500, 90014 Oulu, Finland
e-mail: [email protected]
H. Hintsala
Centre for Environmental and Respiratory Health Research,
Institute of Biomedicine, University of Oulu,
BOX 5000, 90014 Oulu, Finland
123
Med Biol Eng Comput (2012) 50:683–688
DOI 10.1007/s11517-012-0911-y
![Page 2: Experimental comparison of connectivity measures with simulated EEG signals](https://reader035.vdocument.in/reader035/viewer/2022080108/575070241a28ab0f07d398e4/html5/thumbnails/2.jpg)
cross-correlation, coherence, PSI, DTF, PDC and TE for
assessment of directional influences. Analysis is applied to
EEG-based multi-node simulation data with known direc-
tional connectivity pattern in different EEG frequency
bands. Also, the effects of signal-to-noise ratio (SNR) and
decimation were tested.
2 Methods and results
2.1 Simulation of the data
The connectivity data were simulated to test the perfor-
mance of the measures with a predefined causality structure
(Fig. 1). The simulation model design was inspired by the
model used in the study of Porcaro et al. [12] on functional
connectivity estimation, but with eight-node construction
here. Resting EEG (eyes closed) was applied in the simu-
lations as a source signal (S1, S2). EEG applied in the
simulation was measured from 25 persons with 20 elec-
trodes for 40 s with Fs = 1,000 Hz. For the simulation,
random sampling of the person, electrode and the data
segment (10,000 samples) was performed. Also, the noise
was regenerated for each repetition (N = 500). The con-
nectivity equations of the directional connectivity pattern
used are the following in function of sample time t:
x1ðtÞ ¼ S1ðtÞ þ 0:1 � x1ðt � 1Þ þ e1ðtÞx2ðtÞ ¼ x1ðt � 3Þ þ 0:1 � x2ðt � 1Þ þ e2ðtÞx3ðtÞ ¼ 2x1ðt � 6Þ þ 0:1 � x3ðt � 1Þ þ e3ðtÞ
x4ðtÞ¼x2ðt�12Þþx3ðt�10Þþ0:1 �x4ðt�1Þþ0:3 �e4ðtÞ
x5ðtÞ ¼ 3x3ðt � 5Þ þ 0:1 � x5ðt � 1Þ þ e5ðtÞx6ðtÞ ¼ 2x4ðt � 4Þ þ 0:1 � x6ðt � 1Þ þ e6ðtÞx7ðtÞ ¼ S2ðtÞ þ 0:1 � x7ðt � 1Þ þ e7ðtÞx8ðtÞ ¼ 2x7ðt � 6Þ þ 0:1 � x8ðt � 1Þ þ e8ðtÞ:
White Gaussian noise ei was added to the signal of each
node xi with SNR = [1, 5, 10]. A simulation (length 10,000
samples) was repeated 500 times for each SNR level.
Signals were decimated, resulting in typical sampling
frequency (Fs) 500 and 250 Hz, from the original Fs of
1,000 Hz. The power line artifact was removed from the
data with 50 Hz notch filter. Simulation and decimation
were performed with Matlab R2010a. Decimation was
implemented utilizing decimate-function, by first filtering
the signals with a phase neutral low pass Chebyshev type I
filter, with a cutoff frequency of 0.8 9 (Fs/2)/sampling
factor, and then re-sampling the smoothed signal at a
lower rate. The loss of detection power of the AR-based
connectivity measures during decimation is clearly
expected because the original model order 12 was chosen
based on the time delays in the original time series
(Fs = 1,000 Hz). The time delays in the decimated time
series cannot be the same and therefore the detection power
decreases. Thus, model orders 6 and 3 were also tested with
Fs = 500 Hz and Fs = 250 Hz, respectively.
2.2 Connectivity measures
In this study cross-correlation, magnitude-squared coher-
ence, PSI, DTF, PDC and normalized TE were applied to
the connectivity analysis of the simulated data. Mathe-
matical descriptions for the measures and further details are
presented in the Table 1 in the ‘‘Appendix’’. Detailed
descriptions are provided in the following articles: [11]
(cross-correlation and magnitude-squared coherence), [9]
(PSI), [6] (DTF), [2] (PDC) and [14] (TE).
Filtered signals were divided into segments of 4 s,
having a 50 % overlap. In case of PSI, different trials
(length = 10 s data/trial) were done consecutive for to
reach the statistical significance. The segments were nor-
malized to have zero mean and unit variance. For the
analysis performed with cross-correlation and TE the sig-
nals were also filtered with a finite impulse response (FIR)
Equiripple phase neutral filter (Astop 40–50 dB, Apass
0.1 dB, transition 1–2 Hz) to the following frequency
bands: 1–4 Hz (d), 4–8 Hz (h), 8–13 Hz (a), 13–30 Hz (b)
and 30–80 Hz (c). The analysis with other measures was
performed in the same frequency bands. Autoregressive
(AR) model order was defined to be the longest delay in the
connectivity pattern, i.e., 12.
The connectivity analyses were implemented with
Matlab R2010a. Following toolboxes were used in the
connectivity analyses: software to estimate the phase-slope
Fig. 1 The simulation scheme of the directional connectivity pattern.
S1 and S2 denote the source signals, X1–X8 the nodes of the pattern and
D the sample delay of each connection with Fs = 1,000 Hz
684 Med Biol Eng Comput (2012) 50:683–688
123
![Page 3: Experimental comparison of connectivity measures with simulated EEG signals](https://reader035.vdocument.in/reader035/viewer/2022080108/575070241a28ab0f07d398e4/html5/thumbnails/3.jpg)
indexes [9, 10], BioSig software made for biosignal anal-
ysis [3] and TIM toolbox for the estimation of information-
theoretic measures from time series [5, 13].
2.3 Statistical analysis
Significant connections were determined by calculating a
distribution with the connectivity analyses of non-connected
white Gaussian noise signals (N = 4,000). The risk levels of
1–5 % (statistical thresholds) were determined from the
distribution. The results from the connectivity analyses (i.e.,
amount of direct causal connections vs. other relations
between the measures) were assessed by Chi-square tests.
The tests were performed with SPSS for Win 16.0.
2.4 Detecting the connections
The ability of the measures to distinguish the direct causal
connections from the non-causal connections was evaluated
with the simulated data (Fs = 1,000 Hz, SNR = 10) (Fig. 2).
Direct causal connection means that the direct link
between nodes was found, e.g., X1–X2 in Fig. 1. Non-
causal connection is either indirect causal connection (e.g.,
X1–X5 in Fig. 1) or false connection where there is no
connectivity between nodes (e.g., X4–X5 in Fig. 1).
All the measures were able to separate the direct causal
interactions from the non-causal relations in all frequencies
that are statistically significant with p \ 0.01. PDC detec-
ted less non-causal connections from all the measures.
Magnitude-squared coherence detected more non-causal
connections than any other measure. Cross-correlation
and PSI managed to detect almost all the direct causal
connections (99–100 %). DTF and TE, instead, missed
quite a lot of the direct causal connections (36–27 %).
Performance of coherence and PDC was equal when it
comes to the amount of detected direct causal connections;
they both detected 87 % of connections.
2.5 Effect of SNR
Reducing the SNR decreased the amount of the detected
direct causal connections while detected non-causal con-
nections either decreased or remained the same (Fig. 3).
Decreasing the SNR from 10 to 5 did not have a statisti-
cally significant (p \ 0.01) effect to the performance of the
measures. Instead, decreasing the SNR from 5 or 10 to 1
had a strong effect. Further, analysis showed that this
effect was strongest in the c-band. DTF and PDC tolerated
the low SNR better than the other measures. In case of
SNR = 1, PDC was already one of the most sensitive
connectivity measure by detecting more direct causal
connections than the other measures together with cross-
correlation and PSI. TE detected only 36 % of the direct
causal connections with SNR = 1, and missed the half of
the connections that it was able to detect with SNR = 10.
2.6 Effect of decimation
DTF and PDC detected more direct causal connections with
original Fs (1,000 Hz) than with the decimated data.
However, amount of detected non-causal connections was
reduced (PDC, DTF, and TE) with decimation (Fig. 4)
(model order 12). The effect increased when the sampling
factor increased, i.e., more detected direct causal connec-
tions were found with Fs = 500 Hz than with Fs = 250 Hz.
Decimation affected also the detected direct causal
connections found with TE. The effect was dependent on
frequency in the analysis with TE: higher Fs (250 to 500 to
1,000 Hz) resulted in significantly (p \ 0.01) less detected
direct causal connections in the d frequencies (602 to 557 to
Fig. 2 Percentages of the detected direct causal connections and
non-causal connections averaged over the frequency bands. CC cross-
correlation, Coh coherence. Perfect results would be 0 % of non-
causal connections and 100 % of direct causal connections
Fig. 3 The effect of SNR in the amount of detected non-causal
connections. Results are averaged over the frequency bands
Med Biol Eng Comput (2012) 50:683–688 685
123
![Page 4: Experimental comparison of connectivity measures with simulated EEG signals](https://reader035.vdocument.in/reader035/viewer/2022080108/575070241a28ab0f07d398e4/html5/thumbnails/4.jpg)
468) and significantly (p \ 0.01) more detected direct
causal connections in the a (4 to 165 to 669), b (2 to 96 to
611) and c (3 to 1 to 104) frequencies. In h, changes were
508 to 681(significant, p \ 0.01) to 690 (insignificant),
respectively. The effect of decimation to the amount of
detected non-causal connections was similar as in case of
the direct causal connections for DTF, PDC and TE but the
strength of the effect was smaller (Fig. 4). With model
orders 6 and 3 (with Fs = 500 Hz and Fs = 250 Hz,
respectively), results were similar (not shown).
3 Conclusions
Directed connectivity measures with different theoretical
backgrounds, i.e., information theoretic, parametric-mod-
eling based or phase related, has earlier been compared in
this extend only in theory [11]. In the present work, we
compared a set of conventional and directed connectivity
measures with different theoretical backgrounds experi-
mentally with the simulation data based on resting eyes
closed EEG. We also explored the effect of noise level and
preprocessing (decimation). Main results were that all the
measures were able to separate direct causal interactions
from the non-causal relations in all frequencies. However,
PDC had the highest specificity and coherence the lowest
specificity of connections. Low SNR was tolerated better
with DTF and PDC than with the other measures. Deci-
mation affected the results of DTF, PDC and TE.
Coherence, as an undirected measure, detected the con-
nections equally between all the nodes that were somehow
connected, i.e., had the same source signal (S1 or S2); this
resulted in low specificity. PDC, instead, was able to dis-
tinguish the direct connections from the indirect ones, thus
PDC was found to have the highest specificity. The results
are consistent with the theoretical assumptions [7] as well as
with the previous directed connectivity studies made with
the measures based on the AR modeling, showing PDC to
be good direct connectivity estimator [1, 15, 16]. In our
study, cross-correlation and PSI were the most sensitive
measures by detecting over 10 % more of the direct causal
connections than the other measures. However, all the
measures had reasonable sensitivity, i.e., they detected the
direct causal connections within reasonable accuracy.
In this study, DTF and PDC tolerated the low SNR
better than other measures. This was expected as the SNR
has less influence on the AR-based measures, because
additive noise is a part of the underlying parametric model.
With SNR = 1, TE missed the half of the connections that
it was able to detect with SNR = 10. Decreasing the SNR
from 5 or 10 to 1 had a strong effect. This result was
consistent with the simulation study of Astolfi et al. [1],
where they compared the performance of DTF, direct DTF
and PDC on the connectivity analyses with the data sim-
ulated with neural mass model. They found no statistically
significant difference in the connectivity analysis made
with SNR = [10, 5, 3], but the increase in the error func-
tion was significant when SNR was reduced to 1.
Decimation affected the analysis made with DTF, PDC
and TE, but not the analysis made with cross-correlation,
coherence or PSI. This confirmed the findings in a recent
simulation study [4], in which the decimation was found to
lead to wrong conclusions in the analyses made with DTF and
PDC in certain circumstances. In their study, also high-pass,
low-pass and notch filtering of the signals were shown to
confuse the connectivity analyses made with DTF and PDC.
In addition, different effects of the decimation in the distinct
frequency bands are clear because the EEG has different
powers in the frequency bands. However, it remained unclear
why change of model order to keep the delay constant did not
ameliorated the results with AR-based measures.
First, limitation of the present study was that the simu-
lated connections were linear, while the activity of the
neural systems has a non-linear nature. However, the
applicability of the measures used in this study on non-
linear connectivity has been studied elsewhere [7, 9, 11].
Second, another question is how close to real data we can
reach with this kind of simulation data. However, in real
signals directional connectivity structure is typically not
known. To obtain the most reliable data structure, we
applied a real EEG signal to construct the simulated data
with predefined connections. Also, for future studies it is
suggested that different EEG signals, for example different
resting close eye EEG, resting open eye EEG, continuous
EEG oscillation in various tasks, and event-related EEG
need to be tested. Also, models with various node numbers,
connectivity relationships between nodes and time delays
need to be tested to run a comprehensive conclusion. For
Fig. 4 The effect of decimation in the amount of detected non-
causal connections. Results are averaged over the frequency bands
686 Med Biol Eng Comput (2012) 50:683–688
123
![Page 5: Experimental comparison of connectivity measures with simulated EEG signals](https://reader035.vdocument.in/reader035/viewer/2022080108/575070241a28ab0f07d398e4/html5/thumbnails/5.jpg)
example, the effect of decimation may not be as obvious in
cases with longer delay. Furthermore, a separate estimation
for the correctness of the network should be performed in
future studies. Here, it was assumed that the number of
correctly detected connections reflects the performance
also in the network level.
This is the first experimental comparison of connectivity
measures with different theoretical backgrounds, i.e.,
information theoretic, parametric-modeling based or phase
related, in the directed connectivity analysis of the simu-
lated data. Parametric-modeling-based measures had the
highest sensitivity of connections and tolerance to SNR in
simulations based on resting closed eye EEG. However,
decimation of data has to be carefully considered with
these measures. It would be very interesting to apply these
measures to real intracranial EEG signals or EEG source
imaging results and it will be a subject in the further
studies.
Acknowledgments This work was supported by the Academy of
Finland.
Appendix
See Table 1.
Table 1 Mathematical description of the connectivity measures
Connectivity
measure
Equation
Cross-correlation The cross-correlation function Cxy between time-series signals x(t) and y(t) can be defined as
CxyðsÞ ¼PN�s
k¼1
xðk þ sÞy�ðkÞ ð1Þ
where N denotes the number of samples,s 2 �m;�mþ 1. . .m� 1;m½ � the time lag between the signals when m is the maximal time lag considered and *
the complex conjugate [5]. Maximum of the absolute values of the cross-correlation function describes the strength of the dependency, and the lag at
the corresponding value the time delay between the signals. In this study, the analyzed sequences were normalized so the autocorrelations at zero lag
are identically 1.0.
In the present study, m = 500 samples was applied
Coherence Magnitude-squared coherence can be defined as
cxyðf Þ ¼ Xðf ÞY�ðf Þh ij j2Xðf Þh ij j Yðf Þh ij j ð2Þ
where X(f) and Y(f) denote the Fourier transforms of time-series signals x(t) and y(t), respectively, Y* the complex conjugate of Y, �j j the magnitude and
�h i the average [5]
In the present study, Hamming window with length of 4,096 samples was applied
PSI Normalized PSI can be defined as
WXY ¼ =P
f2F
C�XY ðf ÞCXY ðf þ d f Þ !
=stdðWXY Þ ð3Þ
where CXY ðf Þ ¼ SXY ðf Þ=ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiSXXðf ÞSYY ðf Þ
pdenotes the complex coherency, S the cross-spectral matrix, df the frequency resolution, =ð�Þ the imaginary part,
F the set of frequencies over which the slope is summed and stdðWXY Þ is estimated by the jackknife method [9].
In the present study, a segment length of 2.5 s, epoch length of 5.0 s and number of epochs = 50 were applied in the computation of PSI
DTF DTF from node j to i can be defined as
c2ijðf Þ ¼
Hijðf Þj j2Pm
n¼1
Hinðf Þj j2ð4Þ
where H denotes the AR model’s transfer function and m the amount of variables [7].
In the present study model orders of 12, 6, and 3 were applied, as explained in the Sect. 2
PDC PDC from node j to i can be defined as
pijðf Þ ¼ Aijðf Þffiffiffiffiffiffiffiffiffiffiffiffiffiffia�j ðf Þajðf Þp ð5Þ
where A(f) denotes the Fourier transformed matrix including the parameters of the AR model, ajðf Þ the jth column of the matrix A, * the transposition and
complex conjugate operation [8, 16]. In the present study model orders of 12, 6, and 3 were applied, as explained in the section Sect. 2
TE Differential entropy of time-series signal x(t) can be defined as
HðXÞ ¼ �R
lðxÞ log½lðxÞ�dx ð6Þwhere l(x) denotes the probability density function of x [15].
Transfer entropy from time-series signal X(t) to Y(t) can be defined as
TEX!Y ¼ HðY ; YsÞ � HðYÞ½ �. . .
. . .� HðY ;Xs; YsÞ � HðYs;XsÞ½ �;ð7Þ
where H(*) denotes the (differential) (joint) entropy and * the past of the signals [6, 15].
Normalized difference between the two amounts of transfer information [from x(t) to y(t) and from y(t) to x(t)] can be defined as
nTEX!Y ¼ TEX!Y�TEY!X
TEX!YþTEY!Xð8Þ
Normalized transfer entropies were calculated according (8) for maximal transfer entropies computed according (7) with different lags (maximal
lag = 20 samples).
Med Biol Eng Comput (2012) 50:683–688 687
123
![Page 6: Experimental comparison of connectivity measures with simulated EEG signals](https://reader035.vdocument.in/reader035/viewer/2022080108/575070241a28ab0f07d398e4/html5/thumbnails/6.jpg)
References
1. Astolfi L, Cincotti F, Mattia D, Lai M, Baccala L, Fallani F,
Salinari S, Ursino M, Zavaglia M, Babiloni F (2007) Comparison
of different cortical connectivity estimators for high-resolution
EEG recordings. Hum Brain Mapp 28:143–157
2. Baccala LA, Sameshima K (2001) Partial directed coherence: a
new concept in neural structure determination. Biol Cybern
84(6):463–474
3. BioSig, ‘‘BioSig—toolbox for biosignal analysis’’ (online).
Available: http://biosig.sourceforge.net/
4. Florin E, Gross J, Pfeifer J, Fink GR, Timmermann L (2010) The
effect of filtering on Granger causality based multivariate cau-
sality measures. NeuroImage 50(2):577–588
5. Gomez-Herrero G, Wu W, Rutanen K, Soriano M, Pipa G,
Vicente R (2010) Assessing coupling dynamics from an ensemble
of time series (online). Available: arXiv:1008.0539v1
6. Kaminski MJ, Blinowska KJ (1991) A new method of the
description of the information flow in the brain structures. Biol
Cybern 65(3):203–210
7. Kaminski M (2007) Multichannel data analysis in biomedical
research. In: Jirsa VK, McIntosh AR (eds) Handbook of brain
connectivity. Springer, Berlin, pp 327–355
8. Kus R, Kaminski M, Blinowska KJ (2004) Determination of EEG
activity propagation: pair-wise versus multichannel estimate.
IEEE Trans Biomed Eng 51(9):1501–1510
9. Nolte G, Ziehe A, Nikulin VV, Schlogl A, Kramer N, Brismar T,
Muller K-R (2008) Robustly estimating the flow direction of
information in complex physical systems. Phys Rev Lett 100(23):
234101
10. Nolte G, Ziehe A, Nikulin VV, Schlogl A, Kramer N, Brismar T,
Muller K-R, Phase-slope index—software (online). Available:
http://ml.cs.tu-berlin.de/causality/
11. Pereda E, Quiroga RQ, Bhattacharya J (2005) Nonlinear multi-
variate analysis of neurophysiological signals. Prog Neurobiol
77(1–2):1–37
12. Porcaro C, Zappasodi F, Rossini PM, Tecchio F (2009) Choice of
multivariate autoregressive model order affecting real network
functional connectivity estimate. Clin Neurophysiol 120:436–448
13. Rutanen K, Gomez-Herrero G, TIM-toolbox for estimation of
information-theoretic measures from time-series (online). Avail-
able: http://www.cs.tut.fi/*timhome/tim.htm
14. Schreiber T (2000) Measuring information transfer. Phys Rev
Lett 85:461–464
15. Velez-Perez H, Louis-Dorr V, Ranta R, Dufaut M (2008) Con-
nectivity estimation of three parametric methods on simulated
electroencephalogram signals. Conf Proc IEEE Eng Med Biol
Soc 2008:2606–2609
16. Winterhalder M, Schelter B, Hesse W, Schwab K, Leistritz L,
Timmer J, Witte H (2006) Detection of directed information flow
in biosignals. Biomed Tech (Berl) 51(5–6):281–287
688 Med Biol Eng Comput (2012) 50:683–688
123