online open neuroimaging mass meta-analysis

40
Online open neuroimaging mass meta-analysis Finn ˚ Arup Nielsen Lundbeck Foundation Center for Integrated Molecular Brain Imaging at Department of Informatics and Mathematical Modelling Technical University of Denmark and Neurobiology Research Unit, Copenhagen University Hospital Rigshospitalet August 3, 2011

Upload: others

Post on 22-Apr-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Online open neuroimaging mass meta-analysis

Online open neuroimaging mass meta-analysis

Finn Arup Nielsen

Lundbeck Foundation Center for Integrated Molecular Brain Imaging

at

Department of Informatics and Mathematical Modelling

Technical University of Denmark

and

Neurobiology Research Unit,

Copenhagen University Hospital Rigshospitalet

August 3, 2011

Page 2: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Online open neuroimaging mass meta-analysis

Online: On the Web

Open: “Open Science”

Neuroimaging: Talairach coordinates, text mining, brain volume, brain

regions, . . .

Mass meta-analysis: Google-like large-scale automatic processing across

multiple variables.

Finn Arup Nielsen 1 August 3, 2011

Page 3: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Neuroinformatics: Brede tools

Brede Toolbox: A program package primarily written in Matlab. Handles

visualization, linear modeling, multivariate analysis, locations (Talairach

coordinates), volumes, papers, texts.

Brede Database: Basically a collection of XML files with data from neu-

roimaging papers as well as ontologies. Distributed with the Brede Tool-

box. “Output” and query services to the Brede Database (generated with

the Brede Toolbox) is available on the Internet: http://neuro.imm.dtu.dk

Brede Wiki: A wiki with data from neuroimaging papers as well as

ontologies. Both freeform text and “semantically” organized within Me-

diaWiki templates.

Finn Arup Nielsen 2 August 3, 2011

Page 4: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Brede Toolbox with the Brede Database

Graphical user interface of

Brede Toolbox used to en-

ter data into the Brede

Database.

Brede Database: A database

with results from published

neuroimaging studies as well

as ontologies for, e.g., brain

regions and brain functions

(?).

Data stored in XML avail-

able on the Web

Finn Arup Nielsen 3 August 3, 2011

Page 5: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

The Brede Database on the Web

Presentation on the Web

Off-line meta-analysis and generation

of indices and visualization in static

HTML.

Interactive search on coordinates

from Web page or within a image

analysis program (?).

Finn Arup Nielsen 4 August 3, 2011

Page 6: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Searching on Talairach

coordinate

Result after search for nearest

coordinates to (14, 14, 9) with

the Brede Database.

Translation of the data from

XML to SQL (?)

Perl + SQLite web-script

Similar searches possible in Anto-

nia Hamilton’s AMAT programs,

BrainMap, SumsDB and Brede

Wiki.

Finn Arup Nielsen 5 August 3, 2011

Page 7: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Online experiment search (multiple coordinates)

Online search on two coordinates

in left and right amygdala in

the experiments recorded in the

Brede Database.

“Related volume” also available

from the “original” BrainMap

database (?):

http://neuro.imm.dtu.dk/services/jerne/ninf/

Search available to the Brede

Database from SPM plugin (?).

Finn Arup Nielsen 6 August 3, 2011

Page 8: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Coordinates-to-volume

transformation

Coordinates in an article con-

verted to volume-data by fil-

tering each point (kernel den-

sity estimation) (??)

One volume for each article or

one volume for a set of coor-

dinates in multiple articles.

Yellow coordinates from a

study by (?), with grey wire-

frame indicating the isosur-

face in the generated volume

Finn Arup Nielsen 7 August 3, 2011

Page 9: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Kernel density esti-

mators for coordi-

nates

−6 −4 −2 0 2 4 60

0.5

1

Exa

mpl

e lo

catio

ns

−6 −4 −2 0 2 4 60

1

2

σ = 0.05 (Too small)

−6 −4 −2 0 2 4 60

0.1

0.2

0.3

σ = 3.00 (Too Large)

−6 −4 −2 0 2 4 60

0.5

1

σ = 0.49 (LOO CV optimal)

’Talairach coordinate’ in centimeter

P

roba

bilit

y de

nsity

val

ue

Figure 1: Example in one dimension with six co-ordinates and their kernel density estimate.

Regard the coordinates as being gen-

erated from a distribution p(x), where

x is in 3D Talairach space (?).

Kernel methods (N kernels centered

on each location: µn) with homoge-

neous Gaussian kernel in 3D Talairach

space x

p(x) =(2πσ2)−3/2

N

N∑

ne− 1

2σ2(x−µn)

2

σ2 fixed (σ = 1cm) or optimized with

leave-one-out cross-validation (?).

Finn Arup Nielsen 8 August 3, 2011

Page 10: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Brede brain region tax-

onomy/ontology

Taxonomy of neuroanatomi-

cal areas with items linked in a

hierarchy with “Brain” in the

top root and smaller areas in

the leafs. WOROI is the ID.

Records parent region, child

region, naming variations,

Links to other brain region on-

tologies

Links to digital brain atlases

(AAL, Claus Svarer, Alexan-

der Hammers)

Finn Arup Nielsen 9 August 3, 2011

Page 11: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Example with Brain region ontology

The ontology enables one to get all names for PCC and its subregions.

Output is (24 names in total):

’Posterior cingulate gyrus’

’Posterior cingulate’

’Posterior cingulate area’

’Posterior gyrus cinguli’

’Posterior cingulate cortex’

’Left posterior cingulate gyrus’

’Left posterior cingulate’

’Posterior cingulate gyrus, left’

... e.g., BA23, retrosplenial, ...

Suitable for text mining where you identify as many occurrences in a

corpus that is not using a controlled vocabulary, such as ordinary scientific

articles.

Finn Arup Nielsen 10 August 3, 2011

Page 12: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Example: Get PCC locations

Get all posterior cingulate

locations that match on of

the naming variation for

the regions and its subre-

gion.

Model the locations with

kernel density estimation,

and convert the density to

a probability.

Volume written to an Ana-

lyze file

Viewed in the external

MRIcro program

Finn Arup Nielsen 11 August 3, 2011

Page 13: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Topics ontology

WOEXT: 18 Vision (visual perception)

WOEXT: 470 Visual object processing

WOEXT: 126 Visual object recognition

WOEXT: 22 Object recognition

WOEXT: 23 Face recognition

WOEXT: 136 Visual word recognition

WOEXT: 502 Visual body recognition

WOEXT: 137 Visual letter recognition

Topics, such as brain functions and mental disorders, organized in a hi-

erarchy. Example: episodic memory retrieval, OCD, 5-HT2A receptor.

Used to label each neuroimaging experiment

Other efforts: MeSH (too coarse), BrainMap, Cognitive Atlas (Poldrack),

Cognitive Paradigm Ontology (?)

Cognitive components are “open to interpretation”

Finn Arup Nielsen 12 August 3, 2011

Page 14: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Supervised data mining

Volume for a specific tax-

onomic component: “Pain”

Volume threshold at statisti-

cal values determined by re-

sampling statistics (?). Red

areas are the most significant

areas: Anterior cingulate, an-

terior insula, thalamus. In

agreement with “human” re-

viewer (?).

Implementations of supervized

datamining in the Brede Tool-

box and in GingerALE.

Finn Arup Nielsen 13 August 3, 2011

Page 15: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Unsupervised data

mining

Construction of a matrix

X(experiments× voxels)

Decomposition of this matrix

by multivariate analysis PCA,

ICA, NMF, clustering ??.

Other technique: Replicator

dynamics ?.

Comparison of components

with resting-state ?

Finn Arup Nielsen 14 August 3, 2011

Page 16: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Text representation:

a “bag-of-words”

‘memory’ ‘visual’ ‘motor’ ‘time’ ‘retrieval’ . . .

Fujii 6 0 1 0 4 . . .

Maddock 5 0 0 0 0 . . .

Tsukiura 0 0 4 0 0 . . .

Belin 0 0 0 0 0 . . .

Ellerman 0 0 0 5 0 . . .

... ... ... ... ... ... . . .

Representation of the ab-

stract of the articles in “bag-

of-word”. Table counts how often a word occurs

Finn Arup Nielsen 15 August 3, 2011

Page 17: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Exclusion of “stop words”: common words (the, a, of, ...), words for brain

anatomy, and a large number of common words that appear in abstracts.

Mostly words for brain function are left. More advanced extraction: Match

to ontologies

Finn Arup Nielsen 16 August 3, 2011

Page 18: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Grouping of words from articles

1 2 3 4

1

2

3

4

Component

Num

ber

of c

ompo

nent

s

memoryretrievalepisodictimepain

memoryretrievalepisodictimememories

painpainfulmotorsomatosensoryheat

memoryretrievalepisodictimememories

facialexpressionsfacesrecognitionemotion

painpainfulmotorsomatosensoryheat

memoryretrievalepisodicautobiographicamemories

facialexpressionsfacesrecognitionemotion

painpainfulmotorsomatosensoryheat

eyevisualmovementsspatialhumans

Figure 2: Grouped words.

Multivariate analysis (NMF)

of the text in posterior cingu-

late articles to find “themes”,

which can be represented with

weights over words and arti-

cles (?).

Most dominating words: mem-

ory, retrieval, episodic

pain, painful, motor, so-

matosensory

facial, expressions, faces,

eye, visual, movements

Finn Arup Nielsen 17 August 3, 2011

Page 19: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Combining text analysis and coordinates

Is there a difference in

how brain functions dis-

tribute in the cingulate

gyrus?

Possible to find the cor-

responding articles for

the coordinates — and

text mine these articles

for clustering and label

the coordinate accord-

ing to cluster.

Sagittal plot of mem-

ory (magenta) and pain

(yellow).

Finn Arup Nielsen 18 August 3, 2011

Page 20: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Text and volume: Functional atlas

Figure 3: Functional atlas in 3D visualization.

Automatic construction of

functional atlas, where words

for function become associ-

ated with brain areas

Two matrices: Bag-of-words

matrix, matrix from voxeliza-

tion of coordinates. NMF on

the product matrix.

Example components: Blue

area: visual, eye, time.

Black: motor, movements,

hand. White: faces, percep-

tual, face.

Finn Arup Nielsen 19 August 3, 2011

Page 21: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Combining ontologies and coordinates

1: Hot pain, Thermal pain, Warm temperature sensation

2: Somesthesis, Pain, Temperature sensation

3: Externally generated threat response, Threat, Externally generated emotion

4: Memory, Cognition, Memory retrieval

5: Finger movement, Localized movement, Motion, movement, locomotion

6: Language, Phonetic processing, Rhyme judgement

7: Self−reflection, Self processing, Self/other processing

8: Mental process, Sadness, Disgust

9: Face recognition, Objects (processing), Visual object recognition

10: Emotion, Unpleasantness, Fear

11: Happiness, Pleasantness, Sexual arousal

12: Vision (visual perception), Perception, Reading

13: Voice perception, Spatial neglect, Audition

14: Audiovisual speech perception, Multimodal perception, Congruent multimodal perception

15: Productive language, Verbal fluency, Cold pain

16: Syllable counting, Receptive language, Novelty seeking

17: Awake resting with eyes closed, Relaxed conscious state, Conscious state

Conversion of the Brede Database function taxonomy to a matrix and

using that together with matrix from voxelization of the coordinates in

the experiments and non-negative matrix factorization.

Finn Arup Nielsen 20 August 3, 2011

Page 22: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Issues with Brede Database

Difficult to add new information to the Brede Database

Difficult to do incremental additions.

Finn Arup Nielsen 21 August 3, 2011

Page 23: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Issues with Brede Database

Difficult to add new information to the Brede Database

Difficult to do incremental additions.

Solution?

Wiki with structured data

Brede Wiki = MediaWiki templates +

Extraction + SQL + Neuroscience

Finn Arup Nielsen 22 August 3, 2011

Page 24: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Principles of the Brede Wiki

Structured information is stored in the so-

called “templates” of Mediawiki.

Template use simple so it is easy to

convert data all template instantiations

to an SQL representation: No wiki for-

mating in field values, non-nested tem-

plates, lower case field names (a one-to-

one mapping of MediaWiki templates and

ontology classes). (?)

Link as much as possible in the template

values.

Link to external sites whenever possible.

Finn Arup Nielsen 23 August 3, 2011

Page 25: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Brede Wiki templates

Templates may describe a pa-

per with bibliographic infor-

mation or a researcher or jour-

nal.

Hierarchical templates: Brain

regions, Topics, Organiza-

tions, Software.

Multiple templates on each

page, e.g., to describe subject

group, brain scan, experimen-

tal condition, Talairach co-

ordinate, brain volume, gene

personality association.

Finn Arup Nielsen 24 August 3, 2011

Page 26: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Storing of volumes

Finn Arup Nielsen 25 August 3, 2011

Page 27: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Queries

Structured content can be ex-

tracted (like DBpedia on Wikipedia)

Queries are possible, but not within

the wiki

Query on nearby coordinates with

an off-wiki script.

So-called “SKOS file” (?) gener-

ated for brain region and topic hier-

archies from the structured content.

Finn Arup Nielsen 26 August 3, 2011

Page 28: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Brede Wiki and Tool-

box integration

You can get data from the Brede Wiki into Matlab, e.g., paper in the

Brede Wiki (?):

>> title = ’Brain maps of Iowa gambling task’;

>> Ls = brede_web_bw2loc(title);

>> figure, brede_ta3_frame, brede_ta3_loc(Ls)

Get the page from the Web site and extract the

information within the templates and convert

to a structure that fits the Brede Toolbox and

Database.

Finally, plot the locations.

Finn Arup Nielsen 27 August 3, 2011

Page 29: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Issues

Contribution is difficult: Presently “raw” data entry ©..⌢

Online interactive meta-analysis is not immediately available ©..⌢

Finn Arup Nielsen 28 August 3, 2011

Page 30: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Brede Wiki for personality genetics

Data entry in the wiki

in a table-like interface:

Gene, polymophism, ge-

notype, inventory, trait,

personality scores, sub-

ject group, PMID.

“Normal” Brede Wiki

keeps track of data en-

try.

Data can also be ex-

ported to the Brede

Wiki.

So far typed in data from 87 studies with 2815 personality scores.

Finn Arup Nielsen 29 August 3, 2011

Page 31: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Meta-analysis across traits and polymorphisms

Large-scale data mining across all recorded personality traits and poly-

morphisms and present the result on the wiki.

Order meta-analytic results, e.g., with respect to P -value

Finn Arup Nielsen 30 August 3, 2011

Page 32: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

MAOA uVNTR/reward dependence

Forest plot generated by the wiki for the “warrior gene” and Cloninger’s

reward dependence with meta-analysis and Cochrane’s test.

Finn Arup Nielsen 31 August 3, 2011

Page 33: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Issues

Difficult to store more heterogeneous data:

Lacking free floating text. ©..⌢

Integration of different content: Image, volumes, maps ©..⌢

Finn Arup Nielsen 32 August 3, 2011

Page 34: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Data for meta-analysis in a MediaWiki

Example from Brede Wiki with on-

line meta-analytic results from a

standard data set.

Data is represented in a simple

comma-separated values page on

the wiki

Data downloaded and analyzed off-

line in the R programming language

Resulting plot added to the relevant

wikipage.

Finn Arup Nielsen 33 August 3, 2011

Page 35: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Meta-analysis with a MediaWiki-based wiki

Data sets from structural neu-

roimaging meta-analyses across

multiple brain regions by ??

Represent the data a comma-

separated values on a separate

page.

Web script extracts the data

and present a meta-analysis on

a Web page.

Online forest plot and funnel

plots.

Finn Arup Nielsen 34 August 3, 2011

Page 36: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Data quality

Spamming is a serious problem on smaller open wikis.

On an open wiki you need daily surveillance and block vandals and delete

pages

MediaWiki has numerous methods deal with spamming

MediaWiki can also restrict who is able to edit and signup

On the Brede Wiki incomplete data entry is indicated with a prominent

red color

What data is incomplete is not apparent for users during search

Finn Arup Nielsen 35 August 3, 2011

Page 37: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Data quality: Outlier detection

What about data entry

errors and other percu-

liarities?

Data mining for outliers

using an automated al-

gorithm that looks at

the redundancy between

the anatomical label and

the 3D coordinate (?).

Here “parietal” in “left

superior parietal lobe”

does not “fit” with z = −53 and “right” in “Right occipitotemporal

cortex” does not fit with x = −50.

Finn Arup Nielsen 36 August 3, 2011

Page 38: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Open science and Data sharing

Open Science = Open Methods + Open Data

Data from the Brede Database distributed as XML, comma-separated

values file, JSON, SKOS.

Brede Database aggregated into Antonia Hamilton’s AMAT, — an thereby

also SumsDB.

Federated into Neuroscience Information Framework

Acawiki, http://acawiki.org/, academic summaries.

Finn Arup Nielsen 37 August 3, 2011

Page 39: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

The Brede Wiki available from

http://neuro.imm.dtu.dk/wiki/

Brede Database

http://neuro.imm.dtu.dk/services/brededatabase

Brede Toolbox

http://neuro.imm.dtu.dk/software/brede

Finn Arup Nielsen 38 August 3, 2011

Page 40: Online open neuroimaging mass meta-analysis

Online mass meta-analysis

Thanks!

Finn Arup Nielsen 39 August 3, 2011