a geometric database for gene expression data

25
A Geometric Database for Gene Expression Data Baylor College of Medicine Gregor Eichele Christina Thaller Wah Chiu James Carson Rice University Tao Ju Joe Warren

Upload: barto

Post on 22-Feb-2016

37 views

Category:

Documents


0 download

DESCRIPTION

A Geometric Database for Gene Expression Data. Rice University Tao Ju Joe Warren. Baylor College of Medicine Gregor Eichele Christina Thaller Wah Chiu James Carson. Overview. Genes are blueprints for creating proteins - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A Geometric Database for Gene Expression Data

A Geometric Database for Gene Expression Data

Baylor College of MedicineGregor EicheleChristina ThallerWah ChiuJames Carson

Rice UniversityTao JuJoe Warren

Page 2: A Geometric Database for Gene Expression Data

Overview

• Genes are blueprints for creating proteins• For given tissue, only a subset of genes are

generating proteins (expressed)• New laboratory method for determining which

genes are being expressed (Eichele)• Collect expression data over mouse brain for all

30K genes in mouse genome• Problem: Compare expression of different images

Page 3: A Geometric Database for Gene Expression Data

Gene Expression Images

Page 4: A Geometric Database for Gene Expression Data

Example Query

Page 5: A Geometric Database for Gene Expression Data

Brain Atlas

• Difficulty in comparing expression images– Variations in image– No explicit boundaries of anatomical regions

• Solution: Brain atlas– Deformable to images– Explicit modeling of anatomical boundaries– Store the expression data on the atlas– Efficient searching

Page 6: A Geometric Database for Gene Expression Data

• Standard – label image with anatomical regions, deformed onto target image using uniform grid• Brain Warping [Toga, 1999]

• Other deformable modeling tools• Active contours, simplex meshes, etc.

Brain Atlas: Review

Page 7: A Geometric Database for Gene Expression Data

• Model brain as a coarse quad mesh with each quad assigned to an anatomical region

• Edges shared by two quads from different regions defined a network of crease edges

• Subdivision of crease edges yields a network of smooth creases curves bounding regions

Subdivision Mesh as Brain Atlas

Page 8: A Geometric Database for Gene Expression Data

Gene Expression Database

• Collect gene expression data at key cross-sections

• Deform subdivision meshes at those cross-sections onto expression images – Semi-automatic fitting algorithm

• Store gene expressions back onto the mesh.– Multi-resolution structure accelerates comparison

Page 9: A Geometric Database for Gene Expression Data

Mesh Fitting

• Global fitting– Accounts for deformation resulted from imaging

• Local fitting– Accounts for anatomical deviation and tissue distortion

in sectioning– Minimize deviation of the mesh boundary from the

image boundary (Scattered data fitting [Hoppe, 1996])– Relax the internal mesh vertices under energy

constraints

Page 10: A Geometric Database for Gene Expression Data

Minimizing Deformation Energy• Penalize non-affine deformation of the mesh

during the fitting process– Triangulate each quad – Penalize deviation:

T (p4)p1

p3 p2

p4 p1= T (p1)

p3= T (p3) p2= T (p2)

T p4

ˆ

ˆ ˆ

ˆ

• Related to mesh parameterization

4 4( )p T p

Page 11: A Geometric Database for Gene Expression Data

Fitting Results

0

0.02

0.04

0.06

0.08

0.1

0.12

0 20 40 60 80 100

Error plot before and after global fit for 110 images.

Page 12: A Geometric Database for Gene Expression Data

• Automatic annotation – Distribution: ubiquitous, scattered, regional, none.– Strength: +++, ++, +, -– Apply filter to determine distribution and strength of

expression using data stored with the mesh quads• Optimized searching

– Using the multi-resolution structure of subdivision mesh

– Based on Multiscale Image Searching [Chen et.al. 97]– Works with convex norms: L1, Chi-square, etc.– Graphical search interface

Storing Expression With Atlas

Page 13: A Geometric Database for Gene Expression Data

• Database of gene expression data and deformed atlases– currently 1207 images from 110 genes.

• Web server: www.geneatlas.org – Viewing and downloading expression images.– Viewing atlases (using Java Applet).– Graphical interface for searching gene images.– Textual interface for searching annotation.

• It’s all online!

Accessing the Database via the Web

Page 14: A Geometric Database for Gene Expression Data

• Build 3D atlas for mouse brain – Represented as subdivision solid– Partitioned into anatomical regions by surface network– Supports fully 3D queries

• Future work– Deform the mesh onto expression images– Store the expression data onto the mesh– Efficient searching algorithm– User interface to pose 3D queries

Current Work and Future Plans

Page 15: A Geometric Database for Gene Expression Data

Conclusion

• Subdivision meshes for anatomic modeling:

– Flexible control allows easy deformation.

– Explicit modeling of region boundaries.

– Fast multi-resolution comparison of data.

Page 16: A Geometric Database for Gene Expression Data

Acknowledgement

This work is supported in part by:

• A training fellowship from the W.M. Keck Foundation to the Gulf Coast Consortia through the Keck Center for Computational and Structural Biology.

• The Burroughs Wellcome Fund, NLMT15LM07093 and NIHP41RR02250.

• NSF grant ITR-0205671.

Page 17: A Geometric Database for Gene Expression Data

• Identify major anatomical regions in the Paxino’s Atlas (coronal figures).

• Layout triangular mesh for each coronal figure that conforms to region boundaries.

• Construct prisms from triangles, and fit the subdivided volume to the underlying data.

Constructing a Partitioned 3D Atlas

Page 18: A Geometric Database for Gene Expression Data

• Coloring of major anatomical regions in each coronal figure in the Paxino’s Atlas. (Online)

Electronic Paxino’s Atlas

Page 19: A Geometric Database for Gene Expression Data

• Pack uniform triangular grid into anatomic regions, annotated with colors.

• Identify and group consecutive meshes with same topology into one Layer.

2D Triangular Meshing

Page 20: A Geometric Database for Gene Expression Data

• Construct triangular prisms for each layer. (no topology changes)

• Color each prism by the color of the triangles.

• Crease faces: separate the volume into sub-volumes corresponding to each anatomic region.

Crease quad Crease triangle

3D Layered Mesh

Page 21: A Geometric Database for Gene Expression Data

Subdivided Mesh

Page 22: A Geometric Database for Gene Expression Data

3D Brain Anatomy

Page 23: A Geometric Database for Gene Expression Data

• Deform layers in z-direction to more accurately fit boundaries of anatomical regions

• Optimize surface network to fit data and bend minimally

Fit Layered Mesh

Page 24: A Geometric Database for Gene Expression Data

• Apply filter to high-res raw images and compute low-res expression images

• Align images in z-direction using centers of mass, rotate in x-y plane using line of symmetry

• Determine tilt angle of each image versus z-axis using cross-correlation to synthetic cut of atlas

• Fit cross-sections of 3D atlas to the images using deformable modeling methods.

• Map expression data from image back onto 3D prisms that intersect the image plane.

Mapping Expression Data onto Atlas

Page 25: A Geometric Database for Gene Expression Data

• Specify 3D query regions using 2D layering– Select set of triangles in 2D layer view, visualize

corresponding layer of triangular prisms in 3D.– GUI: Separate views of selection window (2D) and

volume-viewing window (3D).

• Display of 3D search results– Quick view of 3D expression patterns as point clouds

lying in the expression image slices.– Optionally, view of raw 2D expression images used to

generate 3D point clouds (with links to genepaint.org).

Querying the 3D Database