medical image registration

Medical Image Registration

Kumar Rajamani

Registration

Spatial transform that maps points from one image to corresponding points in another image

xy

zTime axis: aging, development…

………

… … …

Anatomy

Label

Function

Pathology,Histology…

Cross-subject, same modality

Sam

e s

ub

ject,

cro

ss m

od

ality

…

Commoncoordinatesystem

Image Registration

Data Fusion

Atlases for population studies

Motion Correction for image reconstruction

Surgery Planning

Affine Scale / Skew

Example Registration Transformations

Rigid Deformable

SimilarityRigid + Scale

Original

5

Image comparison

Difference beforeregistration

Difference afterregistration

Registeredmoving image

Before Registration

B-Splines based Registration

After Registration

*GE Owned

B-Splines based Registration (Hill et al.)

Dense NR Registration (Hill et al.)

Registration Criteria

Landmark-based– Features selected by the user

Segmentation-based– Rigidly or deformably align binary

structures Intensity-based

– Minimize intensity difference over entire image

Spatial Transformation

Rigid– Rotations and translations

Affine– Also, skew and scaling

Deformable– Free-form mapping

Registration framework pieces

2 input images, fixed and moving Metric - determines the “fitness” of the

current registration iteration Optimizer - adjusts the transform in an

attempt to improve the metric Interpolator - applies transform to image

and computes sub-pixel values

11

Registration — General Problem

Problem: Find transform T such that image f[p] matches m[T(p)]

Domain ofan individual

subject

Commoncoordinatesystem

f m

)( pq T

transformmetricoptimizer

)]([m],[fminarg ppd T

T

Interpolate

Registration Framework

Transforms

x’=T(x|p)=T(x,y|tx,ty,θ)

Goal: Find parameter values that optimize image similarity metric

y

x

t

t

y

x

y

xx

cossin

sincos

'

''

Optimizer

Often require derivative of image similarity metric (S)

j j

j

ji

x

x

MFSMFS

p

p

p

p '

'

),,|(),,|(

m

nn

m

m

p

x

p

x

p

x

p

x

p

x

p

xp

x

p

x

p

x

J

'''

'''

'''

1

1

1

2

2

2

1

2

1

2

1

1

1

Understanding the Transform Jacobian

J shows how changing p shifts a transformed point in the moving image space.

This allows efficient use of a pre-computed moving-image gradient to infer changes in corresponding-pixel intensities for changes in p

Now we can update dS/dp by just updating J

16

Jacobian and Image Gradient

Transforms

Before we discuss specific transforms, let’s discuss the…

Fixed Set = the set of points (i.e. physical coordinates) that are unchanged by the transform

The fixed set is a very important property of a transform

18

Identity Transform

Maps every point to itself Only used for testing Fixed set = everything (i.e., the entire

space)

Translation Transform Fixed set = empty set Translation can be closely approximated by:

– Small rotation about distant origin, and/or…– Small scale about distant origin– Both of these do have a fixed point

Optimizers will frequently (accidently) do translation by using either rotation or scale– This makes the optimization space harder to use– The final transform may be harder to understand

20

Translation Transform

Fixed set is an empty set

Scaling Transform

Isotropic vs. anisotropic Fixed set is the origin of the coordinates

C

C

2D Rotation Transform

Rotation transforms are typically specific to either 2D or 3D

Fixed set = origin = “center” = C

24

C

C

Rotations in Polar Coordinates

y

x

y

xx

cossin

sincos

'

''

iii

i i

erereyx

rrreyx

)()','(

)sin,cos(),(

Transforms

Affine TransformS=Id reduces to Rigid case,

(e.g. Head motion correction)

Affine Transforms also encode scale/skew factors

tAx

txRSxT tS

.],,[

Scaling Rotation Translation

Original

Rigid

Optimization

Search for value of θ that minimizes cost function S

Gradient descent algorithm– Update of parameter

– G is the variation from the gradient of the cost function

is step length of algorithm

)exp(

'

' Geeee

S

Si

iii

Sequence of translations and metric values at each iteration of the optimizer

Scaling, Rotation,Translation

P=arbitrary point C=fixed point of transformation D=scaling factor Θ=rotation angle P and C are complex numbers (x+iy) or reiθ

Store derivates of P in Jacobian matrix for optimizer

Rigid if D=1, otherwise similarity transform

CDeCPPTP i)()('

Affine Transformation

Collinearity is preserved x’=A x + T A is a complex matrix of coefficients With fixed point

– x’=A (x–C) + C A is optimized similar to the scaling

factor

Quaternions

Quotient of two vectors– Q= A / B

Operator that produces second vector– A= Q B

Represents orientation of one vector with respect to another, as well as ratio of their magnitudes– Versor-rotates vector– Tensor-changes vector magnitude

Scalars and Versors

Quaternion represented by 4 numbers– Versor

• Direction – parallel to axis of rotation• Rotation angle• Norm – function of rotation angle

– Tensor• Magnitude

Rigid Transform in 3D

Use quaternions instead of phasors P’=V*(P-C)+C P’=V*P+T, T=C-V*C P=point, V=Versor, T=Translation, C=fixed

point Transform represented by 6 parameters

– Three numbers representing versor– Three components of fixed coordinate system

Image Interpolators

2 functions– Compute interpolated intensity at

requested position– Detect whether or not requested position

lies within moving-image domain

Nearest Neighbor

Uses intensity of nearest grid position Computationally cheap Doesn’t require floating point

calculations

Linear Interpolation

Computed as the weighted sum of 2n-1 neighbors

n=dimensionality of image Weighting is based on distance

between requested position and neighbors

B-spline Interpolation

Intensity calculated by multiplying B-spline coefficients with shifted B-spline kernels

Higher spline orders require more pixels to computer interpolated value

Third-order B-spline kernels typically used because good tradeoff between smoothness and computational burden

Metrics

Scalar function of the set of transform parameters for a given fixed image, moving image, and transformation type

Typically samples points within fixed image to compute the measure

Mean Squares

Mean squared difference over all the pixels in an image

Intensities are interpolated for the moving image

For gradient-based optimization, derivative of metric is also required

2))],(()([1

),,|( pxMxFN

MFpS i

N

i i TT

Mean Squares

Optimal value of zero Interpolator will affect computation time

and smoothness of metric plot Assumes intensity representing the

same homologous point is in both images

Images must be from same modality

Mean Squares

Smoothness affected by interpolator

Normalized Correlation

Computes pixel-wise cross-correlation between the intensity of the two images, normalized by the square root of the autocorrelation of each image

For two identical images, metric =1 Misalignment, metric <1


-1 added for minimum-seeking optimizers

N

i

N

iii

N

iii

xTMxF

xTMxFMFpS

))(()(

))(()(1),,|(

22

T

Difference Density

Each pixel’s contribution is calculated using bell-shaped function

f(d) has a maximum of 1 at d=0 and minimum of zero at d=+/-infinity

d is difference in intensity b/w F and M

2)/(1

1)(

ddf

Multi-modal Volume Registration by Maximization of Mutual Information

Wells W, Viola P, Atsumi H, Nakajima S, Kikinis R

Registering Images from Same Modality Typical measure of error is sum of

squared differences between voxels values

This measure is directly proportional to the likelihood that the images are correctly registered

Same measure is NOT effective for images of different modalities

Figure 8.9 from the ITK Software Guide v 2.4, by Luis Ibáñez, et al.62

MI Inputs

T1 MRI Proton density MRI

Relationship Between Images of Different Modalities Example: We should be able to construct a

function F() that predicts CT voxel value from corresponding MRI value

Registration could be evaluated by computing F(MR) and comparing it to the CT image– Via sum of squared differences (or correlation)

In practice, this is a difficult and under-determined problem

Mutual Information

Theory: Since MR and CT both describe the underlying anatomy, there will be mutual information between the two images

Find the best registration by maximizing the information that one image provides about the other

Requires no a priori model of the relationship Assumes that max. info. is provided when the

images are correctly registered

Notation

Reference (fixed) volume: u(x) Test (moving) volume: v(x) x: coordinates of the volume T: transformation from coordinate frame

of reference volume to test volume v(T(x)): test volume voxel associated

with reference volume voxel u(x)

Mutual Information

Defined in terms of entropies

If there are any dependencies, H(A,B)<H(A)+H(B)

)BA,()B()A()BA,(

),(log),()BA,(

)(log)()B(

)(log)()A(

HHHI

dabapbapH

dbbpbpH

daapapH

ABAB

BB

AA

Maximizing Mutual Information

h(v(T(x))) encourages transformations that project u into complex parts of v

Last term of MI eqn contributes when u and v are functionally related

Together, last two terms of MI eqn identify transforms that find complexity and explain it well

)))((),(()))((())(()))((),((

)))((),((maxargˆ

xTvxuhxTvhxuhxTvxuI

xTvxuITT

Parzen Windowing

Used to estimate probability density P*(z)

Entropy estimated based on P*(z)

Finding Maximum of I(T)

To find maximum of mutual information, calculate its derivative:

Derivative of reference volume is 0, b/c not a function of T

Entropies depend on covariance of Parzen window functions

)))((),((*)))(((*))((*)( xTvxuhdT

dxTvh

dT

dxuh

dT

dTI

dT

d

Stochastic Maximization of Mutual Information

Similar to gradient descent Steps are taken that are proportional to dI/dT Repeat:

– A {sample of size NA drawn from x}

– B {sample of size NB drawn from x}

– T T+λ(dI/dT)

λ is the learning rate Repeated a fixed # of times, or until

convergence

Stochastic Approximation

Uses noisy derivative estimates instead of the true derivative for optimizing a function

Authors have found that technique always converges to a transformation estimate that is close to locally optimal– NA=NB=50 has been successful

The noise introduced by the sampling can effectively penetrate small local minima

MRI-CT Example

Coarse to fine registration Images were smoothed by convolving with

binomial kernel Rigid transform represented by displacement

vectors and quaternions Images were sampled and tri-linear

interpolation was used 5 levels of resolution

– 10000, 5000(*4) iterations

Initial Condition of MR-CT Registration

Final Configuration for MR-CT Registration

Initial Condition of MR-PET Registration

Final Configuration for MR-PET Registration

Application Register 2 MRIs of brain (SPGR and

T2-weighted) to visualize anatomy and tumor– Create at 3-D model for surgical planning

and visualization

3-D Model

Tumor(green), Vessels(red), Ventricles(blue), Edema (orange)

Correlation Conventional correlation aligns two

signals by minimizing a summed quadratic difference between their intensities

If intensity of one signal is negated, then intensities no longer agree, and alignment by correlation will fail

Mutual information is not affected by negation of either signal

Occlusion

Correlation is significantly affected by occlusion because intensity is substantially different

Occlusion will reduce mutual information at alignment– But “mutual information measure degrades

gracefully when subject to partially occluded imagery”

Comparison to Other Methods

Many researchers use surface-based methods to register MRI and PET imagery– Need for reliable segmentation is a drawback

Others use joint entropy to characterize registration– “not robust”: difficulty describing partial overlap– Mutual Information is better because it has a larger

capture range• Additional influence from term that rewards for complexity

in portion of test volume into which reference volume is transformed

Comparison to Other Methods

Woods et al. register MR and PET by minimizing range of PET values associated with a particular MR intensity value– Effective when test volume distribution is

Gaussian– Mutual Information can handle data that is multi-

modal– Woods’ measure is sensitive to noise and outliers

Registration Tools/ Softwares

Elastix (ITK) ANTS (ITK) Plastimatch Slicer3D (ITK) MevisLab Fiji

Conclusions

Intensity based techniques work directly with volumetric data (vs. segmentation)

Mutual information does not rely on assumptions about nature of imaging modalities

Have also used this technique to register 3D volumetric images to video images of patients

April 2011

87

Kumar Rajamani

GE Global Research

[email protected]

medical image registration

Documents