2203937 active data pools on tsm

29
IBM Software Group September 2007 | Advanced Technical Support © 2007 IBM Corporation http://w3.ibm.com/ibm/presentations Support Technical Exchange Understanding Tivoli Storage Manager Active Data Pools Dave Daun IBM Advanced Technical Support [email protected]

Upload: dens-cant-be-perfect

Post on 03-Jan-2016

55 views

Category:

Documents


1 download

DESCRIPTION

active data pools

TRANSCRIPT

Page 1: 2203937 Active Data Pools on TSM

IBM Software Group

September 2007 | Advanced Technical Support © 2007 IBM Corporation

Support Technical Exchange

Understanding Tivoli Storage Manager Active Data Pools

Dave DaunIBM Advanced Technical [email protected]

Page 2: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 2

Agenda

Overview of Active Data Pools

Defining Active Data Pools

Removing Inactive Files from an Active Data Pool

Restoring from Active Data Pools

Restrictions and Potential Issues

Implementation Ideas

Demonstration of Active Data Pools

Page 3: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 3

Active Data Pools

New feature in TSM 5.4► Long standing requirement from the TSM community

► New Storage Pool Type

► Only for Backup data

► Only for Sequential Access pools

► Implementation is similar to Copy Storage Pools

► Restore will come automatically from the Active Data Pool

Why use Active Data Pools► Faster restore of data

► Useful for disaster recovery

► Minimizes tape mount activity during restore

Page 4: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 4

TSM Pool Types

Random Disk Sequential Disk

(FILE)

Primary

Copy Pool

Active Data Pool

Sequential TapeSequential Disk

(FILE)

Sequential Tape Sequential Disk

(FILE)

Sequential Tape

Page 5: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 5

Collocation of Active DataActive-data pool on

sequential-access disk for fast restoreStorage hierarchy contains

active and inactive dataActive/inactive data in copy pool for disaster recovery

New

Active backup versions are those that still reside on client machine (have not been modified or deleted)

Active data pools are sequential-access--typically sequential-access disk. Can also be on tape.

Active backup versions exist in both a primary pool and one or more active data pools.

Active data pools are populated either synchronously with backup or asynchronously (similar to copy pools)

► Synchronous copy via a primary pool definition

► Asynchronous copy with new command: COPY ACTIVEDATA

► Asynchronous only for LAN-Free backups

Page 6: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 6

Removing Inactive Versions from Active Data Pools

Inactive versions in a Active Data Pool are treated as expired files

► Version is inactivated when a new backup version is received from the client

Reclamation removes inactive versions from Active Data Pools

► Reclamation must be run regularly

► MOVE DATA or MOVE NODEDATA will also remove inactive versions from an Active Data Pool

• RECONSTRUCT defaults to YES on MOVE DATA or MOVE NODEDATA in an Active Data Pool

Page 7: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 7

Active Data Pools Disk vs. Tape

Benefits of Active Data Pools on sequential disk

► Optimized access to active versions for fast restore

► Reduced size of disk pool if only active versions are stored

► Reduced data movement in preparation for restore of active data

Benefits of Active Data Pools on sequential tape

► Reduced storage requirement (onsite or offsite) while protecting data against media failure or disaster

► Simplified tape management because of fewer tapes

Page 8: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 8

Defining an Active Storage Pool

Define the pool

► DEFINE STGPOOL ACTIVE_POOL … POOLTYPE=ACTIVEDATA

Add the active data pool to an existing primary pool definition (optional)

► UPDATE STGPOOL PRIM_POOL … ACTIVEDATAPOOLS=ACTIVE_POOL

Schedule a command to copy from the primary pool to the active pool

► COPY ACTIVEDATA PRIM_POOL ACTIVE_POOL MAXPROC=4

Create a new policy domain to enable active data pools (or update an existing domain)

► DEFINE DOMAIN ACT_DOMAIN … ACTIVEDESTINATION=ACTIVE_POOL

Page 9: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 9

Controlling Which Nodes Use Active Data Pools

Policy Set

Policy Domain

ACTIVEDESTINATION

Management Class

Copygroup

NODES

Active Pool

Primary Pool

Page 10: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 10

Controlling Which Nodes Use Active Data Pools

1. Use an existing domain: Simply use the UPDATE DOMAIN command and specify the list of

active-data pools in the ACTIVEDESTINATION parameter.

2. Restrict which nodes can write to an active-data pool:

Decide which nodes are critical and should use Active Data Pools

Define a new domain and specify the list of active-data pools in the

ACTIVEDESTINATION parameter.

Update the node definitions to assign the nodes to the new domain

Complete the policy structure

Page 11: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 11

Getting Active Versions Into an Active Data Pool

Using simultaneous copy (not available for LAN-Free)► New parameter on DEFINE and UPDATE STGPOOL command—ACTIVEDATAPOOLS

► Specifies the names of active-data pools where the server simultaneously writes data during a client backup operation.

► The combined total number of storage pools in the COPYSTGPOOLS and the ACTIVEDATAPOOLS cannot exceed three.

COPY ACTIVEDATA command► Use this command to copy active versions of backup data from a primary storage pool to

an active-data pool.

► Similar to BACKUP STGPOOL

► Should be used in addition to simultaneous copy

Page 12: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 12

Removing Inactive Files from an Active Data Pool

When files are deactivated, the logical occupancy of the volume it is stored on is reduced by the size of the file. This causes reclamation to run sooner for the volume.

Reclamation (reconstruction) is used to remove inactive files from the active data pool.

Reconstruction will build the new aggregate by copying just the active files and not any deleted or inactive files.

The COPY ACTIVEDATA command and MOVE DATA/NODEDATA commands will reconstruct aggregates as they are copied or moved within an active-data pool. Do not specify RECONSTRUCT=NO on the MOVE DATA/NODEDATA commands when dealing with active-data pools.

Page 13: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 13

Collocation of Active Data – An Example

Client

Active data pool

A0 B0 C0 D0

A0 B0 C0 D0

A0 B0 C0 D0

1. Client backs up A0, B0, C0, D0 to primary pool with simultaneous write to active data pool.

Server

Active/inactive primary pool

B1A0 B0 C0 D0 E1

Client Server

B1 E1

B1A0 B0 C0 D0 E1

B1A0 B0 C0 D0 E1

2. Client backs up B1, E1 with simultaneous write to active data pool. B0 deactivated.

B1A0 B0 C0 D0 E1

Client Server

B1A0 B0 C0 D0 E1

B1A0 C0 D0 E1

3. Reclamation removes inactive B0 from active data pool.

B1A0 B0 C0 D0 E1

Client Server

B1A0 B0 C0 D0 E1

B1A0 C0 D0 E1

4. Client restores active files A0, C0, D0, B1, and E1 from active data pool.

A0 C0 D0 B1 E1

Active/inactive primary pool

Active/inactive primary pool

Active/inactive primary pool

Active data pool Active data pool

Active data pool

Page 14: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 14

Client data restore when using Active Data Pools

When restoring client data, the server will select the active version of a file from an Active Data Poll when appropriate. The restore order has been modified to include Active Data Pools:

1. Active Data Pool (FILE)

2. DISK (random)

3. FILE (primary or copypool).

4. Active Data Pool sequential onsite.

5. Sequential onsite-volume (primary, copy or Active Data Pool), priority considers: mounted(idle) -> automated -> manual

Page 15: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 15

Restoring Volumes or Storage Pools with Active Data Pools

Two new parameters on the RESTORE STGPOOL and RESTORE VOLUME commands:

► ACTIVEDATAONLY

• Specifies that active versions of backup files are to be restored from active-data pools only.

► ACTIVEDATAPOOL

• Specifies the name of the active data pool from which the active versions of backup files are to be restored. If this parameter is not specified, files are restored from any active data pool in which active versions of backup files can be located.

Note: Inactive files cannot be restored from active data pool and will be deleted from the primary pool.

Page 16: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 16

Active Data Pools - The Q OCC and Q CONTENT commands

Query Occupancy Command:

► Number of Files

• Reduced when non-aggregated files are deleted during deactivation

• Reduced by number of logical files only when aggregate is deleted► Physical Size reflects the actual size of the aggregates (as it does today)

► Logical Space Occupied

• Reduced by size of logical file being deleted

• Reduced by size of logical file that is removed during reclamation (reconstruction)

Query Content Command:

► Will show all logical files in an aggregate regardless if it has been removed

Page 17: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 17

Active Data Pools – Restrictions and Potential Issues

Can only be used for Backup data, not Archive data or HSM data► NDMP Backups do not go to Active Data Pools

Performing a point-in-time restore from active data pools is not allowed.

Must plan for time needed to do active data copies► Concurrent copies will be slower

► COPY ACTIVEDATA may be an additional process

COPY ACTIVEDATA runs slower than BACKUP STGPOOL► Difference is due to additional processing needed to verify the active/inactive

state of logical files in aggregates and act accordingly.

► Must check ownership of files and see of Node is authorized to used Active Data Pools for each file

Page 18: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 18

Active Data Pool Implementation Ideas

Use an Active Data Pool as a substitute for “disk only” pool

► Reduces the size of the disk requirement

► Faster restores from disk (non-point-in-time restores)

BackupsActivePool

PrimaryPool

TapeCopyPool(for offsite)

Migration

Page 19: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 19

Active Data Pool Implementation Ideas

Use an Active Data Pool for on-site media recovery

► Reduces the size of the tape requirement

► Faster restores from tape (non-point-in-time restores)

► No DRM support for Active Data Pools – offsite would be difficult

BackupsActivePool

PrimaryPool

TapeCopyPool(for offsite)

Migration

Page 20: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 20

Active Data Pool Implementation Ideas

Use an Active Data Pool as a staging area for an anticipated restore

► Reduces the size of the disk staging area

► Faster restores when enough time is available to perform the staging operation

► Use a special Policy Domain to enable a single node copy

BackupsActivePool

PrimaryPool

TapeCopyPool(for offsite)

Migration

Copy when needed

Page 21: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 21

Active Data Pool Implementation Ideas

Use an Active Data Pool as an electronic vaulting target

► Reduces the amount of data which must be stored in the remote location

► Faster disaster restores

► Replaces the Copy Storage Pool

BackupsActivePool

PrimaryPool

Migration

ExtendedSAN

Page 22: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 22

Active Data Pool Demonstration

Server Layout

DISKPOOL1RandomPrimary Pool

DISKPOOL2RandomPrimary Pool

ACTPOOLSequentialActive Data Pool

POOL101POOL102

POOL201

STGPOOL VOLUMES

VOL001VOL002…..VOL014

DEVCLASS

DISK

ACTDEV

NODE

ACTNODE1(ACTDOM1)

ACTNODE2(ACTDOM2)

Con

curr

ent

Page 23: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 23

Active Data Pool Demonstration

Server Scripts

►FILEVOLS – displays the status, utilization and percent reclaimable of the ACTPOOL volumes

►RECLAIM – runs reclamation on the ACTPOOL with a threshold of 10%

►ACTLOG5 – displays the last five minutes of Activity Log records

►DISK1DESTROY – marks POOL101 volume in DISKPOOL1 destroyed and off-line

Windows Scripts

►Touchfiles – updates the last access date of specified number of files

Page 24: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 24

Active Data Pool Demonstration

Highlights

►Backup repeatedly and run reclamation to see the inactive files removed from the active pool

►Backup with concurrent copy to the active pool

►Use the COPY ACTIVEDATA command to populate the active pool

►Restore data from an active data pool

►Destroy a primary volume and restore from the active pool

Page 25: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 25

Active Data Pools – Q DOMAIN

Page 26: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 26

Active Data Pools – Q STGPOOL

Page 27: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 27

Active Data Pools – Q STGPOOL

Page 28: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 28

Active Data Pools – COPY ACTIVEDATA

Page 29: 2203937 Active Data Pools on TSM

IBM Software Group | Tivoli software

TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 29

Questions???