2203937 active data pools on tsm
DESCRIPTION
active data poolsTRANSCRIPT
IBM Software Group
September 2007 | Advanced Technical Support © 2007 IBM Corporation
Support Technical Exchange
Understanding Tivoli Storage Manager Active Data Pools
Dave DaunIBM Advanced Technical [email protected]
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 2
Agenda
Overview of Active Data Pools
Defining Active Data Pools
Removing Inactive Files from an Active Data Pool
Restoring from Active Data Pools
Restrictions and Potential Issues
Implementation Ideas
Demonstration of Active Data Pools
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 3
Active Data Pools
New feature in TSM 5.4► Long standing requirement from the TSM community
► New Storage Pool Type
► Only for Backup data
► Only for Sequential Access pools
► Implementation is similar to Copy Storage Pools
► Restore will come automatically from the Active Data Pool
Why use Active Data Pools► Faster restore of data
► Useful for disaster recovery
► Minimizes tape mount activity during restore
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 4
TSM Pool Types
Random Disk Sequential Disk
(FILE)
Primary
Copy Pool
Active Data Pool
Sequential TapeSequential Disk
(FILE)
Sequential Tape Sequential Disk
(FILE)
Sequential Tape
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 5
Collocation of Active DataActive-data pool on
sequential-access disk for fast restoreStorage hierarchy contains
active and inactive dataActive/inactive data in copy pool for disaster recovery
New
Active backup versions are those that still reside on client machine (have not been modified or deleted)
Active data pools are sequential-access--typically sequential-access disk. Can also be on tape.
Active backup versions exist in both a primary pool and one or more active data pools.
Active data pools are populated either synchronously with backup or asynchronously (similar to copy pools)
► Synchronous copy via a primary pool definition
► Asynchronous copy with new command: COPY ACTIVEDATA
► Asynchronous only for LAN-Free backups
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 6
Removing Inactive Versions from Active Data Pools
Inactive versions in a Active Data Pool are treated as expired files
► Version is inactivated when a new backup version is received from the client
Reclamation removes inactive versions from Active Data Pools
► Reclamation must be run regularly
► MOVE DATA or MOVE NODEDATA will also remove inactive versions from an Active Data Pool
• RECONSTRUCT defaults to YES on MOVE DATA or MOVE NODEDATA in an Active Data Pool
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 7
Active Data Pools Disk vs. Tape
Benefits of Active Data Pools on sequential disk
► Optimized access to active versions for fast restore
► Reduced size of disk pool if only active versions are stored
► Reduced data movement in preparation for restore of active data
Benefits of Active Data Pools on sequential tape
► Reduced storage requirement (onsite or offsite) while protecting data against media failure or disaster
► Simplified tape management because of fewer tapes
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 8
Defining an Active Storage Pool
Define the pool
► DEFINE STGPOOL ACTIVE_POOL … POOLTYPE=ACTIVEDATA
Add the active data pool to an existing primary pool definition (optional)
► UPDATE STGPOOL PRIM_POOL … ACTIVEDATAPOOLS=ACTIVE_POOL
Schedule a command to copy from the primary pool to the active pool
► COPY ACTIVEDATA PRIM_POOL ACTIVE_POOL MAXPROC=4
Create a new policy domain to enable active data pools (or update an existing domain)
► DEFINE DOMAIN ACT_DOMAIN … ACTIVEDESTINATION=ACTIVE_POOL
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 9
Controlling Which Nodes Use Active Data Pools
Policy Set
Policy Domain
ACTIVEDESTINATION
Management Class
Copygroup
NODES
Active Pool
Primary Pool
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 10
Controlling Which Nodes Use Active Data Pools
1. Use an existing domain: Simply use the UPDATE DOMAIN command and specify the list of
active-data pools in the ACTIVEDESTINATION parameter.
2. Restrict which nodes can write to an active-data pool:
Decide which nodes are critical and should use Active Data Pools
Define a new domain and specify the list of active-data pools in the
ACTIVEDESTINATION parameter.
Update the node definitions to assign the nodes to the new domain
Complete the policy structure
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 11
Getting Active Versions Into an Active Data Pool
Using simultaneous copy (not available for LAN-Free)► New parameter on DEFINE and UPDATE STGPOOL command—ACTIVEDATAPOOLS
► Specifies the names of active-data pools where the server simultaneously writes data during a client backup operation.
► The combined total number of storage pools in the COPYSTGPOOLS and the ACTIVEDATAPOOLS cannot exceed three.
COPY ACTIVEDATA command► Use this command to copy active versions of backup data from a primary storage pool to
an active-data pool.
► Similar to BACKUP STGPOOL
► Should be used in addition to simultaneous copy
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 12
Removing Inactive Files from an Active Data Pool
When files are deactivated, the logical occupancy of the volume it is stored on is reduced by the size of the file. This causes reclamation to run sooner for the volume.
Reclamation (reconstruction) is used to remove inactive files from the active data pool.
Reconstruction will build the new aggregate by copying just the active files and not any deleted or inactive files.
The COPY ACTIVEDATA command and MOVE DATA/NODEDATA commands will reconstruct aggregates as they are copied or moved within an active-data pool. Do not specify RECONSTRUCT=NO on the MOVE DATA/NODEDATA commands when dealing with active-data pools.
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 13
Collocation of Active Data – An Example
Client
Active data pool
A0 B0 C0 D0
A0 B0 C0 D0
A0 B0 C0 D0
1. Client backs up A0, B0, C0, D0 to primary pool with simultaneous write to active data pool.
Server
Active/inactive primary pool
B1A0 B0 C0 D0 E1
Client Server
B1 E1
B1A0 B0 C0 D0 E1
B1A0 B0 C0 D0 E1
2. Client backs up B1, E1 with simultaneous write to active data pool. B0 deactivated.
B1A0 B0 C0 D0 E1
Client Server
B1A0 B0 C0 D0 E1
B1A0 C0 D0 E1
3. Reclamation removes inactive B0 from active data pool.
B1A0 B0 C0 D0 E1
Client Server
B1A0 B0 C0 D0 E1
B1A0 C0 D0 E1
4. Client restores active files A0, C0, D0, B1, and E1 from active data pool.
A0 C0 D0 B1 E1
Active/inactive primary pool
Active/inactive primary pool
Active/inactive primary pool
Active data pool Active data pool
Active data pool
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 14
Client data restore when using Active Data Pools
When restoring client data, the server will select the active version of a file from an Active Data Poll when appropriate. The restore order has been modified to include Active Data Pools:
1. Active Data Pool (FILE)
2. DISK (random)
3. FILE (primary or copypool).
4. Active Data Pool sequential onsite.
5. Sequential onsite-volume (primary, copy or Active Data Pool), priority considers: mounted(idle) -> automated -> manual
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 15
Restoring Volumes or Storage Pools with Active Data Pools
Two new parameters on the RESTORE STGPOOL and RESTORE VOLUME commands:
► ACTIVEDATAONLY
• Specifies that active versions of backup files are to be restored from active-data pools only.
► ACTIVEDATAPOOL
• Specifies the name of the active data pool from which the active versions of backup files are to be restored. If this parameter is not specified, files are restored from any active data pool in which active versions of backup files can be located.
Note: Inactive files cannot be restored from active data pool and will be deleted from the primary pool.
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 16
Active Data Pools - The Q OCC and Q CONTENT commands
Query Occupancy Command:
► Number of Files
• Reduced when non-aggregated files are deleted during deactivation
• Reduced by number of logical files only when aggregate is deleted► Physical Size reflects the actual size of the aggregates (as it does today)
► Logical Space Occupied
• Reduced by size of logical file being deleted
• Reduced by size of logical file that is removed during reclamation (reconstruction)
Query Content Command:
► Will show all logical files in an aggregate regardless if it has been removed
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 17
Active Data Pools – Restrictions and Potential Issues
Can only be used for Backup data, not Archive data or HSM data► NDMP Backups do not go to Active Data Pools
Performing a point-in-time restore from active data pools is not allowed.
Must plan for time needed to do active data copies► Concurrent copies will be slower
► COPY ACTIVEDATA may be an additional process
COPY ACTIVEDATA runs slower than BACKUP STGPOOL► Difference is due to additional processing needed to verify the active/inactive
state of logical files in aggregates and act accordingly.
► Must check ownership of files and see of Node is authorized to used Active Data Pools for each file
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 18
Active Data Pool Implementation Ideas
Use an Active Data Pool as a substitute for “disk only” pool
► Reduces the size of the disk requirement
► Faster restores from disk (non-point-in-time restores)
BackupsActivePool
PrimaryPool
TapeCopyPool(for offsite)
Migration
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 19
Active Data Pool Implementation Ideas
Use an Active Data Pool for on-site media recovery
► Reduces the size of the tape requirement
► Faster restores from tape (non-point-in-time restores)
► No DRM support for Active Data Pools – offsite would be difficult
BackupsActivePool
PrimaryPool
TapeCopyPool(for offsite)
Migration
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 20
Active Data Pool Implementation Ideas
Use an Active Data Pool as a staging area for an anticipated restore
► Reduces the size of the disk staging area
► Faster restores when enough time is available to perform the staging operation
► Use a special Policy Domain to enable a single node copy
BackupsActivePool
PrimaryPool
TapeCopyPool(for offsite)
Migration
Copy when needed
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 21
Active Data Pool Implementation Ideas
Use an Active Data Pool as an electronic vaulting target
► Reduces the amount of data which must be stored in the remote location
► Faster disaster restores
► Replaces the Copy Storage Pool
BackupsActivePool
PrimaryPool
Migration
ExtendedSAN
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 22
Active Data Pool Demonstration
Server Layout
DISKPOOL1RandomPrimary Pool
DISKPOOL2RandomPrimary Pool
ACTPOOLSequentialActive Data Pool
POOL101POOL102
POOL201
STGPOOL VOLUMES
VOL001VOL002…..VOL014
DEVCLASS
DISK
ACTDEV
NODE
ACTNODE1(ACTDOM1)
ACTNODE2(ACTDOM2)
Con
curr
ent
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 23
Active Data Pool Demonstration
Server Scripts
►FILEVOLS – displays the status, utilization and percent reclaimable of the ACTPOOL volumes
►RECLAIM – runs reclamation on the ACTPOOL with a threshold of 10%
►ACTLOG5 – displays the last five minutes of Activity Log records
►DISK1DESTROY – marks POOL101 volume in DISKPOOL1 destroyed and off-line
Windows Scripts
►Touchfiles – updates the last access date of specified number of files
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 24
Active Data Pool Demonstration
Highlights
►Backup repeatedly and run reclamation to see the inactive files removed from the active pool
►Backup with concurrent copy to the active pool
►Use the COPY ACTIVEDATA command to populate the active pool
►Restore data from an active data pool
►Destroy a primary volume and restore from the active pool
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 25
Active Data Pools – Q DOMAIN
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 26
Active Data Pools – Q STGPOOL
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 27
Active Data Pools – Q STGPOOL
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 28
Active Data Pools – COPY ACTIVEDATA
IBM Software Group | Tivoli software
TSM Active Data Pools | September 2007 l © 2007 IBM Corporation 29
Questions???