chapter 5: working with data sets

Introduction of z/OS Basics

Chapter 5: Working with data sets

Chapter 05 Working with Datasets

Chapter objectives

Be able to:

Explain what a data set is

Describe data set naming conventions and record formats

List some access methods for managing data and programs

Explain what catalogs and VTOCs are used for

Be able to create, delete, and modify data sets

Key terms in this chapter

block size

catalog

data set

high level qualifier (HLQ)

library

logical record length (LRECL)

member

PDS and PDSE

record format (RECFM)

system managed storage (SMS)

virtual storage access method (VSAM)

What is a data set?

A data set is a collection of logically related data records stored on one disk storage volume or a set of volumes.

A data set can be:– a source program– a library of macros– a file of data records used by a processing program.

You can print a data set or display it on a terminal. The logical record is the basic unit of information used by a program running on z/OS.

Dataset Naming

What an access method is

Defines the technique used to store and retrieve data.

Includes system-provided programs and utilities to define and process data sets.

Commonly used access methods include the following:

– VSAM: Virtual Storage Access Method

– QSAM: Queued Sequential Access Method

– BSAM: Basic Sequential Access Method

– BDAM: Basic Direct Access Method

– BPAM: Basic Partitioned Access Method

DASD: Use and terminology

Direct Access Storage Device (DASD) is another name for a disk drive.

DASD volumes are used for storing data and executable programs.

Data sets in a z/OS system are organized on DASD volumes.– A disk drive contains cylinders– Cylinders contain tracks– Tracks contain data records.

Datasets

Using a data set

To use a data set, you first allocate it.

– To set aside (create) space for a new data set on a disk.

– To establish a logical link between a job step and any data set

Then, access the data using macros for the access method that you have chosen.

Various ways to allocate a data set:– ISPF data set panel, option 3.2– Access Method Services– TSO ALLOCATE command – job control language (JCL)

Allocating space on DASD volumes

How space is specified:– explicitly (SPACE parameter)– implicitly (SMS data class) System Management Facilities

Logical records and blocks:– Smallest amount of data to be processed– Grouped in physical records named blocks

• Each block has a distinct location and unique address

Data set extents:

– Space for a disk data set is assigned in extents

• Contiguous number of disk drive tracks (or cylinders)– 128 to 255 extents per volume

Data set record formats

record

block block

record record record

record

recordrecord recordrecord record record

record record

record

record record

record

recordrecord

Fixed records.

Fixed blocked records. BLKSIZE = n * LRECL

RDWVariable records.

Variable blocked records. BLKSIZE >= 4 + n * largest LRECL

Undefined records. No defined internal structure for access method.

Record and block descriptors words are each 4 bytes long

one physical block on disk is one logical record and all the blocks/records are the same size

several logical records are combined into one physical block

one logical record as one physical block

four-byte Record Descriptor Word (RDW) at the beginning of the record

several variable-length logical records (each with an RDW) in one physical block

Extra BDW at beginning with total length of block

variable-length physical records/blocks with no predefined structure

Types of data sets

We discuss three types in this class:

– Sequential, partitioned, and VSAM

A sequential data set is a collection of records written and read in sequential order from beginning to end.

A partitioned data set (PDS) is a collection of sequential data sets, called members.

– Consists of a directory and one or more members.

– Also called a library.

A PDSE is a partitioned data set extended.

Types of Non-VSAM datasets

PDS versus PDSE

PDS data sets:

– Simple and efficient way to organize related groups of sequential files.

PDSE data sets:

– Similar to a PDS, but advantages include:

• Space reclaimed automatically when a member is deleted

• Flexible size• Can be shared• Faster directory searches

What is a data set, and how is it stored

Record 1 Record 2 Record 3 Record 4etc ...

Sequential Data Set

Available space

Partitioned Data Set

Previously used space recoverable by compress utility

Directory

Entry for COMPJCL Entry for JCOPY Entry for SORT1

COMPJCL

Partitionedand

Sequential

How data is stored in a z/OS system

Data is stored on a direct access storage device (DASD), magnetic tape volume, or optical media.

You can store and retrieve records either directly or sequentially.

You use DASD volumes for storing data and executable programs, including the operating system itself, and for temporary working storage.

You can use one DASD volume for many different data sets, and reallocate or reuse space on the volume.

General Dataset Specifications

Allocating a Dataset in ISPF

How data sets are named Data set naming convention

– Unique name• Maximum 44 characters

– Maximum of 22 name segments: level qualifier • The first name in the left: high level qualifier (HLQ)• The last name in the right: low level qualifier (LLQ) • Level qualifiers are separated by '.'

– Each level qualifier:• From 1 up to 8 characters• The first must be alphabetical (A-Z) or special (@ # $)• The 7 remaining: alphabetical, national, numeric (0-9) or hyphen (-)• Upper case only

– Example: MYID.JCL.FILE2 HLQ: MYID 3 qualifiers

Member name of partitioned data set

– 8 bytes long

– First byte: alphabetical (A-Z) or special (@ # $)

– The 7 remaining: alphabetical, special, numeric (0-9)

Naming Conventions

The letters LIB somewhere in the name indicate that the data set is a library.

– The letters PDS are a lesser-used alternative for this.

The letters CNTL, JCL, or JOB somewhere in the name typically indicate the data set contains JCL (but might not be exclusively devoted to JCL).

The letters LOAD, LOADLIB, or LINKLIB in the name indicate that the data set contains executables. (A library with z/OS executable modules must be devoted solely to executable modules.)

The letters PROC, PRC, or PROCLIB indicate a library of JCL procedures.

Naming Conventions Various combinations are used to indicate source code

for a specific language, for example COBOL, Assembler, FORTRAN, PL/I, JAVA, C, or C++.

A portion of a data set name may indicate a specific project, such as PAYROLL.

Using too many qualifiers is considered poor practice. For example, P390A.A.B.C.D.E.F.G.H.I.J.K.L.M.N.O.P.Q.R.S is a valid data set name (upper case, does not exceed 44 bytes, no special characters) but it is not very meaningful. A good practice is for a data set name to contain three or four qualifiers.

Again, the periods count toward the 44-character limit.

Catalogs and VTOCs

z/OS uses a catalog and a volume table of contents (VTOC) on each DASD volume to manage the storage and placement of data sets.

– Lists the data sets on a volume

– Lists the free space on the volume.

Volume Table of Contents

LABEL(volser)

Extents

tracks

free spaceYOUR.DATAMY.DATA

trackstracks

Record 1 on the first track

– Contains vosler, a 6-char volume serial number

– Contains pointer to Volume Table of Content

– Lists the data sets that reside on the volume

– Informatioin about location and size of each data set

– Other data set attributes

– ICKDSF, standard program, creates label and VTOC

• Owner can specify location and size of VTOC

How a catalog is used

A catalog associates a data set with the volume on which the data set is located.

Locating a data set requires:– Data set name– Volume name– Unit (volume device type)

Typical z/OS system includes a master catalog and numerous user catalogs.

Catalog Structure

IBMUSER.A1USER.A1SYS1.A1

Catalog Structure

SYSTEM.MASTER.CATALOG

Data Set-SYS1.A1or

HLQs (alias)IBMUSER...USER

Master Catalog

USERCAT.COMPANY

Data Set withHLQ=USER

User Catalog

USERCAT.IBM

User Catalog

Data Set withHLQ=IBMUSER

volume (wrk002)unit (3390)

IBMUSER.A2IBMUSER.A3

volume (wrk001)unit (3390)

volume (012345)unit (tape)

USER.TAPE.A1

stores the full data set name and location of all data sets with a SYS1 prefix

Path to SYS1.A1 data

Path to IBMU

SER.A1 data

Locating a dataset in MVS

Catalog and Uncataloged Datasets

Note the ‘ // ‘ and parm statements used for Job Control Language

Dataset Control Blocks (DSCB)

VTOC Index Structure

ISPF option 3.4

Traditional Disk Capacity (DASD)

Large Volume (own device type)

Data management in z/OS

Data management involves all of the following tasks: allocation, placement, monitoring, migration, backup, recall, recovery, and deletion.– Setting aside (allocating) space on DASD volumes

– Automatically retrieving cataloged data sets by name

– Mounting magnetic tape volumes in the drive

– Establishing a logical connection between the application program and the medium

– Controlling access to data

– Transferring data between the application program and the medium

Data management in z/OS Storage management is done either manually or through

automated processes (or through a combination or both).

In z/OS, Data Facility Storage Management Subsystem (DFSMS) is used to automate storage management for data sets.

– Storage Management Subsystem (SMS).

– System programmer or storage administrator defines policies that automate the management of storage and hardware devices. • describe data allocation characteristics, performance and availability goals, backup

and retention requirements, and storage requirements for the system. – SMS governs these policies for the system and the Interactive Storage

Management Facility (ISMF) provides the user interface for defining and maintaining the policies.

– The data sets allocated through SMS are called system-managed data sets or SMS-managed data sets.

Data Facility Subsystem Managed Storage (DFSMS)

VSAM is Virtual Storage Access Method

– Primarily for applications

VSAM provides more complex functions than other disk access methods

VSAM record formats:– Key Sequence Data Set (KSDS)– Entry Sequence Data Set (ESDS)– Relative Record Data Set (RRDS)– Linear Data Set (LDS)

VSAM Access Method

IDCAMS. An IBM program used to process access method services commands. It can be invoked as a job or jobstep, from a TSO terminal, or from within a user's application program.

Simple VSAM control interval

R1 R2 R3 free space in CIRDF

Record Descriptor Fields

Control Interval (CI): logical data area

Default Cl size is 4K bytes, max to 32K bytes

Contains data records, unused space, Record Descriptor Fields (RDF), and a CI Descriptor Field

VSAM Index Structure

VSAM Keyed Dataset

This is the most common use for VSAM. Each record has one or more key fields and a record can be retrieved (or inserted) by key value. This provides random access to data. Records are of variable length.

VSAM Sequential Dataset = ESDS

ESDS: Entry Sequence Data Set

This form of VSAM keeps records in sequential order. Records can be accessed sequentially. It is used by IMS, DB2, and z/OS UNIX.

VSAM - RRDS

This VSAM format allows retrieval of records by number; record 1, record 2, and so forth. This provides random access and assumes the application program has a way to derive the desired record numbers.

VSAM LDS

This is, in effect, a byte-stream data set and is the only form of a byte-stream data set in traditional z/OS files (as opposed to z/OS UNIX files). A number of z/OS system functions.

Basic Parms for VSAM dataset

Defining a VSAM KSDS using AMS.

z/OS UNIX file systems

z/OS UNIX System Services (z/OS UNIX) allows z/OS to access UNIX files.

A z/OS UNIX file system is hierarchical and byte-oriented.

Files in the UNIX file system are sequential files and are accessed as byte streams.

UNIX files and traditional z/OS data sets can reside on the same DASD volume.

MVS and File Systems

UNIX file system structure

chapter 5: working with data sets

Documents

chapter 6 listable sets and diophantine sets; hilbert’s

chapter 13 cardinality of sets

chapter 1.1 sets, set operations and number sets

selecting representative working cycles from large...

working with themes, style sets, backgrounds, quick parts

working with sets - elsp.ie topic - ordinary level -...

chapter 7 sets & probability

visualizing the evolution of working sets

chapter 2: sets and subsets

secondary analysis and working across data sets and...

chapter 1 sets

working with matrices -...

working with sas data sets

1 chapter 2 fuzzy sets versus crisp sets part one: theory

name: date: maths: higher level working with sets, indices...

working with picture sets - the critical thinking consortium...

chapter 3 sets(5)_4a

chapter 11 i sets enhance

chapter 2: entity-relationship model entity sets entity sets

metal working tools. metal working tools tap & die sets...