a00-203 sas programing exam

23
Exam : A00-203 Title : Sas Warehouse Development Specialist Concepts Exam Ver : 05.26.08

Upload: nadeem-shafique-butt

Post on 10-Mar-2016

218 views

Category:

Documents


3 download

DESCRIPTION

Ver : 05.26.08 Exam : A00-203 QUESTION 2: QUESTION 3: QUESTION 1: QUESTION 4: Actualtests.com - The Power of Knowing A00-203 Answer: D Answer: B Answer: B Which of the following lists contains terms that all apply to data availability? A. I only B. I and II only C. I, II and V only D. III, IV and V only QUESTION 5: QUESTION 6: Actualtests.com - The Power of Knowing A00-203 Answer: B Answer: B

TRANSCRIPT

Page 1: A00-203 SAS Programing Exam

Exam : A00-203 Title : Sas Warehouse Development Specialist Concepts Exam Ver : 05.26.08

Page 2: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

QUESTION 1: Certkiller .com consists of a number of distinct businesses which store their customer details in separate, unlinked operational systems. Which of the following dimension types allows customer data to be fully shared between the businesses in a dimensional model? A. union dimensions B. conformed dimensions C. hierarchical dimensions D. slowly changing dimensions Answer: B

QUESTION 2: Which of the following lists contains terms that all apply to data availability? A. RAID 0 Striping B. RAID 5 Mirroring C. RAID 5 Striping D. RAID 0 RAID 5 Mirroring Answer: B

QUESTION 3: Which of the following options can be used to determine whether SQL procedure statements have been passed to a relational database management system (RDBMS) and efficiently processed? A. the DBSRVTP system option B. the SQL procedure STIMER option C. the SQL procedure FEEDBACK option D. the SASTRACE and SASTRACELOC system options Answer: D

QUESTION 4:

Page 3: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

You are an employee of Certkiller .com. Study the exhibit carefully to view a pre-defined report generated from a data warehouse. Which of the identified elements could qualify as metadata? Exhibit:

A. I only B. I and II only C. I, II and V only D. III, IV and V only Answer: B

QUESTION 5: Which of the following satisfies a requirement to maintain only 7 days of history for the DAILY.CURRENT data set? A. proc append base = daily.day17 data = daily.current; run; B. proc datasets library = daily nolist; age current day1-day7; run; C. proc datasets lib = daily; rename day6 = day7 day5 = day6 day4 = day5 day3 = day4 day2 = day3 day1 = day2 current = day1 ; run; D. data daily.day1 daily.day2 daily.day3 daily.day4 daily.day5 daily.day6 daily.day7; set daily.current; run; Answer: B

QUESTION 6:

Page 4: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

In SAS ETL Studio, metadata facilitates which of the following? I.data model design II.multi-user development III.process documentation IV.data definition standardization A. II and III only B. II and IV only C. II, III, and IV only D. I, II, III, and IV Answer: C

QUESTION 7: Which of the following are likely to lead to a successful implementation of a data warehouse project? I.source code management II.software release management III.naming and programming standards IV.operational data recovery processes A. I and II only B. I and III only C. II and IV only D. III and IV only Answer: B

QUESTION 8: Which of the following is needed to ensure the continued integrity and reliability of data in a production data warehouse? A. monthly meetings with business users to review the functionality of applications B. regular meetings with IT to review hardware configuration and resource utilization C. status reports from production schedulers to ensure that warehouse jobs run successfully D. formal communication with source system owners to identify structure and content changes Answer: D

QUESTION 9: Which of the following is the recommended practice for setting up the production

Page 5: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

environment for a data warehouse application? A. Manually copy entities from development to the production environment. B. Assign the role of setting up a production environment to the system administrator. C. Document steps during development and repeat them in the production environment. D. Create a script in the development environment and run it in the production environment. Answer: D

QUESTION 10: Which of the following validate the transfer of binary files between systems? A. using a checksum and checking the file size B. using a checksum and comparing the first and last N records C. using record counts and comparing a random sample of records D. checking the file size and comparing a random sample of records Answer: A

QUESTION 11: Which of the following statements are true regarding retrieving archived tape data? I. SAS can read tape format data. II. Archived data retrieved to disk should have retention dates. III. Archived retrieval performance can dictate data storage type, hardware choice, and partitioning. IV. Relational database management system (RDBMS) archives can be processed without returning to disk. A. II and III only B. I, II and III only C. II, III and IV only D. I, II, III and IV Answer: B

QUESTION 12: Which of the following is impacted by a data warehousing retention and archiving policy? A. disk space requirements only B. creation of data marts and disk space requirements C. creation of data marts and versioning/change management

Page 6: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

D. disk space requirements and versioning/change management Answer: D

QUESTION 13: According to The Data Warehouse Lifecycle Toolkit, by Ralph Kimball, which of the following represent best practices when deploying a data warehouse? I. Enable warehouse access for all valid users at the same time. II. Deny warehouse access to end-users who have not received warehouse training. III. Provide end-users resource support for warehouse to legacy system reconciliation. IV. End-user education can begin shortly before the production deployment of the warehouse. A. III and IV only B. I, II and III only C. I, II and IV only D. II, III and IV only Answer: D

QUESTION 14: Which of the following is a true statement regarding backup strategies for SAS data sets after the load has been completed? A. metadata must be synced B. file system-level backups are sufficient C. warehouse must be taken off-line to perform backup D. incremental backups can be performed based on SAS data set metadata Answer: B

QUESTION 15: After deploying the warehouse on a UNIX system, jobs accessing the SAS dataset named SALES are performing poorly. The dataset SALES is stored in the file /disk1/sales.sas7bdat and is 3Gb in size. Which of the following methods can be used to most accurately measure the I/O read performance? A. "options fullstimer; libname in '/disk1'; data_NULL_; set in.sales(drop=_all_); run;" B. "timex cat /disk1/sales.sas7bdat > /dev/null" C. "timex cp /disk1/sales.sas7bdat /disk1/foo" and divide the time by 2 D. "timex mv /disk1/sales.sas7bdat /disk1/foo" and divide the time by 2

Page 7: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

Answer: A

QUESTION 16: Which of the following helps to reduce the I/O for a procedure that requires every row of a SAS table to be processed? A. indexing the SAS table B. using the SQL procedure C. partitioning the SAS table D. compressing the SAS data set Answer: D

QUESTION 17: You are an employee of Certkiller .com. Study the exhibit carefully to view a diagram that displays a data warehouse environment: Which of the following describes the optimal method to join a transformed DB2 table with an Oracle table and load the result set to the data warehouse using SAS ETL Server and its components? Exhibit:

A. Generate code with SAS ETL Studio which: connects to Certkiller 2 from Certkiller 1, transforms the DB2 data on Certkiller 1, downloads the Oracle data using the DOWNLOAD procedure to Certkiller 1, joins those result sets on Certkiller 1, downloads the joined data using the DOWNLOAD procedure to Certkiller 2,

Page 8: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

and populates the target data warehouse table. B. Generate code with SAS ETL Studio which: connects to Certkiller 2 from Certkiller 1, transforms the DB2 data on Certkiller 1, extracts the Oracle data from Certkiller 2 using Remote Library Services, joins those result sets on Certkiller 1, and loads the target data warehouse table with the joined data using Remote Library Services to Certkiller 2. C. Generate code with SAS ETL Studio which: connects to Certkiller 1 from Certkiller 2, transforms the DB2 data on host1, downloads the result set using the DOWNLOAD procedure, extracts the ORACLE data on Certkiller 2, joins those result sets on Certkiller 2, and populates the target data warehouse table. D. Generate code with SAS ETL Studio which: connects to Certkiller 1 from Certkiller 2, downloads the DB2 data using Remote Library Services, transforms the downloaded data on Certkiller 2, extracts the ORACLE data on Certkiller 2, joins those result sets on Certkiller 2, and populates the target data warehouse table. Answer: C

QUESTION 18: Which of the following products are needed on the local host to access data from ORACLE on a remote host that is not running SAS? A. SAS/ACCESS to ORACLE only B. SAS/ACCESS to ORACLE and SQL*NET C. SAS/ACCESS to ORACLE and SAS/CONNECT D. SAS/ACCESS to ORACLE and SAS/SHARE*NET Answer: B

QUESTION 19: Which one of the following delivers the best I/O performance when creating a temporary SAS data set from a permanent SAS data set? A. Create the new data set in a library on a different physical disk. B. Create the new data set in the same library as the existing data set. C. Create the new data set in a library managed by a different disk controller. D. Create the new data set in a different library than the existing data set but on the same

Page 9: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

physical disk. Answer: C

QUESTION 20: Which of the following job flow dependencies can be defined using SAS Schedule Manager? I.The programs in the job flow run simultaneously. II.The programs in the job flow are dependent on time. III.The programs in the job flow are dependent on completed job execution. IV.The programs in the job flow are dependent on changes in operating system directory structure. A. I only B. II and III only C. II, III and IV only D. I, II III, and IV Answer: C

QUESTION 21: Which of the following individuals is most likely to determine the requirements that define how data will be displayed and grouped in applications and reporting? A. business end-user B. warehouse architect C. data quality manager D. warehouse database administrator Answer: A

QUESTION 22: Which of the following are primary information that those responsible for running the warehouse jobs need? I. recovery and restart procedures in case of failure II. schedule dependencies between processing jobs III. table of business rules used to load the warehouse IV. list of all input data sources and persons responsible A. I and II only B. I, II and IV only C. I, III and IV only D. II, III and IV only

Page 10: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

Answer: B

QUESTION 23: Which of the following is verified by integration testing? A. Warehouse processes work together. B. Data tables are imported into the database. C. External data is combined with internal data. D. The data integration rules of the extract, transform, and load (ETL) processes work. Answer: A

QUESTION 24: Which of the following is/are data quality checks? I. matching control tables and totals II. comparing values to external sources III. checking program and function return codes IV. verifying that source data files were imported A. I only B. I and II only C. II and III only D. I, III and IV only Answer: B

QUESTION 25: Two physical disks are shared between the production and test environments. Which of the following is the recommended approach when full table scans are the predominant access method? A. Use one disk and its file system for the production system and the other disk and its file system for the test system. B. Create one logical volume with one overall file system using both disks and use directory structures to share space. C. Create two logical volumes on each physical disk and a file system per logical volume. Then, use one file system from each disk for test and one for production. D. Create two logical devices on each disk, then create two file systems. Then, with each file system, use one logical volume from each disk and stripe the data across the two selected logical volumes. Answer: D

Page 11: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

QUESTION 26: Which of the following allows data to be extracted from a single relational database management system (RDBMS) table using changed data capture? A. source RDBMS rollback log B. row level modification date/time stamp C. table level modification date/time stamp D. warehouse operational data store (ODS) metadata Answer: B

QUESTION 27: According to the SAS Rapid Warehousing Methodology, in which of the following phases must the acceptance test criteria be defined? A. final test B. assessment C. deployment D. requirements Answer: D

QUESTION 28: Which of the following objects can be defined as metadata in a data warehouse that uses SAS data sets for storage? A. indexes and tables only B. indexes, schemas, and tables only C. indexes, primary/foreign keys, and tables only D. indexes, primary/foreign keys, schemas, and tables Answer: C

QUESTION 29: Which of the following is the best action to perform during the warehouse process when the values in the source data do not conform to the documented business rules during data validation? A. Write a file of incorrect data for review by data owners. B. Create a program to update the original source data files. C. Change the validation code to correct the incorrect values.

Page 12: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

D. Load non-conforming records and mark them for future fix. Answer: A

QUESTION 30: Which of the following allow files to be shared in heterogeneous environments? I. NFS mounts II. a network filer III. FTP processes IV. an array attached to the hosts via a split bus A. I and III only B. I, II, and III only C. I, III, and IV only D. I, II, III, and IV Answer: B

QUESTION 31: The schema below is updated with monthly snapshots from a single source system. Exhibit:

In what order must the tables be loaded to correctly represent the data? A. 1. DIMB 2. DIMA 3. FACT B. 1. DIMA 2. DIMB 3. FACT C. 1. FACT 2. DIMB 3. DIMA D. The order is not important. Answer: B

QUESTION 32: Metadata for a SAS source table has been captured using the Source Designer Wizard in

Page 13: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

SAS ETL Studio. Which of the following table metadata attributes would be registered with the SAS Metadata Server? I.Cardinality II.Foreign Key III.Unique Index IV.Table Permissions A. I and IV only B. II and III only C. III and IV only D. I, II and III only Answer: B

QUESTION 33: Which of the following should be stored in the metadata for a data warehouse? A. business rules, logical model, and transformation rules B. business rules, conventions and target table data values C. derived data values, subject model and transformation rules D. business rules, target table data values and transformation rules Answer: A

QUESTION 34: A star schema design requires that history information is accurately maintained for a CUSTOMER dimension. According to The Data Warehouse Lifecycle Toolkit, by Ralph Kimball, which of the following slowly changing dimension techniques is the most appropriate to update the CUSTOMER dimension? A. Type I B. Type II C. Type III D. Type IV Answer: B

QUESTION 35: Which of the following are used to redirect processing from SAS to a relational database management system (RDBMS)? I. SAS/Export facility II. remote processing using SAS/CONNECT

Page 14: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

III. SQL pass-through facility with RDBMS-specific SQL IV. SAS WHERE clause used with SAS engine technology A. III and IV only B. I, III and IV only C. II, III and IV only D. I, II, III and IV Answer: A

QUESTION 36: An operational data source is updated by on-line transactional processing (OLTP) applications from 7:00am until 9:00pm every day. OLTP system off-line backups start at 11:00pm and finish by 1:00am. Transactional reporting systems run throughout the night, completing before the start of the on-line day at 7:00am. A 4.5 hour extract job is expected to capture data for a full day in each run. What is the earliest time that the extract job can run to acquire new data for the data warehouse? A. 1:01am B. 7:01am C. 9:01pm D. 11:01pm Answer: A

QUESTION 37: A physical data warehouse is being designed in a UNIX environment. Which of the following is true regarding the allocation of storage space in raw disk partitions versus file systems? I. Raw disk partitions are easier to manage. II. File systems incur more CPU overhead for writes. III. Raw disk partitions can be faster for databases such as Oracle. IV. Indirect-block based file systems need to write maintenance information to only one block. A. I and IV only B. II and IV only C. I and III only D. II and III only Answer: D

Page 15: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

QUESTION 38: Which of the following strategies can be used to increase data warehouse availability? A. archiving B. redundancy C. dependency D. concurrency Answer: B

QUESTION 39: When performing table loads, which of the following automatic data step variables can be used for error checking? A. _IORC_ and _SYSRC_ B. _ERROR_ and _IORC_ C. _ERROR_ and _SYSERR_ D. _SYSERR_ and _SYSRC_ Answer: B

QUESTION 40: What functionality is available in SAS Management Console for pushing metadata from a test environment to a production environment? A. Promotion and Replication B. Promotion and Distribution C. Publishing and Distribution D. Publishing and Replication Answer: A

QUESTION 41: In SAS ETL Studio, which one of the following transformations can be used to generate code for accessing data sources located on different hosts? A. Data Transfer B. Data Download C. Remote Connect D. Remote Services Answer: A

Page 16: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

QUESTION 42: A process flow diagram has been defined using the Process Designer Window within SAS ETL Studio. Which of the following is the purpose of the process flow diagram? A. It specifies the host where the job will be executed. B. It specifies SAS System options to use in the job execution. C. It specifies the processing steps that will load data into one or more target tables. D. It specifies expressions used to map source data columns into one or more target tables. Answer: C

QUESTION 43: Which of the following indicate that a Type I slowly changing dimension should be used? I. Disk space is constrained. II. History needs to be tracked over time. III. A changed record audit trail needs to be maintained. IV. Fields that are updated have little or no impact on analysis. A. I and II only B. I and III only C. I and IV only D. II and IV only Answer: C

QUESTION 44: The on-line data retention policy for the warehouse requires that 36 months of data be available. There is an additional requirement that up to 60 months worth of detail data must be able to be restored on-line consistent with the on-line version. Assume that the monthly load cycle has no out of time period transaction problems. Which of the following fails to meet the requirement for 60 months of data? A. For a Type I dimensional star schema: Remove the facts older than 36 months, archive to tape, and snapshot the dimension tables. B. For a Type II dimensional star schema: Remove the facts older than 36 months, archive to tape, and snapshot the dimension tables. C. For a Type III dimensional star schema: Remove the facts older than 36 months, archive to tape, and snapshot the dimension tables. D. For a Type II dimensional star schema: Remove the facts older than 36 months and archive to tape. There is no need to snapshot the dimension tables.

Page 17: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

Answer: A

QUESTION 45: Which of the following steps are necessary to implement job dependencies in a SAS data warehouse? I.Deploy the job in SAS ETL Studio. II.Define job dependencies in SAS ETL Studio. III.Create a Job Flow in SAS Management Console. IV.Schedule the Job Flow in SAS Management Console. A. II only B. I and II only C. III and IV only D. I, III and IV only Answer: D

QUESTION 46: In SAS ETL Studio, which of the following can be generated using the SQL Join transformation? I.performing a left join of tables II.performing a union join of tables III.joining tables based on user-specified keys IV.joining tables based on the first occurrence of a key variable A. I only B. I and III only C. I, III and IV only D. I, II, III and IV Answer: B

QUESTION 47: Which of the following is the best choice for a surrogate key for a dimension table in a star schema? A. a production key and an effective start date only B. a unique integer variable allocated in ascending numeric order C. a production key, an effective start date, and an effective stop date D. a unique character variable allocated in ascending alphabetic order Answer: B

Page 18: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

QUESTION 48: Which of the following backup scenarios least impacts the data warehouse host? A. using network filers for storage B. using directly attached devices C. running the backup software on a separate host with a high speed NFS connection D. using a storage area network and running the backup software on a separate host Answer: D

QUESTION 49: Which of the following are reasons for monitoring and collecting statistics on the data warehouse activity? I. planning for training II. planning for expanding the warehouse III. fine-tuning the warehouse performance IV. planning for growth in the data warehouse A. I and IV only B. III and IV only C. II, III and IV only D. I, II, III and IV Answer: C

QUESTION 50: In SAS ETL Studio, which of the following are stored as metadata when building a data warehouse? I.permissions II.table attributes III.column attributes IV.number of rows in a table A. III only B. II and III only C. I, II and III only D. II, III and IV only Answer: B

QUESTION 51:

Page 19: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

Which of the following must be considered when designing unit tests for the target load process? I. access capabilities to source data II. predefined test cases that check transformation rules III. validity of the data extracted from the operational data store (ODS) IV. test plan with conditions and results that match the design specification A. II and IV only B. I, II, and IV only C. II, III, and IV only D. I, II, III, and IV Answer: A

QUESTION 52: Which of the following describes the dimension tables in a star schema logical design? A. They are normalized and can join only through the fact table. B. They are denormalized and can join only through the fact table. C. They are normalized and can join through the fact table or each other. D. They are denormalized and can join through the fact table or each other. Answer: B

QUESTION 53: All possible test cases according to the business rules are run using generated data. Which of the following is the impact of using this data in the initial testing and development process? A. The physical data model of the warehouse can be tested. B. The developer can predict the production storage requirements. C. The developer can predict the results of the input source processes. D. Integrity problems in the source data can be identified early in the development life cycle. Answer: A

QUESTION 54: During the data warehouse testing process, which of the following SAS procedures can be used to determine the cardinality of data field values for a data set consisting of character data? A. FREQ only

Page 20: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

B. UNIVARIATE only C. FREQ and RANK only D. FREQ and UNIVARIATE only Answer: A

QUESTION 55: A star schema has been defined using surrogate keys for all dimensions. Changed data capture is being used to provide updates to the SALES fact table and to the PRODUCT and CUSTOMER dimension tables. Which of the following is the correct load sequence for processing the new data? A. 1. Update the SALES table with new data. 2. Update the PRODUCT and CUSTOMER tables. 3. Assign surrogate keys to the new SALES records. B. 1. Update the PRODUCT and CUSTOMER tables. 2. Update the SALES table with new data. 3. Assign surrogate keys to the new SALES records. C. 1. Update the PRODUCT and CUSTOMER tables. 2. Assign surrogate keys to the new SALES records. 3. Update the SALES table with new data. D. 1. Update the SALES table with new data. 2. Assign surrogate keys to the new SALES records. 3. Update the PRODUCT and CUSTOMER tables. Answer: C

QUESTION 56: Which of the following are considered metadata standards? A. DDE and XML B. OMG and XML C. JMS and OMG D. COM and DCOM Answer: B

QUESTION 57: Which of the following is the primary purpose of using a surrogate key with slowly changing dimensions? A. to provide uniqueness of the key in the table B. to reduce the storage requirements of the key

Page 21: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

C. to avoid the need to update existing keys in the table D. to provide a meaningful way for users to identify records by looking at the key Answer: C

QUESTION 58: Which of the following is a benefit of using a journaling file system in a large implementation? A. data redundancy B. data compression C. fewer I/O operations D. quicker recoverability Answer: D

QUESTION 59: For which of the following should metadata be captured? A. data exploitation and data storage only B. extract, transform, and load (ETL) processes only C. data storage and extract, transform, and load (ETL) processes only D. data exploitation, data storage, and the extract, transform, and load (ETL) processes Answer: D

QUESTION 60: Which of the following provides enhanced performance and minimized storage and avoids merge/join processing for look-up values within the data warehouse? A. SAS arrays B. SAS formats C. SAS SCL lists D. SAS macro variables Answer: B

QUESTION 61: The contents of a data dictionary extract are listed below: Exhibit: 203-18.jpg Which of the following describes the type of metadata listed?

Page 22: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

A. static business metadata B. static technical metadata C. dynamic business metadata D. dynamic technical metadata Answer: B

QUESTION 62: The diagram below (Martin notation) shows the logical relationship between two tables named CUSTOMER and ACCOUNT. Exhibit: 203-24.jpg Which of the following describes the relationship between the data in the tables? A. All customers have an account. B. All accounts have at lease one customer. C. There are more customers than accounts. D. There are more accounts than customers. Answer: B

QUESTION 63: Which of the following should be considered when assessing warehouse performance in the test environment? I. access permissions II. data reliability and quality III. software version and configuration IV. hardware specification and configuration A. I and II only B. I and IV only C. II and III only D. III and IV only Answer: D

QUESTION 64: In which of the following are the SAS Data Quality Cleanse functions available? A. SAS DATA step and SQL procedure only B. SAS Component Language (SCL) and SQL procedure only C. SAS DATA step and SAS Component Language (SCL) only D. SAS DATA step, SAS Component Language (SCL) and SQL procedure

Page 23: A00-203 SAS Programing Exam

A00-203

Actualtests.com - The Power of Knowing

Answer: D

QUESTION 65: According to the SAS Rapid Warehousing Methodology, which of the following test stages are performed when designing a warehouse? I. system test II. metadata test III. integration test IV. acceptance test A. II and III only B. III and IV only C. I, III and IV only D. I, II, III and IV Answer: C