sound forge

60001198—Second Edition

Sound Forge and the Intervoice System Audio File Guide

Sound Forge and the Intervoice System Audio File Guide

Document No. 60001198 Second Edition

Copyright 2004 Intervoice, Inc.

All rights reserved.

Trademarks of Intervoice, Inc. and/or its affiliates:

Names and marks for products provided by the Company’s suppliers are used herein for identification purposes and may be trademarks of their respective companies.

Windows NT and Windows 2000 are registered trademarks of Microsoft.

The Apache Software License, Version 1.1Copyright 2000–2001 The Apache Software Foundation. All rights reserved.

Java Runtime Environment Versions 1.2, 1.3, and 1.4 are products of Sun Microsystems, Inc. All rights reserved. Please refer to the license agreement on your system.

Sound Forge is a registered trademark of Sonic Foundry.

No part of this document may be reproduced without the written permission of Intervoice, Inc.

Any comments or suggestions concerning this publication should be directed to:

Technical Documentation DepartmentIntervoice, Inc.17811 Waterview ParkwayDallas, Texas 75252Telephone: (972) 454-8000Fax: (972) 454-8505E-mail: [email protected]

For more information about Intervoice, Inc., call (972) 454-8000, or visit the Intervoice Web site at www.intervoice.com.

The information in this document explains how to create audio files for InterSoft version 3 systems with IQTalk or VoiceXML Browser. The runtime .wav files created here can also be used on MTC 2.x systems. This document includes information on Sound Forge 6.0 and 7.0, Sonic Foundry Batch Converter 5.0, and CRVoice2. If earlier software versions are used, refer to Using Sound Forge for InterVoice Systems (document number 60000259) for Sound Forge 4.0 or Sound Forge on InterSoft Systems User’s Guide (document number 60000984) for Sound Forge 5.0. This document was previously released in December 2002 titled, Sound Forge and the Omvia Media Server Audio File Guide. This is the second edition of this document, November 2004.

Documentation Comment Form

Our goal is to provide accurate and efficient information to our customers. Please complete this form to help us improve the quality and usability of this product documentation.

• List the document you used by name, number, and edition.

____________________________________________________________________

____________________________________________________________________

• Did the document provide the information needed? ❒ Yes ❒ No

Explain______________________________________________________________

____________________________________________________________________

____________________________________________________________________

• Were you able to locate the information in a timely manner? ❒ Yes ❒ No

Explain______________________________________________________________

____________________________________________________________________

____________________________________________________________________

• How accurate did you find the information in the document to be?

❒ Very Accurate ❒ Accurate ❒ Somewhat Accurate ❒ Not Accurate

• When you need to find information, where is the first place you usually look?

❒ Table of Contents ❒ Index ❒ Headings ❒ Scan the text

• Was the index thorough? ❒ Yes ❒ No How could it be improved?

____________________________________________________________________

____________________________________________________________________

• If you completed the steps in this document, how easy were they to follow?

❒ Very Easy ❒ Easy ❒ Somewhat Easy ❒ Not Easy

• Overall, how helpful did you find the document to be?

❒ Very Helpful ❒ Helpful ❒ Somewhat Helpful ❒ Not Helpful

• Are there any technical errors in the document you would like to report?

____________________________________________________________________

____________________________________________________________________

• List any additional topics you would like to see covered in the documentation.

____________________________________________________________________

____________________________________________________________________

Fax to: (972) 454-8505E-mail to: [email protected]

60001198, Second Edition Table of Contents i

Table of Contents

List of Figures v

List of Tables vii

About This Document ix

Purpose and Audience ......................................................................... ixChapters ................................................................................................ xIntervoice Documentation.................................................................... xi

Documentation Formats................................................................. xiAdobe Acrobat .pdf Files......................................................... xiMicrosoft HTML Help .chm Files........................................... xi

Documentation Access .................................................................. xiProduct CD-ROMs .................................................................. xiWeb Site .................................................................................. xii

Entire Documentation Set................................................. xiiSite Preparation and Installation Document Set ............... xii

Hard Copy Documents............................................................ xiiRelated Intervoice Documents........................................................... xiii

InterSoft Version 3 .......................................................................xiiiSound Forge ................................................................................. xiii

Conventions ....................................................................................... xivInformation Symbols ................................................................... xiv

Technical Support ............................................................................... xvWestern Hemisphere ..................................................................... xvEastern Hemisphere ...................................................................... xv

Overview 1-1

Summary of Changes.........................................................................1-2Voice Recording Concept Diagram ................................................... 1-3Recommended Hardware...................................................................1-4

Specific Hardware........................................................................1-4Recommended Software ....................................................................1-5

ii Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

Setting Up the Studio 2-1

Studio Setup Checklist .......................................................................2-2Installing the Hardware......................................................................2-3

Connecting the Sound Card to the Speakers................................2-4Installing the Sonic Foundry Batch Converter ..................................2-6Installing CRVoice2 ...........................................................................2-7Installing Sound Forge.......................................................................2-7Making Quality Recordings...............................................................2-8

Setting Microphone Levels ..........................................................2-8Placing the Microphone ...............................................................2-9Avoiding Clipping......................................................................2-10

Performing the Noise Floor Test......................................................2-12

Defining Audio File Formats 3-1

Intervoice Master Format...................................................................3-2

Using Sound Forge to Record Messages 4-1

Setting Up Automatic Labeling .........................................................4-2Setting Recording Attributes .............................................................4-5Recording Process Checklist .............................................................4-6Recording Methods............................................................................4-7

Continuous Recording .................................................................4-8Multiple Takes (No Regions).................................................4-8

Message-by-Message Recording .................................................4-8Multiple Takes Creating Regions...........................................4-8Create a New Window for Each Take....................................4-8

Recording Messages ..........................................................................4-9Continuous Recording .................................................................4-9

Multiple Takes (No Regions).................................................4-9Message-by-Message Recording ...............................................4-12

Multiple Takes Creating Regions.........................................4-12Extracting Regions.........................................................4-14

Create a New Window for Each Take..................................4-15Reviewing Messages........................................................................4-17Removing Intervoice Batch Scripts .................................................4-18

Postprocessing 5-1

Batch Converter Overview ................................................................5-2Downloading the Batch Converter ..............................................5-3Using the Sample Batch Script ....................................................5-3Converting the .wav Files to Runtime Format Overview............5-4

Using the Batch Converter.................................................................5-6Modifying the Sample Batch Script...................................................5-8

60001198, Second Edition Table of Contents iii

Naming and Converting .wav to .ivc to m.### Files 6-1

Filename Formats (.ivc to m.###)...................................................... 6-2Voice File Naming Convention.................................................... 6-3

Using the CRVoice2 Utility ............................................................... 6-4

Maintaining Audio Quality 7-1

Voice File Quality Issues ...................................................................7-2

Frequently Asked Questions A-1

FAQs ................................................................................................. A-2Recording Steps .................................................................... A-2Defining DC Offset............................................................... A-2Producing 64-KB Messages.................................................. A-3Converting Messages............................................................ A-3Using the Gain Leveling Preset ............................................ A-4Installing the Audiophile 2496 Sound Card ......................... A-4Turning Off AGC.................................................................. A-5Normalizing the Message Volume........................................ A-5Filtering Messages ................................................................ A-6Reducing Pops and Clicks in Messages................................ A-6

Glossary Glossary-1

Index Index-1

60001198, Second Edition List of Figures v

List of Figures

Figure 1-1. Voice Recording Concept Diagram............................. 1-3Figure 2-1. Voice Recording Hardware ......................................... 2-3Figure 2-2. Voice Recording Without Clipping ........................... 2-10Figure 2-3. Voice Recording with Clipping................................. 2-11Figure 4-1. Recording Process....................................................... 4-7Figure 5-1. .wav to .ivc Batch Conversion Process ....................... 5-2Figure 5-2. .wav to runtime .wav Batch Conversion Process........ 5-2Figure 6-1. Filename Formats (.ivc to m.###)............................... 6-2Figure 6-2. Transforming .wav Files to .ivc Files.......................... 6-3

60001198, Second Edition List of Tables vii

List of Tables

Table 2-1. Studio Setup Checklist ................................................. 2-2Table 3-1. Format Differences....................................................... 3-3Table 4-1. Recording Process Checklist........................................ 4-6

60001198, Second Edition About This Document ix

About This Document

Purpose and Audience

The Sound Forge and the Intervoice System Audio File Guide provides information on creating audio files using Sound Forge. Topics include studio set up, file formats and file naming conventions, post-processing using the Batch Converter, and troubleshooting. This document also provides detailed instructions on using the CRVoice2 utility to rename audio files to the Intervoice runtime format.

Sound Forge can be used for recording and editing, but it is required for voice file postprocessing, noise floor testing, and complex editing.

The audience for this guide includes voice talents and speech developers interested in an efficient method of postprocessing voice files, testing the noise floor, and editing complex voice recordings.

Sound Forge Versions and Screen Appearance

This document includes information for using Sound Forge version 6 or 7 for creating audio files for use on the Intervoice system. The appearance of the graphical user interface (GUI) screens pictured in this document may vary slightly from those on your system.

x Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

Chapters

This document contains the chapters briefly described below.

Chapter 1—Overview

Provides the overview for this document. It contains a purpose statement for the entire document. It also contains a concept diagram and a list of required hardware and software.

Chapter 2—Setting Up the Studio

Describes the elements necessary to properly set up the recording studio. It explains how to install the voice recording hardware and configure volume controls. It also discusses how to set recording attributes and the microphone level, and how to perform the noise floor test.

Chapter 3—Defining Audio File Formats

Describes the voice file formats and explains the master format differences.

Chapter 4—Using Sound Forge to Record Messages

Describes how to set up the automatic labeling functions and describes the voice recording process, the methods of recording voice messages, and the use of Sound Forge to record voice messages.

Chapter 5—Postprocessing

Explains the steps to convert multiple voice files to the voice file format using the Sonic Foundry Batch Converter 5.0.

Chapter 6—Naming and Converting .wav to .ivc to m.### Files

Provides the Intervoice InterSoft IQTalk voice file naming convention and explains how to use the CRVoice2 utility to rename .ivc files to m.###.

Chapter 7—Maintaining Audio Quality

Explains the purpose of master voice files and stresses the importance of editing the files before compressing them. It also stresses the importance of retaining original master voice files.

60001198, Second Edition About This Document xi

Intervoice Documentation

All Intervoice standard product documentation is available online in various locations. Documents are included on the product CD-ROMs, on the product GUI Help menu, as well as on the Intervoice Web site. The online Core Product Documentation Guide, document number 60000021, includes descriptions of all available documents.

Documentation Formats

Intervoice provides its standard product documentation in two online formats: Adobe Acrobat .pdf files and Microsoft HTML Help .chm files.

Adobe Acrobat .pdf Files

The Adobe Acrobat Reader with the Search tool plug-in enables viewing and full-text searches of .pdf documents. Use Adobe Acrobat Reader with Search version 4 or later. For more information, go to their Web site: http://www.adobe.com.

Microsoft HTML Help .chm Files

Microsoft HTML Help is the help file standard for Windows. This file typically serves as the product online help as well as a standalone online document file. The compressed .chm file contains the entire document. The viewer includes a Table of Contents, Index, and Search. Use Internet Explorer version 4 or later. For more information, go to their Web site: http://msdn.microsoft.com.

Documentation Access

Intervoice provides its standard product documentation in these online locations: • Product CD-ROMs• Web Site• Hard Copy

Product CD-ROMs Intervoice product CD-ROMs include the product documentation online files. Sometimes the product CD-ROMs are a mix of Intervoice products as well as other vendor partner product CD-ROMs.

The most common locations to find online documentation are:• Documentation or Docs directory on product CD-ROM• Help menu on product GUI window• INTV\DOCS directory on the system hard drive

xii Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

Web Site Entire Documentation Set

Intervoice provides an online source of its documentation through the company Web site. RealCare support customers, customers under warranty, channel partners, and distributors can visit the site to register for access to the documentation.

1 Go to http://www.intervoice.com/support/techinfo/

2 Select Technical Documents.

3 Log in with your user ID and password.

4 Choose an applicable document category or type in a keyword to search.

5 To find a list of documents that have been uploaded to the Web recently, view the Web Upload Report in the Document Update category.

Site Preparation and Installation Document Set

1 Go to http://www.intervoice.com/support/techinfo/

2 Select Site Preparation Information. (no password required)

3 View the list to determine which document applies to your system.

Hard Copy Documents

Because Intervoice primarily provides its product information as online documents on product CD-ROMs and the Web, only a limited number of documents are available as hard copy for shipments. These include product quick reference cards for hardware, system installation, as well as vendor partner products.

Contact your Sales Representative for more information.

60001198, Second Edition About This Document xiii

Related Intervoice Documents

The following documents contain useful information about your product. The documents are listed in functional order.

InterSoft Version 3

Loading InterSoft Software (InterSoft version 3) – Doc Number 60001263

Describes how to install the latest InterSoft version 3 software. It also describes how to edit configuration files and apply system utilities.

InterSoft System Administration (InterSoft version 3) – Doc Number 60001092

Explains how to prepare a system for the runtime production environment. It also provides commands and explanations of the basic system administration functions of an InterSoft version 3 system, including starting and stopping the runtime system, monitoring system status, creating and editing the various application tables, and working with voice messages.

InterSoft System Utilities (InterSoft version 3) – Doc Number 60001094Describes the system utilities used to prepare, operate, and monitor a system in runtime.

Troubleshooting InterSoft Systems (InterSoft version 3) – Doc Number 60001078

Explains how to isolate problems on an InterSoft version 3 runtime system and troubleshoot once a problem has been isolated. It describes the various tools available for problem isolation, explains how to use each tool (or what document to reference for instructions), and explains how to analyze the results of the diagnostic tests.

Sound Forge

Sound Forge ManualProvides overview and installation information. It describes how to use and optimize Sound Forge. See the product CD-ROM and Web site: http://mediasoftware.sonypictures.com/download/.

xiv Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

Conventions

This document uses the following keyboard, command prompt, and menu conventions:

Information Symbols

Intervoice documents use the following symbols to help you quickly identify the type of information contained in a note box:

Example Instructs you to...

<Enter> Press the Enter key.

<Ctrl><Esc> Hold the Control key while pressing the Escape key.

[d:\appl]dir <Enter> Enter a command at a Windows command prompt.

Note These commands are not case-sensitive.

Edit > Paste > Save Select Edit from the menu bar, choose Paste, and then select Save from its menu.

Symbol Name Description

Warning Warns you about actions that can create electrical hazards.

Caution Cautions you about actions that can cause data loss.

General Provides additional general information about the topic being discussed.

Tip Provides a useful tip about the topic being discussed. The tip is located in the margin of the page for easy reference.

Document Reference

Refers you to a related document for further information on the product.

60001198, Second Edition About This Document xv

Technical Support

Intervoice wants its customers to be satisfied with its products. Your service contract and/or support guidelines specify which regional support office to contact.

If you need assistance, contact Intervoice technical support at the numbers listed below.

Western Hemisphere

For 24-hour support, please telephone or fax the following regional support office:

USA(outside USA) Tel: +1 800 955-4688 or +1 972 454-8130(within USA) Tel: (800) 955-4688 or (972) 454-8130

Eastern Hemisphere

During normal office hours, please telephone or fax one of the following regional support offices:

England(outside UK) Tel: +44 161 495 1234 Fax: +44 161 495 1007(within UK) Tel: 0161 495 1234 Fax: 0161 495 1007

Germany(outside Germany) Tel: +49 611 184 440 Fax: +49 611 184 4444(within Germany) Tel: 0611 184 440 Fax: 0611 184 4444

Singapore(outside Singapore) Tel: +65 395 4300 Fax: +65 395 4277(within Singapore) Tel: 0395 4300 Fax: 0395 4277

United Arab Emirates(outside UAE) Tel: +9714 332 5480 Fax: +9714 332 5580(within UAE) Tel: 04 332 5480 Fax: 04 332 5580

If out of hours support is included in your contract, please telephone:(outside UK) Tel: +44 208 750 6387(within UK) Tel: 0208 750 6387

Out of hours calls are directed to a support desk, where you can record your problem details. You will be contacted as soon as possible.

60001198, Second Edition 1-1

1

OverviewThe Sound Forge and the Intervoice System Audio File Guide provides information on creating audio files using Sound Forge. Topics include studio set up, file formats and file naming conventions, post-processing using the Batch Converter, and troubleshooting. This document also provides detailed instructions on using the CRVoice2 utility to rename audio files to the Intervoice runtime format.

Sound Forge can be used for recording and editing, but it is required for voice file postprocessing, noise floor testing, and complex editing.

The audience for this guide includes voice talents and speech developers interested in an efficient method of postprocessing voice files, testing the noise floor, and editing complex voice recordings.

This chapter provides the overview for this document. It contains a purpose statement for the entire document. It also contains a concept diagram and a list of required hardware and software.

For more information on preparing voice files, refer to the Sound Forge documents and Intervoice’s InterSoft runtime documents.

For more information on Sound Forge, visit theWeb site: http://mediasoftware.sonypictures.com/support/, or visit the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording.

In this chapter...

▼ Summary of Changes, 1-2

▼ Voice Recording Concept Diagram, 1-3

▼ Recommended Hardware, 1-4

▼ Recommended Software, 1-5

1-2 Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

Summary of Changes

The major revisions to this document are summarized below:

Changes

• Updated the Web site URL for downloading Sound Forge and Batch Converter.

• Reorganized chapters into a more functional order.• Replaced Omvia Media Server term with Intervoice runtime

system (or more specifically with InterSoft IQTalk runtime system or VoiceXML Browser system).

Additions

• Added information about Sound Forge version 7.• Added information about recommended audio file formats for

VoiceXML Browser systems.

60001198, Second Edition Overview 1-3

Voice Recording Concept Diagram

Voice Recording Concept Diagram

Figure 1-1 illustrates the process of creating voice recordings.

Figure 1-1. Voice Recording Concept Diagram

Microphone

Speakers

Computer

{SoundCard

Voice Recording Software

Master .wav files

Runtime .wav files for

VoiceXML

CRVoice2 Utility

NA for VoiceXML

Intervoice System

Renaming

Sound ForgeBatch Converter

Recording

Processing

Preamplifier

Intervoice format .ivc files

for InterSoft IQTalk

m.### runtime files


Recommended Hardware

The following hardware is recommended:• Intel Pentium or compatible computer• Sound card• Preamplifier• Amplified speakers with cables• High-quality, unidirectional, low-impedance microphone• XLR Cable (connects the microphone to the preamplifier)• Y-adapter audio cable (connects the preamplifier to the sound card)

Specific Hardware

Intervoice recommends the following hardware for the best quality when recording:• M-Audio Audiophile 2496 sound card• Midiman Audio Buddy preamplifier• Sennheiser E845 supercardioid vocal microphone

Note In noisy environments, a head-mounted, noise-canceling electret microphone, such as the Shure WH20XLR, is recommended.

• Microphone base• Horizon 10-ft. XLR microphone cable (connects the microphone to

the preamplifier)• 6-ft shielded audio cable (1/4-inch to RCA jack) (Radio Shack

42-2373A)• 6-ft Y-adapter audio cable (1/8-inch to RCA (phono) or 3.5 mm

stereo plug to two RCA plugs (connects the preamplifier to the sound card)

• 3.5 mm stereo jack-to-jack shield adapter• Labtec speakers (AC powered)

60001198, Second Edition Overview 1-5

Recommended Software

Recommended Software

The following software is recommended:

Sound Forge 6:• Microsoft Windows 98, ME, or 2000• Intervoice CRVoice2 utility• Sonic Foundry Batch Converter 5.0

Sound Forge 7:• Microsoft Windows 2000 or XP• Intervoice CRVoice2 utility• Sonic Foundry Batch Converter 5.0


2

Setting Up the StudioThis chapter describes the elements necessary to properly set up the recording studio. It explains how to install the voice recording hardware and configure volume controls. It also discusses how to set recording attributes and the microphone level, and how to perform the noise floor test.

In this chapter...

▼ Studio Setup Checklist, 2-2

▼ Installing the Hardware, 2-3

▼ Installing the Sonic Foundry Batch Converter, 2-6

▼ Installing CRVoice2, 2-7

▼ Installing Sound Forge, 2-7

▼ Making Quality Recordings, 2-8

▼ Performing the Noise Floor Test, 2-12


Studio Setup Checklist

Use the checklist in Table 2-1 to ensure that you set up the studio properly.

Table 2-1. Studio Setup Checklist

Studio Setup Checklist

Set up hardware.

❏ Install the sound card, following the manufacturer’s instructions.

❏ Connect the preamplifier. Verify that the preamplifier is attached to a line-level input from the sound card. This may be labeled line or aux.

❏ Connect the microphone.

Note Using a penny, remove the gold plated screw in the microphone mount. Then attach the microphone mount to the top of the microphone base.

❏ Connect the speakers.

❏ Turn on the computer. Refer to the OEM documentation for installation instructions.

❏ Turn on the preamplifier.

Install software.

❏ Sonic Foundry Batch Converter 5.0

❏ CRVoice2 utility

❏ Sound Forge 6.0 or 7.0

Set up audio.

❏ On some sound cards, you must select correct audio input using the sound card control application. The recommended sound card, M-Audio Audiophile 2496, does not use an audio control application on the PC. In this configuration, the audio volume is controlled through the preamplifier.

❏ Set up audio levels. The input level should be set as loud as possible without clipping the .wav file.

❏ Perform noise floor tests.

60001198, Second Edition Setting Up the Studio 2-3

Installing the Hardware


The following hardware is required for voice recording:

Figure 2-1. Voice Recording Hardware

Pre

Sennheiser E845

Microphone

Jack-to-Jack Shield Adapter

Y-adapter Audio Cable

XLR Microphone

Cable

Midiman Audio Buddy Preamplifier

Labtec Speakers AC Powered with Cables

M-Audio Audiophile 2496

Sound Card

Power Cord

Audio Cable 1/4-inch to RCA

Jack


Connecting the Sound Card to the Speakers

To connect the speakers to the sound card, follow these steps:

1 You will need the Y-Adapter audio cable, the stereo jack-to-jack shield adapter, the speakers, and the sound card installed in your computer.

2 Insert the speaker connector into one end of the stereo jack.

3 Insert the Y-cable into the other end of the stereo jack-to-jack adapter.



4 Insert the Y-cable connectors into the sound card connectors in your computer.

Use the manufacturer’s instructions to install the other components.

Preamplifier Audio Cable

Y-Cable to Speakers

Input Jack to

Output Jack


Installing the Sonic Foundry Batch Converter

If Batch Converter 5.0 is not on the Sound Forge CD, Sound Forge 6.0 and 7.0 customers with a valid serial number can download Batch Converter 5.0 from this Web site URL: http://mediasoftware.sonypictures.com/download/step2.asp?DID=339&#Return/. It is important that you register your copy of Sound Forge and the Batch Converter to obtain technical support.

Note Sound Forge 7.0 users should contact Intervoice Customer Support if the serial number supplied with their copy of Sound Forge 7.0 fails to enable the Batch Converter 5.0.

Periodic updates are available to Sound Forge and the Batch Converter. Intervoice recommends checking the Web site first for the latest updates to the Batch Converter.

For more Sound Forge information, contact technical support at http://mediasoftware.sonypictures.com/support.

For Intervoice 24-hour support, contact the following support office:(outside USA) Tel: +1 800 955-4688 or +1 972 454-8130(within USA) Tel: (800) 955-4688 or (972) 454-8130

To install the Batch Converter:

1 Run the Batch Converter installation file. For example, if it was downloaded to your desktop, double-click it. Or, if it is on a CD-ROM, go to the directory and start the .exe file.

2 Follow the installation screens.

3 After installing the Batch Converter, download the Intervoice Example Batch Script file at: www.intervoice.com/support/techinfo/voicerecording.

4 Save the IVBExampleBatchScriptV6.zip file to the Batch Converter directory.

5 Unzip the batch script, IVBExampleBatchScriptV6.bcs.

6 Refer to the topic “Using the Sample Batch Script” on page 5-3 for more information.


Installing CRVoice2

Installing CRVoice2

Numerical file extensions are not recognized correctly by Windows, so Intervoice created a proprietary extension, .ivc, to help identify Intervoice files on Windows systems. However, when .ivc files are transferred to an Intervoice InterSoft IQTalk system, the .ivc extensions (for example BankABC502.ivc) must be replaced with numeric extensions such as m.502. The CRVoice2 utility renames the .ivc files to m.### files.

To install CRVoice2:

1 Download the file from the Intervoice Web site at: www.intervoice.com/support/techinfo/voicerecording.

2 Save the file to the directory where you will be converting your .wav master files to .ivc files for InterSoft IQTalk systems.

Refer to “Using the CRVoice2 Utility” on page 6-4 for more information.

Installing Sound Forge

To install Sound Forge:

1 Ensure that the Batch Converter, sample Intervoice script, and CRVoice2 utility have already been installed.

2 Insert the Sound Forge CD-ROM in the CD-ROM drive. The installation program should autostart.

3 Follow the installation instruction screens.

4 Go to the Web site and download the latest update to Sound Forge:http://mediasoftware.sonypictures.com/download/step2.asp?DID=465. (This update fixes an .ivc file issue.)

5 Run the downloaded file (such as soundforge70b.exe) and follow the onscreen installation instructions.

6 Refer to the “Making Quality Recordings” on page 2-8 and the other topics in this document for information about using Sound Forge.


Making Quality Recordings

To ensure that recordings are of the highest quality, set the microphone levels correctly, place the microphone properly, and avoid clipping. As a general rule, microphone levels should be as loud as possible without causing any clipping. You can adjust final volume levels in message files after they have all been recorded. Use the volume or the dynamic graphic preset. The best signals in the noise ratio are achieved when a loud, unclipped recording is made.

Setting Microphone Levels

To set microphones levels correctly, follow these steps:

1 Set the recording attributes using File > New as 11,025, 16 bit, Mono. (Refer to “Setting Recording Attributes” on page 4-5).

2 In the main Sound Forge window, select the Record button on the left of the button bar. The Record window appears.

3 In Sound Forge 6, check the Monitor checkbox beneath the Reset button to turn on the record meters.

Set the microphone volume to the highest level at which you can speak at a normal volume without clipping. This produces the best signal-to-noise ratio.



4 Speak into the microphone in your loudest normal voice. Adjust the microphone volume on the preamplifier so that clipping does not occur even when you speak at your loudest volume. (Clipping occurs when the level meters are pushed into the red zone and the word “Clip” appears.)

5 If clipping occurs, click the Reset button. Remember, if clipping occurs during the recording process, the recording is ruined.

6 On the preamplifier, lower the microphone level.

7 Repeat step 4 through step 6 until clipping no longer occurs. The microphone level is then set correctly.

Placing the Microphone

Intervoice recommends a supercardioid vocal microphone (such as the Sennheiser E845) to help eliminate background noise during recording. When using a desk-mounted microphone, ensure that the distance between the voice talent’s mouth and the microphone is fairly constant for each recording.

Most commercial microphones do not have an output signal level that is high enough to drive the sound card. Therefore, they require a microphone preamplifier. External microphone preamplifiers improve signal-to-noise ratios by keeping high-gain amplification stages out of the PCs electrical noise fields.

For more information about the Sennheiser E845 microphone, go to www.allprosound.com or www.fullcompass.com.


Avoiding Clipping

Sound Forge clips a recording when it exceeds the maximum storable amplitude or when any value beyond the maximum is cut off, resulting in audible distortion. Clipping destroys a recording’s quality; therefore, clipped recordings must be rerecorded. Clipping occurs when the level meters are pushed into the red zone (see Figure 2-3 on page 2-11).

Figure 2-2 illustrates a voice recording without clipping.

Figure 2-2. Voice Recording Without Clipping

To prevent clipping, set the recording input level just low enough to enable the voice talent to speak at normal volume without pushing the level meters into the red zone.

No clipping

No clipping



The flat tops in Figure 2-3 illustrate clipping.

Figure 2-3. Voice Recording with Clipping

Clipping

Clipping


Performing the Noise Floor Test

Always check the background noise and a noise floor of the recording studio and equipment before starting a recording session. Recordings made with high background noise will not provide a clear voice sound during playback. Sound Forge must be installed to perform this test. A high noise floor is a common cause of poor audio quality and clicks in compressed files. (See “Using Sound Forge to Record Messages,” Chapter 4 for more information.)

To check for noise using Sound Forge tools, follow these steps:

1 Ensure that the microphone levels on the preamplifier are set correctly for a recording session, that is, as loud as possible without causing clipping (see “Setting Microphone Levels” on page 2-8).

2 Verify that the recording attributes are 11,025 Hz (or 11.025 KHz), 16 bit, Mono.

3 Record five seconds of silence, at the standard volume level. Do not speak while recording. The recording studio should be as quiet as possible during this test. Stop recording. Close the Record dialog box.

4 With a sound file window open, go to the main Sound Forge window and select Process > DC Offset. The DC Offset – <filename> window appears.

5 In Sound Forge 6, set the Preset field to Intervoice DC Offset Removal.In Sound Forge 7, set the Preset field to Remove DC offset....


Performing the Noise Floor Test

6 Select the Automatically detect and remove option to remove the DC offset, and click OK.

7 To view statistics, select Tools > Statistics.

The Statistics window appears.

8 Verify that the RMS power in the statistics dialog box is below 60 dB (for example, –65 dB).

9 Verify that the Average value (DC offset) statistic is 0.

Perform this test regularly on systems used to record voice messages. The voice message fails the test if the noise floor is louder than –60 dB. Louder noise is represented by lower negative number.

For example, a –55dB noise floor is louder than –60 dB. The test may fail because the room noise is too loud (due to air conditioning, PC fan, paging system), or the sound card preamplifier is noisy.

Correct any problems before continuing to record. Noise floors greater than –60 dB (for example, –55 or –50 dB) can be heard in the playback process and cause pops and other audible degradation during the compression process.


3

Defining Audio File FormatsThis chapter describes the voice file formats and explains the master format differences.

The voice files are recorded as a standard Intervoice master .wav files, then depending on the runtime system, the files are converted to runtime files: .wav or .ivc formats.

In this chapter...

▼ Intervoice Master Format, 3-2


Intervoice Master FormatIntervoice uses the .wav format for recording and saving all voice files. A .wav file is a standard format in which messages are recorded on a computer. Almost any audio recording tool designed for computers can be used to create the .wav file. The Intervoice standard master format uses .wav files because this format supports high-quality recordings, and the equipment required to record and play .wav files is commonly available.

The Intervoice master .wav format is 11,025 samples per second, 16 bits per sample, linear encoding, which equates to a data rate of 22,000 bytes per second.

The Intervoice master format keeps the master files in a much higher quality format than the older compressed µ-law masters. The Intervoice format makes the file sizes somewhat larger, but the voice quality of subsequent copies of this master is much better.

Although higher sampling rates are available, Intervoice uses the 11.025 KHz (or 11,025 Hz) sample rate for master files because any advantage to using a higher sample rate is lost through bandwidth limitations of phone line transmission.

The higher the sample rate, the wider the audio bandwidth, and the larger the file size. Regardless of the master format used, the InterSoft IQTalk runtime .ivc format is compressed using ADPCM techniques to 8,000 samples per second and 4 bits per sample. This is also referred to as 8KHz ADPCM or 32KBps (8 KHz x 4 bits). The VoiceXML runtime .wav format is 64KBps (8 KHz x 8bits). KBps is kilobits per second.

Intervoice Standard Master Format

The Intervoice standard format for recording master voice files is a 11.025 KHz (or 11,025 Hz), 16-bit, monophonic, linear .wav file. This format is the master format for storage of Intervoice voice files.

60001198, Second Edition Defining Audio File Formats 3-3

Intervoice Master Format

Table 3-1 charts the differences between the InterSoft master format and the standard runtime format.

After messages are recorded, they must be compressed from master .wav files into ADPCM (.ivc) or A-law or µ-law PCM .wav files. Sound Forge provides a Batch Converter that enables you to easily modify, edit, and convert large groups of messages. See “Batch Converter Overview” on page 5-2 for more information.

Table 3-1. Format Differences

Format Name

Sample Rate

Sample Size (Mono)

Data Format

Size of 10 sec Message

Windows Master Format

11,025 smp/sec

16 bits/sample Linear .wav 220 KB

InterSoft IQTalk Runtime Format

8,000 smp/sec 4 bits/sample ADPCM (.ivc) G.721

40 KB

VoiceXML Runtime Format

8,000 smp/sec 8 bits/sample A-law or µ-law PCM .wav

80 KB


4

Using Sound Forge to Record MessagesThis chapter explains how to set up the automatic labeling functions and describes the voice recording process, the methods of recording voice messages, and the use of Sound Forge to record voice messages.

Sample Screens

This document contains representations of application screens. These screens are used as examples only and may not appear exactly as those on your system.

In this chapter...

▼ Setting Up Automatic Labeling, 4-2

▼ Setting Recording Attributes, 4-5

▼ Recording Process Checklist, 4-6

▼ Recording Methods, 4-7

▼ Recording Messages, 4-9

▼ Reviewing Messages, 4-17

▼ Removing Intervoice Batch Scripts, 4-18


Setting Up Automatic Labeling

Windows 2000 allows long filenames. Assigning long filenames (BANKABC.1stGreeting.1048, for example) to message files may help keep track of messages and associate them with the correct applications. Long filenames make it easier to identify message files to be renamed for use in runtime. (See “Filename Formats (.ivc to m.###)” on page 6-2 and .)

The automatic labeling function provided by Sound Forge can be used to assign descriptive filenames to master voice files. It is also used to automatically name and number regions or files during the recording process. Intervoice recommends that you set up the automatic labeling function before recording messages.

To set up Sound Forge’s automatic labeling function before recording your messages, follow these steps:

1 Open Sound Forge.

2 From the main Sound Forge window, open the Record window by

either clicking the Record button or selecting Special > Transport > Record.

Automatic versus Manual Labeling

If you do not set up the automatic labeling function before recording messages, you must manually number the regions or takes after recording.

60001198, Second Edition Using Sound Forge to Record Messages 4-3

Setting Up Automatic Labeling

The Record window appears.

3 Right-click the Record window’s title bar.

4 Select Automatic Labeling from the menu that appears.

The Automatic Labeling window opens.


5 If you are recording in Create a new window for each take mode, type the prefix to be assigned to all files in the New window title prefix field in the Sound windows area. Modify the fields immediately following it, as explained in step 6 and step 7.

If you are recording in Multiple takes creating regions mode, type the prefix to be assigned to all files in the New region prefix field in the Regions area. Modify the fields below it, as explained in step 6 and step 7.

6 Select the Use counter and start at (0 to 65,000) checkbox, and change the number in the field to the number at which the numbers should begin (500, for example). For information on Intervoice voice file naming conventions, see page 6-3.

7 Make sure to select the Insert leading zeros in field width of (2 to 24) checkbox, and set leading zeros to 3.

8 Click OK. Each region or window is numbered starting at the number entered in the Use counter and start at (0 to 65,000) field in step 5.

Do not include any numeric characteristics at the end of your filenames unless you want them to be included in the final message number.

The names assigned to the regions or files are converted to message names and numbers in the runtime system.


Setting Recording Attributes

Setting Recording Attributes

The recording attributes control how Sound Forge stores the master files. Sound Forge remembers these settings for new recordings, so you set attributes only once.

To set recording attributes, follow these steps:

1 Open Sound Forge.

2 From the main Sound Forge window, select File > New. The New Window dialog box appears:

3 Enter the appropriate recording attributes. The Intervoice standard settings are:

Sample rate: 11,025 HzSample size: 16-bitChannels: Mono

4 Click OK.

These settings are stored as the default. Therefore, you should create them the first time you record.


Recording Process Checklist

Use the checklist in Table 4-1 to ensure that you complete all the steps in the recording process.

Table 4-1. Recording Process Checklist

Recording Process Checklist

❏ Complete the noise floor test.

❏ Record the master file using the voice recording software.

❏ Use the Batch Converter to postprocess the master file (remove pause, adjust gain, and so on) and compress the file into applicable runtime format.

❏ Run the CRVoice2 utility (for the InterSoft IQTalk runtime system only).

❏ Move the files to the Intervoice system.


Recording Methods

Recording Methods

Two major methods of recording are used: • Continuous (recording continuously while reading a script) • Message-by-message (stopping the recording after each message)

In Sound Forge, the continuous recording mode is called Multiple takes (no Regions). The two modes of message-by-message recording are called Multiple takes creating Regions and Create a new window for each take.

Regardless of which recording method you use, you must separate all messages into individual files during the final process of message creation.

Figure 4-1 illustrates the recording process and recording methods.

Figure 4-1. Recording Process

* Renaming is required if the automatic naming or numbering function was not set up or if the messages are recorded out of sequence.

Continuous

Message

Manuallymark

regions

byMessage

Renameregions*

Extractregions

Extractregions

Recording Files Postprocessing

1. Review masters.

2. Archive masters.

3. Complete batch post processing.

4. Use CRVoice2 utility. (InterSoft IQTalk systems only)

5. Place files in runtime.

(Record all messages in one continuous session)

Regions

Files(Record each messageseparately, stopping after each)

(if required)

Renamefiles*

(if required)regionsExtract

Refer to the Sound Forge document for detailed information about recording modes.


Continuous Recording

The continuous recording method is appropriate when voice talents prefer to read continuously from a script or prefer not to operate the computer while recording. The voice talent records a group of all the messages at once. Then a subsequent editing process marks the individual recordings using region markers.

Multiple Takes (No Regions)

The Multiple takes (no Regions) mode in Sound Forge allows multiple takes to be recorded in one file, but avoids defining any regions in the Regions List.

Recording starts at the position shown in the Start field when the Record button is selected and continues until the Stop button is selected. Any data that is currently after the position in the Start field is replaced. When recording is stopped, the start position remains at the end of the take allowing the next take to be recorded immediately. After all messages are recorded, the editor must manually place the regions.

Message-by-Message Recording

The message-by-message method means that the user stops recording after each message, creating multiple regions or new files for each take.

Multiple Takes Creating Regions

The Multiple takes creating Regions mode allows multiple takes to be recorded with each take automatically defining a region in the Regions List. Recording starts at the position shown in the Start field when the Record button is selected and continues until the Stop button is selected. Any data that is currently after the position in the Start field is replaced. When recording is stopped, the start position remains at the end of the take allowing the next take to be recorded immediately. When the Record button is selected, the automatic naming function increments the region name.

Create a New Window for Each Take

The Create a new window for each take mode is similar to the Multiple takes creating Regions mode. However, instead of creating new regions with each take, the system creates a new file for each take. This proves helpful in recording data where each take is intended to be saved into its own file. This mode is useful if the messages in a script are not in sequence (such as m501, m504, m503, m502).


Recording Messages

Recording Messages

The recording procedures described here are divided into continuous and message-by-message recording.

Continuous Recording

The continuous recording method is appropriate when voice talents prefer to read continuously from a script or prefer not to operate the computer while recording. The voice talent records a group of all the messages at once. Then a subsequent editing process marks the individual recordings using region markers.

Multiple Takes (No Regions)

The Multiple takes (no Regions) mode in Sound Forge allows multiple takes to be recorded in one file, but no regions are defined in the Regions List. The editor must manually place the regions.

To record messages in the Multiple takes (no Regions) manner, follow these steps:

1 Open Sound Forge.

2 From the button bar, select the Record button.

The Record <Sound x> window appears.

3 Select Multiple takes (no Regions) in the Mode field.

4 In the Mode field, select the recording mode.

Note Make sure that DC adjust, in the lower right of the Record window, is NOT checked.

5 In Sound Forge 6, select the Monitor checkbox to display the level of the recording input on the meters. To prevent clipping, use the Monitor function to set the appropriate recording volume. When a recording is clipped, the word “Clip” appears above each meter. Clipping destroys a recording’s quality.


6 Set the recording volume using either the sound card’s or the operating system’s volume control application. The volume should be set at the highest level at which you can speak at normal volume without clipping.

7 Select the Record button in the Record <Sound x> window.

8 Record successive messages without stopping between them.

9 Click the Stop button.

The messages appear as one long interrupted set of wave formations.

10 Highlight the area of wave that you want to include in your first region.

11 From the grey bar directly above the waves, right click and select Insert Marker/Region.

Always record as loudly as possible without clipping. The volume of the master files can be adjusted after recording is complete.


Recording Messages

The Insert Marker/Region window appears.

12 Enter a name for the region in the Name field. (Alternately, you can set up automatic labeling for your windows if you want to give the same prefix to all of them. See “Setting Up Automatic Labeling” on page 4-2.)

13 Select the Region button.

14 Click OK.

15 Repeat step 10 through step 14 on page 4-11 for each region you wish to create.

Your waveforms are now divided by regions.


Message-by-Message Recording

The message-by-message method means that the user stops recording after each message, creating multiple regions or new files for each take.

Multiple Takes Creating Regions

The Multiple takes creating Regions mode allows multiple takes to be recorded with each take automatically defining a region in the Regions List.

If you have the Batch Converter plug-in, Multiple takes creating Regions is the easiest recording mode to use. This mode places all takes in a single window, marking the beginning and end of each new take with a region marker and a vertical black line. Later, using the Extract Regions tool (available only with the Batch Converter plug-in), automatically saves each region as a separate file. When recording large numbers of messages (over 50), you may want to divide the regions into several different files for easier manipulation.

To record messages in the Multiple takes creating Regions manner, follow these steps:

1 Open Sound Forge.



3 Select Multiple takes creating Regions in the Mode field.

Note Make sure that DC adjust, in the lower right of the Record window, is NOT checked.



Recording Messages

.


6 Record a first message, and click the Stop button after it.

7 Continue creating messages and clicking Stop between each.

Your messages are automatically labeled for you.

Checking for Clipping

After recording each region or take, check the input level meters. If the Clip indicator appears above them, you must rerecord the region or take.


Extracting Regions

If you have a file composed of multiple regions, you must extract the regions to use them in a runtime system. After recording is complete, each message is marked with its own region. They can be subsequently separated into different files through the Batch Converter plug-in in Sound Forge.

To extract regions in the Multiple takes creating Regions manner, follow these steps:

1 In Sound Forge, complete recording in the Multiple takes creating Regions manner.

2 From the menu bar, select Tools > Extract Regions.

The Extract Regions window appears.

3 Select the regions to extract in the Regions to extract field.

4 Click Extract.


Recording Messages

Create a New Window for Each Take

The Create a new window for each take mode is similar to the Multiple takes creating Regions mode. However, instead of creating new regions with each take, the system creates a new file for each take. This proves helpful in recording data where each take is intended to be saved into its own file. This mode is useful if the messages in a script are not in sequence (such as m501, m504, m503, m502).

If you do not have the Batch Converter plug-in, use New window for each take mode. This mode places each take in its own data window. To view all your takes, choose Window > Cascade > Tile Horizontally, or Window > Cascade > Tile Vertically.

To record messages in the Create a new window for each take manner, follow these steps:

1 Open Sound Forge.



3 Select Create a new window for each take in the Mode field.


5 Set the recording volume using either the sound card’s or the operating system’s volume control application. The volume should be set at the highest level at which you can speak at normal volume without clipping.

6 Click the Prepare button.

Checking for Clipping

After recording each region or take, check the input level meters. If the Clip indicator appears above them, you must rerecord the region or take.


This function allows time for you to create and rename the new window before Sound Forge begins recording. If you do not use the Prepare button, the beginning of each message may be cut off. It is not necessary to use this function if you are using the Multiple takes creating Regions mode. After you click Prepare, a flashing green box that reads “Prepare” appears to the left of the Sync button.


8 Record a first message, and click the Stop button after it.

9 Continue creating messages and clicking Stop between each.

Each message appears with its own window, as shown below:


Reviewing Messages

Reviewing Messages

Review all messages after recording and check for noises, clicks, pops, and clipping. After a message is recorded, the data window for that recording appears in the Sound Forge main window.

To review your messages, follow these steps:

1 To listen to your messages, click the Play button in the data window. (Or, place the cursor at the beginning of the message to be played, then press the spacebar. Press the spacebar again to stop.)

The data window’s buttons are labeled below:

2 If you recorded in Multiple takes creating Regions mode, you can manually adjust the region markers to remove leading and trailing silences. Or, use the Auto Trim/Crop process to trim the regions after they have been extracted.

3 After completing these steps, if you recorded in Multiple takes creating Regions mode or if you recorded in Create a new window for each take mode, proceed to “Batch Converter Overview” on page 5-2.

Play button

Stop button

Go to Start button

Go to End button

For instructions on manually adjusting region markers, refer to the Sound Forge 6.0 Document.


Removing Intervoice Batch ScriptsBefore installing the new sample batch scripts, remove the old batch scripts.

To remove the old sample batch scripts, follow these steps:

1 Open Sound Forge.

2 From the main Sound Forge window, select Tools > Batch Converter.

The Batch Converter window opens, and the list of batch scripts appears.

4 Select a script and right-click.

5 Click Remove to delete the batch script.

6 Repeat step 4 and step 5 to remove old batch scripts.


5

PostprocessingThis chapter provides the steps to convert multiple voice files to the voice file format using the Sonic Foundry Batch Converter 5.0.

The voice talent’s responsibility is to provide individual master voice files recorded with high quality. The voice talent may or may not have the expertise to postprocess audio files and troubleshoot problems.

Either the voice talent or the application engineer must postprocess the master .wav files to compress, enhance, and normalize volume levels to complete the message creation process.

In this chapter...

▼ Batch Converter Overview, 5-2

▼ Using the Batch Converter, 5-6

▼ Modifying the Sample Batch Script, 5-8


Batch Converter Overview

To prepare files for the InterSoft IQTalk runtime system, the Sonic Foundry Batch Converter 5.0 takes the master .wav format and converts it to the .ivc file ADPCM format.

Figure 5-1. .wav to .ivc Batch Conversion Process

To prepare files for the VoiceXML Browser runtime system, the Sonic Foundry Batch Converter 5.0 takes the master .wav format and converts it to the runtime .wav file format.

Figure 5-2. .wav to runtime .wav Batch Conversion Process

The batch script, Intervoice Example Batch Script, automates most of the postprocessing steps for voice files. The sample script shows how most of the steps can be automated in a single batch process using the Batch Converter. (The Intervoice Example Batch Script is for demonstration purposes. You must modify the preset parameters to suit the specific circumstance.)

Master Format11 KHz, 16-bit Sonic Foundry

Batch ConverterMono .wav

InterSoft IQTalk Format8 KHz, 4-bitADPCM .ivc

Master Format11 KHz, 16-bit Sonic Foundry

Batch ConverterMono .wav

VoiceXML Format8 KHz, 8-bit

A-law/µ-law .wav

60001198, Second Edition Postprocessing 5-3


Downloading the Batch Converter

Sony Pictures Digital Media offers periodic updates to the Batch Converter software. Intervoice recommends that you check their Web site. You can download the Batch Converter from Sony Pictures Digital Media at:

http://mediasoftware.sonypictures.com/download/step2.asp?DID=339&#Return/

Be sure to register your copy of the Batch Converter to obtain technical support. See “Installing the Sonic Foundry Batch Converter” on page 2-6.

Note Sound Forge 7.0 users should contact Intervoice Customer Support if the serial number supplied with their copy of Sound Forge 7.0 fails to enable the Batch Converter 5.0.

Using the Sample Batch Script

The batch script described below is available on the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording.

To use the sample batch script in the Batch Converter 5.0, follow these steps:

1 Open the Batch Converter.

2 Select File > Open Script. The Open dialog box appears.

3 Navigate to the Intervoice Example Batch Script file, and click Open to install the batch script.

The Intervoice Example Batch Script creates 32 KBps compressed messages for µ-law playback and –12 dB volume normalization. You may obtain other options, such as µ-law playback, different volumes, and 32 KB compression, by editing the script in the Sonic Foundry Batch Converter.

The Intervoice Example Batch Script is for demonstration purposes. You must modify the preset parameters to suit the specific circumstance.


Converting the .wav Files to Runtime Format Overview

A series of transformational steps is required to convert a .wav file into the runtime file format. All of these steps can be automated into a single batch process in the Batch Converter. The Batch Converter can apply this process to a large number of master .wav files in a single step, creating a set of runtime files, one for each master .wav file.

To convert a master .wav file into a runtime file, the Batch Converter must perform a specific set of processes in a specific order on the original .wav master file. In version 5.0, Sonic Foundry calls these steps plug-ins (presets in Sound Forge 4.5). The final script is called a plug-in chain. The basic recording process provides a clean audio signal digitized to the system’s hard drive in .wav file format. However, there are several other processing steps that must be applied to an audio file after it is recorded, but before it can be placed on an Intervoice system. The Intervoice sample Batch Converter postprocessing procedures are described below:

DC Offset removalRemoves any DC offset before messages are compressed. Most sound cards have some DC offset that can affect the compression process.

Front and Back silence trimming with 20-ms fadesRemoves excess silence on the front and back of messages to tighten the message delivery. Generally the messages are trimmed to 20 milliseconds or less of silence on both ends. Noise at the beginning or end of messages (such as mouse or keyboard clicks) causes the auto-trimming process to trim too early or too late, leaving extra silence in the message. These messages must be hand-trimmed.

Front and Back silence buffering with 50-ms buffersPrevents the destabilization of the ADPCM decoder when two messages are played back to back. Once the arbitrary silences are trimmed from the message, this step places a standard 50-millisecond silence buffer on the beginning and end of every message.

Message level normalization Converts all the messages in a single application to a single standard volume level, so that listeners hear every message at the same volume.

8 KHz resamplingConverts the 11.025 KHz sample rate of the master file to an 8 KHz sampling rate.



EqualizationPerforms the frequency manipulation required to make high-quality message playback over telephone lines.

ADPCM compression for A-law playback and Intervoice formattingCompresses the linear format to ADPCM to reduce the size of the message files on the Intervoice system. This step is where you specify whether a message is to be played as µ-law or A-law, or to be compressed to 32 KB.


Using the Batch Converter

To use the Batch Converter, follow these steps:

1 From the main Sonic Foundry Batch Converter window, select File > Open Script.

2 Navigate to and open the Intervoice Example Batch Script, IVBExampleBatchScriptV6.bcs.

The Sonic Foundry Batch Converter - IVBExampleBatchScriptV6.bcs window appears in the Script tab view.

3 Do not select any of the options on the window.

4 Select the directory where you want the output files saved.

5 In the File format area, set the Format and Template.

Note If you change any selection, save the changes as a new batch script and retain the original sample batch script.


Using the Batch Converter

6 To choose the files that are to be converted, go to the main Sonic Foundry Batch Converter window and select File > Add Media, or click Add Media on the toolbar.

7 In the Add Media dialog box, select all the .wav files to be converted and click Open to continue

The Files to Convert window tab view appears.

8 Select Convert Files. The status changes to complete.

Backup Master Files Before Converting

Be sure to make a backup copy of all master .wav files and store them in a separate directory away from any editing or converting activity. Do this to prevent losing master files by accidently converting the master files into the compressed .ivc or .wav format. The compressed files cannot be converted back to masters, due to quality loss during conversion.


Modifying the Sample Batch Script

The Intervoice Example Batch Script file provides the Batch Converter with an example of the appropriate presets to be used during the conversion of master voice files into ADPCM (.ivc) or A-law or µ-law PCM .wav files. The batch script described below is available on the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording.

This topic explains how to modify the sample batch script for different requirements. The basic script is designed to resample and compress .wav files into the runtime format. This sample script also includes settings to trim and normalize volume.

The settings that can be modified are message level normalization and equalization. See page 5-9 for the settings that cannot be modified.

To modify the batch script, follow these steps:

1 Open the Batch Converter.

2 From the main Sonic Foundry Batch Converter window, select File > Open Script.

3 Double-click the settings you want to change.

Note If you change any selection, save the changes as a new batch script and retain the original sample batch script.

4 Rename and save the changed batch script.

This is a general batch script explanation only. You need to change formats and volume levels for real runtime systems. For more information on batch scripts, refer to the Sound Forge document.


Modifying the Sample Batch Script

Batch Script Settings That Cannot Be Modified

Some batch script settings should not be modified, as they control critical parameters in the conversion process. Other batch script parameters can be modified to adjust the runtime messages for various use.

The settings that should not be modified are:• DC offset removal• Front and back silence trimming• Front and back silence buffering• 8 KHz resampling• ADPCM compress for A-law playback• Intervoice formatting


6

Naming and Converting .wav to .ivc to m.### FilesThis chapter describes the Intervoice InterSoft IQTalk voice file naming convention and explains how to use the CRVoice2 utility to rename .ivc files to m.###.

Most audio recording applications accept message files with long filenames such as BankABCgreeting503.wav. Messages can be given long filenames for easy identification. The Intervoice InterSoft IQTalk system only accepts messages in a specific format. The format is m.###. All InterSoft runtime messages must start with m, and end with three numeric digits. Message numbers larger than 999 are represented by placing more digits to the left of the suffix and to the right of the m. For example, message number 2046 is renamed to m2.048 and message number 48, 423 would be m48.423. The CRVoice2 utility can be used to quickly convert long filenames to the InterSoft runtime standard of m.###.

In this chapter...

▼ Filename Formats (.ivc to m.###), 6-2

▼ Using the CRVoice2 Utility, 6-4


Filename Formats (.ivc to m.###)

Intervoice master files are .wav files. They have filenames of naming convention xxxxx.wav, where xxxxx can be alphabetic, numeric, alphanumeric, or other characters except special characters (for example, slashes and colons). The filenames have a limit of 253 characters, plus 3 character extension.

Intervoice InterSoft IQTalk compressed files have two different naming conventions: xxxxx.ivc and m.###. Microsoft Windows programs identify the .ivc extension as the Intervoice file extension. The m.### naming convention is used to identify Intervoice messages placed on the Intervoice runtime system.

However, Windows does not correctly identify files named m.###, so these files are not used on the Windows development or recording systems. The m.### files are used only on the Intervoice system. Figure 6-1 shows examples of accepted filename formats, including .wav, .ivc, m.###, and m#.### formats.

Figure 6-1. Filename Formats (.ivc to m.###)

Original Master BankABC502.wav

Compress to Intervoice Format

BankABC502.ivc

Rename for Runtime (CRVoice2 Utility)

m.502

Original Master Firstmenu2046.wav

Compress to Intervoice Format

Firstmenu2046.ivc

Rename for Runtime (3-character Extension)

m2.046

60001198, Second Edition Naming and Converting .wav to .ivc to m.### Files

Filename Formats (.ivc to m.###)

Voice File Naming Convention

Before voice files can be used on an Intervoice system, the files must conform to Intervoice’s voice file naming conventions. InterSoft IQTalk runtime voice files must be named in the following format: m.###. The smallest message number allowed is m.000. Once m.999 is reached, begin renumbering at m1.000, m1.001, and so on. The highest message number supported in runtime is m2147483.647.

Windows 2000 also requires all media files to have three-character extensions to identify the file type. Numeric file extensions are not recognized by Microsoft Windows, so Intervoice created a new extension, .ivc, to identify Intervoice files on Windows systems. However, when .ivc files are transferred to an Intervoice system, the .ivc extensions must be replaced with numeric extensions. The CRVoice2 utility renames the .ivc files to the m.### format. For Intervoice purposes, the .wav file extension indicates a master voice file, while the .ivc file extension indicates a file that has been compressed for use on an Intervoice InterSoft IQTalk system. Once you have compressed the messages into .ivc files, use the CRVoice2 utility (see page 6-4) to copy and rename all the .ivc files to the m.### format. The CRVoice2 utility leaves the original long filenames on the .wav and .ivc files for easy reference later.

Figure 6-2 illustrates the progression from .wav files to .ivc files then to m.### files.

Figure 6-2. Transforming .wav Files to .ivc Files

BankABC501.wav BankABC501.ivc

m.501m.502

BankABC502.wavBankABC503.wav

BankABC502.ivcBankABC503.ivc

Voice Recording Software Batch Converter

CRVoice2 utility

(original masters) (trimmed, EQ’d,normalized, compressed)

m.503

Same files,same content,different names


Using the CRVoice2 Utility

Before the .ivc voice files can be used on an Intervoice InterSoft IQTalk system, they must be renamed to the Intervoice voice filename format, m.###. The CRVoice2 utility automatically copies and renames all the .ivc compressed voice files created in Sound Forge to the Intervoice m.### file naming convention. CRVoice2 leaves the original long filenames on the .wav and .ivc files for easy reference later. The CRVoice2 utility also checks and corrects file lengths to ensure that all the files are exact multiples of four bytes. The CRVoice2 utility does not change the contents of a file in any way.

If the .ivc filename includes the message number (for example, BankABCMainMenu503.ivc), the CRVoice2 utility copies and renames all message files quickly and correctly. This feature allows you to apply the full capabilities of long filenames to help identify message files before renaming them for placement on a runtime system.

To use the default operation of the CRVoice2 utility, place the CRVoice2.exe file in the directory that contains the .ivc files to be renamed.

The CRVoice2 utility renames only .ivc files in the directory where the utility is located.

CRVoice2 Utility Options

To view the list of the options available in the CRVoice2 utility, start a command prompt in Windows, then navigate to the directory where the CRVoice2 utility is located. Type CRVoice2 -? <Enter>. A list of all of the CRVoice2 utility options is displayed.

60001198, Second Edition Naming and Converting .wav to .ivc to m.### Files

Using the CRVoice2 Utility

To start the CRVoice2 utility from an icon, follow these steps:

1 Copy the CRVoice2.exe utility file from the Intervoice Web site to the directory containing the .ivc files to be copied and renamed. Make sure to copy the actual CRVoice2.exe file rather than a shortcut to it. (Hold down the <Ctrl> key to drag and drop files.)

2 Double-click the copy of the CRVoice2 icon.

The CRVoice2 utility copies all the .ivc files in the current directory and renames the copies with the m.### convention. CRVoice2 affects only the files in the local directory with the .ivc extension.

To start the CRVoice2 utility from a command prompt, type the following command in this format:

CRVoice2 <-rename> <source file spec> <target directory>

The CRVoice2 utility provides the following options:

-hDisplays the available parameters in the CRVoice2 utility.

<-rename> (optional)Renames the files instead of copying them. The <target directory> parameter is ignored when another one is specified. This renames the existing .ivc files to m.500 instead of duplicating them, so the original .ivc filenames are lost.

<source file spec>Indicates the files to be renamed or copied (wildcards are allowed) and the location of the files. The command copies and renames all .ivc files in the current directory.

<target directory> Indicates the directory in which the renamed copies are placed. The default is the current directory, if this parameter is not used.

Example:

The following sample CRVoice2 command copies all the .ivc files in the c:\audio directory that begin with the bankabc prefix and renames them to the appropriate m.### filenames. The copies are placed in the c:\audio\bankabc directory.

[c:\audio]Crvoice2 bankabc*.ivc c:\audio\bankabc


7

Maintaining Audio QualityThis chapter explains the purpose of master voice files and stresses the importance of editing the files before compressing them. It also stresses the importance of retaining original master voice files.

In this chapter...

▼ Voice File Quality Issues, 7-2


Voice File Quality Issues

Master .wav voice files are intended to store messages in a high-quality format for editing and archiving original messages. To prevent loss of quality, do not edit and reuse compressed messages. After an Intervoice master.wav file has been compressed into the .ivc, m.###, or runtime .wav format, it must not be changed or edited. If you need to change a voice file, edit the Intervoice .wav master file and then compress the new edited file into the .ivc, m.###, or runtime .wav format.

The Intervoice .wav master format is an 11.025 KHz (or 11,025 Hz), 16-bit per sample, linear format. It provides a high-quality method of voice storage for editing and manipulation before compression. All editing and volume control adjustments should be made on the 16-bit per sample master .wav file before compression into the .ivc or .wav runtime format.

After all editing of the master .wav voice file is complete, the file should be processed using the Batch Converter. No editing of any type should be done on the compressed file.

Note that the original masters of the .wav voice files should be retained for future use. After a .wav voice file is compressed into the .ivc format, some quality is lost. Even though the Batch Converter and Sound Forge can convert the .ivc or runtime .wav files back to master .wav file size, a substantial loss of quality occurs in the new “master” .wav files, and even more quality is lost in the second compression step. For this reason, all re-editing should be done on the original 16-bit master .wav file.

Keep two versions of the master file: one original and one for editing. Do not discard or edit the original master.

Loss of Voice Quality

Severe loss of voice quality occurs if you convert Intervoice .ivc files to .wav files, edit those .wav files, and try to convert the .wav files back to .ivc files.

To correctly edit files, save a copy of the original 16-bit master .wav file, edit the original 16-bit master .wav file, and then convert the newly edited .wav into .ivc and m.### (or 8-bit runtime .wav) for use on the Intervoice system.

60001198, Second Edition A-1

A

Frequently Asked QuestionsThis appendix provides some frequently asked message recording questions and the answers to those questions. Users should consult this chapter before calling for technical support. This chapter will be updated as new questions are encountered.

Contact Sound Forge technical support at:http://mediasoftware.sonypictures.com/support. (Intervoice recommends contacting Sonic Foundry first for any technical support issues, as Intervoice does not have specific support personnel for Sound Forge.)

For Intervoice technical support, see “Technical Support” on page xv.

For a list of message recording FAQs, visit the Intervoice Web site at www.intervoice.com/support/techinfo/voicerecording.

In this appendix...

▼ FAQs, A-2

A-2 Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

FAQs

The following questions and answers pertain to recording messages.

Recording Steps What are the main steps to record a voice message?The primary steps in the recording process are as follows:

1 Set levels.

2 Perform the noise floor test (see page 2-12).

3 Record messages (Refer to the voice recording software).

4 Apply the batch script (see page 5-8).

5 Run the CRVoice2 utility (see page 6-4), for InterSoft IQTalk only.

6 Move the files to the Intervoice system.

Defining DC Offset What is the DC offset and what does it do?DC offset is the average vertical offset from 0 dB of the recorded wave form. This is normally visible in the Sound Forge window only in silent sections of the wave form. Most sound cards have some DC offset when you are recording. Usually you cannot see this offset in the Sound Forge window unless you expand the vertical gain nearly to the limit and look into a silent portion of the recorded wave form.

DC offset can cause problems in compressing voice files using the Intervoice ADPCM format. It also causes errors when you are trying to measure the noise floor of a recording. The Sound Forge Tools > Statistics command measures the average value or DC offset of a wave form along with several other important statistics about a wave form. This is the command used to check the noise floor of your recording setup.

Since virtually all sound cards have some DC offset, it should be routinely removed as part of the standard processing of 11.025 KHz master files for compression into Intervoice formatted messages. A preset to remove DC offset should be placed in the standard Intervoice batch script that is applied to all Sound Forge voice messages prepared for runtime systems.

60001198, Second Edition Frequently Asked Questions A-3

FAQs

Sound Forge already has a preset (Process/DC Offset) that removes the DC offset from any recording. Select Process > DC Offset in the Sound Forge main window. The DC Offset window opens. Ensure that the Automatically detect & remove option is selected and that the Compute from first 5 seconds only option is unchecked. Save this preset as Intervoice DC Removal Preset and place it in the standard batch script.

Producing 64-KB Messages

Is it possible to produce 64-KB µ-law or A-law messages with the Sound Forge Tool?

Yes, you can create 64-KB messages with Sound Forge, as follows:

Before you can save a Sound Forge file as an Intervoice 64-KB message, you must make sure the file is sampled at exactly 8.0 KHz. If the file is not sampled at 8 KHz, you must resample the message in Sound Forge to change the sample rate to 8 KHz. Remember that if you are down-sampling (for example, 22 KHz to 8 KHz), you must use the antialias filter in the resample preset. Up-sampling (for example, 6 KHz to 8 KHz) does not require the antialias filter. (See “Batch Converter Overview” on page 5-2 for more information.)

Once you have an 8-KHz message, select File > Save as. The Save As dialog box appears. In the Save as type field, select the Intervoice (*.ivc) format. In the Format selection field, select the G.711 µ-law or A-law format. Click the Save button. You can also put all of this into a batch converter script to convert several .wav voice files to Intervoice 64-KB master files in a single step.

Converting Messages

Can I convert Dialogic IVR messages to Intervoice messages?Yes. Just open the Dialogic messages in Sound Forge, resample to 8 KHz (if not already there), and save as an .ivc file. You can save the file as an Intervoice format 24-KB, 32-KB, or 64-KB, µ-law or A-law message, depending on the Save As format option you select. You can also put all of this in a batch converter script to convert many .wav or Dialogic voice files to Intervoice 64-KB master files in a single step.


Using the Gain Leveling Preset

When should I use the Intra-message Gain Leveling Preset? The Intra-message Gain Leveling Preset is intended to be used only for a specific poorly recorded single message where the voice talent’s dynamics are too wide. (That is why it is called intra-message.) This usually occurs if the voice talent leans too close and then too far away from the microphone while recording a single message. That is to say, if a specific single message has passages that are too soft, while other parts of the same message are too loud, then you can use this preset to level the audio within that message. This preset will not equalize the level of a group of messages.

The Normalize preset is included in the utilities package and it is the correction to adjust the levels of a group of messages to a consistent level. That is why it is included in the Sound Forge utilities batch scripts, and the Intra-message Gain Leveling Preset is not.

Installing the Audiophile 2496 Sound Card

When I installed the M-Audio Audiophile 2496 sound card, the software disabled the Windows volume controls on my system.

This is not an error or problem. The software is working correctly. Once the sound card is installed, the volume is controlled through the Midiman Audio Buddy preamplifier.


FAQs

Turning Off AGC How do I turn off the Automatic Gain Control (AGC) on my sound card?The AGC that is on many sound cards must be turned off for any Intervoice voice message recordings.

Unfortunately, Creative Labs turns on the Automatic Gain Control on all of their sound cards by default (except the AWE Gold). You can turn off AGC only by using the Creative mixer applet that comes with the Sound Blaster card in the application software package (ctmix32.exe). The standard Microsoft Windows 95 mixer applet (yellow speaker) does not have the ability to change this setting on Creative Labs cards.

To turn off the AGC, follow these steps:

1 Install Creative Labs Audio Mixer applet (ctmix32.exe).

2 Start ctmix32.exe.

3 Right-click the mixer face.

4 Select the Input/Output settings.

5 Uncheck Automatic Gain Control.

You should not make recordings with the Sound Card AGC enabled. It can cause background noise as well as clicks and pops in the final recording.

Normalizing the Message Volume

When is the best time to normalize the volume so that it matches the messages already on the system?

The messages should be saved in 11, 025 KHz, 16-bit, monophonic, linear .wav format and recorded as loud as possible without clipping. All the subsequent processing can be performed using the Batch Converter. Generally, reducing the volume of the message is one of the last steps before compressing the .wav file into the runtime format.


Filtering Messages When I am converting messages to the Intervoice ADPCM format, what filtering or optimization is necessary for the telephony network?

“Converting the .wav Files to Runtime Format Overview” on page 5-4 describes the steps for postprocessing a .wav file to InterSoft .ivc file format. The most critical step for voice clarity is equalization. By using the parametric equalization function in the Batch Converter to change the low frequency cutoff of the recorded voice, you can significantly change the clarity of the voice over the phone.

Most users find that they can achieve their voice quality requirements by modifying the low-shelf frequency of the paragraphic EQ step. Usually they raise the low-shelf to between 200 and 300 Mhz for improved clarity. You may find that a modification of the high-shelf parameter also will improve some types of voices. If you want, you can boost a small portion of the audio spectrum centered at 800 Hz up a few decibels to get extra clarity voice.

Do not improve the voice by boosting the bass. While this may seem improved on a hi-fi speaker, the voice quality does not work well over a phone handset. The primary information carrying bandwidth of the human voice is between 400 and 2000 Hz, and boosting frequencies outside of this range actually lowers the listener’s comprehension.

Reducing Pops and Clicks in Messages

I have heard that the CRVoice2 utility fixes the pops and clicks in messages. Is this a result of making sure the message is in 4-byte increments, or does something else resolve this problem? And what does the CRVoice2 utility do?

Sound Forge version 4.5 and later versions of the Batch Converter with the Example Batch Script Vx.0.bcs file create InterSoft .ivc files with the correct byte count lengths. If you convert your .wav message masters using Sound Forge 4.5 or later versions of the Batch Converter, which is the correct way to convert messages, you do not have problems with pops caused by message length.

Other wave editors do not process or compress the message files correctly and should not be used to create InterSoft .ivc files. Other editors can be used to record the .wav master files, but only the Sonic Foundry Batch Converter can convert them.


FAQs

The CRVoice2 utility checks the message length and corrects it when necessary. However, it should not be used for that purpose because it does not perform the other operations on the .wav file that the Batch Converter does. The CRVoice2 utility is used strictly to rename the .ivc files to m.### for Intervoice systems. That is all it is designed to do. Checking the message length was included in this utility to fix a bug in an earlier version of Sound Forge. This earlier version is no longer available.

60001198, Second Edition Glossary-1

Glossary

µ-law (pronounced mu-law) A Pulse Code Modulation (PCM) voice-encoding standard in which analog voice is band-limited to 4-KHz bandwidth and then sampled at twice its highest frequency (4000 x 2), with the sample then converted as an eight-bit binary number. Samples must be taken 8000 times per second, so each sample is 1/8000 of a second, or 125 microseconds. A standard telephone voice channel is transmitted at 64 KBps, or eight samples in 125 microseconds. The µ-law standard is used on T1 systems in the United States, Japan, Mexico, and Canada. See also A-law.

adaptive differential pulse code modulation (ADPCM) A method of encoding digitized voice that requires fewer bits than linear or µ-law encoding (voice compression). The Intervoice system has a standard file format for voice messages using this compression method at 32 KBps and 24 KBps.

ADPCMSee adaptive differential pulse code modulation.

A-law The method of encoding sampled audio waveforms used in the 2048-KB, 30-channel PCM primary system. A-law is the standard widely used outside of North America on E1 systems. See also µ-law.

automatic labelingThe function of Sound Forge that allows the user to assign descriptive filenames to master voice files. This function should be set up before messages are recorded.

bandwidthThe range of electrical frequencies a device can employ.

Glossary-2 Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

Batch Converter The Sonic Foundry program that allows the processing of multiple audio files in a single process.

clickA sound, or noise, that occurs in voice recordings; a type of audible interference that must be corrected in voice recordings.

command prompt An operating system prompt that appears in an operating system window or full-screen session. Commands that run programs are entered at this prompt.

CRVoice2 utility A utility that assists in the naming of multiple voice files for use in InterSoft runtime.

dBmA decibel below 1 milliwatt (mW). See also decibel.

DC offsetThe average of all of the positive and negative integers in a waveform. This number can be positive or negative. Zero is referred to as no DC offset.

decibel (dB)A unit for measuring the power of a sound or the strength of a signal.

direct memory access See DMA.

DMA (direct memory access) A technique in which an adapter bypasses a computer’s CPU and handles the transfer of data between itself and the system’s memory directly.

equalization The frequency manipulation of a sound using adjustable filters.

filenameA name given by the user to electronically stored data for purposes of saving it and retrieving it at a later time. Filenames may be of varying lengths depending on the operating system used. The three-character extension (in the 8.3 format) designates the type of file: text (.txt), word processing (.doc), executable program (.exe), and so on. For multiple-operating system compatibility, the system requires the 8.3 format and recommends lowercase filenaming.


imsgmod4.exe utility A utility that truncates messages to a length divisible by 4 bytes. This truncation eliminates clicks and pops within messages used on the system. This utility is not recommended by Intervoice.

InterSoft systemAn Intervoice system that operates on the InterSoft runtime environment with IQTalk or the VoiceXML Browser.

InterSoft version 3The core software program used to bring applications into IQTalk voice browser runtime. InterSoft version 3 is designed to run on Windows NT and Windows 2000 operating systems.Intervoice IQ Environment [VoiceXML Browser System]A completely integrated environment that is bundled with the Intervoice InterSoft runtime system. However, it can also have speech recognition and text-to-speech servers. It offers a complete solution including design, development, and deployment tools as well as operation administration and management tools. It also includes data integration options to connect to legacy, enterprise server, and Web-based data sources. Browser options are IQTalk or VoiceXML, and operating system options are Windows NT or Windows 2000.

Intervoice runtime systemOne or more Intervoice voice browser systems and other hardware that perform call processing, audiotex, speech recognition, call directing, voice messaging, and interactive voice response. This system is the interface between telephone callers and a host computer, the Internet, a local area network (LAN), a wide area network (WAN), or telecommunication network, and local or remote databases. The Intervoice voice browser systems can be integrated in one box or distributed in multiple boxes. This can also be called IVR and was formerly called Omvia Media Server.

IQTalkThe InterSoft runtime engine and voice browser that interprets and processes the InVision application. It can also interface with user-defined programs written in C or Assembly.

KBpsKilobits per second. This is the number of bits passing a specific point per second in an audio file/voice message.

message An audio file that, combined with other messages, comprises the spoken information in an application.


Mu-law See µ-law.

normalization A process used in the Intervoice Example Batch V2.sfz file to adjust the volume of all the audio files to the same level.

Omvia Media ServerSee Intervoice runtime system.

popA sound, or noise, that occurs in voice recordings; a type of audible interference that must be corrected in voice recordings.

preset An audio processing function in Sound Forge.

RealCareThe remote diagnostic and troubleshooting service available to customers as a contractual option. RealCare operates a national WATS line at 1-800-955-4688.

runtimeThe mode in which the system answers customer calls and communicates with the host; the system in the production environment.

Sound ForgeA program, created by Sonic Foundry, and distributed by Sony Media Software, Inc., that performs postprocessing and complex editing. It can also be used to record and edit messages.

trimming The action of eliminating or cutting silence from the beginnings and ends of voice messages.

voice compressionA method for reducing the bandwidth required for storing or transmitting digital voice signals.

voice file set A set of individual voice files that contain the recorded messages for an application. These voice files are contained in a subdirectory of the application directory.


voice message A digital .wav file used when an individual message is changed in a voice file set (.vfs). While the message file is being changed, any application playing the current message continues to play it. The new message is played at the next message request.

voice message file numbering The method of naming voice message files m.###. They must begin with m to be recognized, for example:

m.001 – message number 1m.777 – message number 777m2.010 – message number 2010

Voice Pool Set A type of VoiceSet that contains a collection of voice messages for applications that record large amounts of voice, such as LeaveWord recording. Messages from a Voice Pool Set (as in an InVision application) are used in place of messages from the voice file set (.vfs). Voice Pool Sets are contained in a subdirectory of the application directory. See also VSUTIL utility.

Voice Set utilitySee VSUTIL utility.

voice talentThe person whose voice is used to produce voice recordings.

VSUTIL utility (VoiceSet utility) The program that creates VoiceSet directories based on the information it reads in the .app file (stored in the application directory). Actual pools are created in the Voice Pool Set for use with LeaveWord and Advanced LeaveWord, as well as in the Alternate Message table for use by YourVoice. It also creates any necessary indirection files. VSUTIL should be started from a command prompt in the application directory.

.wav A standard format for audio files. A file that contains Windows-compliant audio data. The file can be played on any Windows system with audio hardware. Intervoice has defined an 11.025 KHz, 16-bit .wav file as its master format.


Windows 2000A network operating system built from Microsoft Windows NT technology. Windows 2000 provides support for both peer-to-peer networking as a work group and centralized administration networking as a domain. Windows 2000 has two major versions: Windows 2000 Server and Windows 2000 Professional. For IVRs, Intervoice recommends Windows 2000 Server, which supports communication between a larger number of systems. Windows 2000 Professional can be used to run applications on business desktops and laptop computers.

60001198, Second Edition Index-1

Index

.ivc files 6-4, 6-5naming 6-3quality of 7-2

.wav files 6-4clipping 2-2conversion and reconversion 7-2converting 5-2defined Glossary-5described 3-2editing and compressing 7-2master format 7-2

keeping two copies 7-2renaming 6-3updating 7-2

µ-law compressed data format, explained 3-2

AAC-powered speakers with cables,

recommended hardware 2-3adaptive pulse code modulation (ADPCM) 5-2

.ivc data format 3-3, 5-8decoder 5-4

ADPCM. see adaptive differential pulse code modulation

A-lawADPCM compression 5-5defined Glossary-1

applet, ctmix32.exe A-5audience for this document ixaudio cable

recommended hardware 2-3Y-adapter 2-4

audio file formatInterSoft IQTalk 3-3VoiceXML 3-3

Automatic Gain Control (ACG) A-5automatic labeling

and Sound Forge 4-2defined Glossary-1setting up 4-2

Bbackground noise 2-12bandwidth, defined Glossary-1bass boosting A-6Batch Converter 3-3, 4-6

defined Glossary-2downloading 2-6installing 5-2modifying sample batch script 5-8parametric equalization function A-6postprocessing steps

8 KHz resampling 5-4DC Offset Removal 5-4Front and Back silence buffering 5-4Front and Back silence trimming 5-4Message level normalization 5-4

registering 2-6removing batch scripts 4-18using 5-6using sample batch script 5-3

batch scriptsmodifying 5-3, 5-8removing 4-18

buffering, silence 5-4

Index-2 Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

buttonsRecord 2-8, 4-2Reset 2-8Stop 4-8

Ccaution note xivchecklists

recording process 4-6studio set up 2-2

clickdefined Glossary-2reducing A-6

clippingdescribed 2-9fixing 2-9preventing 2-10

command prompt 6-4defined Glossary-2

connecting the sound card and speakers 2-4conventions used in this document xivconverting .wav file to.ivc file 5-4CRVoice2 utility 4-6

and .ivc files 2-7command 6-5defined Glossary-2install directory 6-4options

<-rename> (optional) 6-5<source file spec> 6-5<target directory> 6-5-h 6-5

renaming files 2-7starting from a command prompt 6-5starting from an icon 6-5using 6-4

CRVoice2.exe file 6-4, 6-5ctmix32.exe A-5customer support xv

DdB(decibel), defined Glossary-2dBm, defined Glossary-2DC offset

defined Glossary-2removal 2-13, 5-4statistics, viewing 2-13window 2-12

diagramconversion process 5-2creating audio recordings 1-3from .wav to .ivc 6-3

DMA (direct memory access), defined Glossary-2

document reference xivdocumentation

access xiformats xiIntervoice xiOEM for installation instructions 2-2

downloading from Web siteBatch Converter 2-6

Eequalization 5-5

defined Glossary-2example Batch Script 5-2

FFAQs

converting messages A-3defining DC offset A-2filtering messages A-6installing the Audiophile 2496 card A-4normalizing the message volume A-5producing 64-KB messages A-3recording steps A-2reducing pops and clicks A-6turning off Automatic Gain Control (AGC)

A-5using the Gain Leveling Preset A-4

figures, list of vfilename

defined Glossary-2formats 6-2


files.ivc 6-5.wav

defined Glossary-5CRVoice2.exe 6-4, 6-5Intervoice Example Batch Script 5.0 5-3

filtering messages A-6formats, standard InterSoft runtime 3-3

GGain Leveling Preset, described A-4general note xiv

Hhardware, recommended 1-4

Iinformation symbols used in this document xivinstalling

Batch Converter 2-6CRVoice2 2-7hardware 2-3

InterSoftfile naming conventions 6-1master format, differences between

Windows master format 3-3runtime format 3-2runtime messages, format 6-1

InterSoft system, defined Glossary-3InterSoft version 3, defined Glossary-3Intervoice

documentation xiExample Batch Script 5.0 file 5-3Example Batch Script file

contents 5-3Example Batch V2.sfz file 5-2master format 3-2standard master format 3-2technical support xvvoice file, naming conventions 6-3Web site 1-1, 5-3, 5-8, 6-5, A-1

Intervoice systemdefined Glossary-3

IQTalk, defined Glossary-3

Jjack-to-jack shield adapter, recommended

hardware 2-3

KKBps

defined Glossary-3kilobits per second Glossary-3

Llist of figures vlist of tables vii

Mmaster format

.wav files 7-2Windows 3-3

M-audio sound card, recommended 2-3message

defined Glossary-3file, final volume levels 2-8filtering A-6volume, normalizing A-5

microphone 2-2cable, recommended hardware 2-3commercial and preamplifiers 2-9connecting 2-2level on preamplifier 2-9, 2-12placement 2-9recommended hardware 2-3recommended type, supercardioid 2-9Sennheiser, more information 2-9setting levels 2-8

modifying the example batch script 5-8mu law, defined Glossary-1

Nnoise floor test 2-2, 2-12, 4-6

failure causes 2-13normalization 5-4

defined Glossary-4Normalize preset A-4

Index-4 Sound Forge and the Intervoice System Audio File Guide 60001198, Second Edition

note boxescaution xivdocument reference xivgeneral xivtip xivwarning xiv

OOmvia Media Server

defined Glossary-4optimization of messages A-6

Pparametric equalization function A-6playback 5-5plug-in chain 5-4pop

defined Glossary-4reducing A-6

postprocessing 5-4power cord, recommended hardware 2-3preamplifier 2-2

and commercial microphones 2-9line-level input

aux 2-2line 2-2

recommended hardware 2-3preset

defined Glossary-4Gain Leveling A-4Normalize A-4

prompt, command, defined Glossary-2purpose of this document ix

RRealCare xv

defined Glossary-4recommended hardware

AC-powered speakers with cables 2-3, 2-4audio cable 2-3jack-to-jack shield adapter 2-3, 2-4M-audio sound card 2-3microphone 2-3microphone cable 2-3power cord 2-3preamplifier 2-3

sound card 2-4Y-adapter audio cable 2-3

Record button 2-8, 4-2recording

attributes 2-8Intervoice standard settings 4-5setting 4-5

continuous methods 4-7, 4-8, 4-9Multiple takes (no Regions) 4-8, 4-9

message-by-message methods 4-7, 4-8, 4-12Create a new window for each take 4-8,

4-15Multiple takes creating Regions 4-8,

4-12messages in Sound Forge 4-9process checklist 4-6

resampling 5-4Reset button 2-8RMS power 2-13runtime, defined Glossary-4

Ssample batch script, using into Batch Converter

5-3setting recording attributes 4-5setting up automatic labeling 4-2signal-to-noise ratio 2-8silence

buffering 5-4trimming 5-4

software, recommended 1-5sound card 2-2, 5-4

audio input 2-2Automatic Gain Control (ACG) A-5connecting to speakers 2-4installing A-4volume controls disabled upon installation

A-4Sound Forge

automatic labeling function 4-2Batch Converter 5-2defined Glossary-4importance of registering 2-6level monitors 4-9, 4-12, 4-15recording messages 4-9recording mode 4-9


technical support A-1speakers 2-2

connecting to sound card 2-4Stop button 4-8studio setup 2-2summary of changes 1-2system utilities

CRVoice2 utility 6-4VSUTIL utility (VoiceSet utility)

Glossary-5

Ttables, list of viitechnical support xv

Sound Forge A-1tip xivtrimming

defined Glossary-4silence 5-4

Uupdating the Batch Converter 2-6using

Example Batch Script 5-3utilities

batch scripts A-4CRVoice2 6-4

defined Glossary-2VSUTIL (VoiceSet utility) Glossary-5

defined Glossary-5

Vviewing DC offset statistics 2-13voice

compression, defined Glossary-4file set, defined Glossary-4files

converting Dialogic A-3quality issues 7-2

message file numbering, defined Glossary-5

message, defined Glossary-5quality 7-2talent A-4

and individual master files 5-1and microphone 2-9

defined Glossary-5reading preference 4-8, 4-9

Voice Pool Set, defined Glossary-5VSUTIL utility (VoiceSet utility)

defined Glossary-5

Wwarning note xivWeb sites

Intervoice A-1Windows 2000, defined Glossary-6

YY-adapter audio cable 2-4

recommended hardware 2-3