tools and techniques for speech and language processing · tools and techniques for speech and...

21
Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1 of 16 Logistics Course Overview Tools and Techniques for Speech and Language Processing Lane Schwartz

Upload: others

Post on 07-Aug-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Tools and Techniques forSpeech and Language Processing

Lane Schwartz

University of Illinois at Urbana-Champaign

Week 1 of 16

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 2: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Lecturers and office hours

Yan Tang, Assistant Professor ([email protected])

Room ???, Foreign Languages Building

Office hours: 4:00-5:00pm Tuesday and Thursday

Chase Adams, Teaching Assistant ([email protected])

Room 2039, Foreign Languages Building

Office hours: 10:00-11:00 AM Mondays;1:00-2:00 PM Thursdays

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 3: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Course web site

http://computational.linguistics.illinois.edu/

lingNNN/SSSYYYY

Replace NNN with the course number for this course

Replace SSS with the current semester (fall or spring)

Replace YYYY with the current four-digit year

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 4: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

DRES

If a student has a disability or condition that requires specialconsideration, the student is expected to present the requisiteletter from the University Division of Disability Resources andEducational Services (DRES) no later than the beginning of thesecond day of class.

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 5: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Academic integrity

This course follows the University of Illinois Student Coderegarding Academic Integrity. The College of Liberal Arts andSciences also has an excellent web page on the topic. You areexpected to read these resources prior to the second day of class,and to understand your responsibilities with regard to AcademicIntegrity. All work submitted for this class must be solely yourown. Violations of Academic Integrity include, but are not limitedto, copying, cheating, and unapproved collaboration.

Violations will not be tolerated.

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 6: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

i>clicker

We will use i>clicker in this class.

You must have a physical i>clicker.

You must bring your i>clicker to every class session.

You must always have working batteries.

Your i>clicker must be registered.

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 7: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Piazza

Course Piazza site linked off course web page

Verify that you are enrolled in the course Piazza site

Ask questions via Piazza

Do not ask questions via email

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 8: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Github

https://github.com

Github username

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 9: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Other business

http://computational.linguistics.illinois.edu/

ling402/SSSYYYY/syllabus.html

Student responsibilities

Absences and late work policy

etc...

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 10: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

What to learn?

Basics of command line tools

Scripting and executing (shell script, Python script)

Versioning (using git)

Basics of Python programming, e.g. data type, flow control,object-oriented programming and debugging

Applications: text processing, speech signal processing

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 11: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Course Overview

Gain proficiency with command line tools, including git

Gain proficiency in shell scripting

Gain understanding of basic data structures

Gain proficiency with Python

Learn how to think like a programmer

Learn how to go from a problem statement to a solution

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 12: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Evaluation

Lecture attendance: 5%

Daily quizzes: 5%

Daily homework: 20%

Weekly homework: 50%

Exam(s): 20%

100%

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 13: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Homework Assignments

Daily homework:

https://2019-fall-uiuc-ling402.github.io/hw_daily.html

Weekly homework:

will be announced each week in due course

https://2019-fall-uiuc-ling402.github.io/schedule.html

or the “Homework” tab atop of the course website

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 14: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Textbooks and Tools

Textbooks:

The Linux Command Line: A Complete Introduction. WilliamE. Shotts, Jr.

Bash Reference Manual. Chet Ramey & Brian Fox

The Python Language Reference. Guido van Rossum, et al.

Natural Language Processing with Python. Steven Bird, EwanKlein, and Edward Loper

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 15: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Textbooks and Tools

Prerequisite software:

Mac: Termial.app (comes within the system)

Linux/Unix: GNOME Terminal (comes within the system)

Windows: PuTTY, Cygwin (needs separate installation andsetup)

Python installation (for your own machine):

Anaconda: https://www.anaconda.com/

Download: https://www.anaconda.com/distribution/

Python version: 3.7 (recommended!)

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 16: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Hands-on: create a Github account

Go to http://github.com

Complete the registration

Remember your username!

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 17: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Hands-on: First touch on command line

COMMAND [OPTION]... [ARGUMENT]...

e.g. ls -a /bin/etc

Some commands:

pwd - print current working directory

echo - display a line of text

cat - concatenate files, or copy standard input to standardoutput

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 18: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Hands-on: Text editing using vi

vi - open the Vi Improved (VIM) text editore.g. vi #launch vim editor and create a new file

vi myfile #open “myfile” in vim

Two modes: command mode and insert mode

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 19: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Hands-on: Text editing using vi

Command mode:

Arrow keys - change the position of the cursor

x - delete the character after or under the cursor, or cutselected text upon selecting

v - initialise selection

y - copy the selected text upon selecting

p - paste the selected text

i - enter insert mode

:w - write/save changes to the file

:q - quit vim editor

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 20: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Hands-on: Text editing using vi

Insert mode:

The “ESC” key - return to the command mode

You can only quit/close the editor from the command mode

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz

Page 21: Tools and Techniques for Speech and Language Processing · Tools and Techniques for Speech and Language Processing Lane Schwartz University of Illinois at Urbana-Champaign Week 1

Homework

https://2019-fall-uiuc-ling402.github.io/hw00.html

Tasks:

to practice login to the CL server using SSH

to create and edit a file called “.who am I” under your homedirectory using vi editor

to practice some basic Shell commands, e.g. cat, tail and cut

Logistics Course Overview

Tools and Techniques for Speech and Language Processing Lane Schwartz