prerequisites - university of texas at arlington · electronic commerce both business-to-business...

9
Web Data Management and XML L1: Introduction 1 Introduction © Leonidas Fegaras University of Texas at Arlington Web Data Management and XML L1: Introduction 2 Information Class: TuTh 5:30-6:50pm Instructor: Leonidas Fegaras Office: ERB 653 (Engineering Research Bldg) Phone: 817-272-3629 Email: [email protected] Office hours: Tuesday and Thursday 3:30-5:00pm GTA: Upa Gupta [email protected] Class Web: https://lambda.uta.edu/cse5335/ Visit the class web page often. It will contain reading assignments, project description, class notes, grades, etc. Web Data Management and XML L1: Introduction 3 Description XML has become an important standardization for data representation and information exchange among Internet co- operative applications. This course provides an in depth study of the area of web data management with an emphasis on XML standards and technologies. The course primarily covers the state of the art in designing and building web applications and services, primarily focusing on issues and challenges that revolve around the management and processing of XML data. Web Data Management and XML L1: Introduction 4 Prerequisites Prerequisite: CSE 3330/CSE 5330 (Database Systems I) or equivalent Students are expected to have a working knowledge of Java SQL basic HTML Students without adequate preparation are at substantial risk of failing this course

Upload: others

Post on 12-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

Intr

od

uct

ion

©L

eon

idas

Feg

aras

Un

iver

sity

of

Tex

asat

Arl

ing

ton

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

Info

rmat

ion

Cla

ss:

Tu

Th

5:3

0-6

:50

pm

Inst

ruct

or:

Leo

nid

asF

egar

as

Off

ice:

ER

B 6

53

(E

ng

inee

rin

g R

esea

rch

Bld

g)

Ph

on

e:8

17

-27

2-3

629

Em

ail:

feg

aras

@cs

e.u

ta.e

du

Off

ice

ho

urs

:T

ues

day

an

d T

hu

rsd

ay3

:30

-5:0

0p

m

GT

A:

Up

a G

up

tau

pa.

gu

pta

@m

avs.

uta

.ed

u

Cla

ss W

eb:

htt

ps:

//la

mb

da.

uta

.ed

u/c

se5

33

5/

Vis

it t

he

clas

s w

eb p

age

oft

en.

It w

ill

con

tain

rea

din

g a

ssig

nm

ents

, p

roje

ct d

escr

ipti

on

, cl

ass

no

tes,

gra

des

, et

c.

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

Des

crip

tio

n

XM

L h

as b

eco

me

an i

mp

ort

ant

stan

dar

diz

atio

n f

or

dat

are

pre

sen

tati

on

an

d i

nfo

rmat

ion

exch

ang

e am

on

gIn

tern

et c

o-

op

erat

ive

app

lica

tio

ns.

This

co

urs

e p

rov

ides

an

in

dep

th s

tud

y o

f th

e ar

ea o

f w

eb d

ata

man

agem

ent

wit

h a

n e

mp

has

is o

n X

ML

sta

nd

ard

s an

d

tech

no

log

ies.

The

cou

rse

pri

mar

ily

co

ver

s th

e st

ate

of

the

art

in d

esig

nin

g a

nd

b

uil

din

gw

eb a

pp

lica

tio

ns

and

ser

vic

es,

pri

mar

ily

fo

cusi

ng

on

is

sues

an

dch

alle

ng

es t

hat

rev

olv

e ar

ou

nd

th

e m

anag

emen

t an

d

pro

cess

ing

of

XM

L d

ata.

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n4

Pre

requ

isit

es

Pre

requ

isit

e: C

SE

33

30

/CS

E 5

33

0 (

Dat

abas

eS

yst

ems

I) o

r eq

uiv

alen

t

Stu

den

ts a

re e

xp

ecte

d t

oh

ave

a w

ork

ing

kn

ow

led

ge

of

Jav

a

SQ

L

bas

icH

TM

L

Stu

den

ts w

ith

ou

t ad

equ

ate

pre

par

atio

n a

re a

t su

bst

anti

al r

isk

of

fail

ing

this

co

urs

e

Page 2: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n5

Gra

din

g

The

fin

al g

rad

e w

ill

be

bas

ed o

n

50%

10

sm

all

pro

gra

mm

ing

ass

ign

men

ts

20%

mid

term

exam

30%

fin

alex

am (

com

pre

hen

siv

e)

Fin

al g

rades

wil

l b

e as

sig

ned

acc

ord

ing

to

the

foll

ow

ing

sca

le:

A:

sco

re >

= 9

0,

B:

80

<=

sco

re <

90

, C

: sc

ore

< 8

0

So

met

imes

, I

use

lo

wer

cu

toff

po

ints

, d

epen

din

gon

th

eov

eral

lp

erfo

rman

ce o

f th

e cl

ass

Yo

ur

gra

des

wil

l b

e av

aila

ble

on

-lin

e o

n t

he

cou

rse

web

pag

e

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n6

Rea

din

g M

ater

ial

Ther

e is

no

req

uir

ed t

extb

oo

k b

ut

yo

u a

re e

xp

ecte

d t

o r

ead

man

yo

nli

ne

tuto

rial

s an

d r

efer

ence

s

Lin

ks

wil

lbe

giv

en o

ut

in c

lass

Man

y g

ood

on

lin

etu

tori

als,

eg

htt

p:/

/ww

w.w

3sc

ho

ols

.co

m/d

efau

lt.a

sp

Man

y b

oo

ks

on

web

pro

gra

mm

ing

an

d X

ML

sta

nd

ard

s

Pro

gra

mm

ing

th

eW

orl

d W

ide

Web

, by R

ob

ert

W.

Seb

esta

(7th

Ed

itio

n)

cov

ers

the

firs

t par

t of

the

cours

e (w

eb p

rogra

mm

ing)

See

the

syll

abu

s fo

r m

ore

reco

mm

end

ed b

ook

s

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n7

Ex

ams

Bo

th e

xam

s ar

e o

pen

note

s

all

no

tes

mu

st b

ese

cure

ly b

oun

d i

n o

ne

no

teb

oo

k

no b

oo

ks

are

allo

wed

The

fin

al e

xam

wil

lco

ver

the

mat

eria

l fr

om

th

e fi

rst

lect

ure

up

to

an

d i

ncl

ud

ing

th

e la

st l

ectu

re

On

ce t

he

exam

gra

des

are

po

sted

, y

ou

wil

l h

ave

10

bu

sin

ess

day

s to

dis

pu

te y

ou

rg

rad

e an

d g

et y

ou

r ex

am r

e-ev

alu

ated

no r

e-ev

aluat

ion

wil

lbe

ente

rtai

ned

aft

er t

he

10

day

per

iod

No

mak

eup

ex

ams

wil

l b

e g

iven

un

less

ther

e is

a j

ust

ifia

ble

reas

on

(such

as

illn

ess,

sic

kn

ess

or

dea

th i

n t

he

fam

ily

)

Ify

ou

mis

s an

ex

am a

nd

yo

uca

n p

rov

e th

at y

ou

r re

aso

n i

sju

stif

iab

le,

yo

u s

ho

uld

arr

ang

e w

ith

th

e in

stru

cto

r to

tak

e th

em

akeu

p e

xam

wit

hin

a w

eek

fro

m t

he

reg

ula

r ex

am t

ime.

Fo

r an

yo

ther

cas

e, y

ou

wil

l g

et a

zer

o g

rad

e fo

rth

e m

isse

dex

am.

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n8

Pro

gra

mm

ing

Ass

ign

men

ts

Ther

e w

ill

be

ten

sm

all

wee

kly

pro

gra

mm

ing

ass

ign

men

ts

Eac

h a

ssig

nm

ent

mu

st b

e d

on

e in

div

idu

ally

Det

ails

wil

l b

e g

iven

ou

t in

cla

ss

Lat

e p

roje

ctw

ill

be

mar

ked

20

poin

tso

ff p

er d

ay (

ou

t o

f 1

00

m

ax) So

, th

ere

is n

o p

oin

t su

bm

itti

ng

apro

ject

rep

ort

mo

re t

han

4 d

ays

late

!

Th

is p

enal

ty c

anno

t b

ew

aiv

ed,

un

less

th

ere

was

a c

ase

of

illn

ess

or

oth

ersu

bst

anti

al i

mp

edim

ent

bey

on

d y

our

contr

ol,

wit

h p

roo

fin

do

cum

ents

fro

m t

he

sch

oo

l

Page 3: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n9

So

ftw

are

Most

pro

ject

s w

ill

be

do

ne

inJa

va

(usi

ng

JD

K 6

)

but

som

e w

ill

be

do

ne

in J

avaS

crip

t, P

HP

, an

dX

Qu

ery

yo

u a

reex

pec

ted t

o h

ave

a w

ork

ing

kn

ow

led

ge

of

Jav

a, S

QL

, an

d H

TM

L

The

soft

war

e u

sed

for

the

pro

ject

s is

op

en-s

ou

rce,

fre

e, p

latf

orm

-in

dep

end

ent,

and

wel

l-su

ited

fo

r Ja

va:

Jav

a/w

eb d

evel

op

men

t p

latf

orm

: E

clip

se J

ava

EE

ID

E f

or

Web

Dev

elop

ers

(ver

sio

n:

Ind

igo)

Yo

u c

an d

o m

ost

of

the

pro

ject

s o

n y

our

PC

/lap

top

un

der

an

yp

latf

orm

Lin

ux

, M

AC

OS

X,

MS

Win

do

ws,

etc

dir

ecti

on

s o

f h

ow

to d

ow

nlo

ad t

he

req

uir

edso

ftw

are

wil

lbe

giv

eno

ut

incl

ass

Alt

ho

ug

h w

e w

ill

bri

efly

tal

kab

ou

tit

, w

ew

ill

no

t u

se M

icro

soft

AS

P.N

ET

(V

isu

al S

tud

io,C

#,

etc)

, si

nce

th

is f

ram

ework

is

pla

tfo

rm-d

epen

den

t (f

or

IIS

only

)

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

0

Ch

eati

ng

All

wo

rk i

n t

his

clas

s m

ust

be

do

ne

ind

ivid

ual

ly.

No

cop

yin

g i

sp

erm

itte

d

Ch

eati

ng

inv

olv

es g

ivin

g a

ssis

tan

ce t

o o

r re

ceiv

ing

ass

ista

nce

fro

m o

ther

stu

den

ts o

rfr

om

oth

er i

ndiv

idu

als,

cop

yin

g m

ater

ial

fro

m t

he

web

, et

c

I st

rict

ly a

dh

ere

to t

he

Un

iver

sity

of

Tex

as a

t A

rlin

gto

n r

ule

s an

d

gu

idel

ines

fo

r h

and

lin

gv

iola

tio

ns

of

acad

emic

dis

ho

nes

ty.

Ple

ase

refe

r to

th

e p

amp

hle

t "C

HE

AT

ING

: D

efin

itio

ns

and

C

on

seq

uen

ces"

fo

r ad

dit

ion

al i

nfo

rmat

ion

Yo

u a

re r

equ

ired

to

sig

n a

nd

ret

urn

th

e st

atem

ent

abo

ut

acad

emic

dis

ho

nes

ty

Ifan

y o

ne

is c

aug

ht

for

chea

tin

g,o

r in

du

lge

in p

lag

iari

sm o

rco

llu

sio

n o

n a

pro

gra

mm

ing

ass

ign

men

t o

r on

a e

xam

, th

e g

rad

efo

r th

e en

tire

co

urs

e w

ill

be

an a

uto

mat

ic F

ail

gra

de

(F)

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

1

Mis

cell

aneo

us

Dis

tance

Edu

cati

on

Stu

den

ts:

Th

e re

qu

irem

ents

fo

rd

ista

nce

ed

uca

tio

n s

tud

ents

are

the

sam

eas

fo

rre

gu

lar

stu

den

ts w

ith

the

poss

ible

exce

pti

on

of

the

exam

s

Ifyo

u a

rea

dis

tan

ceed

. st

ud

ent

and

wo

rk w

ith

in o

ne

ho

ur

dri

vin

g d

ista

nce

fro

m U

TA

, th

enyo

u n

eed t

o c

om

ean

d t

ake

the

exam

s in

per

son

. O

ther

wis

e, y

ou

wil

lh

ave

to f

ind

an

ex

am p

roct

or

on s

ite

to s

uper

vis

eth

eex

ams.

Th

ep

roct

or

can

no

tbe

anyo

ne

equ

alor

bel

ow

yo

ur

pay

gra

de

atyo

ur

off

ice,

un

less

it

is s

om

eone

in H

R t

hat

sp

ecia

lize

s in

pro

cto

ring

exam

s. T

he

pro

cto

r co

uld

be

som

eon

e fr

om

a l

oca

lsc

ho

ol,

tes

tin

g c

ente

r,

etc.

Th

e p

roct

or

mu

stb

e ap

pro

ved

by t

he

inst

ruct

or

and

ap

roct

or

agre

emen

t m

ust

be

sig

ned

. T

he

exam

wil

l b

e d

eliv

ered

to

a p

roct

or

in t

he

mo

rnin

g o

f th

e ex

amday

.

Sp

ecia

l A

cco

mm

od

atio

ns:

Ifyo

u r

equ

ire

an a

cco

mm

od

atio

n b

ased

on

dis

abil

ity,

Iw

ou

ld l

ike

to m

eet

wit

hyo

u i

nth

ep

riv

acy o

fm

y o

ffic

e, d

uri

ng

the

firs

tw

eek o

f th

e se

mes

ter,

to

mak

esu

reyo

u a

reap

pro

pri

atel

y a

ccom

mo

dat

ed.

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

2

Ten

tati

ve

Sch

edu

le

Intr

od

uct

ion

an

d m

oti

vat

ion

Web

ap

pli

cati

on

dev

elo

pm

ent

Dynam

ic w

eb p

ages

HT

TP

GE

T/P

OS

T r

equ

ests

HT

ML

fo

rms

Cli

ent-

sid

epro

gra

mm

ing

(Jav

aScr

ipt)

XH

TM

L a

nd C

SS

style

shee

ts

Th

e d

ocu

men

t o

bje

ct m

od

el (

DO

M)

and

dyn

amic

HT

ML

Asy

nch

ron

ous

serv

er r

eques

ts (

AJA

X)

and

Xm

lHtt

pR

eques

t

Ser

ver

-sid

epro

gra

mm

ing

:P

HP

scr

ipts

Co

okie

s an

d s

essi

on

s

web

mas

hu

ps

Ser

vle

ts (

To

mca

t)

Jav

aS

erv

er P

ages

(JS

P)

Dat

abas

eco

nn

ecti

vit

y (

JDB

C)

Page 4: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

3

Ten

tati

ve

Sch

edu

le (

con

t.)

Clo

ud

com

pu

tin

g

Dis

trib

ute

dfi

le s

yst

ems

(HD

FS

, C

assa

nd

ra)

Th

e M

ap-R

edu

ce f

ram

ewo

rk (

Had

oo

p,

Hiv

e, P

ig)

Am

azo

n W

eb s

erv

ices

an

d E

last

icC

om

pu

te C

loud

(EC

2)

XM

L s

tan

dar

ds

DT

D a

nd

XM

L S

chem

a

XP

ath

XM

L p

rog

ram

min

g (

DO

M, S

AX

, S

tAX

)

XS

LT

XQ

uer

y

Jav

a/X

ML

dat

a b

ind

ing

(JA

XB

)

XM

L d

ata

mo

del

ing

Nat

ive

XM

Lst

ora

ge

man

agem

ent

Ind

exin

g t

ech

niq

ues

Xin

dic

ean

d B

erk

eley

DB

XM

L

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

4

Ten

tati

ve

Sch

edu

le (

con

t.)

Rel

atio

nal

dat

abas

es a

nd

XM

L

XM

L s

hre

dd

ing

XM

L p

ub

lish

ing

XM

L o

n c

om

mer

cial

dat

abas

es(O

racl

eX

ML

DB

, S

QL

Ser

ver

SQ

LX

ML

)

XM

L d

ata

man

agem

ent

Qu

ery p

roce

ssin

g a

nd

op

tim

izat

ion

Up

dat

es a

nd V

iew

mai

nte

nan

ce

Inte

gri

ty c

onst

rain

ts

XM

L s

earc

h e

ng

ines

Info

rmat

ion r

etri

eval

Web

sear

ch e

ng

ines

XM

L r

ank

ing

Web

ser

vic

es

RE

ST

ful

vs

SO

AP

-bas

edw

eb s

erv

ices

Sta

nd

ard

s:S

OA

P,

WS

DL

, U

DD

I

Ax

is a

nd

JA

X-W

S

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

5

Tra

dit

ion

alD

B A

pp

lica

tio

ns

Ty

pic

ally

bu

sin

ess

ori

ente

d

Lar

ge

amo

un

t o

f d

ata

Dat

a is

wel

l-st

ruct

ure

d,

no

rmal

ized

, w

ith

pre

def

ined

sch

ema

Lar

ge

nu

mb

er o

f co

ncu

rren

t u

sers

(tr

ansa

ctio

ns)

Sim

ple

dat

a, s

imp

le q

uer

ies,

an

d s

imple

upd

ates

Ty

pic

ally

up

dat

e in

ten

siv

e

Sm

all

tran

sact

ion

s

Hig

h p

erfo

rman

ce,

hig

h a

vai

lab

ilit

y,

scal

abil

ity

Dat

a in

teg

rity

an

dse

curi

ty a

re o

f m

ajo

r im

po

rtan

ce

Go

od

adm

inis

trat

ive

sup

po

rt,

nic

e G

UIs

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

6

Do

cum

ent

Ap

pli

cati

ons

Hu

man

fri

end

ly:

wh

at-

you

-see

-is-

wha

t-yo

u-g

et p

arad

igm

Fo

cus

on

pre

sen

tati

on

Info

rmat

ion

is

div

ided

in

to m

ult

iple

sm

all

do

cum

ents

Most

ly s

tati

c

Imp

lici

t st

ruct

ure

: se

ctio

n,

subse

ctio

n,

par

agra

ph

, et

c

Met

a-d

ata:

tit

le,

auth

or,

dat

e, i

nd

exin

g k

eyw

ord

s, e

tc

Co

nte

nt

stru

ctu

re:

form

/lay

ou

t,in

ter-

rela

tio

nsh

ips,

refe

ren

ces

Tag

gin

g:

eg, <

p>

fo

r n

ew p

arag

rap

h

mar

ku

p l

ang

uag

es:

HT

ML

, X

ML

, ..

.

Op

erat

ion

s: r

etri

evin

g,

edit

ing

, sp

ell-

chec

kin

g,

pri

nti

ng

, et

c

Info

rmat

ion

Ret

riev

al:

sim

ple

key

wo

rd s

earc

h

mo

stsu

cces

sfu

lin

web

sea

rch

en

gin

es (

eg,

Goo

gle

)

Page 5: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

7

Inte

rnet

Ap

pli

cati

on

s

Inte

rnet

ap

pli

cati

on

s

use

het

ero

gen

eou

s, c

om

ple

x,

hie

rarc

hic

al,fa

st-e

vo

lvin

g,

un

stru

ctu

red

/sem

istr

uct

ure

d d

ata

acce

ss m

ost

ly r

ead

-on

ly d

ata

req

uir

e lo

ng

tran

sact

ions

(bu

sin

ess

pro

cess

es)

nee

d 1

00

% a

vai

lab

ilit

y

man

age

mil

lion

s o

f u

sers

wo

rld

-wid

e

hav

e h

igh

-per

form

ance

req

uir

emen

ts

are

conce

rned

wit

hse

curi

ty (

encr

yp

tio

n)

are

conce

rned

wit

hp

rese

nta

tio

n/i

nte

ract

ion

on

web

bro

wse

r

lik

e to

cu

sto

miz

e d

ata

in a

per

son

aliz

ed m

ann

er

exp

ect

togai

nuse

r’s

tru

st f

or

bu

sin

ess-

to-c

on

sum

ertr

ansa

ctio

ns

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

8

Ele

ctro

nic

Co

mm

erce

Bo

thb

usi

nes

s-to

-bu

sin

ess

(B2

B)

and

bu

sin

ess-

to-c

on

sum

er

(B2

C)

inte

ract

ion

s

Fo

cus

on

sel

lin

g a

nd

bu

yin

g:

Ord

er m

anag

emen

t

Pro

du

ct c

atal

og

s

Pro

du

ct c

on

fig

ura

tio

n

Sal

es a

nd

mar

ket

ing

Ed

uca

tio

n a

nd

tra

inin

g

Web

ser

vic

es

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n1

9

Oth

er W

eb A

ppli

cati

on

s

Web

Ser

vic

esR

esou

rce-

ori

ente

d:

RE

ST

ful

Web

Ser

vic

es

SO

AP

-bas

ed:

SO

AP

, W

SD

L,

UD

DI

Web

in

teg

rati

on

Het

ero

gen

eous

dat

aso

urc

es a

nd

typ

es

Th

ou

san

ds

of

web

-acc

essi

ble

dat

a so

urc

es

Dynam

ic d

ata

Dat

a w

areh

ouse

s

Web

pu

bli

shin

gA

cces

s d

iffe

ren

t ty

pes

of

con

ten

t fr

om

bro

wse

rs(P

DF

, H

TM

L,

XM

L)

Str

uct

ure

d,

dyn

amic

, cu

sto

miz

ed/p

erso

nal

ized

con

ten

t

Inte

gra

tio

n w

ith a

pp

lica

tio

n

Acc

essi

ble

via

maj

or

gat

eway

s an

d s

earc

h e

ng

ines

Ap

pli

cati

on

in

teg

rati

on

Tra

nsf

orm

atio

n b

etw

een d

iffe

ren

t d

ata

form

ats

(eg,

XM

L,

HT

ML

)

Inte

gra

tio

n o

f m

ult

iple

app

lica

tio

ns

Web

mas

hup

s W

eb D

ata

Man

agem

ent

and

XM

LL

1:

Intr

od

uct

ion

20

Cu

rren

t In

tern

et A

pp

lica

tio

n A

rch

itec

ture

s

Arc

hit

ectu

re:

Ser

ver-

Tie

r: r

elat

ion

ald

atab

ases

an

d g

atew

ays

tod

iver

se d

ata

sou

rces

, su

ch a

s, f

iles

, O

LE

/DB

etc

. U

se o

f en

terp

rise

ser

ver

s

Mid

dle

-Tie

r: p

rov

ides

dat

a in

teg

rati

on

& d

istr

ibu

tio

n,

qu

ery

, et

c.C

on

sist

s o

f a

web

ser

ver

or

an a

pp

lica

tio

n s

erv

er

Cli

ent-

Tie

r: m

ost

ly a

web

bro

wse

r, m

ay r

un

scr

ipts

Ch

arac

teri

stic

s:

Cu

sto

miz

atio

n i

s ac

hie

ved

at

the

serv

er s

ite

(cu

sto

mer

dat

a in

ad

atab

ase)

wit

h s

om

edat

a at

th

e cl

ien

t si

te (

coo

kie

s)

Lo

ad b

alan

cin

g i

sty

pic

ally

har

dw

are

bas

ed (

mu

ltip

le s

erv

ers,

D

NS

rou

ters

)

Page 6: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

1

HT

ML

<html>

<head><title>My Web Page</title></head>

<body>

<h1>Introduction</h1>

Lookat<ahref=”http://lambda.uta.edu/index.html”>this document</a>

<img src=”image.jpg”width=100 height=50>

</body>

</html>

It's

am

ark

up

lan

gu

age:

tex

t (c

on

ten

t) +

tag

s (c

on

tro

l m

ark

s)

Itis

ver

y s

imple

: h

um

an r

ead

able

, ca

n b

e ed

ited

by

an

y e

dit

or

Itre

flec

tsdo

cum

ent

pre

sen

tati

on

(la

yo

ut)

, n

ot

the

sem

anti

cs o

r st

ruct

ure

of

dat

a

Un

iver

sal:

po

rtab

le t

oan

y p

latf

orm

HT

ML

pag

es a

re c

on

nec

ted

th

rou

gh

hy

per

tex

t li

nk

s

HT

ML

pag

es c

an b

e lo

cate

d u

sing

web

sea

rch

eng

ines

Gre

at f

or

hu

man

-to

-hu

man

an

dh

um

an-t

o-m

ach

ine

inte

ract

ion

s

attr

ibu

te n

ame

attr

ibu

te v

alu

e

open

ing t

ag

clo

sin

g t

ag

hyper

tex

t li

nk

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

2

Is H

TM

L A

pp

ropri

ate

for

Web

Inte

ract

ion

s?

Fo

r m

ach

ine-

to-m

ach

ine

inte

ract

ion

s, y

ou

wan

t to

ex

chan

ge

dat

a

not

inte

rest

ed i

ndat

apre

sen

tati

on

Nee

d t

ob

e ab

le t

o e

xtr

act

dat

a fr

agm

ents

an

d c

on

stru

ct n

ew o

nes

dif

ficu

lt t

o d

o t

his

in H

TM

L(s

eeX

HT

ML

and

Aja

x)

Nee

d t

ob

e ab

le t

o u

pd

ate

and

tra

nsf

orm

HT

ML

Nee

d a

un

iver

sal

dat

a re

pre

sen

tati

on

th

at i

s:

Go

od

for

dat

a ex

chan

ge

amo

ng

web

ap

pli

cati

on

s

sent

thro

ugh t

he

Inte

rnet

wit

hout

tran

sform

atio

n (

no d

ata

mar

shal

ing)

Po

wer

ful

eno

ug

h t

oca

ptu

re c

om

ple

xw

eb d

ata

Su

itab

le f

or

sto

rag

e o

n a

dat

abas

ese

rver

Am

enab

le t

oqu

eryin

g a

nd u

pd

atin

g

Des

crib

ed b

ypo

wer

ful

sch

ema

lan

gu

ages

requir

ed f

or

val

idat

ion o

f w

eb s

ervic

es

Ad

op

ted

by i

ndu

stry

support

edby

stan

dar

ds

pla

tform

an

dven

do

rin

dep

enden

t

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

3

XM

L

XM

L (

eXte

nsi

ble

Mar

ku

p L

angu

age)

is

a te

xtu

al l

ang

uag

e fo

rre

pre

sen

tin

g a

nd

exch

ang

ing

dat

a o

n t

he

web

It i

s d

esig

ned

to

im

pro

ve

the

fun

ctio

nal

ity

of

the

Web

by

pro

vid

ing

mo

refl

exib

le a

nd

ad

apta

ble

in

form

atio

n

iden

tifi

cati

on

Bas

edon

SG

ML

XM

Lw

as d

evel

op

ed a

roun

d 1

99

6

It i

s ca

lled

ext

ensi

ble

bec

ause

it i

s n

ot

afi

xed

form

atli

ke

HT

ML

(a

sin

gle

, p

red

efin

ed m

ark

up

lan

gu

age)

it i

s ac

tual

lya

met

ala

ng

uag

e (a

lan

gu

age

for

des

crib

ing

oth

er l

ang

uag

es)

wh

ich l

ets

yo

u d

esig

n y

ou

row

n c

ust

om

ized

mar

ku

p l

ang

uag

es f

or

lim

itle

ss d

iffe

ren

t ty

pes

of

do

cum

ents

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

4

XM

L (

con

t.)

XM

L c

an b

e u

nty

ped

(sem

istr

uct

ure

d),

bu

t th

ere

are

stan

dar

ds

for

sch

ema

confo

rman

ce

DT

D

XM

L S

chem

a

Wit

ho

ut

sch

ema,

an

XM

L d

ocu

men

t is

wel

l-fo

rmed

if

it s

atis

fies

sim

ple

sy

nta

ctic

con

stra

ints

:

tag

s al

way

s co

me

in m

atch

ing

sta

rt/e

nd

tags

pro

per

nes

tin

g o

f st

art

and e

nd

tags

a si

ng

le r

oo

tel

emen

t

Wit

h a

sch

ema,

an

XM

L d

ocu

men

t is

va

lid

if

its

stru

ctu

reco

nfo

rms

to a

DT

D o

r an

XM

L S

chem

a

Page 7: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

5

Ex

amp

le

<p

eop

le>

<p

erso

n>

<n

ame>

Leo

nid

as F

egar

as <

/nam

e>

<te

l> (

81

7)

27

2-3

62

9<

/tel

>

<em

ail>

feg

aras

@cs

e.u

ta.e

du

</e

mai

l>

</p

erso

n>

<p

erso

n>

<n

ame>

Ram

ez E

lmas

ri <

/nam

e>

<te

l> (

81

7)

27

2-2

34

8<

/tel

>

<em

ail>

elm

asri

@cs

e.u

ta.e

du

</e

mai

l>

</p

erso

n>

</p

eop

le>

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

6

Wh

y X

ML

is

soP

op

ula

r?

Itlo

ok

s li

ke

HT

ML

sim

ple

, h

um

an-r

ead

able

, m

ach

ine-

read

able

, ea

syto

lea

rn,

univ

ersa

l

Fle

xib

le &

ex

ten

sib

le,

sin

ce y

ou

can

rep

rese

nt

any

kin

d o

f d

ata

unli

ke

HT

ML

HT

ML

des

crib

espre

sen

tati

on

wh

ile

XM

L d

escr

ibes

con

ten

t

Pre

cise

wel

l-fo

rmed

:pro

per

ly n

este

d X

ML

tag

s

vali

d:

its

stru

ctu

re m

ayco

nfo

rm t

o a

DT

D o

ran

XM

L S

chem

a

Su

pp

ort

edby

th

e W

3C

tru

sted

an

d a

dop

ted

by i

nd

ust

ry

Man

y s

tan

dar

ds

aro

un

d X

ML

: sc

hem

as,

qu

ery

lan

gu

ages

,et

c

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

7

Wh

ere

do

th

e X

ML

dat

a co

me

from

?

Most

ly g

ener

ated

... b

ut

few

han

d-w

ritt

enX

ML

do

cum

ents

Web

-Ser

vic

es (

SO

AP

mes

sag

es,

WS

DL

des

crip

tio

ns)

XH

TM

L

dum

ps

from

rel

atio

nal

dat

abas

es (

dat

a p

ub

lish

ing

)

fro

m d

esk

top

app

lica

tio

ns

(MS

Off

ice

XM

L f

orm

at–

do

cx,

pptx

, ..

.)

con

figu

rati

on

file

s fo

r v

ario

us

apps

(eg

, fo

r th

e G

NO

ME

des

kto

p)

met

adat

a(e

g, M

PE

G-7

met

adat

a)

log

s, b

log

s, R

SS

, st

ock

fee

ds,

new

s fe

eds

...

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

8

What

XM

Lhas

to

do

wit

h D

atab

ases

?

XM

L i

san

im

po

rtan

t st

andar

diz

atio

n f

or

dat

a re

pre

sen

tati

on

an

d

exch

ang

e, b

ut

we

stil

l n

eed

to s

tore

and

qu

ery l

arg

e re

po

sito

ries

of

XM

L d

ata

dat

am

od

els

and

sch

ema

repre

sen

tati

on

s

quer

y l

ang

uag

es,

dat

a in

dex

ing

, q

uer

yop

tim

izer

s

upd

ates

, vie

w m

ain

ten

ance

con

curr

ency

, d

istr

ibu

tion

, se

curi

ty, et

c

Nee

d b

oth

hea

vy-d

uty

dat

abas

es a

t th

ese

rver

-sid

efo

r st

ori

ng d

ata,

an

d

the

XM

L f

orm

atfo

r ex

chan

gin

g d

ata

bet

wee

n a

pp

lica

tio

ns

Page 8: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n2

9

XM

LS

yn

tax

XM

Lco

nsi

sts

of

tag

s an

d t

ext

Tex

t is

bo

un

ded

by

tag

s. C

DA

TA

:ch

ara

cter

da

ta.

<ti

tle>

Th

e B

ig S

leep<

/tit

le>

<year>

19

35

</y

ear>

A C

DA

TA

sect

ion

is

tex

tw

itho

ut

mar

ku

p (

tag

s)

yo

u c

an't

use

<or&

but

yo

u m

ay u

se&

lt;

or&

am

p;

May

wra

pth

ete

xt

aro

un

d

<![

CD

ATA

[an

d

]]>

<![

CD

ATA

[<sen

der>

John

Sm

ith

</s

ender>

]]>

sam

e as

:&

lt;s

end

er&

gt;

Joh

nSm

ith&

lt;/

sen

der&

gt;

Co

mm

ents

:

<!-

- decla

rati

on

s f

or

<head>

& <

bod

y>

-->

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

0

XM

L S

yn

tax

(co

nt.

)

Sp

ecia

l ch

arac

ters

in C

DA

TA

:

&#x

xx

;w

her

e x

xx i

s th

eA

SC

IIoct

aln

um

ber

of

the

char

acte

r

eg,

&#

16

0;

fo

rsp

ace

Yo

u m

ay d

efin

e sp

ecia

l ch

arac

ters

as

enti

ties

(in

DT

D):

<!E

NTIT

Y n

bsp "

&#

160;"

>

Use

in

CD

AT

A:

&nb

sp;

So

me

pre

def

ined

en

titi

es:

&am

p;

&lt

;&

gt;

&ap

os;

&quot;

Pro

cess

ing

inst

ruct

ion

s:

<?applicati

on-n

am

e p

ara

m=

“valu

e”

...

?>

PC

DA

TA

(P

arse

d C

har

acte

r D

ata)

: ch

arac

ters

to b

ep

arse

d b

yth

eX

ML

par

ser

con

tain

s bo

thC

DA

TA

an

d m

arku

p t

hat

con

stit

ute

val

id X

ML

syn

tax

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

1

XM

L S

yn

tax

(co

nt.

)

Tag

s co

me

inm

atch

ing

pai

rs:

<date

>8

/25

/20

04

</d

ate

>

A t

agn

ame

mu

st s

tart

wit

h l

ette

r o

r u

nd

ersc

ore

an

d c

an c

on

tain

o

nly

lett

ers,

nu

mber

s, h

yp

hen

s, p

erio

ds,

an

d u

nd

ersc

ore

s

Nam

e::

=N

ameS

tart

Char

(Nam

eChar

)*

Nam

eSta

rtC

har

::=

":"

| [A

-Z]

| "_

"|[

a-z]

Nam

eChar

::=

Nam

eSta

rtC

har

| "-"

| "."

|[0

-9]

Fo

r ea

ch o

pen

ing

tag

th

ere

mu

st b

e a

mat

chin

g c

losi

ng

tag

Tag

s m

ust

be

pro

per

lyn

este

d:

val

id n

esti

ng:

<pers

on>

<n

am

e>

...<

/nam

e>

...<

/pers

on>

inv

alid

nes

tin

g:<

pers

on

><

nam

e>

...<

/pers

on>

...<

/nam

e>

An

XM

L d

oc

mu

st h

ave

exac

tly

on

eel

emen

t, c

alle

d t

he

roo

t

XM

L d

ecla

rati

on

:

op

tion

al,

at t

he

beg

innin

g o

f X

ML

do

c

<?xm

l vers

ion=

"1.0

" encodin

g=

"UTF-8

"?>

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

2

XM

LE

lem

ents

An

ele

men

t is

a s

egm

ent

of

an X

ML

do

cum

ent

bet

wee

n a

n

op

enin

g a

nd

th

e m

atch

ing

clo

sin

g t

ags

<p

erso

n>

<n

ame>

Ram

ezE

lmas

ri <

/nam

e>

<te

l> (

81

7)

27

2-2

348

</t

el>

<em

ail>

elm

asri

@cs

e.u

ta.e

du

</e

mai

l>

</p

erso

n>

An

ele

men

t m

ay c

on

tain

a m

ixtu

reof

sub

-ele

men

ts a

nd

CD

AT

A

<ti

tle>

An

<em

>el

emen

t</e

m>

is

ase

gm

ent<

/tit

le>

An

ab

bre

via

tio

n:

for

an e

lem

ent

wit

h e

mp

ty c

onte

nt,

we

can

use

:

<ta

gn

ame/

>

inst

ead

of:

<ta

gn

ame>

</t

agn

ame>

Page 9: Prerequisites - University of Texas at Arlington · Electronic Commerce Both business-to-business (B2B) and business-to-consumer (B2C) interactions Focus on selling and buying: Order

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

3

Rep

rese

nti

ng

Dat

a U

sin

g X

ML

●N

esti

ng

tag

s ca

nbe

use

d t

o e

xp

ress

var

iou

s st

ruct

ure

s, s

uch

as a

reco

rd:

<p

erso

n> <

nam

e> R

amez

Elm

asri

</n

ame>

<te

l> (

81

7)

27

2-2

348

</t

el>

<em

ail>

elm

asri

@cs

e.u

ta.e

du

</e

mai

l>

</p

erso

n>

We

can

rep

rese

nt

a li

stb

y u

sin

g t

he

sam

e ta

g r

epea

ted

ly:

<ad

dre

sses

>

<p

erso

n>

...

</p

erso

n>

<p

erso

n>

...

</p

erso

n>

<p

erso

n>

...

</p

erso

n>

...

</a

ddre

sses

>

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

4

XM

L s

tru

ctu

re

XM

L:

<p

erso

n>

<n

ame>

Ram

ez E

lmas

ri <

/nam

e>

<te

l>(8

17

) 2

72

-23

48

</t

el>

<em

ail>

elm

asri

@cs

e.u

ta.e

du

</e

mai

l>

</p

erso

n>

in L

isp

:

(per

son

(n

ame

“R

amez

Elm

asri

”)

(tel

“(8

17)

272-2

348

”)

(em

ail

“el

mas

ri@

cse.

uta

.edu

”))

as

a t

ree

data

str

uct

ure

:

per

son

nam

ete

lem

ail

Ram

ez E

lmas

ri(8

17

) 2

72

-23

48

elm

asri

@cs

e.u

ta.e

du

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

5

Att

ribu

tes

An

op

enin

g t

ag m

ay c

on

tain

att

rib

ute

s

typ

ical

ly u

sed

to

des

crib

e th

e co

nte

nt

of

an e

lem

ent

<auth

or

ssn=

"2787901">

<nam

e>

Ram

ez E

lmasri

</n

am

e>

<em

ail>

elm

asri

@cse.u

ta.e

du <

/em

ail>

</a

uth

or>

Yo

u m

ay h

ave

mu

ltip

le a

ttri

bu

tes

in a

n o

pen

ing

tag

bu

t ea

ch a

ttri

bu

te n

ame

mu

stb

ed

iffe

ren

t

It's

no

t al

way

scl

ear

wh

en t

o u

se a

ttri

bu

tes

<au

tho

r>

<ssn

>2

787

90

1<

/ssn>

<n

am

e>

Ra

mez E

lmasri

</n

am

e>

<e

mail>

elm

asri

@cse

.uta

.ed

u<

/em

ail>

</a

uth

or>

ID a

ttri

bu

tes

are

spec

ial:

mu

st b

e u

niq

ue

wit

hin

the

do

cum

ent

An

ID

ref

attr

ibu

te m

ust

ref

er t

o a

n e

xis

tin

g I

D i

n t

he

sam

e d

oc

Web

Dat

a M

anag

emen

t an

d X

ML

L1

:In

tro

du

ctio

n3

6

Ref

eren

cin

g E

lem

ents

Usi

ng I

Ds/

IDre

fs

<fa

mil

y>

<p

erso

n id

="j

ane"

moth

er=

"mar

y"

fath

er=

"jo

hn">

<n

ame>

Jan

e D

oe

</n

ame>

</p

erso

n>

<p

erso

n i

d=

"jo

hn

" ch

ild

ren

="j

ane

jack

">

<n

ame>

Jo

hn

Do

e <

/nam

e> <

mo

ther

/>

</p

erso

n>

<p

erso

n i

d=

"mar

y"

chil

dre

n=

"jan

e j

ack

">

<n

ame>

Mar

yD

oe

</n

ame>

</p

erso

n>

<p

erso

n

id=

"jac

k"

mo

ther

="m

ary"

fath

er=

"jo

hn

">

<n

ame>

Jac

k D

oe

</n

ame>

</p

erso

n>

</f

amil

y>