338 seminar4 keithlawrenz

31
Los Angeles | London | New Delhi Singapore | Washington DC One approach to an online digital content repository Automating Workflow from Acceptance to Publication

Upload: society-for-scholarly-publishing

Post on 24-Jun-2015

144 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 338 seminar4 keithlawrenz

Los Angeles | London | New DelhiSingapore | Washington DC

One approach to an online digital content repository

Automating Workflow from Acceptance to Publication

Page 2: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

SAGE Publications

● Books, journals and reference publishing programs for higher education• 560+ journals

● Publishing offices in • Los Angeles, CA• London, England• New Delhi, India• Washington DC

Page 3: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

My background

● Keith Lawrenz• Senior Business Analyst, Publishing Technologies• 4 years with SAGE

● Specialties• Business process engineering• Content modeling• XML, XQuery, XSLT, XProc, relaxNG schema…

Page 4: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview

● Why SAGE invested in a repository● The SAGE repository workflow and how we

implemented phase one repository with RSuite

● Results and lesson learned

Page 5: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

The business opportunity

● Secure SAGE online digital assets● Reliably deliver online content● Provide a platform for content analytics● Enable online product development

Page 6: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

The business landscape

● HighWire upgrade to H2O● Migrating from proprietary DTD to NLM

schema● XML first workflow from back end XML

conversion● Online reference and book products that

require archive support

Page 7: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

SAGE Online Content Repository

Page 8: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Attributes in a repository solution

● Flexibility• Analyst implemented business rules

● Scale• To support SAGE journal content• To support all SAGE content ongoing

● Access• Deployed worldwide

Page 9: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Implementation phases

1. Current journal content

2. Journal backfile

3. Encyclopedias and handbooks

4. Electronic books

Page 10: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Manuscript Management

SAGE Journals Workflow

Production Management

Online Content Repository

Page 11: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

The scale

● 560+ journals● 770,000 articles since 1894● 40,000 new articles / year

• 80% PDF with header and references XML• 20% Full text

● ~70,000 unique issue deliveries / year to 50+ ftp targets

● 2 full-time headcount U.S. & UK offices

Page 12: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Page 13: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Ingest print-ready article PDF

Page 14: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Deliver to encoding vendor

Page 15: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Ingest xml-encoded issue

Page 16: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Deliver to hosting platform

Page 17: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Support editorial approval process

Page 18: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Track go-live on hosting platform

Page 19: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Overview of a journal issue

XM

L C

onv

ersi

on

Ven

dor

(Jou

ve)

Onl

ine

Con

tent

E

dito

rC

ont

ent

Re

cepi

ent

sS

OC

RJo

urn

al

Pro

duct

ion

Unit of Content is a Journal Issue

Start

FTP (UK) or NFS (US)

Zip Final Print-Ready

PDFs

Ingest Unencoded

Issue

Store in Repository

Deliver Unencoded

Issue

FTPCreate

SAGEMeta XML

Nomalize PDF Files

Zip Issue

FTP Store in Repository

Deliver HW Issue

Ingest Encoded

Issue

FTP – HighWire Express

Process Issue for Hosting

Quality Check Issue

Changes? Edit ArticlesApprove

Issue

Deliver Full Issue

Deliver PubMed Abstract

XML

FTP and/or NFS sites

Online Preview Issue Online

Issue Online?

End

Deliver XML Issue

End

Yes

Yes

OK to Host?Yes

Deliver to additional targets

Page 20: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 21: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 22: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 23: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 24: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 25: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 26: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 27: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Page 28: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

The results

● 100,000+ issue deliveries● 99.5% plus uptime● Aggressive expansion plans

• NLM XML-first workflows• Ingesting back content ~770,000 journal articles• ~200 encyclopedias and handbooks• SAGE Research Methods Online ~600 books

Page 29: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Important features

● Rich & accessible workflow• Human readable error messaging• SAGE configurable

● Supports key XML technologies● CMS functions● Web-delivered application● MarkLogic inside● Active Directory integration

Page 30: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Lessons learned

● Understand and document your workflows● Use proven software● Start small● Iterate● Technologists must partner with users

Page 31: 338 seminar4 keithlawrenz

Automating Workflows from Acceptance to Publication, SSP, 2010 Los Angeles | London | New DelhiSingapore | Washington DC

Keith Lawrenz

Senior Business Analyst, Publishing Technologies

[email protected]

RSuite – Booth #218