![Page 1: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/1.jpg)
Electronic Books
● Lecture 6 Ing. Miloslav Nič Ph.D.letní semestr 2010-2011BI-XML© Miloslav Nič, 2011
Evropský sociální fond Praha & EU: Investujeme do vaší budoucnosti
![Page 2: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/2.jpg)
E-book
● Wikipedia:
An electronic book (also e-book, ebook, digital book) is a text and image-based publication in digital form produced on, published by, and readable on computers or other digital devices.
![Page 3: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/3.jpg)
E-book formats
● TXT● HTML collection● PDF● Kindle (based on Mobipocket)● EPUB● ... and many more basd on similar
principles
![Page 4: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/4.jpg)
EPUB x PDF
● http://www.adobe.com/content/dam/Adobe/en/
devnet/digitalpublishing/pdfs/EPUB_datasheet.pdf● PDF: a fixed page - the publisher in complete control over
page layout and presentation
● EPUB: text reflow according to screen size
![Page 5: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/5.jpg)
International Digital Publishing Forum (IDPF)
● http://idpf.org/● a global trade and standards organization● develops and maintains the EPUB content
publication standard
![Page 6: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/6.jpg)
EPUB
● a distribution and interchange format standard for digital publications and documents
● latest stable version EPUB 2.0.1● EPUB 2 initially standardized in 2007● EPUB 3 in the process of being
standardized (2011?)
![Page 7: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/7.jpg)
Google and EPUB
![Page 8: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/8.jpg)
Project Gutenberg
![Page 9: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/9.jpg)
ePub Readers
● see e.g. http://www.jedisaber.com/eBooks/Readers.asp
● Some examples:
– Bookworm
– Calibre
– FB Reader
– Mobipocket
– Stanza
– ....
![Page 10: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/10.jpg)
ePUB and Kindle
● not direct support at this moment● several converters available
![Page 11: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/11.jpg)
Bookworm.oreilly.com
![Page 12: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/12.jpg)
FBReader
● my favourite reader (both Linux and Android in my case; installers for other versions - e.g. Windows, Mac also exists)
● http://www.fbreader.org/
![Page 13: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/13.jpg)
EPUB Standards
● Open Publication Structure (OPS)– book content in XHTML or DTBook
● Open Packaging Format (OPF)– book structure and metadata
● Open Container Format (OCF)– book file structure and compression to a
single file
![Page 14: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/14.jpg)
Open Publication Structure (OPS)
● XML files● Namespaces:
– XHTML: ● http://www.w3.org/1999/xhtml
– DAISY:● http://www.daisy.org/z3986/2005/dtbook/
– OPS:● http://www.idpf.org/2007/ops
![Page 15: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/15.jpg)
XHTML
● XHTML 1.1; only some modules are included
● a selection of supported elements:– html, head, title, body
– abbr, acronym, address, blockquote, br, cite, code, dfn, div, em, h1, h2, h3, h4, h5, h6, kbd, p, pre, q, samp, span, strong, var, dl, dt, dd, ol, ul, li, sub, sup
– a, img, caption, col, colgroup, table, tbody, td, tfoot, th, thead, tr
![Page 16: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/16.jpg)
CSS
● a subset of CSS 2 supported● must be supplied with the book (not via
web)● E-Book readers are very variable (screen
size, graphic capabilities) – CSS styleshets very usefull
![Page 17: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/17.jpg)
Images
● @alt of <img> required● core media types support of which is
required:– image/gif
– image/jpeg
– image/png
– image/svg+xml
![Page 18: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/18.jpg)
DTBook (Digital Talking Book)
● an XML vocabulary defined in ANSI/NISO Z39.86-2005 Standard (http://www.niso.org/workrooms/daisy/Z39-86-2005.html)
● recommended for more advanced applications (e.g. educatonal books)
● supports footnotes, sidebars, annotations, page numbers, etc.
![Page 19: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/19.jpg)
DTBook features
● hierarchical navigation● sequential reading with choices (e.g. skip
footnotes)● specific reading methods for different
components (e.g. tables)● time synchronization via SMIL
![Page 20: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/20.jpg)
Navigation Control File (NCX)
● http://www.niso.org/workrooms/daisy/Z39-86-2005.html#NCX
● exposes the hierarchical structure of a book
![Page 21: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/21.jpg)
Open Packaging Format (OPF)
● describes and references all components of the electronic publication (e.g. markup files, images, navigation structures)
● provides publication-level metadata
● specifies the linear reading-order of the publication
● provides fallback information to use when unsupported extensions to OPS are employed
● provides a mechanism to specify a declarative global navigation structure (the NCX)
![Page 22: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/22.jpg)
OPF File Structure
● Package:– Metadata
– Manifest
– Spine
– Guide
![Page 23: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/23.jpg)
<package>
● root element of OPF package● Attributes:
– xmlns=”http://www.idpf.org/2007/opf”
– version = “2.0”
– unique-identifier = “an-unique-id”● primary book identifier selected from a
collection of Dublin core identifier elements in <metadata>
● if not world-wide unique it may cause problems in libraries and catalogues
![Page 24: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/24.jpg)
<metadata>
● a required child of <package>● its children either elements from Dublin
core namespace and/or <meta> elements with same syntax as XHTML
![Page 25: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/25.jpg)
<dc:elements>
● Dublin core: http://dublincore.org/documents/dces/
● Elements: contributor, coverage, creator, date, description, format, identifier, language, publisher, relation, rights, source, subject, title, type
● e.g.:– <dc:title>A book</dc:title>
– <dc:identifier>uhf-232-dsds</dc:identifier>
![Page 26: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/26.jpg)
<dc:identifier>
● at least one <identifier> with attribute @id must be present inside <metadata>
● the value of an @id attribute must be equal to the @unique-identifier of <package> element
– content of the <identifier> element with such @id is used to uniquely identify the book in libraries and catalogues
![Page 27: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/27.jpg)
<manifest>
● the next required child of <package>● provides a list of all the files that are part of
the publication (xhtml, css, images, …)● each file listed in a child <item>● each file must be given precisely once but
the order of files is not significant
![Page 28: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/28.jpg)
<item> child of <manifest>● Attributes, all required:
– @id
– @href● relative paths interpreted relative to the
location of OPF file containing the <manifest>
– @media-type
● Optional attribute:
– @fallback ● provides an @id of another item to be
used if this item @media-type is not supported
![Page 29: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/29.jpg)
<spine>
● the next required element● collects “main eBook pages”● contains one or more <itemref> elements
– <itemref idref='anID'>● anID is @id of a <manifest>/<item>
● @toc of <spine>– contains a value of @id of an <item>
which provides a content for eBook, usually in NCX format
![Page 30: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/30.jpg)
Open Container Format (OCF)
● a general-purpose container technology● collects a related set of files into a single-
file container● the required format for a file containing an
EPUB book● a ZIP archive
![Page 31: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/31.jpg)
OCF file structure
● File mimetype
● Directory META-INF with files:
– container.xml (required)
– manifest.xml
– metadata.xml
– signatures.xml
– encryption.xml
– rights.xml
● Directory OEBPS with EPUB files (which may be in subdirectories)
● Other directories, e.g. PDF for alternative book versions
![Page 32: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/32.jpg)
file: mimetype
● in the root of ZIP archive● it must be the first file in the archive● must contain text:
application/epub+zip
● make sure there are no whitespaces around this text
● simplifies automatic recognition of the archive
![Page 33: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/33.jpg)
container.xml
● in directory META-INF● format:
<container version="1.0" xmlns="urn:oasis:names:tc:opendocument:xmlns:container">
<rootfiles>
<rootfile full-path="OEBPS/an_OPF_file.opf"
media-type="application/oebps-package+xml" />
</rootfiles>
</container>
![Page 34: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/34.jpg)
EPUB 3.0
● http://idpf.org/epub/30/spec/epub30-overview.html
● 4 specifications:– EPUB Publications 3.0
– EPUB Content Documents 3.0
– EPUB Open Container Format (OCF) 3.0
– EPUB Media Overlays 3.0
● in draft stage
![Page 35: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/35.jpg)
Some changes from v.2
● http://idpf.org/epub/30/spec/epub30-changes.html
● HTML5 syntax (DTBook no longer an alternative syntax to XHTML)
● NCX superseded by EPUB Navigation Document (uses <nav> from HTML5)
● text-to-speech facilities● multimedia support (via HTML5 <audio>
and <video>)
![Page 36: Ing. Miloslav Nič Ph.D. letní semestr 2010-2011 BI-XML - Zvon.orgzvon.org/BI-XML/L6-ePub.pdf · 2014. 2. 20. · Open Packaging Format (OPF) describes and references all components](https://reader033.vdocument.in/reader033/viewer/2022052008/601ca2293ad6f440535980ac/html5/thumbnails/36.jpg)
EPUB Media Overlays 3.0
● defines a usage of SMIL● a simplified subset of SMIL 3.0 that allow
sequencing of clips● <par> + <seq>● @clipBegin, @clipEnd