BBN–ANG–183 Typography
Lecture 5A: Breaking text
Zoltan Kiss & Peter Szigetvari
Dept of English Linguistics, Eotvos Lorand University
kz & szp (delg) typo/breaking (5A) 1 / 23
outline
probelms with WYSIWYGchanging appearancesWYSIWYG is dangerousthe importance of document structure
breaking textinto linesinto pagesfootnotes
conclusions
sample exam questions
kz & szp (delg) typo/breaking (5A) 2 / 23
breaking text into lines and pages is the central issue in typesetting(cf. Hungarian tordeles, tordelo) — today, we’re going to see
kz & szp (delg) typo/breaking (5A) 3 / 23
breaking text into lines and pages is the central issue in typesetting(cf. Hungarian tordeles, tordelo) — today, we’re going to see
◮ why this is so
kz & szp (delg) typo/breaking (5A) 3 / 23
breaking text into lines and pages is the central issue in typesetting(cf. Hungarian tordeles, tordelo) — today, we’re going to see
◮ why this is so
◮ how to do it smartly
kz & szp (delg) typo/breaking (5A) 3 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)
◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)
◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document
◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)
◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document
◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes
the solution
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)
◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document
◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes
the solution
◮ the size of the textblock (width and height of the area where text willbe set) should be specified
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with size
◮ word processors like MS Word and OpenOffice Writer specify marginsize (not the width of the line)
◮ the size of the printer’s paper is often set for the whole system, notfor the word processor, or the document
◮ if the paper size changes (e.g., A4 vs. US Letter) the width of the linealso changes
the solution
◮ the size of the textblock (width and height of the area where text willbe set) should be specified
◮ but this is less trivial, since one must calculate the necessary marginsizes from the given paper size beforehand
kz & szp (delg) typo/breaking (5A) 4 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:
◮ e.g., font face, size is part of the document, even when this is not necessary
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:
◮ e.g., font face, size is part of the document, even when this is not necessary
a solution
do not use .doc/.odt when plain text (.txt) will do
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:
◮ e.g., font face, size is part of the document, even when this is not necessary
a solution
do not use .doc/.odt when plain text (.txt) will do
a problem with this solution
some necessary properties of the text (e.g., italicization, special characters) willalso be lost in plain text
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:
◮ e.g., font face, size is part of the document, even when this is not necessary
a solution
do not use .doc/.odt when plain text (.txt) will do
a problem with this solution
some necessary properties of the text (e.g., italicization, special characters) willalso be lost in plain text
the ultimate solution
do not use word processors (i.e., WYSIWYG technology)
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG changing appearances
why does my document’s appearance change all the time?
a problem with other text properties
◮ by default, word processors store documents in a format where the contentand the form is unnecessarily merged:
◮ e.g., font face, size is part of the document, even when this is not necessary
a solution
do not use .doc/.odt when plain text (.txt) will do
a problem with this solution
some necessary properties of the text (e.g., italicization, special characters) willalso be lost in plain text
the ultimate solution
do not use word processors (i.e., WYSIWYG technology): use mark-up languages
kz & szp (delg) typo/breaking (5A) 5 / 23
probelms with WYSIWYG WYSIWYG is dangerous
the perils of WYSIWYG
WYSIWYG word processing is dangerous for two reasons
kz & szp (delg) typo/breaking (5A) 6 / 23
probelms with WYSIWYG WYSIWYG is dangerous
the perils of WYSIWYG
WYSIWYG word processing is dangerous for two reasons
1. the structure of the document will not be specified
a section title will not be specified as a section title, it will simply beset in a larger/emboldened/whatever font, with extra space aboveand below
kz & szp (delg) typo/breaking (5A) 6 / 23
probelms with WYSIWYG WYSIWYG is dangerous
the perils of WYSIWYG
WYSIWYG word processing is dangerous for two reasons
1. the structure of the document will not be specified
a section title will not be specified as a section title, it will simply beset in a larger/emboldened/whatever font, with extra space aboveand below
2. line and page breaks will be chosen during writing the text
as a consequence, any subsequent change (or simply reading the fileby another word processor) may result in undesired line and pagebreaks
kz & szp (delg) typo/breaking (5A) 6 / 23
probelms with WYSIWYG WYSIWYG is dangerous
the perils of WYSIWYG
WYSIWYG word processing is dangerous for two reasons
1. the structure of the document will not be specified
a section title will not be specified as a section title, it will simply beset in a larger/emboldened/whatever font, with extra space aboveand below
2. line and page breaks will be chosen during writing the text
as a consequence, any subsequent change (or simply reading the fileby another word processor) may result in undesired line and pagebreaks
both dangers can be avoided, but generally users of word processors fallinto these traps: WYSIWYG technology notoriously guides users into them
kz & szp (delg) typo/breaking (5A) 6 / 23
probelms with WYSIWYG the importance of document structure
the importance of document structure
why is it a problem if the structure of the document is not specified?
the main points of producing electronic documents are
kz & szp (delg) typo/breaking (5A) 7 / 23
probelms with WYSIWYG the importance of document structure
the importance of document structure
why is it a problem if the structure of the document is not specified?
the main points of producing electronic documents are
◮ reusability: (parts of) a document should be used on severaloccasions, it is rarely useful to create documents for the sake of asingle printing
kz & szp (delg) typo/breaking (5A) 7 / 23
probelms with WYSIWYG the importance of document structure
the importance of document structure
why is it a problem if the structure of the document is not specified?
the main points of producing electronic documents are
◮ reusability: (parts of) a document should be used on severaloccasions, it is rarely useful to create documents for the sake of asingle printing
◮ machine-parsability: the document should be correctly interpretableby artificial intelligence, too (e.g., blind people may want to have itread, we may want to convert it to another format — e.g., HTML forweb publishing)
kz & szp (delg) typo/breaking (5A) 7 / 23
probelms with WYSIWYG the importance of document structure
the importance of document structure
why is it a problem if the structure of the document is not specified?
the main points of producing electronic documents are
◮ reusability: (parts of) a document should be used on severaloccasions, it is rarely useful to create documents for the sake of asingle printing
◮ machine-parsability: the document should be correctly interpretableby artificial intelligence, too (e.g., blind people may want to have itread, we may want to convert it to another format — e.g., HTML forweb publishing)
both aims are easy/possible to achieve only if the structure of thedocument is specified
kz & szp (delg) typo/breaking (5A) 7 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
◮ the font’s properties: face, weight, size
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
◮ the font’s properties: face, weight, size
◮ the shape of the paragraph (e.g., justified or not)
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
◮ the font’s properties: face, weight, size
◮ the shape of the paragraph (e.g., justified or not)
options
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
◮ the font’s properties: face, weight, size
◮ the shape of the paragraph (e.g., justified or not)
options
◮ normally these are not inherent properties of the text, when any ofthem changes, the places of line breaks change
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
◮ the font’s properties: face, weight, size
◮ the shape of the paragraph (e.g., justified or not)
options
◮ normally these are not inherent properties of the text, when any ofthem changes, the places of line breaks change
◮ if these properties are inherent in the text, create an image file ofyour text (e.g., a JPEG file) and store your text in that (you will notbe able to modify the text though!)
kz & szp (delg) typo/breaking (5A) 8 / 23
probelms with WYSIWYG the importance of document structure
breaks and reusability
where a line is broken depends on
◮ the width of the column (line, textblock)
◮ the font’s properties: face, weight, size
◮ the shape of the paragraph (e.g., justified or not)
options
◮ normally these are not inherent properties of the text, when any ofthem changes, the places of line breaks change
◮ if these properties are inherent in the text, create an image file ofyour text (e.g., a JPEG file) and store your text in that (you will notbe able to modify the text though!) — this is like printed text:lacking the versatility of electronic text
kz & szp (delg) typo/breaking (5A) 8 / 23
breaking text
the rule of breaking text
text
here means the flowing text of the document, not including illustrations(pictures, charts, tables, etc.), which are usually less rigidly positioned indocuments
kz & szp (delg) typo/breaking (5A) 9 / 23
breaking text
the rule of breaking text
text
here means the flowing text of the document, not including illustrations(pictures, charts, tables, etc.), which are usually less rigidly positioned indocuments
breaking text
must be done “blindly”, i.e., without knowing where the breaks willactually be in a specific version of the text, i.e., we must specify placeswhere text may and where it may not be broken
kz & szp (delg) typo/breaking (5A) 9 / 23
breaking text
the rule of breaking text
text
here means the flowing text of the document, not including illustrations(pictures, charts, tables, etc.), which are usually less rigidly positioned indocuments
breaking text
must be done “blindly”, i.e., without knowing where the breaks willactually be in a specific version of the text, i.e., we must specify placeswhere text may and where it may not be broken
while sentences and paragraphs are primarily logical units, lines and pagesare rigid typographical units, which text must fit
kz & szp (delg) typo/breaking (5A) 9 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
◮ characters
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
◮ characters
◮ whitespace ( SPACE , TAB , NEWLINE )
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
◮ characters
◮ whitespace ( SPACE , TAB , NEWLINE )
SPACE
is a universal breaking point: lines may be broken at a SPACE
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
◮ characters
◮ whitespace ( SPACE , TAB , NEWLINE )
SPACE
is a universal breaking point: lines may be broken at a SPACE
TAB
is not uniform in its behaviour wrt line breaking, but it is not used in text
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
◮ characters
◮ whitespace ( SPACE , TAB , NEWLINE )
SPACE
is a universal breaking point: lines may be broken at a SPACE
TAB
is not uniform in its behaviour wrt line breaking, but it is not used in text
NEWLINE
breaks the line obligatorily
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
points of breaking text into lines
a line of text
is a string of two types of element:
◮ characters
◮ whitespace ( SPACE , TAB , NEWLINE )
SPACE
is a universal breaking point: lines may be broken at a SPACE
TAB
is not uniform in its behaviour wrt line breaking, but it is not used in text
NEWLINE
breaks the line obligatorily; in word processors, it ends the paragraph, intext formatters two or more NEWLINE s represent a paragraph division
kz & szp (delg) typo/breaking (5A) 10 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
◮ a line should not be broken at an automatic breaking point
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
◮ a line should not be broken at an automatic breaking point
◮ a line should be broken, but there is no automatic breaking point
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
◮ a line should not be broken at an automatic breaking point
◮ a line should be broken, but there is no automatic breaking point
do not break at breaking point: nonbreaking space
a.k.a. hard space, is like SPACE except that it is not a breaking point
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
◮ a line should not be broken at an automatic breaking point
◮ a line should be broken, but there is no automatic breaking point
do not break at breaking point: nonbreaking space
a.k.a. hard space, is like SPACE except that it is not a breaking point
◮ symbols: (HTML), ~ (TEX, LATEX), in word processors usually
shown as a grey space: ·
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
◮ a line should not be broken at an automatic breaking point
◮ a line should be broken, but there is no automatic breaking point
do not break at breaking point: nonbreaking space
a.k.a. hard space, is like SPACE except that it is not a breaking point
◮ symbols: (HTML), ~ (TEX, LATEX), in word processors usually
shown as a grey space: ·
◮ examples of use: Henry ·VIII, VIII. ·Henrik, G. ·B. ·Shaw,
2 ·+ ·2 ·= ·4
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
human intervention
explicit specification of breaking points
is necessary when automatic breaking points are not appropriate, i.e.,
◮ a line should not be broken at an automatic breaking point
◮ a line should be broken, but there is no automatic breaking point
do not break at breaking point: nonbreaking space
a.k.a. hard space, is like SPACE except that it is not a breaking point
◮ symbols: (HTML), ~ (TEX, LATEX), in word processors usually
shown as a grey space: ·
◮ examples of use: Henry ·VIII, VIII. ·Henrik, G. ·B. ·Shaw,
2 ·+ ·2 ·= ·4
◮ longer stretches: <nobr>text not to be broken</nobr>
kz & szp (delg) typo/breaking (5A) 11 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
◮ hyphen (“-”)
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
◮ hyphen (“-”)
◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
◮ hyphen (“-”)
◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)
it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
◮ hyphen (“-”)
◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)
it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)
inhibiting breaking
if the line is automatically broken at a hyphen, this can be inhibited by anonbreaking (a.k.a. hard) hyphen: - , e.g.,
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
◮ hyphen (“-”)
◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)
it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)
inhibiting breaking
if the line is automatically broken at a hyphen, this can be inhibited by anonbreaking (a.k.a. hard) hyphen: - , e.g.,
◮ talking about the suffix -ing, 1969-07-20
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
other breaking points in lines
other points where lines may automatically be broken
are software specific; they may include the following
◮ hyphen (“-”)
◮ slash (“/”); closing parenthesis, bracket, brace (“)”, “]”, “}”)
it was not a very bright idea to let lines break automatically at a slash orclosing parens (e.g., they say that Michael Moore’s film, Fahrenheit 9/11, is rather biassed)
inhibiting breaking
if the line is automatically broken at a hyphen, this can be inhibited by anonbreaking (a.k.a. hard) hyphen: - , e.g.,
◮ talking about the suffix -ing, 1969-07-20
◮ rajzolhatunk-e?
kz & szp (delg) typo/breaking (5A) 12 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
hyphenation
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
hyphenation
◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
hyphenation
◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish
◮ some technically correct hyphen positions are
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
hyphenation
◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish
◮ some technically correct hyphen positions are◮ esthetically displeasing: e.g., a-roma, Indi-a, (small bits of words are
not hyphenated off)
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
hyphenation
◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish
◮ some technically correct hyphen positions are◮ esthetically displeasing: e.g., a-roma, Indi-a, (small bits of words are
not hyphenated off)◮ morphologically misleading: e.g., elo-dong vs. el-odong ‘saunter about’
(it is better to hyphenate between morphemes)
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
breaking lines elsewhere
if lines were broken only at spaces many lines would be too loose (largespaces between words); the solution is
hyphenation
◮ it is language specific, in fact, it is a typographical convention ofindividual publishers, cf. British Eng-lish vs. American En-glish
◮ some technically correct hyphen positions are◮ esthetically displeasing: e.g., a-roma, Indi-a, (small bits of words are
not hyphenated off)◮ morphologically misleading: e.g., elo-dong vs. el-odong ‘saunter about’
(it is better to hyphenate between morphemes)
◮ even the best hyphenation algorithms occasionally fail
kz & szp (delg) typo/breaking (5A) 13 / 23
breaking text into lines
soft hyphen
a soft/optional/discretionary hyphen
is manually inserted at positions where the hyphenation algorithm fails, itonly appears if the position occurs at the end of the line —­ (in HTML), \- (in TEX, LATEX), - -appendix (in word processors)
kz & szp (delg) typo/breaking (5A) 14 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
◮ Hungarian vissza → visz- sza
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
◮ Hungarian vissza → visz- sza
◮ Norwegian bussjafør → buss- sjafør
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
◮ Hungarian vissza → visz- sza
◮ Norwegian bussjafør → buss- sjafør
◮ Dutch omaatje → oma- tje
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
◮ Hungarian vissza → visz- sza
◮ Norwegian bussjafør → buss- sjafør
◮ Dutch omaatje → oma- tje
◮ German Becker → Bek- ker (the new orthography dictates Be- cker)
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
◮ Hungarian vissza → visz- sza
◮ Norwegian bussjafør → buss- sjafør
◮ Dutch omaatje → oma- tje
◮ German Becker → Bek- ker (the new orthography dictates Be- cker)
◮ Greek Maòou → Ma- �ou (cf. ma�a → mai- a)
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
tricky hyphenation conventions
◮ Polish kung-fu → kung- -fu
◮ Hungarian vissza → visz- sza
◮ Norwegian bussjafør → buss- sjafør
◮ Dutch omaatje → oma- tje
◮ German Becker → Bek- ker (the new orthography dictates Be- cker)
◮ Greek Maòou → Ma- �ou (cf. ma�a → mai- a)
discretionary
such patterns are difficult to deal with, but TEX/LATEX provide a way, e.g.,vi\discretionary{sz-}{sz}{ssz}a,oma\discretionary{-}{}{a}tje give the correct output, but still allsuch patterns must be marked
kz & szp (delg) typo/breaking (5A) 15 / 23
breaking text into lines
widow/orphan word
widow/orphan word
many consider it ugly to leave (part of) a single (especially short) word inthe last line of a paragraph (it is esthetically displeasing)
kz & szp (delg) typo/breaking (5A) 16 / 23
breaking text into lines
widow/orphan word
widow/orphan word
many consider it ugly to leave (part of) a single (especially short) word inthe last line of a paragraph (it is esthetically displeasing)
examples
Lorem ipsum dolor sit amet, consectetuer adipiscing elit. Quisque sed nunc eu-
ismod tortor ornare rutrum. Suspendisse sagittis odio non metus. Praesent sem
magna, pretium vel, elementum at, consectetuer et, mi. Donec aliquet tortor ut
ipsum. Vivamus et arcu sed lacus rhoncus vulputate. Ut condimentum, erat vel
lobortis faucibus, neque quam faucibus sapien, eget dapibus magna metus ve-
lit.
Phasellus non odio. Ut sollicitudin. Vestibulum blandit, massa at vehicula
varius, dui enim porttitor augue, id auctor est metus a nisi. Quisque ligula mi,
aliquam ac, consectetuer nonummy, fermentum eu, urna. Sed aliquam felis quis
quam. Pellentesque non metus. Nisi vestibulum nisi elementum amoris tutae
sit.
kz & szp (delg) typo/breaking (5A) 16 / 23
breaking text into pages
breaking text into pages
◮ text is broken into lines that form paragraphs
kz & szp (delg) typo/breaking (5A) 17 / 23
breaking text into pages
breaking text into pages
◮ text is broken into lines that form paragraphs
◮ paragraphs are broken into pages
kz & szp (delg) typo/breaking (5A) 17 / 23
breaking text into pages
breaking text into pages
◮ text is broken into lines that form paragraphs
◮ paragraphs are broken into pages
◮ page breaking is the vertical analogue of line breaking
kz & szp (delg) typo/breaking (5A) 17 / 23
breaking text into pages
breaking text into pages
◮ text is broken into lines that form paragraphs
◮ paragraphs are broken into pages
◮ page breaking is the vertical analogue of line breaking
default page breaks
the best place to break text into pages is between two paragraphs (theanalogue of SPACE )
kz & szp (delg) typo/breaking (5A) 17 / 23
breaking text into pages
breaking text into pages
◮ text is broken into lines that form paragraphs
◮ paragraphs are broken into pages
◮ page breaking is the vertical analogue of line breaking
default page breaks
the best place to break text into pages is between two paragraphs (theanalogue of SPACE )
except
between a secion title and the first paragraph of the section!
kz & szp (delg) typo/breaking (5A) 17 / 23
breaking text into pages
breaking text into pages
◮ text is broken into lines that form paragraphs
◮ paragraphs are broken into pages
◮ page breaking is the vertical analogue of line breaking
default page breaks
the best place to break text into pages is between two paragraphs (theanalogue of SPACE )
except
between a secion title and the first paragraph of the section!(most word processors and text formatters take care of this after paragraphsmarked as a heading: another advantage of marking document structure)
kz & szp (delg) typo/breaking (5A) 17 / 23
breaking text into pages
breaks within a paragraph
usually page breaks fall within a paragraph (the analogue of hyphenation)
paragraphs
in the default case, can be broken between any two lines
kz & szp (delg) typo/breaking (5A) 18 / 23
breaking text into pages
breaks within a paragraph
usually page breaks fall within a paragraph (the analogue of hyphenation)
paragraphs
in the default case, can be broken between any two lines
restrictions
avoid breaking a paragraph
kz & szp (delg) typo/breaking (5A) 18 / 23
breaking text into pages
breaks within a paragraph
usually page breaks fall within a paragraph (the analogue of hyphenation)
paragraphs
in the default case, can be broken between any two lines
restrictions
avoid breaking a paragraph
◮ after a line that ends in a hyphenated word, esp. on a recto page
kz & szp (delg) typo/breaking (5A) 18 / 23
breaking text into pages
breaks within a paragraph
usually page breaks fall within a paragraph (the analogue of hyphenation)
paragraphs
in the default case, can be broken between any two lines
restrictions
avoid breaking a paragraph
◮ after a line that ends in a hyphenated word, esp. on a recto page
◮ before the last line (creating a widow or orphan line)
kz & szp (delg) typo/breaking (5A) 18 / 23
breaking text into pages
breaks within a paragraph
usually page breaks fall within a paragraph (the analogue of hyphenation)
paragraphs
in the default case, can be broken between any two lines
restrictions
avoid breaking a paragraph
◮ after a line that ends in a hyphenated word, esp. on a recto page
◮ before the last line (creating a widow or orphan line)
◮ after the first line (creating a club line)
kz & szp (delg) typo/breaking (5A) 18 / 23
breaking text into pages
breaks within a paragraph
usually page breaks fall within a paragraph (the analogue of hyphenation)
paragraphs
in the default case, can be broken between any two lines
restrictions
avoid breaking a paragraph
◮ after a line that ends in a hyphenated word, esp. on a recto page
◮ before the last line (creating a widow or orphan line)
◮ after the first line (creating a club line)
(cf. the inhibition of hyphenating off small bits of words)
kz & szp (delg) typo/breaking (5A) 18 / 23
breaking text into pages
widow and club lines
a widow line
Lorem ipsum dolor sit amet, con-sectetuer adipiscing elit. Aliquamjusto quam, porta et, lacinia vel,molestie ac, arcu. Curabitur inurna. Donec eu sapien. Vestibu-lum suscipit. Etiam lorem. Donecpurus. Etiam odio nisi, sem-per nec, feugiat eget, faucibusvel, turpis. Nam tincidunt plac-erat felis. Phasellus ac ante necnunc lobortis malesuada. Donecac massa. Morbi id tellus. Crasvel felis. Pellentesque lacus pede,pretium at, elementum in, ul-lamcorper in, pede. Cras nequenunc, dapibus ut, sodales sit amet,cursus at, velit. In vitæ velitsit amet metus vulputate con-sectetuer. Fusce eu nibh. Nam id
tortor.Sed diam. Curabitur sapien
enim, rutrum id, iaculis sed, mat-tis vel, nisi. In fermentum pellen-tesque turpis. Mauris sed massaat mauris suscipit suscipit. Sedvel lectus in neque porttitor mat-tis. Suspendisse condimentum dic-tum arcu. Phasellus eget do-lor quis enim interdum tempor.Nam lobortis. Nam augue est,lacinia non, tempor in, placeratut, neque. Phasellus convallis do-lor vel velit. Nam nec enim eu felismalesuada feugiat. Donec turpis.Nulla vel tortor id felis aliquetpretium. Nullam sit amet erat infelis eleifend egestas.
Morbi vel tortor sed tortor con-
sectetuer mollis. Aliquam maurisnunc, fermentum sit amet, fermen-tum vitæ, venenatis et, tellus.
Cras et ipsum. Etiam so-dales tempus mauris. Ut fermen-tum fermentum nisi. Cras nonmassa. Sed nunc. Donec facili-sis diam non nunc. Aliquam fer-mentum bibendum dolor. Duismalesuada ultrices ipsum. Fuscediam. Suspendisse pulvinar do-lor iaculis magna. Vestibulum so-dales. Vestibulum erat. Sus-pendisse potenti. Sed imperdietpretium erat. Morbi molestie,leo nec pharetra facilisis, sapiensapien vulputate erat, id aliquamvelit nulla non eros. Nulla facilisi.Ænean nec diam. Proin sapien dui.
a club line
kz & szp (delg) typo/breaking (5A) 19 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
◮ in the latter case, the footnote has to be broken between pages
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
◮ in the latter case, the footnote has to be broken between pages
◮ it is better to break a footnote on a verso page than on a recto page
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
◮ in the latter case, the footnote has to be broken between pages
◮ it is better to break a footnote on a verso page than on a recto page;why?
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
◮ in the latter case, the footnote has to be broken between pages
◮ it is better to break a footnote on a verso page than on a recto page;why? because the reader does not have to turn the page this way
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
◮ in the latter case, the footnote has to be broken between pages
◮ it is better to break a footnote on a verso page than on a recto page;why? because the reader does not have to turn the page this way
◮ before the break in a footnote the text should end abruptly (by ahyphenated word, not by the end of a sentence), so that the reader iswarned that the footnote did not end
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
◮ a footnote must appear on the page it is referred to
◮ if this is impossible, it must begin on the page it is referred to
◮ in the latter case, the footnote has to be broken between pages
◮ it is better to break a footnote on a verso page than on a recto page;why? because the reader does not have to turn the page this way
◮ before the break in a footnote the text should end abruptly (by ahyphenated word, not by the end of a sentence), so that the reader iswarned that the footnote did not end
◮ a footnote continued on a following page is sometimes marked by aline above it
kz & szp (delg) typo/breaking (5A) 20 / 23
breaking text footnotes
breaking footnotes
kz & szp (delg) typo/breaking (5A) 21 / 23
conclusions
conclusions
you must always think twice before
◮ breaking a line with an explicit line break
kz & szp (delg) typo/breaking (5A) 22 / 23
conclusions
conclusions
you must always think twice before
◮ breaking a line with an explicit line break
◮ hyphenating a word with an explicit hyphen
kz & szp (delg) typo/breaking (5A) 22 / 23
conclusions
conclusions
you must always think twice before
◮ breaking a line with an explicit line break
◮ hyphenating a word with an explicit hyphen
◮ breaking a paragraph with an explicit page break
kz & szp (delg) typo/breaking (5A) 22 / 23
sample exam questions
sample exam questions
Which of the following is an inherent property of a text?
1. the size of the letters
2. the width of the text block
3. the height of the text block
4. the italicization of words
kz & szp (delg) typo/breaking (5A) 23 / 23
sample exam questions
sample exam questions
Which of the following is an inherent property of a text?
1. the size of the letters
2. the width of the text block
3. the height of the text block
4. the italicization of words
kz & szp (delg) typo/breaking (5A) 23 / 23
sample exam questions
sample exam questions
Which of the following is an inherent property of a text?
1. the size of the letters
2. the width of the text block
3. the height of the text block
4. the italicization of words
What is a club line?
1. the last line of a paragraph with the rest on the preceding page
2. the first line of a paragraph with the rest on the following page
3. a thick line separating the text body and the footnotes
4. a line under the header of a page.
kz & szp (delg) typo/breaking (5A) 23 / 23
sample exam questions
sample exam questions
Which of the following is an inherent property of a text?
1. the size of the letters
2. the width of the text block
3. the height of the text block
4. the italicization of words
What is a club line?
1. the last line of a paragraph with the rest on the preceding page
2. the first line of a paragraph with the rest on the following page
3. a thick line separating the text body and the footnotes
4. a line under the header of a page.
kz & szp (delg) typo/breaking (5A) 23 / 23