bnc sampler: xml editionmonologue 167714 (16.92%) 7196 (9.42%) 18 dialogue 822990 (83.07%) 69122...

37
BNC Sampler: XML edition July 31, 2008 1 What is the BNC Sampler? The BNC Sampler is a subset of the British National Corpus (BNC). This document offers a short introduction to the BNC Sampler corpus, with an outline of the areas where the Sampler differs from the BNC XML Edition. Detailed information about the design and construction of the BNC can be found in the Reference Guide for the British National Corpus (XML Edition) at http://www.natcorp. ox.ac.uk/XMLedition/URG/. The BNC Sampler consists of two collections of written and spoken material of about one million words each, originally compiled to mirror the composition of the full BNC as far as possible. The BNC Sampler was initially used for a tagging enhancement project at Lancaster University. It was annotated with a more detailed set of part-of-speech tags than the BNC and the annotation was manually checked and post-edited. The results of this tagging enhancement project fed into the development of the second edition of the BNC (BNC World, 2001). The BNC Sampler is thus likely to be of interest to: those who wish to use a smaller corpus with equal amounts of written and spoken material ; those who want a resource with highly detailed and accurate part-of-speech annotation. 1.1 Composition of the BNC Sampler As noted above, one motivation for the original design of the BNC Sampler was that it should reflect as far as possible the full variety of text types in the full BNC, even though the BNC itself was not yet complete at the time the texts were selected. The texts were chosen and classified according to the full range of selection criteria defined at the start of the BNC project, and thus form an interesting example of ‘balanced’ corpus design. The following tables demonstrate the composition of the Sampler with respect to all of the original BNC design criteria. Size in Kbyte 43946 Size in w-units 1993525 Size in s-units 126976 Text type words s-units texts Spoken demographic 493852 (24.77%) 52144 (41.06%) 47 Spoken context-governed 496852 (24.92%) 24174 (19.03%) 51 Written books and periodicals 888522 (44.57%) 44852 (35.32%) 69 Written-to-be-spoken 18121 (0.90%) 1056 (0.83%) 3 Written miscellaneous 96178 (4.82%) 4750 (3.74%) 14 Total 496852 24174 184 1.1.1 Spoken Texts 1

Upload: others

Post on 24-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

BNC Sampler: XML edition

July 31, 2008

1 What is the BNC Sampler?The BNC Sampler is a subset of the British National Corpus (BNC). This document offers a shortintroduction to the BNC Sampler corpus, with an outline of the areas where the Sampler differs from theBNC XML Edition. Detailed information about the design and construction of the BNC can be foundin the Reference Guide for the British National Corpus (XML Edition) at http://www.natcorp.ox.ac.uk/XMLedition/URG/.

The BNC Sampler consists of two collections of written and spoken material of about one millionwords each, originally compiled to mirror the composition of the full BNC as far as possible. The BNCSampler was initially used for a tagging enhancement project at Lancaster University. It was annotatedwith a more detailed set of part-of-speech tags than the BNC and the annotation was manually checkedand post-edited. The results of this tagging enhancement project fed into the development of the secondedition of the BNC (BNC World, 2001).

The BNC Sampler is thus likely to be of interest to:

• those who wish to use a smaller corpus with equal amounts of written and spoken material ;

• those who want a resource with highly detailed and accurate part-of-speech annotation.

1.1 Composition of the BNC SamplerAs noted above, one motivation for the original design of the BNC Sampler was that it should reflectas far as possible the full variety of text types in the full BNC, even though the BNC itself was not yetcomplete at the time the texts were selected. The texts were chosen and classified according to the fullrange of selection criteria defined at the start of the BNC project, and thus form an interesting exampleof ‘balanced’ corpus design. The following tables demonstrate the composition of the Sampler withrespect to all of the original BNC design criteria.

Size in Kbyte 43946Size in w-units 1993525Size in s-units 126976

Text type words s-units textsSpoken demographic 493852 (24.77%) 52144 (41.06%) 47Spoken context-governed 496852 (24.92%) 24174 (19.03%) 51Written books and periodicals 888522 (44.57%) 44852 (35.32%) 69Written-to-be-spoken 18121 (0.90%) 1056 (0.83%) 3Written miscellaneous 96178 (4.82%) 4750 (3.74%) 14Total 496852 24174 184

1.1.1 Spoken Texts

1

Page 2: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

1.1 Composition of the BNC Sampler 1 WHAT IS THE BNC SAMPLER?

Table 3: Domain for context-coverned spoken material

words s-units textsEducational/Informative 80463 (16.19%) 7322 (30.28%) 9Business 134275 (27.02%) 5673 (23.46%) 13Public/Institutional 145508 (29.28%) 4816 (19.92%) 14Leisure 136606 (27.49%) 6363 (26.32%) 15Total 493852 52144 51

Table 4: Age band of demographic respondent

words s-units texts0-14 22387 (4.53%) 1254 (2.40%) 215-24 64652 (13.09%) 7471 (14.32%) 625-34 135973 (27.53%) 11640 (22.32%) 1235-44 97834 (19.81%) 13724 (26.31%) 1045-59 107112 (21.68%) 12619 (24.20%) 1160+ 65894 (13.34%) 5436 (10.42%) 6Total 493852 52144 47

Table 5: Social class of demographic repondent

words s-units textsAB 164933 (33.39%) 13383 (25.66%) 16C1 98700 (19.98%) 9641 (18.48%) 9C2 137686 (27.88%) 18619 (35.70%) 14DE 92533 (18.73%) 10501 (20.13%) 8Total 493852 52144 47

Table 6: Sex of demographic respondent

words s-units textsMale 241493 (48.89%) 24183 (46.37%) 23Female 252359 (51.10%) 27961 (53.62%) 24Total 990704 76318 47

Table 7: Spoken interaction type

words s-units textsMonologue 167714 (16.92%) 7196 (9.42%) 18Dialogue 822990 (83.07%) 69122 (90.57%) 80Total 990704 76318 98

Table 8: Region where spoken

words s-units textsUnknown 54129 (5.46%) 1164 (1.52%) 6

2

Page 3: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

1 WHAT IS THE BNC SAMPLER? 1.1 Composition of the BNC Sampler

Region where spoken(cont.)South 375312 (37.88%) 27688 (36.27%) 37Midlands 199666 (20.15%) 14988 (19.63%) 19North 361597 (36.49%) 32478 (42.55%) 36Total 1002821 50658 98

1.1.2 Written texts

Table 9: Author age band

words s-units textsUnknown 935786 (93.31%) 49128 (96.97%) 8135-44 26550 (2.64%) 0 (0%) 145-59 7629 (0.76%) 232 (0.45%) 260+ 32856 (3.27%) 1298 (2.56%) 2Total 1002821 50658 86

Table 10: Author sex

words s-units textsUnknown 405633 (40.44%) 20091 (39.66%) 39Male 396786 (39.56%) 22145 (43.71%) 35Female 195581 (19.50%) 8142 (16.07%) 11Unknown 4821 (0.48%) 280 (0.55%) 1Total 1002821 50658 86

Table 11: Author type

words s-units textsCorporate 79369 (7.91%) 4285 (8.45%) 9Multiple 368323 (36.72%) 17458 (34.46%) 32Sole 550136 (54.85%) 28446 (56.15%) 43Unknown 4993 (0.49%) 469 (0.92%) 2Total 1002821 50658 86

Table 12: Audience age

words s-units textsChild 23700 (2.36%) 2326 (4.59%) 3Teenager 30110 (3.00%) 3673 (7.25%) 4Adult 946106 (94.34%) 44449 (87.74%) 78Any 2905 (0.28%) 210 (0.41%) 1Total 1002821 50658 86

Table 13: Domain for written texts

words s-units textsImaginative 233774 (23.31%) 21332 (42.10%) 18natural & pure science 35456 (3.53%) 774 (1.52%) 5

3

Page 4: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

1.1 Composition of the BNC Sampler 1 WHAT IS THE BNC SAMPLER?

Domain for written texts(cont.)applied science 106193 (10.58%) 5494 (10.84%) 10social science 76211 (7.59%) 3438 (6.78%) 10world affairs 306921 (30.60%) 9201 (18.16%) 23commerce & finance 60270 (6.01%) 3613 (7.13%) 6arts 58318 (5.81%) 3049 (6.01%) 3belief & thought 43626 (4.35%) 1225 (2.41%) 4leisure 82052 (8.18%) 2532 (4.99%) 7Total 1002821 50658 86

Table 14: Audience level

words s-units textsUnknown 9505 (0.94%) 363 (0.71%) 1Low 172777 (17.22%) 11564 (22.82%) 22Medium 568876 (56.72%) 29136 (57.51%) 44High 251663 (25.09%) 9595 (18.94%) 19Total 1002821 50658 86

Table 15: Written Medium

words s-units textsBook 616213 (61.44%) 31927 (63.02%) 45Periodical 272309 (27.15%) 12925 (25.51%) 24Miscellaneous – published 59145 (5.89%) 3368 (6.64%) 8Miscellaneous – unpublished 37033 (3.69%) 1382 (2.72%) 6To-be-spoken 18121 (1.80%) 1056 (2.08%) 3Total 1002821 50658 86

Table 16: Place of publication

words s-units textsUnknown 81999 (8.17%) 3950 (7.79%) 11UK 258098 (25.73%) 11855 (23.40%) 23North 8580 (0.85%) 1493 (2.94%) 1Midland 18749 (1.86%) 0 (0%) 1South 635395 (63.36%) 33360 (65.85%) 50Total 1002821 50658 86

Table 17: Written sample type

words s-units textsUnknown 430623 (42.94%) 22146 (43.71%) 43Whole text 187955 (18.74%) 7987 (15.76%) 15Beginning sample 170767 (17.02%) 12078 (23.84%) 14Middle sample 151063 (15.06%) 7772 (15.34%) 11End sample 26550 (2.64%) 0 (0%) 1Composite 35863 (3.57%) 675 (1.33%) 2Total 1002821 50658 86

4

Page 5: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

2 FORMAT OF THE BNC SAMPLER

Table 18: Written reception status

words s-units textsUnknown 262460 (26.17%) 12646 (24.96%) 24Low 226382 (22.57%) 9537 (18.82%) 19Medium 256448 (25.57%) 13622 (26.89%) 19High 257531 (25.68%) 14853 (29.32%) 24Total 1002821 50658 86

Table 19: Target audience sex

words s-units textsUnknown 280387 (27.95%) 14950 (29.51%) 28Male 20002 (1.99%) 0 (0%) 1Female 40288 (4.01%) 2227 (4.39%) 3Mixed 662144 (66.02%) 33481 (66.09%) 54Total 1002821 50658 86

Table 20: Written text time period

words s-units texts1975-1993 1002821 (100%) 50658 (100%) 86

2 Format of the BNC SamplerThe first edition of the BNC Sampler (1997) was distributed in SGML format; this version has beenautomatically converted to XML but no other changes have been made to the files. The originaldocumentation for the SGML version is available from the BNC website at http://www.natcorp.ox.ac.uk/corpus/sampler/, and includes full information about the CLAWS part-of-speechtagging applied to the Sampler, including a description of the CLAWS system itself.

There are several differences between the format of the BNC Sampler (in its second edition) and theBNC XML Edition.

tagset for linguistic annotation As noted above, the BNC Sampler has been manually annotated witha more detailed tagset than the BNC XML Edition. More information about the tagset (CLAWS7) used in the sampler is available from the BNC website (http://www.natcorp.ox.ac.uk/corpus/sampler/guide_C7.htm).

lemmatization Unlike the BNC XML Edition (and BNC Baby), the BNC Sampler has not beenannotated with lemmatized forms of each word, nor does it include simplified part-of-speech tags(pos).

tokenization In the BNC Sampler, words forming multi-word units are tagged as one wordtogether. In the BNC XML Edition, multi-word units are marked using a <mw> elementto enclose sequences of orthographic words, which are also tagged individually. Forexample, in the BNC Sampler the sequence ‘of course’ is marked up as follows:

<w type="RR">of course </w>

The multi-word unit is analyzed as an adverb and given one part-of-speech tag (RR). Inthe BNC XML Edition, however, the same sequence would be marked up as follows:

5

Page 6: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

<mw c5="AV0"> <w c5="PRF"

hw="of" pos="PREP">of </w> <w c5="NN1" hw="course"

pos="SUBST">course</w> </mw>

The multi-word unit is analyzed as an adverb (AV0) and marked using a <mw> element. Its twocomponent parts are analyzed separately, and each is annotated with a detailed part-of-speech code(in the c5 attribute), a headword (lemma) ( in the hw attribute) and a simplified part-of-speech code(in the pos attribute).

structural markup There are several differences in the way XML markup has been applied between theBNC Sampler and the BNC XML Edition. The treatment of overlapping speech is different; theelements used to mark-up structural divisions are different; the value-ranges and names of someattributes have changed, many header elements have changed their names, etc. We do not itemizethese differences here, since the elements and attributes are fully documented in the complete UserReference Guides for the two corpora.

Most of the discussion of XML-specific matters in the documentation for BNC Baby appliesequally to the XML version of the BNC Sampler. Most of the sample scripts provided however needmodification.

3 Source TextsBibliographic details1 of the files included in the BNC Sampler are as follows:

[A7V] 8802 words from The Guardian, electronic edition of 1989-11-08: Foreign news pages. GuardianNewspapers Ltd London 1989

[A87] 11070 words from The Guardian, electronic edition of 1989-11-11: Foreign news pages.Guardian Newspapers Ltd London 1989

[A8J] 8071 words from The Guardian, electronic edition of 1989-11-23: Foreign news pages. GuardianNewspapers Ltd London 1989

[A8W] 10362 words from The Guardian, electronic edition of 1989-12-07: Foreign news pages.Guardian Newspapers Ltd London 1989

[A95] 10204 words from The Guardian, electronic edition of 1989-12-08: Foreign news pages.Guardian Newspapers Ltd London 1989

[A9E] 18288 words from The Guardian, electronic edition of 1989-12-10: Foreign news pages.Guardian Newspapers Ltd London 1989

[A9M] 11338 words from The Guardian, electronic edition of 1989-12-11: Foreign news pages.Guardian Newspapers Ltd London 1989

[A9V] 7203 words from The Guardian, electronic edition of 1989-12-13: Foreign news pages. GuardianNewspapers Ltd London 1989

[AA4] 8282 words from The Guardian, electronic edition of 1989-12-20: Foreign news pages. GuardianNewspapers Ltd London 1989

1The word counts given here are for the corresponding version of this text in the BNC XML Edition, and may thus varysomewhat from the count in the BNC Sampler itself.

6

Page 7: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

[AAB] 9884 words from The Guardian, electronic edition of 1989-12-21: Foreign news pages.Guardian Newspapers Ltd London 1989

[AAK] 9498 words from The Guardian, electronic edition of 1989-12-22: Foreign news pages.Guardian Newspapers Ltd London 1989

[AAT] 6977 words from The Guardian, electronic edition of 1989-12-31: Foreign news pages. GuardianNewspapers Ltd London 1989

[AEA] 26515 words from Tomorrow. Taylor, Elizabeth Russell Peter Owen Publishers London 199152-137

[ALS] 4149 words from Captain Pugwash and the huge reward. Ryan, John Gungarden Books Rye,East Sussex 1991 4-43

[AP6] 1879 words from Monster Raving Loony Party’s draft manifesto for General Election 1992. u.p.

[APJ] 3440 words from Report on visit to Peto Institute. Eccleshall, J Davis, J u.p.

[B2E] 25386 words from Oh! sister I saw the bells go down. Saunders-Veness, Frances The BookGuild Ltd Lewes, East Sussex 1989 7-73

[BMJ] 14268 words from Channel tunnel. Grayson, Leslie The British Library Board London 19901-103

[BP6] 6553 words from Welcome to Somerset. u.p.

[C9C] 14796 words from The Gardener. Maxwell Consumer Magazines London 1992-12, 1991-03

[CAA] 6295 words from New Millennium summer holidays. u.p.

[CBB] 18566 words from The myths and legends of Stamford in Lincolnshire. Smith, Martin PaulWatkins Stamford, Lincs 1991 15-108

[CCD] 39460 words from The child bride. Wiat, Philippa Robert Hale Ltd London 1990

[CDH] 9416 words from Hair Flair. Shaws Publications Ltd London 1992 4-58

[CEL] 20709 words from Today. News Group Newspapers Ltd London 1992-12

[CF5] 3393 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 1993-03

[CF6] 5958 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 1993-03

[CF7] 480 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 1993-03

[CF8] 12806 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 1993-03

[CF9] 41587 words from East Anglian Daily Times. East Anglian Daily Times Company Ipswich 1993-03

[CHP] 14040 words from Queen Mary’s dolls’ house. Stewart-Wilson, Mary The Bodley Head London1989 10-190

[CHR] 10339 words from Return of the red nose joke book. Green, Rod Boxtree London 1991

[CL8] 8431 words from Computergram international. u.p.

7

Page 8: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

[CN4] 21989 words from The Artist: a magazine giving instruction in all branches of art. The ArtistPublishing Company Ltd Tenterden 1992 7-49

[DCH] 15274 words from Amnesty International meeting

DCHPS000 unspecified

DCHPS001 unspecified

DCHPS002 unspecified

DCHPS003 unspecified

DCHPS004 unspecified

DCHPS005 unspecified

DCHPS006 unspecified

DCHPS007 unspecified

DCHPSUNK Unknown speaker, other

DCHPSUGP Group of unknown speakers, other

[EAP] 6292 words from [New Oxford English Dictionary procedures documents] u.p.

[EBK] 6533 words from Action. World Assoc for Christian Comm 1991-07/1993-02

[EVR] 11176 words from Egyptian gods and myths. Thomas, Angela P Shire Publications Ltd UK1989 6-60

[EVY] 16693 words from Manpower solutions. Dean, Derek J Scutari Projects Ltd UK 1987 1-60

[EW4] 27383 words from Proportional representation: which system? Sykes, Leslie The HornbeamPress Leicester 1990 1-76

[EX7] 3948 words from Dear Green Place [from Truth, dare or promise] Riley, Denise Virago PressLtd London 1985 237-248

[F71] 6772 words from A poet’s response to the pictures of Gauguin: [Picture appreciation lesson]

F71PS000 unspecified

F71PS001 unspecified

F71PS002 unspecified

F71PS003 unspecified

F71PS004 unspecified

F71PS005 unspecified

F71PS006 unspecified

F71PS007 unspecified

F71PSUNK Unknown speaker, other

F71PSUGP Group of unknown speakers, other

[F77] 4930 words from [Etching lesson]

PS1L3 46, Andrew, teacher

PS1L4 45, teacher

PS1L5 14, Kevin, student

F77PSUNK Unknown speaker, other

8

Page 9: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

F77PSUGP Group of unknown speakers, other

[F7G] 5736 words from [Teachers’ conference: discussing assessment procedures]

PS1M4 40+, Andrew, teacher

PS1M5 30+, Angela, teacher

PS1M6 30+, Paul, teacher

PS1M7 40+, Rod, teacher

PS1M8 50+, Don, teacher

PS1M9 40+, Alan, teacher

PS1MA 30+, Terry, teacher

F7GPSUNK Unknown speaker, other

F7GPSUGP Group of unknown speakers, other

[F7J] 11165 words from [COHSE/NALGO/NUPE/meeting]

F7JPS000 unspecified

F7JPS001 unspecified

F7JPS002 unspecified

F7JPSUNK Unknown speaker, other

F7JPSUGP Group of unknown speakers, other

[F86] 9128 words from [Church of Scotland: Meeting on rules and regulations]

PS1NE Hugh, moderator

PS1NF Mr Boyd

PS1NG Mr Torrence

PS1NH Mr Forrester

PS1NJ Mr McGilvery

F86PS000 unspecified

F86PS001 unspecified

F86PS002 unspecified

F86PS003 unspecified

F86PSUNK Unknown speaker, other

F86PSUGP Group of unknown speakers, other

[F98] 5602 words from Computers and the humanities. Kenny, A u.p.

[F9M] 8430 words from City psalms. Zephaniah, B Bloodaxe books ltd Newcastle upon Tyne 199211-64

[FA4] 7294 words from Further developments of the electronic book. Feldman, Tony BNBR London1991

[FB4] 21440 words from The history of Siberia: from Russian conquest to revolution. Wood, AlanRoutledge & Kegan Paul plc London 1991 1-91

[FCF] 3625 words from The Weekly Law Reports 1992 Volume 3. u.p.

9

Page 10: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

[FEJ] 17099 words from Model financial statements for public and private companies. Stoy HaywardButterworth & Company (pub) Ltd UK 1990 1-115

[FL6] 4863 words from Eating disorders: television discussion

FL6PS000 unspecified

FL6PS001 unspecified

FL6PS002 unspecified

FL6PS003 unspecified

FL6PS004 unspecified

FL6PS005 unspecified

FL6PS006 unspecified

FL6PS007 unspecified

FL6PS008 unspecified

FL6PSUNK Unknown speaker, other

FL6PSUGP Group of unknown speakers, other

[FLK] 5302 words from Young women in Scotland: television discussion

FLKPS000 unspecified

FLKPS001 unspecified

FLKPS002 unspecified

FLKPS003 unspecified

FLKPS004 unspecified

FLKPS005 unspecified

FLKPS006 unspecified

FLKPS007 unspecified

FLKPS008 unspecified

FLKPSUNK Unknown speaker, other

FLKPSUGP Group of unknown speakers, other

[FLS] 10830 words from General Portfolio health and safety meeting

PS1PT 38, Roger, first aid representative

PS1PU 47, Roger, first aid representative

PS1PV 36, Peter, first aid representative

PS1PW 32, Katie, first aid representative

PS1PX 24, Dianne, first aid representative

PS1PY 28, Suzanne, first aid representative

PS1R0 58, Norman, first aid representative

PS1R1 33, Carmel, first aid representative

PS1R2 26, Steve, first aid representative

FLSPSUNK Unknown speaker, other

FLSPSUGP Group of unknown speakers, other

[FLU] 4243 words from Albert Gunter: sermon

10

Page 11: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS1RD Albert, minister

[FLY] 6227 words from 11th year science lesson on chemistry of metal processing

PS1RS 43, Tony, teacher

FLYPSUNK Unknown speaker, other

FLYPSUGP Group of unknown speakers, other

[FM4] 11203 words from Tutorial lesson: GCSE maths tutoring session

PS1S9 50, John, tutor

PS1SA 16, Andrew, student

FM4PSUNK Unknown speaker, other

FM4PSUGP Group of unknown speakers, other

[FM7] 11058 words from Strangers - talk by PC Bruce: Talk/presentation

PS1SF pc bruce, police officer

FM7PSUNK Unknown speaker, other

FM7PSUGP Group of unknown speakers, other

[FMP] 15376 words from Planning and development in York: greenbelt planning - public enquiry

PS1TW 55, John, department of the environment adjudicator

PS1TX 58, Harry, deputy chairman

PS1TY 64, George, barrister

PS1U0 46, barrister

PS1U1 30, barrister

PS1U2 40, barrister

FMPPS000 unspecified

FMPPS001 unspecified

FMPPS002 unspecified

FMPPSUNK Unknown speaker, other

FMPPSUGP Group of unknown speakers, other

[FMS] 11933 words from Legal advice: pre-retirement course

PS1UD 50, solicitor

FMSPSUNK Unknown speaker, other

FMSPSUGP Group of unknown speakers, other

[FR2] 21677 words from An introduction to rural geography. Gilg, A Routledge & Kegan Paul plcLondon 1989 67-137

[FRY] 9266 words from The railway children: Oxford Bookworms edition. Nesbit, E Escott, JohnOxford University Press Oxford 1993

[FSB] 8817 words from The star zoo. Gilbert, H Oxford University Press Oxford 1992 1-55

[FU0] 16355 words from Dog-whelks: an introduction to the biology of nucella. Crothers, J H FieldStudies Council UK 1985

11

Page 12: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

[FU6] 23216 words from Rosencrantz and Guildenstern are dead. Stoppard, Tom Faber & Faber LtdLondon 1986 9-93

[FU7] 8185 words from Revolt in Roundhay [excerpt from Truth, Dare or Promise] Rowbotham, SheilaVirago Press Ltd London 1985

[FU9] 2961 words from Chaos. Muhamad, M A Holden, M V Manchester University Press Manchester1987 15-33

[FUG] 11104 words from Management training course

PS1U3 50+, Gordon, training manager

PS1U4 Brain

PS1U5 Mike

PS1U6 unspecified

PS1U7 Philip

PS1U8 Anthony

PS1U9 Thomas

PS1UA Jane

FUGPS000 unspecified

FUGPSUNK Unknown speaker, other

FUGPSUGP Group of unknown speakers, other

[FUH] 11792 words from Tutorial lesson: junior-level maths

PS1UE 50+, John, tutor

PS1UF 9, Kerry, student

FUHPSUNK Unknown speaker, other

FUHPSUGP Group of unknown speakers, other

[FUT] 8564 words from Presentation on consumer rights

PS1VF 60, retired trading standards officer

FUTPSUNK Unknown speaker, other

FUTPSUGP Group of unknown speakers, other

[FUU] 8033 words from Talk on fire prevention

PS1VG 55, Jack, retired fire prevention officer

FUUPS000 unspecified

FUUPSUNK Unknown speaker, other

FUUPSUGP Group of unknown speakers, other

[FX5] 9730 words from Radio Forth: radio broadcast

PS223 David, disc jockey, Other participants are radio listening phone ins.

FX5PS000 unspecified

FX5PS001 unspecified

FX5PS002 unspecified

FX5PS003 unspecified

12

Page 13: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

FX5PS004 unspecified

FX5PS005 unspecified

FX5PS006 unspecified

FX5PS007 unspecified

FX5PSUNK Unknown speaker, other

FX5PSUGP Group of unknown speakers, other

[FX6] 6824 words from Old Harlow W

PS224 Margaret, secretary

PS225 Vera

PS226 Gloria, Is the speaker.

PS227 Mildred

FX6PSUNK Unknown speaker, other

FX6PSUGP Group of unknown speakers, other

[FXR] 11419 words from Orchestral society committee meeting

PS23A 23, Greg, doctor

PS23B 49, John, civil servant

PS23C 47, Pauline, teacher

PS23D 50, John, retired teacher

PS23E 44, Julia, translator

PS23F 46, Liz, teacher

PS23G 58, Malcolm, retired school master

FXRPSUNK Unknown speaker, other

FXRPSUGP Group of unknown speakers, other

[FY8] 12735 words from Nottingham Oral History Project: interview

PS25M nun, social worker

PS25N interviewer

FY8PSUNK Unknown speaker, other

FY8PSUGP Group of unknown speakers, other

[FYJ] 11492 words from Nottinghamshire Oral History Project: talk

PS268 20, interviewer

PS269 70, retired, Ex. Political Activist

FYJPSUNK Unknown speaker, other

FYJPSUGP Group of unknown speakers, other

[G0A] 40300 words from The Crow Road. Banks, Iain Abacus London 1993 157-296

[G0C] 12384 words from A cure for unemployment. Kinnaird, Hugh Graham-Cameron PublishingCambridge 1985 1-40

[G0K] 15479 words from Design and management techniques for fire prevention in high bay ware-houses. National Materials Handling Centre Cranfield 1980 1-82

13

Page 14: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

[G11] 6131 words from Making cocoa for Kingsley Amis. Cope, Wendy Faber & Faber Ltd London1986 13-69

[G1V] 7524 words from The man with night sweats. Gunn, T Faber & Faber Ltd London 1992 1-75

[G22] 8733 words from Brownie. Girl Guides Association London 1991-12/1992-03

[G2R] 3170 words from [Vale of White Horse District Council: official leaflets] u.p.

[G3N] 15210 words from The laws of Occam programming. Hoare, C A R Roscoe, A W OxfordUniversity Computing Laboratory UK 1986 1-77

[G3U] 12798 words from Justice and Peace Group meeting

PS1VH 40, Charlie, traidcraft worker

PS1VJ 40, Moira, traidcraft worker

G3UPS000 unspecified

G3UPS001 unspecified

G3UPS002 unspecified

G3UPS003 unspecified

G3UPS004 unspecified

G3UPS005 unspecified

G3UPS006 unspecified

G3UPSUNK Unknown speaker, other

G3UPSUGP Group of unknown speakers, other

[G4K] 9381 words from Word-processing tutorial

PS1WS 50, John, tutor

PS1WT 45, Sue, secretary

PS1WU 16, Sarah, student

[G4N] 10262 words from Lockmaking in Willenhall: conversation with Dick Brown, lockmaker

PS21S m. glasson, interviewer

PS21T Dick, lockmaker

G4NPS000 unspecified

G4NPSUNK Unknown speaker, other

G4NPSUGP Group of unknown speakers, other

[G5A] 9269 words from Auction rooms

PS281 50, auctioneer

G5APSUNK Unknown speaker, other

G5APSUGP Group of unknown speakers, other

[G63] 8182 words from Oral history project interview

PS2CH 40+, Interviewing other three participants.

PS2CJ 60+, Hugh, retired shipbuilder

PS2CK 60+, Cathie, housewife

14

Page 15: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS2CL 60+, Paddy, retired shipbuilder

G63PSUNK Unknown speaker, other

G63PSUGP Group of unknown speakers, other

[GT9] 24405 words from The Dictionary of National Biography: Missing persons. Nicholls, C SOxford University Press Oxford 1993

[GUB] 22316 words from Curtains and blinds. Kittier, Eileen Ward Lock Ltd London 1990 6-70

[GUL] 6264 words from Taped. Barnes, Trevor Coronet Books London 1993 1-41

[GV1] 20066 words from Garden pools, waterfalls and fountains. Booth-Moores, Andrew Ward LockLtd London 1991 7-93

[GV9] 6021 words from The love of a king. Barnes, Trevor Dainty, Peter Oxford University PressOxford 1989 1-38

[GW5] 8765 words from Skyjack! Oxford Bookworms edition. Vicary, Tim Oxford University PressOxford 1989 1-54

[GWA] 5872 words from Voodoo Island. Duckworth, Michael Oxford University Press Oxford 19891-39

[GX0] 7193 words from [Miscellaneous prayers] Smith, David K u.p.

[GX4] 2488 words from [LASMO July 1992 – news releases] u.p.

[H0H] 6831 words from The initial destinations of 1984 information technology advanced coursestudents. Gordon, Alan Science & Eng Research Council UK 1984 1-37

[H0S] 14865 words from The online catalogue: developments & directions. Hildreth, Charles RLibrary Association Publishing Ltd London 1989 25-43

[H13] 1265 words from Excerpt from My favourite stories of Lakeland. Wyatt, John Lutterworth PressCambridge 1981 113-116

[H47] 11533 words from British Rail quality assurance seminar

PS1X2 45, Dennis, british rail employee

PS1X3 22, Clare, audio transcriber

PS1X4 45+, Trevor, british rail employee

PS1X5 55+, Norman, british rail employee

PS1X6 50+, Mike, consultant

H47PSUNK Unknown speaker, other

H47PSUGP Group of unknown speakers, other

[H4A] 9844 words from Labour Club public meeting

PS1XR 35, Steve, health worker

PS1XS 35, John, council worker

PS1XT 56, Jeff, retired

PS1XU 28, Ed, trade union officer

PS1XV 40, Graham, member of parliament

PS1XW 65+, Ronald, retired

15

Page 16: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS1XX 65+, Ida, pensioners’ movement representative

PS1XY 43, Alan, trade union representative

PS1Y0 35, Gwen, hosiery worker

H4APSUNK Unknown speaker, other

H4APSUGP Group of unknown speakers, other

[H5D] 9458 words from Careers Service: meeting

PS21B 30, Sue, careers advisor

PS21C 40, Keith, careers advisor

[H7C] 8034 words from Norther Ireland: the election of the Constitutional Convention May 1975.Knight, James The Arthur McDougall Fund UK 1975 7-17

[H8W] 14678 words from [Access course essays] Woodall, Julia R u.p.

[HDT] 13625 words from Trade Union Annual Congress: trade union congress

PS2FT trade unionist

PS2FU Alan, trade unionist

PS2FV Donald, trade unionist

PS2FW Dave, trade unionist

PS2FX Les, trade unionist

PS2FY Charlie, trade unionist

PS2G0 Steve, trade unionist

PS2G1 Sammy, trade unionist

PS2G2 Peter, trade unionist

HDTPSUNK Unknown speaker, other

HDTPSUGP Group of unknown speakers, other

[HE3] 5897 words from Panorama: the Branch Davidian sect: documentary

PS2R8 david koresh, religious leader, This is almost certainly David Koresh speaking, but

PS2R9 John, tv presenter

PS2RA Janet, Davidian

PS2RB Davidian

PS2RC Livingstone, Davidian

PS2RD Bonnie

PS2RE unspecified

PS2RF May well be an actor portraying a Davidian

PS2RG May well be an actor portraying a Davidian

HE3PSUNK Unknown speaker, other

HE3PSUGP Group of unknown speakers, other

[HE4] 4054 words from Assignment - the legacy of the Brazilian gold rush: documentary

PS2S7 40+, julian pettifer, tv presenter, Almost all his speech is scripted.

PS2S8 60+, Rubens, brazilian minister for the environment, First language is Brazilian Portuguese

16

Page 17: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS2S9 scientist, First language is probably Brazilian Portuguese

PS2SA Roberto, First language is Brazilian Portuguese

HE4PSUNK Unknown speaker, other

HE4PSUGP Group of unknown speakers, other

[HEM] 10019 words from Gwynedd County Council tape 4: interview for oral history project

PS2VR 67, John, farmer, Interviewee

PS2VS historian, Interviewer

[HLW] 11700 words from Trade Union Annual Congress

PS2K5 Dick, president of trade union

PS2K6 Bob, trade unionist

PS2K7 Bill, trade unionist

PS2K8 Avril, trade unionist

PS2K9 Mel, trade unionist

PS2KA Cliff, trade unionist

PS2KB John, trade unionist

HLWPS000 unspecified

HLWPS001 unspecified

HLWPSUNK Unknown speaker, other

HLWPSUGP Group of unknown speakers, other

[HM4] 12145 words from London Talkback Radio: radio broadcast

PS2N3 Douglas, radio presenter

PS2N4 Pam, sports presenter

PS2N5 Dave, radio presenter

PS2N6 Frances

PS2N7 Judy, astrologer

PS2N8 Humphrey

PS2N9 David

PS2NA Bob

PS2NB Madeline, writer

HM4PSUNK Unknown speaker, other

HM4PSUGP Group of unknown speakers, other

[HXN] 11358 words from European economic integration: Competition policy. Kemp, JohnMcDonald,FrankDearden, StephenLongman Group UK Ltd Harlow 1992 59-80

[HY1] 9678 words from Markets and dealers. Harrington, Richard Longman Group UK Ltd Harlow1992 61-81

[HYF] 6363 words from Pearson analyst presentation: analysts’ meeting

PS302 Michael

PS303 Frank, chief executive

17

Page 18: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS304 James, financial director

PS305 David

PS306 Mark

PS307 Mark

HYFPS000 unspecified

HYFPSUNK Unknown speaker, other

HYFPSUGP Group of unknown speakers, other

[J1L] 8137 words from Speeches by Tony Hall. Hall, Tony u.p.

[J1N] 2686 words from Scottish TV – sports news scripts. u.p.

[J24] 4845 words from Guinness brewing worldwide. u.p.

[J2G] 10482 words from Hombre. Leonard, Elmore Chivers Press UK 1989 1-45

[J2H] 6721 words from Chaos [2/2] Firth, W J Manchester University Press Manchester 1987 135-155

[J2J] 6869 words from Chaos [1/2] Sparrow, C Manchester University Press Manchester 1987 111-131

[J3W] 10494 words from Royal Yachting Association annual general meeting

PS3PC Rod, racing coach

PS3PD Peter, chairman

PS3PE John

PS3PF Chris

PS3PG unspecified

PS3PH Frank

PS3PJ Richard

PS3PK Bob

J3WPSUNK Unknown speaker, other

J3WPSUGP Group of unknown speakers, other

[J44] 9666 words from West Sussex Council: Highways Committee meeting

PS3S0 unspecified

J44PSUNK Unknown speaker, other

J44PSUGP Group of unknown speakers, other

[J55] 15608 words from Peter Grimes: Gloriana. Britten, Benjamin John Calder Ltd UK 1983 7-69

[J6W] 6950 words from Damages for personal injury and death: Damages on deathSaunt,ThomasKemp, DavidLongman Group UK Ltd Harlow 1993 52-68

[J8G] 11926 words from Life in Harlow: interview by Harlow Study Centre

PS1N3 unspecified

PS1N4 electrical retailer

PS1N5 unspecified

J8GPSUNK Unknown speaker, other

J8GPSUGP Group of unknown speakers, other

18

Page 19: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

[J97] 10965 words from British Market Research Bureau monthly meeting

PS3U2 John, managing director

PS3U3 Stella, director

PS3U4 John, departmental chairman

PS3U5 Richard, administrator

PS3U6 Alan, director

PS3U7 Rita, director

PS3UA Caroline, consultant

J97PSUNK Unknown speaker, other

J97PSUGP Group of unknown speakers, other

[JJA] 9634 words from Hertfordshire County Council: Environment Committee meeting

PS44R councillor

PS44S councillor

PS44T chairman

PS44U councillor

PS44V Tony, councillor

PS44W councillor

PS44X councillor

PS44Y York, councillor

PS450 councillor

JJAPSUNK Unknown speaker, other

JJAPSUGP Group of unknown speakers, other

[JJS] 10280 words from Bacons College: lesson

PS47D lecturer, no further information given

PS47E Berkam, pupil

PS47F teacher, no further information given

PS47G no further information given

PS47H David, pupil, no further information given

PS47J pupil

PS47K teacher

PS47L pupil

PS47M teacher

JJSPSUNK Unknown speaker, other

JJSPSUGP Group of unknown speakers, other

[JJV] 12394 words from O’Halloran v

PS480 Drake, judge

PS481 Cox

PS482 Bate

19

Page 20: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS483 Elaine

PS484 Clive, police inspector

PS485 Michael, detective constable

PS486 Denser

PS487 Michael, police superintendent

JJVPSUNK Unknown speaker, other

JJVPSUGP Group of unknown speakers, other

[JJW] 11591 words from O’Halloran v

PS488 Michael, police superintendent

PS489 Drake, judge

PS48A Denser, barrister

PS48B Isles, police sergeant

PS48C John, police sergeant

JJWPSUNK Unknown speaker, other

JJWPSUGP Group of unknown speakers, other

[JNG] 9997 words from Save The Children: meeting

PS4EG Sally, chair

PS4EH Janet, shop manageress

PS4EJ Kathy, area rep

PS4EK Jacquie, shop leader

PS4EL Elise, no further information given

PS4EM Ismar, student, no further information given

PS4EN Nicholas, director general

PS4EP Mike, chair

PS4ER Mark, no further information given

JNGPSUNK Unknown speaker, other

JNGPSUGP Group of unknown speakers, other

[JNM] 8318 words from House of Commons Select Committee for Defence: meeting

PS4F0 Perry

PS4F1 Michael

PS4F2 Gordon

PS4F3 John

PS4F4 Nicholas

JNMPSUNK Unknown speaker, other

JNMPSUGP Group of unknown speakers, other

[JXL] 4434 words from Nasser. Woodward, Peter Longman Group UK Ltd Harlow 1991 1-13

[KB1] 39285 words from 17 conversations recorded by ‘Albert’ (PS01A, R 57) between 1 and 6February 1992 with 9 interlocutors

20

Page 21: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS01A 53, Albert, unemployed, Central Northern England, self

PS01B 47, June, forecourt attendant, Central Northern England, wife

PS01C 70, Ada, retired, Central Northern England, mother-in-law

PS01D 18, Corrinne, student, Central Northern England, daughter

PS01E 39, Colin, unemployed, Central Northern England, brother-in-law

PS01F 38, Karen, housewife, Central Northern England, sister-in-law

PS01G 5, Sasha, student, Central Northern England, niece

PS01H 67, Rose, retired, Central Northern England, aunt-in-law

PS01J 46+, None, shopkeeper, Scottish, stranger

KB1PSUNK Unknown speaker, other

KB1PSUGP Group of unknown speakers, other

[KB2] 35089 words from 25 conversations recorded by ‘Alec’ (PS01T, R 60) between 31 January and7 February 1992 with 5 interlocutors

PS01T 68, Alec, retired, Central Northern England, self

PS01U 45, Ann, saleswoman, Central Northern England, friend

PS01V 63, Joyce, retired, Central Northern England, wife

PS01W Joan, retired, Central Northern England, friend

KB2PSUNK Unknown speaker, other

KB2PSUGP Group of unknown speakers, other

[KB3] 10492 words from 8 conversations recorded by ‘Alison’ (PS147, R 72) between 30 January and4 February 1992 with 4 interlocutors

PS147 43, Alison, housewife, Scottish, self

PS148 7, Grant, student (state primary), Scottish, son

PS149 53, Bob, deputy prison governor, Scottish, husband

PS14A 2, Annabel, student (private pre), Scottish, daughter

KB3PSUNK Unknown speaker, other

KB3PSUGP Group of unknown speakers, other

[KB8] 78798 words from 34 conversations recorded by ‘Ann2’ (PS14B, R 601) on 3 April 1992 with37 interlocutors

PS14B 53, Ann, registered childminder, Central Northern England, self

PS14C 57, James, charge nurse, North-east England, husband

PS14D 9, Rebecca, student, North-east England, friend

PS14E 8, Amy, student, North-east England, friend

PS14F 8, Matthew, student, North-east England, friend

PS14H 6, Richard, student, North-east England, friend

PS14K 6, Martin, student, Northern England, friend

PS14L 36, Dorothy, probation officer, Central Midlands, friend

PS14M 2, James, pre-school, North-east England, friend

PS14N 2, Bryony, pre-school, North-east England, friend

21

Page 22: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS14P 2, Richard, pre-school, North-east England, friend

PS14R 53, Isabelle, shop assistant, North-east England, neighbour

PS15B 10, Sally, student, North-east England, friend

PS15C 37, Andrea, teacher, North-east England, friend

PS15D 35+, Sid, teacher, North-east England, friend

PS15G 42, Marilyn, secretary, Northern England, friend

PS15S 55, Bill, clerk to town council, North-east England, friend

PS15U 58, Betty, housewife, North-east England, friend

PS15V 27, Paul, unemployed, North-east England, friend

PS166 38, Jackie, clerical officer, North-east England, neighbour

PS167 39, Pat, shop assistant, North-east England, husband’s niece

PS168 9, Chris, student, North-east England, husband’s great-niece

PS169 86, Margaret, retired, North-east England, mother-in-law

PS16F 7, Erin, student, North-east England, friend

PS16G 32, Kathy, housewife, Northern England, friend

PS16H 6, Martin, student, North-east England, friend

PS16S None, ?

PS172 Evelyn, ?

PS17G 30, Maggie, childminder, Northern England, friend

PS17L 70, Arthur, retired, Northern England, neighbour

PS17M 70, Tom, retired, Irish, neighbour

PS17N 35, Rosie, childminder, Northern England, friend

PS17S 60, Alan, baker, North-east England, acquaintance

PS18D 29, Louise, housewife, North-east England, friend

PS50S 35, Jean, housewife, Northern England, acquaintance

PS6NU 24, Alison, housewife, Northern England, neighbour

KB8PSUNK Unknown speaker, other

KB8PSUGP Group of unknown speakers, other

[KB9] 31762 words from 44 conversations recorded by ‘Annette’ (PS1CX, R 62) between 1 and ??February 1992 with 6 interlocutors

PS1CX 44, Annette, administrative assistant, Lancashire, self

PS1CY 19, Teresa, stable hand, Lancashire, daughter

PS1D0 26, David, engineer, Lancashire, son-in-law

PS1D1 24, Tracy, housewife, Lancashire, daughter

PS1D2 71, Donald, retired, Northern England, father

PS1KS 48+, Pat, Lancashire, neighbour

KB9PSUNK Unknown speaker, other

KB9PSUGP Group of unknown speakers, other

[KBF] 105262 words from 103 conversations recorded by ‘Brenda’ (PS04U, R 22) between 28November and 4 December 1991 with 6 interlocutors

22

Page 23: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS04U 34, Brenda, housewife, London, self

PS04V 37, Dave, hgv driver, London, husband

PS04W 15, Lee, student (state secondary), London, son

PS04X 11, Carla, student (state primary), London, daughter

PS04Y 62, Jean, housewife, Home Counties, friend

PS050 23, Nigel, plumber, London, plumber

KBFPSUNK Unknown speaker, other

KBFPSUGP Group of unknown speakers, other

[KBG] 28812 words from 10 conversations recorded by ‘Carl’ (PS051, R 104) between 21 and 27February 1992 with 9 interlocutors

PS051 34, Carl, teacher, North-east England, self

PS052 33, Susan, teacher (pt), Home Counties, wife

PS053 4, Christopher, pre-school, son

PS054 1, Rebecca, pre-school, daughter

PS055 7, Andrew, student (primary), son

PS056 9, Robert, student (secondary), son

PS057 58, Peter, lorry driver, East Anglia, neighbour

PS058 65, Alan, retired, East Anglia, neighbour

PS059 62, Pat, retired, Home Counties, father-in-law

KBGPSUNK Unknown speaker, other

KBGPSUGP Group of unknown speakers, other

[KBK] 53920 words from 228 conversations recorded by ‘Chris’ (PS05X, R 500) between 31 May and1 June 1991 with 8 interlocutors

PS05X 51, Chris, export merchant, Home Counties, self

PS101 54, Norrine, teacher, London, wife

PS102 44, Lynda, export assistant, London, colleague

PS103 30, Susan, cleaner, London, employee

PS104 30+, Bill, plastics company director, London, neighbour

PS106 60+, Tony, retired, London, neighbour

PS107 45, David, property manager, London, neighbour

PS108 60+, Evelyn, retired, London, stranger

KBKPSUNK Unknown speaker, other

KBKPSUGP Group of unknown speakers, other

[KBL] 27294 words from 40 conversations recorded by ‘Cherrilyn’ (PS06A, R 115) between 21 and26 February 1992 with 9 interlocutors

PS06A 43, Cherrilyn, nursing auxiliary, Lower South-west England, self

PS06B 13, Jessica, student (state secondary), Lower South-west England, daughter

PS06C None, ?

PS06D 21, Sophie, student, Lower South-west England, daughter

23

Page 24: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS06E 12, Kate, student, Lower South-west England, friend

PS06F 43, Brian, mortgage broker, Lower South-west England, husband

PS06G 33, Fiona, matron of nursing home, Central South-west England, colleague

PS06H 16, Lorraine, care assistant, Home Counties, colleague

PS06J 70, None, Lower South-west England, stranger

KBLPSUNK Unknown speaker, other

KBLPSUGP Group of unknown speakers, other

[KBU] 11022 words from 2 conversations recorded by ‘Denise’ (PS18E, R 604) between 6 and 7 April1992 with 8 interlocutors

PS18E 25, Denise, student nurse, Central Northern England, self

PS18H None, ?

PS18J None, ?

PS18K None, ?

PS18L None, ?

PS18M None, ?

PS18N None, ?

KBUPSUNK Unknown speaker, other

KBUPSUGP Group of unknown speakers, other

[KBX] 28273 words from 11 conversations recorded by ‘Donald2’ (PS1DW, R 43) between 17 and 10January 1992 with 6 interlocutors

KBXPS000 None, North-west Midlands, ?

KBXPS001 None, North-west Midlands, ?

PS1DW 56, Donald, taxi driver, North-west Midlands, self

PS1DX 60, Anna, housewife, European (Russian), friend

PS1DY 60, Mary, housewife, European (Russian), friend

PS1E0 50, Barry, North-west Midlands, friend

KBXPSUNK Unknown speaker, other

KBXPSUGP Group of unknown speakers, other

[KC0] 45900 words from 14 conversations recorded by ‘Enid’ (PS08Y, R 103) between 21 and 27February 1992 with 10 interlocutors

PS08Y 70, Enid, housewife, London, self

PS090 79, Noel, retired (master organ builder), London, husband

PS091 89, Mollie, retired (musician), London, friend

PS092 62, Ann, retired (teacher), friend

PS093 40, John, butcher, East Anglia, friend

PS094 46, Pam, shop assistant, East Anglia, friend

PS095 95, Stanley, retired (farmer), East Anglia, friend

PS096 50, Patsy, computer programmer, friend

PS097 34, Ian, advertising executive, friend

24

Page 25: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

KC0PSUNK Unknown speaker, other

KC0PSUGP Group of unknown speakers, other

[KC1] 20607 words from 32 conversations recorded by ‘Frank’ (PS09E, R 116) between 21 and 28February 1992 with 9 interlocutors

PS09E 50, Frank, tv engineer, Central South-west England, self

PS09F 48, Lyn, housewife, Lower South-west England, wife

PS09G 44, Steve, salesman, London, colleague

PS09H 46, Gerry, tv dealer, Central South-west England, customer

PS09K 60, Rex, tv dealer, Lower South-west England, customer

PS09L 58, Marge, tv dealer, Lower South-west England, customer

PS09M 4, Jonny, pre-school, Lower South-west England, son

PS09N 55, Brian, salesman, London, friend

KC1PSUNK Unknown speaker, other

KC1PSUGP Group of unknown speakers, other

[KC2] 44626 words from 28 conversations recorded by ‘Fred’ (PS09T, R 504) between 13 and 20March 1992 with 9 interlocutors

PS09T 77, Fred, retired, North-east Midlands, self

PS09U 71, Florence, retired, North-east Midlands, wife

PS09V 41, David, painter/decorator, North-east Midlands, son

PS09W 32, Clive, unemployed, North-east Midlands, son-in-law

PS09X 36, Marjorie, housewife, North-east Midlands, daughter

PS09Y 11, Gemma, student, North-east Midlands, granddaughter

PS0A0 37, Steven, office manager, North-east Midlands, son

PS0A1 7, Emily, student, North-east Midlands, granddaughter

PS0A2 39, Sandra, housewife, North-east Midlands, daughter-in-law

KC2PSUNK Unknown speaker, other

KC2PSUGP Group of unknown speakers, other

[KC3] 36931 words from 15 conversations recorded by ‘Frederick’ (PS0A8, R 11) between 10 and 15January 1992 with 10 interlocutors

PS0A8 66, Frederick, retired (social worker), Central Midlands, self

PS0A9 49, Janet, health visitor, Central Midlands, friend

PS0AA 49, Peter, chemist, Home Counties, friend

PS0AB 68, Joan, housewife, Central Midlands, friend

PS0AC 40, Vicki, teacher, South Midlands, friend

PS0AD 15, Heidi, schoolgirl, Central Midlands, friend

PS0AE 40, Yvonne, clerk, Central Midlands, colleague

PS0AF 30, Trish, physiotherapist, Central Midlands, colleague

PS0AG 61, Eileen, cleaner, Irish, colleague

KC3PSUNK Unknown speaker, other

25

Page 26: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

KC3PSUGP Group of unknown speakers, other

[KC4] 23715 words from 25 conversations recorded by ‘Fred2’ (PS1E4, R 333) between 31 May and5 June 1991 with 9 interlocutors

PS1E4 78, Fred, retired, North-west Midlands, self

PS1E5 None, ?

PS1E6 None, ?

PS1E7 None, ?

PS1E8 None, ?

PS1E9 None, ?

PS1EA None, ?

PS1EB None, ?

PS1EC None, ?

KC4PSUNK Unknown speaker, other

KC4PSUGP Group of unknown speakers, other

[KC7] 15279 words from 7 conversations recorded by ‘Gill’ (PS0BK, R 32) on 15 January 1992 with5 interlocutors

PS0BK 21, Gill, waitress, Home Counties, self

PS0BL 22, Jemma, unemployed, Home Counties, friend

PS0BS 18, Nancy, business student, London, friend

PS0BT 21, Jim, student, Home Counties, friend

KC7PSUNK Unknown speaker, other

KC7PSUGP Group of unknown speakers, other

[KC8] 18878 words from 11 conversations recorded by ‘Gillian’ (PS0BY, R 29) between 29 Novemberand 6 December 1991 with 6 interlocutors

PS0BY 32, Gillian, housewife, East Anglia, self

PS0C1 34, Robert, computer programmer, East Anglia, husband

PS0C4 None, ?

PS0C5 60, Dennis, retired, London, friend

PS0C6 61, Iris, retired, London, friend

PS0C7 40+, Bonnie, housewife, East Anglia, friend

KC8PSUNK Unknown speaker, other

KC8PSUGP Group of unknown speakers, other

[KCA] 21431 words from 22 conversations recorded by ‘Gordon’ (PS0DL, R 24) between 10 and 13January 1992 with 8 interlocutors

PS0DL 32, Gordon, unemployed, Welsh, self

PS0DM 29, Debbie, housewife, Welsh, wife

PS0DN 29, Hayley, housewife, Welsh, cousin

PS0DP 32, Lyn, housewife, Welsh, friend

PS0DR 56, Tom, factory worker, Welsh, friend

26

Page 27: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0DT 6, Sean, student (state primary), Welsh, son

PS0DU 3, Kirsty, student (state pre), Welsh, daughter

KCAPSUNK Unknown speaker, other

KCAPSUGP Group of unknown speakers, other

[KCB] 15326 words from 13 conversations recorded by ‘Graeme’ (PS0DX, R 122) between 22 and 26February 1992 with 7 interlocutors

PS0DX 62, Graeme, company secretary, New Zealand, self

PS0DY 58, Ann, housewife, wife

PS0E2 23, Sarah, sales representative, daughter

PS0E3 25, Antony, car salesman, son-in-law

PS0E4 60+, Joyce, housewife, friend

PS0E5 53, Hazel, coffee shop assistant, friend

PS0E6 57, Zara, coffee shop owner, friend

KCBPSUNK Unknown speaker, other

KCBPSUGP Group of unknown speakers, other

[KCE] 50776 words from 24 conversations recorded by ‘Helena’ (PS0EB, R 2) between 12 and 20March 1992 with 9 interlocutors

PS0EB 16, Helena, student, North-east Midlands, self

PS0EC 16, Emma, student, Upper South-west England, friend

PS0ED 41, Sheila, driving instructor, North-east Midlands, mother

PS0EE 17, David, student, North-east Midlands, brother

PS0EF 13, Joanne, student, Upper South-west England, friend

PS0EG 18, Andy, insurance clerk, Upper South-west England, friend

PS0EH 19, Scott, student, Upper South-west England, friend

PS0EJ 18, mark/shrimpy, spring inspector, Upper South-west England, friend

PS0EK 16, Susan, student, Upper South-west England, friend

KCEPSUNK Unknown speaker, other

KCEPSUGP Group of unknown speakers, other

[KCG] 28227 words from 20 conversations recorded by ‘Jane’ (PS19L, R 609) between 2 and 9 April1992 with 10 interlocutors

PS19L 33, Jane, housewife, North-west Midlands, self

PS19M 38, John, hgv driver, North-west Midlands, husband

PS19N 7, Stacey, student (state primary), North-west Midlands, daughter

PS19P 41, Chris, barmaid, North-west Midlands, sister-in-law

PS19R 65+, Madge, housewife, North-west Midlands, sister-in-law’s mother

PS19S 65+, Margaret, housewife, North-west Midlands, friend

PS19T 65+, Bill, retired, North-west Midlands, sister-in-law’s father

PS19U 30, Lynn, housewife, North-west Midlands, friend

PS19V 64, Mary, housewife, North-west Midlands, mother-in-law

27

Page 28: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

KCGPSUNK Unknown speaker, other

KCGPSUGP Group of unknown speakers, other

[KCH] 37385 words from 19 conversations recorded by ‘Jane2’ (PS1BS, R 622) between 4 and 10April 1992 with 7 interlocutors

PS1BS 40, Jane, optician/student, Humberside, self

PS1BT 46, Phillip, chartered engineer, Humberside, husband

PS1BU 9, Christopher, student (state primary), Humberside, son

PS1BV 6, David, student (state primary), Humberside, son

PS1BW 27, Julie, administrative officer, Central Northern England, babysitter

PS1BX 24, Susan, teacher, Central Northern England, child’s teacher

PS6R1 42, Janet, clerk, Central Northern England, colleague

KCHPSUNK Unknown speaker, other

KCHPSUGP Group of unknown speakers, other

[KCL] 32729 words from 32 conversations recorded by ‘John’ (PS0F8, R 4) between 14 and ?? March1992 with 6 interlocutors

PS0F8 53, John, engineer, self

PS0F9 39, Mary, nurse, Irish, wife

PS0FA 18, Brian, apprentice engineer, son

PS0FB 16, Patricia, student, daughter

PS0FC 25, Ded, cashier, West Midlands, daughter

PS0FD 40, Lyn, housewife, West Midlands, friend

KCLPSUNK Unknown speaker, other

KCLPSUGP Group of unknown speakers, other

[KCN] 45451 words from 55 conversations recorded by ‘Josephine’ (PS0FF, R 512) between 12 and20 March 1992 with 8 interlocutors

PS0FF 54, Josephine, housewife, Welsh, self

PS0FG 84, Tina, retired, Welsh, mother

PS0FH 57, Danny, disabled unemployed, Welsh, husband

PS0FJ 32, Susan, housewife, Welsh, daughter

PS0FK 34, Steve, ship’s captain, North-east Midlands, son-in-law

PS0FL 25, Mark, assistant manager, Welsh, son

PS0FM 9, Ricky, student, Welsh, grandson

PS0FN 30, Sally, housewife, Welsh, daughter

KCNPSUNK Unknown speaker, other

KCNPSUGP Group of unknown speakers, other

[KCT] 97005 words from 29 conversations recorded by ‘June’ (PS0FP, R 30) between 29 Novemberand 5 December 1991 with 17 interlocutors

PS0FP 40, June, dinner lady (pt), East Anglia, self

PS0FR 10, Jonathan, student (state primary), East Anglia, son

28

Page 29: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0FS 40, Geoffrey, chargehand, East Anglia, husband

PS0FU 30, Wendy, housewife, East Anglia, friend

PS0FX 40, Susan, dental nurse, East Anglia, sister

PS0G0 76, A., retired, East Anglia, mother-in-law

PS0G1 78, L., retired, East Anglia, father-in-law

PS0G2 70, Kathleen, retired, London, mother

PS0G3 67, Robert, retired, London, father

PS0G4 50, Brian, plumber, London, neighbour

PS0G5 47, Val, housewife, East Anglia, sister-in-law

PS0G7 23, Sarah, hairdresser, East Anglia, niece

PS0G8 44, Val, shopkeeper, London, stranger

PS0G9 43, Katt, unemployed, London, friend

PS0GA 47, Sue, housewife, East Anglia, friend

PS0GE 10, Ashley, student, East Anglia, son’s friend

KCTPSUNK Unknown speaker, other

KCTPSUGP Group of unknown speakers, other

[KCU] 49751 words from 9 conversations recorded by ‘Julie’ (PS0GF, R 114) between 20 and 22February 1992 with 6 interlocutors

PS0GF 24, Julie, housewife, Lower South-west England, self

PS0GG 25, Gary, storesperson, Lower South-west England, husband

PS0GH 2, Vicki, pre-school, Lower South-west England, daughter

PS0GJ 20, Shelly, housewife, Lower South-west England, cousin-in-law’s wife

PS0GK 25, Phil, cold store worker, Lower South-west England, cousin-in-law

PS0GL 4, Sammy, pre-school, Lower South-west England, cousin-in-law’s son

KCUPSUNK Unknown speaker, other

KCUPSUGP Group of unknown speakers, other

[KCV] 32714 words from 50 conversations recorded by ‘Katherine’ (PS0H7, R 444) between 2 and 5June 1991 with 3 interlocutors

PS0H7 57, Katherine, housewife, European (German), self

PS125 56, Patrick, company secretary, husband

PS126 25, Stefan, student, European (German), friend

KCVPSUNK Unknown speaker, other

KCVPSUGP Group of unknown speakers, other

[KCX] 60332 words from 25 conversations recorded by ‘Kathleen2’ (PS1FC, R 59) between 30January and 7 February 1992 with 9 interlocutors

PS1FC 37, Kathleen, crossing warden, Central Northern England, self

PS1FD 54, Flo, youth worker, North-east England, friend

PS1FE 42, Maggie, housewife, London, friend

PS1FF 40+, None, taxi driver, Central Northern England, stranger

29

Page 30: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS1FG 40+, Alice, smallholder, Central Northern England, friend

PS1FH 39, Steve, electrician, Central Northern England, husband

PS1FJ 2, Krista, pre-school, Central Northern England, friend’s granddaughter

PS1FK 30+, Enid, Central Northern England, colleague

KCXPSUNK Unknown speaker, other

KCXPSUGP Group of unknown speakers, other

[KCY] 24051 words from 11 conversations recorded by ‘Keith’ (PS0H9, R 13) between 10 and 15January 1992 with 10 interlocutors

PS0H9 36, Keith, telecommunication engineer, Central Midlands, self

PS0HA 33, Richard, telecommunication engineer, Central Midlands, colleague

PS0HB 34, Russell, bakery worker, Central Midlands, friend

PS0HC 35, Michael, engineer, Central Midlands, friend

PS0HD 42, Kathleen, nurse, Central Midlands, sister

PS0HE 42, Terence, engineer, Central Midlands, brother-in-law

PS0HF 72, Joseph, retired, Central Midlands, father

PS0HG 74, Eileen, housewife, Central Midlands, mother

PS0HH 45, Dave, engineer, North-east Midlands, colleague

KCYPSUNK Unknown speaker, other

KCYPSUGP Group of unknown speakers, other

[KD0] 77692 words from 106 conversations recorded by ‘Kevin’ (PS0HM, R 26) between 29 Novemberand 5 December 1991 with 14 interlocutors

PS0HM 41, Kevin, draughtsman, London, self

PS0HN 12, Paul, student (state secondary), London, son

PS0HP 40, Ruth, teacher, wife

PS0HR 29, Michelle, local government officer, European (French), sister-in-law

PS0HS 74, Eric, retired, London, father

PS0HT 40, Adrian, salesman, London, brother-in-law

PS0HU 41, Karen, secretary, London, sister

PS0HV 33, Andrew, local government officer, London, brother

PS0HW 13, Lisa, student, London, niece

PS0HX 70+, babs (aka mutty), retired, Lower South-west England, brother-in-law’s mother

PS0HY 70+, Joy, retired, London, mother

PS0J0 15, Michael, student, London, nephew

PS1KN None, ?

KD0PSUNK Unknown speaker, other

KD0PSUGP Group of unknown speakers, other

[KD1] 40487 words from 14 conversations recorded by ‘Larna’ (PS0JA, R 107) between 20 and 24February 1992 with 8 interlocutors

PS0JA 25, Larna, housewife, self

30

Page 31: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0JB 4, Charlotte, student (state pre), daughter

PS0JC 29, Pauline, housewife, sister-in-law

PS0JD Vicky, ?

PS0JE 25, Alex, secretary, friend

PS0JF 3, Anthony, pre-school, son

PS0JG 45, David, lorry driver, stepfather

PS0JH 3, Aaron, nephew

KD1PSUNK Unknown speaker, other

KD1PSUGP Group of unknown speakers, other

[KD2] 19822 words from 22 conversations recorded by ‘Linda’ (PS0J1, R 61) between 30 January and2 February 1992 with 9 interlocutors

PS0J1 20, Linda, trainee typist, Central Northern England, self

PS0J2 60+, Alice, retired, Central Northern England, neighbour

PS0J3 45, Dave, disabled unemployed, Central Northern England, father

PS0J4 51, Ivy, housewife, Central Northern England, mother

PS0J5 21, David, plasterer, Central Northern England, brother

PS0J6 25, Brian, plasterer, Central Northern England, brother

PS0J7 23, Tracey, trainee typist, Central Northern England, sister

PS0J8 50, Peter, salesman, Central Northern England, acquaintance

PS0J9 45, Margaret, shop assistant, Central Northern England, acquaintance

KD2PSUNK Unknown speaker, other

KD2PSUGP Group of unknown speakers, other

[KD3] 33516 words from 40 conversations recorded by ‘Lisa’ (PS0JJ, R 502) between 12 and 20March 1992 with 7 interlocutors

PS0JJ 30, Lisa, housewife, South Midlands, self

PS0JK 0, Peter, pre-school, son

PS0JL 29, Melvin, self-employed panel beater, husband

PS0JP 27, Diane, housewife, South Midlands, sister-in-law

PS0JR 52, Anne, maggot farm manager, South Midlands, aunt

PS0JS 27, Cheryl, housewife, neighbour

PS0JT 2, Ben, pre-school, neighbour’s son

KD3PSUNK Unknown speaker, other

KD3PSUGP Group of unknown speakers, other

[KD5] 54036 words from 55 conversations recorded by ‘Mark’ (PS0JX, R 105) between [dateunknown] and 26 February 1992 with 25 interlocutors

PS0JX 27, Mark, technician, self

PS0JY 27, Sue, nurse, wife

PS0K0 60, Dad, retired, father

PS0K1 50, Mum, nurse, European (Dutch), mother

31

Page 32: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0K2 29, Isobel, secretary, sister

PS0K3 60+, Derek, father-in-law

PS0K4 60+, Pauline, mother-in-law

PS0K5 30, Spencer, manager, brother

PS0K6 30, Sally, teacher, sister-in-law

PS0K7 28, Norman, technician, colleague

PS0K8 26, Bedge, engineer, colleague

PS0K9 30+, Yun, technician, Chinese, colleague

PS0KA 30+, Ian, engineer, colleague

PS0KB 20+, Justin, technician, colleague

PS0KC 30+, John, technician, colleague

PS0KD 20+, Paul, technician, colleague

PS0KE 28, Carol, clerk, colleague

PS0KF 25, Dave, technician, colleague

PS0KG 40+, Daphney, secretary, colleague

PS0KH 40+, Collin, engineer, colleague

PS0KJ 20+, Andy, engineer, Scottish, colleague

PS0KK 40+, Frank, manager, European (German), colleague

PS0KL 30+, Paul, technician, colleague

PS0KM 40+, George, engineer, colleague

KD5PSUNK Unknown speaker, other

KD5PSUGP Group of unknown speakers, other

[KD6] 39227 words from 19 conversations recorded by ‘Martin’ (PS0KN, R 4) between 11 and 16January 1992 with 8 interlocutors

PS0KN 17, Martin, student, Midlands, self

PS13K 37, Lynn, nurse (pt), Midlands, mother

PS13L 3, Robert, student (state pre), Midlands, brother

PS13M 40, Dave, unemployed, Central South-west England, step-father

PS13N 13, Geoff, student (state secondary), Midlands, brother

PS13P 17, Rich, student, Midlands, friend

PS13R 19, Scott, mechanic, Midlands, friend

PS13S 20, Sarah, clerk, Midlands, friend

KD6PSUNK Unknown speaker, other

KD6PSUGP Group of unknown speakers, other

[KD8] 76445 words from 31 conversations recorded by ‘Martine’ (PS0LK, R 524) between 12 and 20March 1992 with 10 interlocutors

PS0LK 25, Martine, senior technician, Welsh, self

PS0LL 28, Mike, construction worker, Welsh, husband

PS0LM 55, Merielle, housewife, Welsh, mother

PS0LN 45, None, pub landlord, Home Counties, stranger

32

Page 33: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0LP 58, Harold, engineer, Welsh, father

PS0LR 76, Nora, housewife, Welsh, grandmother-in-law

PS0LS 45, Will, civil engineer, Merseyside, colleague

PS0LT 40, Michael, technical director, Home Counties, colleague

PS0LU 27, Jim, technician, Home Counties, colleague

KD8PSUNK Unknown speaker, other

KD8PSUGP Group of unknown speakers, other

[KDH] 8207 words from 13 conversations recorded by ‘Paul’ (PS0MX, R 807) between 10 and 16April 1992 with 5 interlocutors

PS0MX 32, Paul, baker, Irish, self

PS0MY 12, Francis, student (state secondary), Irish, son

PS0N0 8, Lemar, student (state primary), Irish, son

PS0N1 32, Nora, stitcher, Irish, wife

KDHPSUNK Unknown speaker, other

KDHPSUGP Group of unknown speakers, other

[KDM] 108739 words from 103 conversations recorded by ‘Raymond’ (PS0PN, R 523) between 13and 19 March 1992 with 46 interlocutors

PS0PN 57, Raymond, retired, North-west Midlands, self

PS0PP 55, Margaret, retired, London, wife

PS0PR 61, Brian, technician, North-west Midlands, colleague

PS0PS 50, Shirley, housewife, North-west Midlands, colleague

PS0PT 50, Brian, photographer, North-west Midlands, colleague

PS0PU 54, Eric, retired, North-west Midlands, colleague

PS0PV 45, Chris, housewife, North-west Midlands, colleague

PS0PW 46, John, driver, Merseyside, colleague

PS0PX 70, Paula, housewife, London, colleague

PS0PY 66, Gwyneth, housewife, North-west Midlands, colleague

PS0R0 35, Bruce, technician, Merseyside, colleague

PS0R1 57, Joan, tutor, North-west Midlands, colleague

PS0R2 31, Joanna, upholsterer, Home Counties, colleague

PS0R3 54, Arthur, driver, Merseyside, colleague

PS0R4 36, Paul, policeman, North-west Midlands, son

PS0R5 27, Cathy, laboratory technician, North-west Midlands, daughter-in-law

PS0R6 11, Nik, boys brigade, Welsh, stranger

PS0R7 60, Dorothy, machinist, Welsh, colleague

PS0R8 6, Laura, student, North-west Midlands, granddaughter

PS0R9 5, Scott, student, North-west Midlands, grandson

PS0RA 8, Emily, student, North-west Midlands, granddaughter

PS0RB 31, Carrie, housewife, North-west Midlands, daughter

PS0RC 61, Ken, retired, North-west Midlands, friend

33

Page 34: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0RD 61, Norman, joiner, North-west Midlands, neighbour

PS0RE 37, Molly, housewife, North-west Midlands, colleague

PS0RF 60, John, schoolmaster, North-west Midlands, colleague

PS0RG 45, George, tutor, Central Midlands, colleague

PS0RH 43, Nanette, housewife, European (Dutch), colleague

PS0RJ 37, Victoria, housewife, United States, colleague

PS0RK 61, Arthur, storeman, South Midlands, friend

PS0RL 40, Glyn, auction worker, Welsh, colleague

PS0RM 63, Hughes, chemist, Central Midlands, stranger

PS0RN 27, Hayley, chemist, North-west Midlands, friend

PS0RP 62, Hilary, housewife, North-west Midlands, neighbour

PS0RR 42, Phyllis, secretary, North-west Midlands, friend

PS0RS 22, Louise, typist, North-west Midlands, friend

PS0RT 61, Tim, dentist, North-west Midlands, colleague

PS0RU 62, Ted, retired, Merseyside, colleague

PS0RV 57, Pat, housewife, London, colleague

PS0RW 48, Mike, restorer, North-west Midlands, colleague

PS0RX 27, Tony, remover, Merseyside, colleague

PS0RY 64, Alice, housewife, North-west Midlands, colleague

PS0S0 45, Brian, salesman, North-west Midlands, friend

PS0S1 61, Mary, housewife, North-west Midlands, colleague

PS0S2 42, Teresa, housewife, North-west Midlands, colleague

KDMPSUNK Unknown speaker, other

KDMPSUGP Group of unknown speakers, other

[KDN] 43638 words from 90 conversations recorded by ‘Raymond2’ (PS1HH, R 802) between 15 and17 April 1992 with 10 interlocutors

PS1HH 53, Raymond, unemployed, Irish, self

PS1HJ 45, Jean, housewife, Irish, wife

PS1HK 3, Kylie, pre-school, Irish, neighbour

PS1HL 34, Paul, window cleaner, Irish, neighbour

PS1HM 43, Mary, shop assistant, Irish, stranger

PS1HN 10, Kelly Ann, student, Irish, niece

PS1HP 11, Stephen, student, Irish, nephew

PS1HR 39, John, painter, Irish, brother-in-law

PS1HS 20, Raymond, painter, Irish, son

KDNPSUNK Unknown speaker, other

KDNPSUGP Group of unknown speakers, other

[KDU] 8038 words from 18 conversations recorded by ‘Sam’ (PS0S3, R 803) between 15 and 21 April1992 with 7 interlocutors

PS0S3 58, Sam, company director, Irish, self

34

Page 35: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS0S4 56, George, clerk, Irish, brother-in-law

PS0S5 60, Betty, Irish, sister-in-law

PS0S6 51, Diana, housewife, Irish, wife

PS0S8 26, Shirley, bank official, Irish, daughter

PS0S9 61, Bill, retired, Irish, friend

PS0SA 55, Margaret, typist (pt), Irish, friend

KDUPSUNK Unknown speaker, other

KDUPSUGP Group of unknown speakers, other

[KE3] 62395 words from 150 conversations recorded by ‘Tony’ (PS0V4, R 12) between 28 Novemberand 5 December 1991 with 9 interlocutors

PS0V4 37, Tony, civil service, Central South-west England, self

PS0V5 35, Jackie, civil servant (pt), Central South-west England, wife

PS0V6 11, Christine, student (state secondary), Central South-west England, daughter

PS0V7 60+, Rene, retired, Central South-west England, mother-in-law

PS0V8 59, Mum, shop assistant, Central South-west England, mother

PS0V9 29, Vicky, sales assistant, Central South-west England, friend

PS0VA 40, Dot, civil servant, Central South-west England, colleague

PS0VB 37, Martin, civil servant, Central South-west England, colleague

PS0VC 50, Margaret, civil servant, Central Northern England, colleague

KE3PSUNK Unknown speaker, other

KE3PSUGP Group of unknown speakers, other

[KP6] 33704 words from 10 conversations recorded by ‘Catriona’ (PS52C, R 733) on ?? ?? 1993 with9 interlocutors

PS52C 16, Catriona, student, London, self

PS52D 16, Phil, student, London, friend

PS52E 16, Sal, student, London, friend

PS52F 16, Jess, student, London, friend

PS52G 16, Lucy, student, London, friend

PS52H 16, Zoe, student, London, friend

PS52J 16, Duncan, student, London, friend

PS52K Father, artist, London, father

PS52L Mother, artist, London, mother

KP6PSUNK Unknown speaker, other

KP6PSUGP Group of unknown speakers, other

[KP8] 20442 words from 7 conversations recorded by ‘Christopher’ (PS52T, R 74) between 30 Januaryand 8 February 1992 with 6 interlocutors

PS52T 33, Christopher, civil servant, Scottish, self

PS52U 33, Wendy, nurse (pt), Scottish, wife

PS52V 5, Jonathan, student (state primary), Scottish, son

35

Page 36: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS52W 3, Michael, student (state pre), Scottish, son

PS52X 60, Norma, retired (physiotherapist), Scottish, mother

PS52Y 72, Hazel, housewife, London, mother-in-law

PS530 65+, None, watchtower representative, Scottish, stranger

KP8PSUNK Unknown speaker, other

KP8PSUGP Group of unknown speakers, other

[KPD] 4826 words from 5 conversations recorded by ‘Gary’ (PS549, R 70) on 30 January 1992 with 5interlocutors

PS549 36, Gary, shunter, Scottish, self

PS54A 11, Laura, student, Scottish, daughter

PS54B 8, Karen, student, Scottish, daughter

PS54C 33, Lilias, shop assistant, Scottish, wife

PS54D 73, Jake, retired, Scottish, father-in-law

PS54E 71, Lilias, retired, Scottish, mother-in-law

KPDPSUNK Unknown speaker, other

KPDPSUGP Group of unknown speakers, other

[KPG] 42330 words from 39 conversations recorded by ‘Josie’ (PS555, R 702) [dates unknown] with34 interlocutors

PS555 14, Josie, student, London, self

PS556 15, Shelley, student, London, friend

PS557 15, Kerry, student, London, friend

PS558 15, Grace, student, London, friend

PS559 16, Daniel, student, London, friend

PS55A 13, Truno, student, London, friend

PS55B 12, Sean, student, London, friend

PS55C 12, Petro, student, London, friend

PS55D 10, Raphael, student, London, friend

PS55E 17, Vergina, student, London, friend

PS55F 13, Tina, student, London, friend

PS55G Greta, housewife, friend

PS55H Jane, housewife, London, friend

PS55J Madigan, teacher, teacher

PS55K 15, Alice, student, London, friend

PS55L 14, Kate, student, London, friend

PS55M 15, James, student, London, friend

PS55N 15, Peter, student, London, friend

PS55P Chris, housewife, friend

PS55R 16, Ozzi, student, friend

PS55S 15, Wesley, student, London, friend

PS5AG None, teacher, London, teacher

36

Page 37: BNC Sampler: XML editionMonologue 167714 (16.92%) 7196 (9.42%) 18 Dialogue 822990 (83.07%) 69122 (90.57%) 80 Total 990704 76318 98 Table 8: Region where spoken words s-units texts

3 SOURCE TEXTS

PS5AH 15, Cassie, student, London, friend

PS5AJ None, ?

PS6R3 15, Warren, student, London, friend

PS6R4 12, Jesica, student, London, friend

PS6R5 15, Andrew, student, London, friend

PS6R6 16, Karen, student, London, friend

PS6R7 None, London, stranger

PS6TS None, ?

PS6TT None, ?

PS6TU None, ?

PS6TY None, ?

KPGPSUNK Unknown speaker, other

KPGPSUGP Group of unknown speakers, other

[KST] 31800 words from 12 conversations recorded by ‘Margaret2’ (PS6RG, R 120) between 20 and27 February 1992 with 7 interlocutors

PS6RG 50, Margaret, housewife, Central South-west England, self

PS6TD None, ?

PS6TE None, ?

PS6TF None, ?

PS6TG None, ?

PS6TH None, ?

PS6TJ None, ?

KSTPSUNK Unknown speaker, other

KSTPSUGP Group of unknown speakers, other

37