services digitisation & content management. 600 people – india
Post on 14-Dec-2015
216 Views
Preview:
TRANSCRIPT
& Content Management
600 People – India
Services
Digitisation Services
Bibliographic services
Content Management Services
Digitisation Services
Full text capture of historic manuscripts
16th Century Church Records
17 & 18th century Census Records
18th and 19th Century Life Event Records
Bibliographic Services
On site catalogue imaging Services
Retro Conversion of Catalogues
• MARC21• MODS
Finding Aids
• EAD
Digitisation of Large volume of Print content
The UK parliamentary debates – The Hansard
16th - 18th Century American Texts
Legal texts and publications
Historic Newspapers
AEL offers end-to-end solution fornewspaper digitisation.
Archival Newspaper Digitisation
A complete Solution for the historic newspaper Digitisation:
On-site / Off-site Scanning of Paper or microfilm
OCR and clean up
Article level Zoning
Quality Assurance
Hosting & Search solutions
Micrographic services
Micrographic lab that can scan and print 16mm or 35 mm microfilms, Microfiche or aperture cards to 600 DPI Tiff images.
Capable of scanning up to 40,000 frames / day
Reprographic services
Scanning for Newspapers & large format documents
Overhead non contact scanning with minimal damage to original pages
Capable of scanning up to 10,000 broadsheet pages /day
Colour scanning with 10,200 pixels
Image Enhancement:Cropping, de-skew, de-speckle, Lighting corrections, histogram adjustment, Filter, Geometrical corrections.
Scanning From Microfilm…
Advantages
Lower cost for scanning
Sometimes only microfilm is available
Scanning From Microfilm…
Disadvantages
Poor microfilmingProcess & material technology of 50s & 60s
Poor Filming Methods
Scanning from Paper…
Advantages
Excellent image &Text quality
Scanning from Paper…
Badly Stored/ damaged original paper copies.
Higher cost of scanning
Disadvantages
Content Segmentation – Page & article analyses
AEL uses third party software tools as well as own tools for article segmentation
Automatic zoning & article segmentation software is not perfect!
Manual correction of the segmentation is required for 20-40% of the articles.
OCR Problems
Most archival Newspapers yield low OCR accuracies.
Poor OCR Useless for OCR
AEL offers manual OCR clean-up options.• Headline re-key
• Proofread / re-key first few lines• Full page proofread
Customized search solutions for the digitised archive
Madras
Article level Metadata
METS ALTO MODS MIX
NewsML
Other metadata schemas
Newspaper Digitisation Process
QualityAssurance
Content Export
Images from Paper & Microfilm
Scanning
Conversion flowWEB Search
Content database
Content
Content formatting
Zoning & article segmentation
DatabaseServer
OCR Text Images
XML metadataJpeg 2000 Images
OCR / Cleanup
ContentInput
Web Hosting
Thank you
top related