text detection strategies

15
Welcome to our first Computer Vision Meetup Sponsored by

Upload: anyline

Post on 07-Jan-2017

1.102 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Text Detection Strategies

Welcome to our first

Computer Vision Meetup

Sponsored by

Page 2: Text Detection Strategies

Daniel Albertini Technical Director & Co-Founder

[email protected]

Anyline - a product of 9yards GmbH

Zirkusgasse 13/2b

1020 Wien

Page 3: Text Detection Strategies

Agenda

- Overview Talk about different text detection strategies. - Feedback about possible future Meetup topics. - Get-together, discuss and beer.

Page 4: Text Detection Strategies

Text Detection Strategies Overview

Page 5: Text Detection Strategies

SWT (Stroke Width Transformation)

Computes per pixel the most likely stroke width containing the pixel. Steps: - Compute Edge Map of image. - Compute X & Y Gradient Map. - Calculate Ray from every edge pixel with

the direction from the gradient maps. - Set the value of the pixels of the ray to

the min of current value and ray length. - Group neighbor pixels with similar

stroke width together to find letter candidates.

Page 6: Text Detection Strategies

SWT (Stroke Width Transformation)

Page 7: Text Detection Strategies

SWT Rejecting connected components strategies: - Variance of the stroke width. - Aspect ratio.

- Too large & too small components - Components which are clearly not part of a

word / text line

Page 8: Text Detection Strategies

SWT (Stroke Width Transformation)

Page 9: Text Detection Strategies

SWT (Stroke Width Transformation)

Advantages: - Is able to accurately detect

text in different sizes, styles, colors.

- Can detect text independent of perspective and rotation.

- First step of SWT is a good all-rounder thresholding method for images with text.

Disadvantages: - Relatively slow performance

(edge & gradient maps). - Needs information if text or

background is darker (in the grayscale image).

Page 10: Text Detection Strategies

MSER (Maximally Stable Extremal Regions)

Blob detection method suitable for detecting character features. This method detects regions which are considered stable over a large range of threshold values.

Page 11: Text Detection Strategies

MSER

Threshold value: 10 45 75

105 135 165

Page 12: Text Detection Strategies

MSER (Maximally Stable Extremal Regions)

Page 13: Text Detection Strategies

MSER (Maximally Stable Extremal Regions)

Advantages: - Is able to accurately detect

text in different sizes, styles, colors.

- Can detect text independent of perspective and rotation.

- Good performance.

Disadvantages: - Sensible against blur. - No binary image as an output

(thresholding for OCR still needed).

Page 14: Text Detection Strategies

ER Variation for text detection

Sequential classifier trained for character detection instead of maximum region Advantages: - Only Character regions will be found. No need for analyzing and rejecting

components.

Disadvantages: - Needs training for different font or character types - Slower performance

Page 15: Text Detection Strategies

The End

Sources

SWT: http://research.microsoft.com/pubs/149305/1509.pdf MSER: http://www.icg.tugraz.at/pub/pubobjects/docvpr2006