decomposing latex documents from geometrical and logical … · decomposing latex documents from...

1
Decomposing LaTeX Documents from Geometrical and Logical Layout Vincent Deo, Terry Kong, Maisy Wieman Department of Electrical Engineering, Stanford University Overview Methodology References Results LaTeX is a low-level markup and programming language that produces aesthetically pleasing PDF documents with diverse content. This project aims to analyze the layout of mobile images of LaTeX documents to simplify the generation of LaTeX source code. Input is captured from anAndroid phone, and the results are computed on a server. Preprocessing Find Maximal White Rectangles Classify Regions H.S. Baird, S.E. Jones and S.J. Fortune, "Image Segmentation by Shape-Directed Covers,"Proc. Int',l Conf. Pattern Recognition, pp. 820-825, 1990. Create and balance binary tree of white rectangles Scan Image for black CCs Adaptive Binarization Skew Correction Margin Removal Future Work 1. Use the results of layout decomposition to generate code for a LaTeX template for the given document 2. Add text recognition to generate the entire LaTeX document Determine CC Statistics Use Horizontal Projections 368 Input Largest 100 Rectangles Color Type Red Figures Yellow Text Green Page No. Blue No type Global and Local Feature Comparison

Upload: others

Post on 08-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Decomposing LaTeX Documents from Geometrical and Logical … · Decomposing LaTeX Documents from Geometrical and Logical Layout Vincent Deo, Terry Kong, Maisy Wieman Department of

Decomposing LaTeX Documents from Geometrical and Logical LayoutVincent Deo, Terry Kong, Maisy Wieman

Department of Electrical Engineering, Stanford University

Overview

Methodology

References

Results

LaTeX is a low-level markup and programming language that produces aesthetically pleasing PDF documents with

diverse content. This project aims to analyze the layout of mobile images of LaTeX documents to simplify the generation

of LaTeX source code. Input is captured from an Android phone, and the results are computed on a server.

Preprocessing

Find Maximal White

Rectangles

Classify RegionsH.S. Baird, S.E. Jones and S.J. Fortune, "Image Segmentation by

Shape-Directed Covers,"Proc. Int',l Conf. Pattern Recognition, pp.

820-825, 1990.

Create and balance

binary tree of white

rectangles

Scan Image

for black

CCs

Adaptive

Binarization

Skew

Correction

Margin

Removal

Future Work1. Use the results of layout decomposition to generate code for a

LaTeX template for the given document

2. Add text recognition to generate the entire LaTeX document

Determine CC

StatisticsUse Horizontal

Projections

368

Input

Largest 100

Rectangles

Color Type

Red Figures

Yellow Text

Green Page No.

Blue No type

Global and

Local Feature

Comparison