Can You Read Me Now? Content Aware Rectification using Angle Supervision

AI-generated keywords: CREASE OCR Rectification Angle Supervision Content

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The ubiquity of smartphone cameras has revolutionized document capture
  • Photographed documents often have folds and crumples, causing local variance in text structure
  • OCR systems rely on rectifying geometric distortions for accurate recognition
  • Previous approaches to rectify document images focus on global features, overlooking content signals
  • CREASE is a learned approach for document rectification that leverages the document's content as hints
  • CREASE employs pixel-wise angle regression and curvature estimation to optimize the rectification model
  • CREASE outperforms previous approaches in OCR accuracy, geometric error, and visual similarity
  • This advancement improves OCR accuracy and usability of smartphone-captured documents.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor, Roee Litman

Presented in ECCV 2020

Abstract: The ubiquity of smartphone cameras has led to more and more documents being captured by cameras rather than scanned. Unlike flatbed scanners, photographed documents are often folded and crumpled, resulting in large local variance in text structure. The problem of document rectification is fundamental to the Optical Character Recognition (OCR) process on documents, and its ability to overcome geometric distortions significantly affects recognition accuracy. Despite the great progress in recent OCR systems, most still rely on a pre-process that ensures the text lines are straight and axis aligned. Recent works have tackled the problem of rectifying document images taken in-the-wild using various supervision signals and alignment means. However, they focused on global features that can be extracted from the document's boundaries, ignoring various signals that could be obtained from the document's content. We present CREASE: Content Aware Rectification using Angle Supervision, the first learned method for document rectification that relies on the document's content, the location of the words and specifically their orientation, as hints to assist in the rectification process. We utilize a novel pixel-wise angle regression approach and a curvature estimation side-task for optimizing our rectification model. Our method surpasses previous approaches in terms of OCR accuracy, geometric error and visual similarity.

Submitted to arXiv on 05 Aug. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2008.02231v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The ubiquity of smartphone cameras has revolutionized the way documents are captured, with more and more people opting to photograph documents rather than scan them. However, unlike flatbed scanners, photographed documents often exhibit folds and crumples which lead to significant local variance in text structure. This poses a challenge for Optical Character Recognition (OCR) systems as the accuracy of recognition is heavily influenced by the ability to rectify geometric distortions in the document. While OCR systems have made great strides in recent years, most still rely on a pre-processing step that ensures straight and axis-aligned text lines. Previous works have attempted to rectify document images taken in real-world conditions using various supervision signals and alignment techniques; however these approaches primarily focus on global features extracted from the document's boundaries overlooking valuable signals that could be derived from the document's content. To address this limitation, we introduce CREASE: Content Aware Rectification using Angle Supervision. Our method is the first learned approach for document rectification that leverages the content of the document itself including word location and orientation as hints to assist in the rectification process. We employ a novel pixel-wise angle regression approach and incorporate a curvature estimation side-task to optimize our rectification model. Our method outperforms previous approaches in terms of OCR accuracy, geometric error and visual similarity. By considering both global features from the document's boundaries and local signals obtained from its content CREASE achieves superior performance in rectifying documents captured under challenging conditions. This advancement has significant implications for improving OCR accuracy and enhancing usability of smartphone-captured documents. The authors of this study include Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor and Roee Litman; it was presented at ECCV 2020 conference.
Created on 02 Aug. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.