Deskew is a tool for deskewing scanned text documents. Deskewing is a term used for fixing the scans to get the lines of text straight. That means rotating the image of the scanned page a little to compensate for the misalignment of the paper in the scanner.

DeskewDeskew Pdf Pagesnewfamous

Deskew uses Hough transform to detect 'text lines' in the image and their orientation. As an output, you get an image rotated so that the detected lines are horizontal. You can find more theory in introductory post Deskewing Scanned Documents if you are curious.

Deskew is a command line tool for Windows, Linux, and macOS. Simple GUI frontend also exists. Based on the user feedback and my own usage, I'd say it works quite well 🙂 On occasion, I think about stuff like: Deskew web app, GPU deskewing, Deskew app for Android, and even Deskew Photoshop plugin!

Deskew Pdf Files

All Posts About Deskew

Today, Red Software is proud to announce version 2.18 of its ASP.NET PDF viewer and editor, RAD PDF.This version includes a new page 'Deskew' tool which can be used to straighten scanned PDF pages which may be just a few degrees off. Additionally, version 2.18 includes a number of new client and server API methods. How to deskew a document Step 1. Set Options to Professional.Turn on Auto Skew.Press Scan. See in your PDF viewer that the document is deskewed. Sure, the PDF format is universally accepted, but that’s no excuse for sending out scanned documents that are skewed and headache-inducing! Thankfully, that problem is easily solved by today’s discount software promotion, A-PDF Deskew! A-PDF Deskew quickly corrects skewed text in scanned PDF documents, automatically straightening them using a special algorithm. In Rotation group box, you can use Auto Deskew button to deskew PDF document. Or you can click Draw a line to Rotate, Clockwise 90 or Counter clockwise 90 to rotate the input document. In the following spin box, you can manually input angle degree according to your needs and then click the button besides to rotate. (Note: Positive number is to.

  • Deskew CLI Tool v1.30 ReleasedPosted on: June 19, 2019
  • GUI Frontend for DeskewPosted on: December 4, 2018
  • Deskew Tool v1.25 ReleasedPosted on: May 23, 2018
  • Deskew Tool v1.20 ReleasedPosted on: November 29, 2016
  • Deskew Tool Version 1.10Posted on: March 7, 2014
  • Deskew Tool UpdatedPosted on: June 6, 2012
  • Deskewing Scanned DocumentsPosted on: January 5, 2011

Downloads

Command line tool and GUI frontend downloads:

Deskew v1.30
» 4.3 MiB - 13,084 hits - June 19, 2019
Command line tool for deskewing scanned documents. Binaries for several platforms, test images, and Object Pascal source code included.

DeskewGui v0.90
» 4.1 MiB - 2,372 hits - March 18, 2019
GUI frontend for Deskew command line tool. Prebuilt binaries for Windows, macOS, and Linux. Windows and Linux versions need Deskew command line tool binaries.

Bug Reports & Feature Proposals

Please report bugs and propose enhancements and new features in Deskew's issue tracker:
https://github.com/galfar/deskew/issues

Deskew Pdf Image

Source Code

Deskew's sources are hosted here:
https://github.com/galfar/deskew

Donation

If you want to donate to Deskew Tools you can click Sponsor button at Deskew's GitHub page.

My book scanner (see here) creates images like these:Pdf
Is there any way to cut them out automatically and possibly even deskew them, so that the OCR (tesseract) works better?

Deskew Pdf Pages

My method now, which is simple, stupid but kinda works, is to create the RGB-average of the first, say, 20 lines of the images (assuming that there's a border around the book itself), and then go down line by line, taking the average, and if it is like at least 5% or something higher in one line (assuming paper is mostly white), then the above gets cut.
I do the same then from left to right, right to left and bottom to top.Pdf
The results are mostly Ok (especially when creating images with a better camera than the example image here), but they're not perfect.
How to do that without putting any manual work into this, once the scanning process started, while getting good (or, preferably, very good) results, using only free software?
Is there anything like that available?
It would be of great help for me!