[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

OCR



From Slashdot:
"An anonymous reader writes "Google recently released [0]Tesseract as open source. Originally developed at the HP Labs from 1985-1995, it has been touted as one of the most accurate Optical Character Recognition (OCR) programs available. Having sat on the shelf gathering dust for so many years, Google cleaned up some of the more outdated portions of the code and released it for general consumption. You can [1]download Tesseract over at Sourceforge."

JE: The business connection is clear. Improved OCR means more indexable text online, which plays to Google's core business. Smart move. It's interesting to see how some successful companies strive to build environments and not just products (See James Moore, "The Death of Competition").

Joe Esposito