Peter O'Kelly's Reality Check: Rosetta: Understanding text in images and videos with machine learning -- Facebook Code

Wednesday, September 12, 2018

Rosetta: Understanding text in images and videos with machine learning -- Facebook Code

Check the full post for details

"A significant number of the photos shared on Facebook and Instagram contain text in various forms. It might be overlaid on an image in a meme, or inlaid in a photo of a storefront, street sign, or restaurant menu. Taking into account the sheer volume of photos shared each day on Facebook and Instagram, the number of languages supported on our global platform, and the variations of the text, the problem of understanding text in images is quite different from those solved by traditional optical character recognition (OCR) systems, which recognize the characters but don’t understand the context of the associated image.

To address our specific needs, we built and deployed a large-scale machine learning system named Rosetta. It extracts text from more than a billion public Facebook and Instagram images and video frames (in a wide variety of languages), daily and in real time, and inputs it into a text recognition model that has been trained on classifiers to understand the context of the text and the image together."

Rosetta: Understanding text in images and videos with machine learning -- Facebook Code

Peter O'Kelly's Reality Check

Wednesday, September 12, 2018

Rosetta: Understanding text in images and videos with machine learning -- Facebook Code

No comments:

Blog Archive