video OCR

Episode X - A New Search (and a Forceful 2018!)

CLICK to watch how VideoSpace aid the Rebels in their search for the New Death Star...

In search of the new Death Star plans, the Rebels secretly planted thousands of cameras within the Empire in hope of finding clues. However, with thousands of videos and impending deadline, how can the rebels possibly search for information within videos?! Fortunately...

IN A GALAXY, NOT FAR AWAY... 

VideoSpace Search Engine is helping the Rebels searches inside videos for speech, text, objects, motion, faces, emotions (note: not for stormtroopers since they wear helmets) automatically! Thus, finding vital clues for the New Death Star plans...

In the meantime, as we help the Rebels fight the Evil Empire, we wish you a...

FORCEFUL 2018! 

From, 

All of us at Babbobox

MAY THE SEARCH BE WITH YOU!

What is a Video Search Engine? Part II – Searching Text

In Part I, we found out that there are 7099 living languages in the world. That includes both written and spoken only languages. According to Ethnologue (20th edition) out of that 7,099 living languages, 3,866 have a developed writing system.

Which leads us to this second part of our series – Searching Text inside a video. Besides Speech, Text is probably the second most important element where we can extract data from.

For example, in a presentation or talk given by a speaker. Besides speech, the speaker would augment the session with a set of slides. Therefore, besides his voice, text (in the slides) is another set of data that can be captured. This is important because what he says and what he present in the slides can be vastly different.

Text that can be OCRed during a presentation

Text that can be OCRed during a presentation

The technology to capture these text inside the video is called Video OCR (Optical Character Recognition). Video OCR is derived from OCR, a technology that has been around a long time.

By strict definition, Optical Character Recognition (OCR) is the mechanical or electronic conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image (source: Wikipedia). The first OCR machine that read characters and converted them into standard telegraph code was invented by Emanuel Goldberg in 1914!

Unfortunately, one hundred years on, OCR technology still has some ways to go, especially in the field of adding more language capabilities and recognizing handwriting. However, with more A.I. and Machine Learning, the hope is that researchers can add more capabilities to what OCR can do now.

However, Video OCR is giving OCR a new lease of life by simply adding another dimension – moving images. Given the amount of videos that has never been OCRed before and the amount of videos being generated every day, the potential for Video OCR is immerse.

To find out more about how you can search TEXT inside your videos, visit VideoSpace Video Search Engine or our Video-Search-as-a-Service.

VideoSpace to experiment with Video OCR (optical character recognition)

videospace

In the hunt for added capabilities for indexing and search, VideoSpace is happy to announce that we are currently looking into adding Video OCR (optical character recognition) technology as part of our enterprise offering. 

Video OCR, is a lot more advanced than mere OCR in documents and even images, because it has to go through each individual frame of video. 

For VideoSpace, this means that we are looking into the possibilities to incorporate 2 state-of-the-art technologies and techniques to enable search - Video Indexing and Video OCR. 

As for today, the Video OCR engine that we are experimenting with can recognize and support the following languages:

  • Arabic
  • Chinese Simplified
  • Chinese Traditional
  • Czech
  • Danish
  • Dutch
  • English
  • Finnish
  • French
  • German
  • Greek
  • Hungarian
  • Italian
  • Japanese
  • Korean
  • Norwegian
  • Polish
  • Portuguese
  • Romanian
  • Russian
  • Serbian Cyrillic
  • Serbian Latin
  • Slovak
  • Spanish
  • Swedish
  • Turkish

Go to VideoSpace for more exciting updates!