textav-event-2017
  • Introduction
  • Intro
    • Introduction
    • TODOS
  • Projects
    • hyperaudio
    • oTranscribe
    • WebAv
    • Opened Captions service
    • Opened Captions annotated articles
      • presentation at SRCCON
    • FrameTrail
    • Captions and TV Archives
    • Extending audiogram with automated transcriptions
    • Palestinian Remix
    • BBC Dialogger
    • autoEdit
  • Remote Presentations
    • Aeneas
    • Mercury
    • Captioning Workflow
      • Needs For Captioning Tool
    • Transcription Service at the FT
    • BBC Video Context
  • Problem Domains
    • Problem domain and component based design
    • Interactive Transcription
    • 🔪✅⬇️ (Annotations models)
    • Object-based Broadcasting
    • Tv Archive AI pipeline
    • The Problem with archives
    • From Spoken Word To Sheet Music
  • Services
    • PopUp Archive & Audiosear.ch
    • YouTube for Publishers (Europe) at the Guardian
    • Microsoft STT & Cognitive Services
  • Unconference Projects
    • TransProvenance
      • Architecture
      • Futures of the project
    • Transcript correction
      • webaligner
    • AI Pipeline
      • I learned what Tesseract can do (and so can you!)
    • Captioning Workflow System
    • removeTextTrack API
Powered by GitBook
On this page
  • I learned what Tesseract can do (and so can you!)
  • As google doc:
  1. Unconference Projects
  2. AI Pipeline

I learned what Tesseract can do (and so can you!)

PreviousAI PipelineNextCaptioning Workflow System

Last updated 6 years ago

I learned what Tesseract can do (and so can you!)

Made at the Text AV conference at NYU

July 19-21

Corbin Ordel

Using a couple programs that are downloadable and easy to use (they have desktop icons and everything), you can create your own OCR (text recognition) program!

Step 1:

Step 2:

Step 3***:

(***This step is optional, but if you want to see your collected data in your terminal, do this! Also why not do it anyway! it doesn’t take long.)

~And I quote, “The objective of osc-web is to make possible to send and receive OSC messages on the Web browser.”~ This is very useful for all sorts of cool things.

Step(‘s) 4 (Set up of all components): NOTE: If you want to just watch a video walkthrough, click here --->

  • Open VDXM5 and make a new project (file --> new)

  • Drag your video into your media bin

  • Go to “Plugin” and select “Syphon Output” and select “Main OutPut”

  • Click “Layers” and find the “Crop” button - use the cropping feature to find your desired text, in the case of the attached demo, it is the ticker at the bottom of the hyper relevant cnn video.

  • After your video is cropped, Open OCR-OSC

  • Go back to VDXM5 and go to file → preferences → OSC

  • Click out “Output Ports” and make sure you have something that looks like this:

  • Go back to OCR-OSC and click the drop down menu - you should be able to select “VDMX5 - Main Output”

  • It should be streaming the video from VDMX5 when you select the correct source

  • Now what if we want information in our terminal?

  • Go to terminal and change your directory to osc-web (command is most likely [ cd osc-web ]

  • But wait, no information? Where is it?

  • Go to you osc-web folder and find the web-side folder and click on app.html

  • But wait? Still no information? Go to preferences in OCR-OSC and change your port directed at 3333

  • You should now be set up and watching your very own OCR (text recognition) program!

Let me know if you need more info!!!

Email me @ corbinordel@gmail.com

As google doc:

Programs will use: , ,

Download and Install this program called , (Or any program that can send video through a syphon as communication)

Download and Install this program called . These programs were created by a very cool person named . He has made some great projects including this !

Go to and follow the directions for install and usage: for your enjoyment and terminal ease - here is a picture of what to do (if you have git and npm package manager installed on your machine)!

Thanks to all the awesome software that makes and the awesome people who came to !

VDXM5
OCR-OSC
OSC-WEB
VDXM5
OCR
OCR-OSC
David Lublin
one
https://github.com/automata/osc-web
@DavidLublin
TextAV