> For the complete documentation index, see [llms.txt](https://textav.gitbook.io/textav-event/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://textav.gitbook.io/textav-event/projects/captions-and-tv-archives.md).

# Captions and TV Archives

{% embed url="<https://www.youtube.com/embed/H3hrwyE9sSc>" %}

## Notes:

* Archive.org/TV&#x20;
* 2 million news shows online, searchable captions
* The "Third Eye" -- reading and analyzing the "lower third" of the screen -- What are they reporting, what and how are they summarizing?
* Uses tesseract-ocr and simhash to pull lines from multiple news channels
* [www.twitter.com/tvThirdEye](http://www.twitter.com/tvThirdEye) -- watches CNN, tweets headlines.
* CLIPS -- little JSON annotations to set start/end - points. Using JSONPatch.
* Popcorn.js at IA: <http://archive.org/pop/>&#x20;
  * Popcorn
    * Popcorn
      * Popcorn
        * 🍿