textav-components
  • Introduction
  • Guidelines for this board
    • How to add a new library/Modules/components
    • Trello --> Github --> gitbook (programmatically)
    • Plan Roadmap taxonomy
    • github README template
  • Media processing
    • ffmpeg and electron - example boilerplate
    • convert video to audio [Open Source]
    • Generate list of ffmpeg supported file formats [Open Source]
    • Detect silence [Open Source]
    • Youtube Video downloader module (?) [Open Source]
    • Module: Open source STT //Gentle refactor [Open Source]
    • cc extraction // OCR of captions [Open Source]
    • Module: Video format converter [Open Source]
    • Module: Video metadata reader [Open Source]
    • Banpass filter module
    • Tesseract - OCR
  • Transcriptions - utils
    • Transcriber module
    • Sample material for testing STT services [Open Source / CC]
    • Create word accurate time codes from line accurate time-coded transcript (eg srt)
    • Language codes ISO-639-1 Code
    • Module: Timecode conversion [Open Source]
    • UI Utilities for timecode representation
    • Sanitise string for file path
  • Transcription STT Sdk
    • Web Speech API
    • Pocket Sphinx STT [Open Source]
    • IBM Watson STT [Proprietary]
    • Google Cloud Speech API [Proprietary]
    • Microsoft Bing STT [Proprietary]
    • Baidu STT SDK [Proprietary]
    • Speechmatics STT SDK [Proprietary]
    • Spoken Data STT SDKs [Proprietary]
    • Gentle (Server) STT node SDK [Open Source]
    • Temi.com/rev.com [Proprietary]
    • Latvian Kaldi [open source]
    • Mod9
    • Movi - arduino component, offline
    • deepgram
    • Mozilla deep speech
    • AWS Transcriber
  • Transcription UI
    • Transcription text editor with Draft.js Editor [Open Source]
    • Overtyper
  • Alignement
    • Alignement
    • Module: to align partially scripted speeches
  • Captions
    • Module: captions composer (with text pre-segmentation) [Open Source]
    • Module: Captions burner [Open Source]
    • Srt parser composer // Pietro [Open Source]
    • TTML Parser // Gary, Brightcove [Open Source]
  • Annotations
    • Annotation model atjson
  • Paper-editing & remixing UI
    • Front end component: video preview of JSON Edl
  • Cognitive insights
    • LIUM Speaker Diarization BBC - [Open Source]
    • Module: open source summarization module [Open Source]
    • Module: punctuation and capitalisation. [Open Source]
  • Translation SDK
    • Deep L - Translation SDK node
  • Export & remix & video editing
    • Parse EDL (plain text) to JSON [Open Source]
    • Module: Post to facebook [Open Source]
    • EDL composer from JSON EDL [Open Source]
    • Module: Post to Twitter Video [Open Source]
    • edit video EDL (JSON) - ffmpeg-remix (super fast video editing of mp4 videos) // Laurian [Open Sourc
    • EDL Json to XML FCP7 (compatible with premiere) [Open Source]
    • Popcorn Js // Mozilla/Internet Archive [Open Source]
  • unsorted
    • NWJS boilerplate
    • QCTool
    • VRecord
    • Electron travis CI automated build: OSX, Linux, Windows
Powered by GitBook
On this page
  1. Cognitive insights

Module: open source summarization module [Open Source]

PreviousLIUM Speaker Diarization BBC - [Open Source]NextModule: punctuation and capitalisation. [Open Source]

Last updated 6 years ago

These algo they generally tend to work by comparing / ranking sentences, so need punctuation. STT and live captioning generally does not return punctuation.

Labels

Node module, Nice to have,

Link to trello card: Module: open source summarization module [Open Source]