IIIF Interactive Transcript - Parliamentary Debates

One of the demos on Day 1 of BBCTextAV was the BBC's FastForward, which synchronises scrubbing of video with the text transcript (navigating through either video or text changes your position in the other).

This would be useful functionality to have available for IIIFarrow-up-right AV content, where the text transcript comprises textual annotations on the time dimension of the canvas. So we decided to build a proof of concept viewer.

We had access to metadata for ~3500 German parliamentary debates. We transformed this metadata to IIIF Manifests, ending up with one Manifestarrow-up-right per debate, each with one Canvasarrow-up-right representing the duration of the debate, annotated with a video file to provide the AV content. Other sources of textual data were available, but for this MVP we used the WEBVTT files as source for the transcripts.

Sample Manifest:

https://tomcrane.github.io/bbctextav/iiif/ID191002001.jsonarrow-up-right

This links to an annotation list for the text content:

https://tomcrane.github.io/bbctextav/iiif/ID191002001-transcript.jsonarrow-up-right

The parliamentary source data is hacked into IIIF here: https://github.com/tomcrane/bbctextav/blob/master/converter/make_iiif.pyarrow-up-right

We produce a IIIF Collection of the manifests, so the viewer software knows what's available: https://tomcrane.github.io/bbctextav/iiif/collection.jsonarrow-up-right

The manifests are then shown using this viewer:

https://openhypervideo.github.io/iiif-interactive-transcript/arrow-up-right

https://openhypervideo.github.io/iiif-interactive-transcript/

Team

  • Joscha Jaeger

  • Tom Crane

Last updated