Edit: timing values, caption text (with support for multiline text), speaker name (autofill & dropdown menu).
Add: new captions into the list (with timing & speaker initialised with default dynamic values).
Delete: existing captions from the list
Handle simultaneous captions (when speakers speak at the same time) - unlikely to be generated by any transcription service with single channel audio. But this may occur with: a) Manual edits and b) Multiple channel audio
Set: a maximum caption length (maybe a column width slider?). Captions that exceed this length will be automatically split.
Split: one caption into two
Merge: multiple captions into one (must enforce a reasonable max word limit)
Need to create the capability to: