AudioAlign is a research tool to investigate automatic synchronization of audio and video recordings that have either been recorded in parallel at the same event or contain the same aural information. It is designed as a GUI for the Aurio library.
To see what AudioAlign can do, check the demo videos linked below.
AudioAlign has been developed for a research project with the goal to automatically synchronize audio and video recordings, recorded at the same time at the same event, e.g. a speech or a music concert. The idea was to synchronize all those videos taken from the crowd and combine them to
- generate multicamera cuts by switching between perspectives (video),
- create videos with full or at least better event coverage,
- replace bad quality audio or video tracks with better ones, or
- detect interesting moments (where many recordings have been captured at the same time).
AudioAlign can be used for a lot more uses cases though, including
- video mashups (video)
- comparison of live performances (video)
- synchronization of different cover interpretations (video)
- voice dubbing (video)
- ground truth creation (website)
- evaluation of fingerprinting algorithms
- ...
See CHANGELOG.
- Visual Studio 2022
- .NET SDK 6.0
- Audio and video files can be drag & dropped into the timeline
- Multiple files will be added as multiple tracks
- Hold
SHIFT
to add the files as a single concatenated track
- Navigating the multitrack view
- Click anywhere or drag the caret in the time scale to set the current (playback) position
- Press
SPACE
to start/pause playback - Vertically drag the bottom of a track to resize its height
- Scroll the mouse wheel to scale the time resolution (zoom into/out of the timeline) at the current position
- Hold
CTRL
to smoothly scroll the timeline - Hold
CTRL + SHIFT
to page through the timeline
- Hold
- Select a track by clicking its title bar
- Hold
CTRL
to select multiple tracks - Hold
SHIFT
to select a range of tracks
- Hold
- Move tracks on the timeline by dragging them horizontally
Mario Guggenberger, Mathias Lux, and Laszlo Böszörmenyi. 2012. AudioAlign – Synchronization of A/V-Streams Based on Audio Data. 2012 IEEE International Symposium on Multimedia. Irvine, CA, USA, 2012, pp. 382-383. DOI=http://dx.doi.org/10.1109/ISM.2012.79
Mario Guggenberger. 2015. Aurio: Audio Processing, Analysis and Retrieval. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 705-708. DOI=http://dx.doi.org/10.1145/2733373.2807408
For questions and issues, please open an issue on the issue tracker. Commercial support, development and consultation is available through Protyposis Multimedia Solutions.
Copyright (C) 2010-2023 Mario Guggenberger [email protected].
This project is released under the terms of the GNU Affero General Public License. See LICENSE
for details.