Speech And Music Segmenter and Annotator
The targeted users and customers are the multimedia industry actors, and all academic or industrial laboratories interested in audio document processing.
As shown on Figure below, the SAMuSA module takes an audio file or stream as an input, and returns a text file containing detected segments of: speech, music and silence.
To perform segmentation, SAMuSA uses audio class models as external resources. It also calls external tools for audio feature extraction (Spro software ), and for audio segmentation and classification (Audioseg software ). These tools are included in the SAMuSA package.
Trained on hours of various TV and radio programs, this module provides efficient results: 95% of speech and 90% of music are correctly detected.
One hour of audio can be computed in approximately one minute on standard computers.
SAMuSA was developed in Irisa/INRIA Rennes by the Metiss team.
SAMuSA is a software that has been developed at Irisa in Rennes and is the property of CNRS and Inria.
SAMuSA is currently available as a prototype only. It can be released and supplied under license on a case-by-case basis.